The End of Hardware Book - Augmented Reality

Augmented Reality is more than Virtual Reality

Towards Wearable,
High-Performance
Augmented Reality Displays

All rights preserved. Patents pending¹ on approaches presented in this paper.

Download updated PDF version

Note: Supplemental materials, the Blender model used for the simulations, and a set of video tutorials can be obtained (for non-profit use only) from the website of the book "Displays - Fundamentals and Applications, 2nd edition" (see "material" page), or just here from this page.

Abstract

A novel configuration for an augmented reality (AR) display and optics is introduced, consisting of a curved, transparent display and a curved mirror. The image is seen through the back side of the transparent display, whose image is reflected and focused towards the user’s eye by the mirror.
It is demonstrated that this configuration delivers not only a very large field of view and high resolution but also, contrary to what one would normally expect, a very large eyebox.

It is shown that this approach will allow for high performance AR displays hardly any bigger or heavier than a pair of normal, light-weight, state-of-the-art prescription glasses, and that they can be worn using the same convenient type of frame, due to their light weight and large eyebox.

First in this paper, a more conventional design is analyzed, using only a curved display and one mirror in an off-axis configuration. An ideal optical design for this, using ellipsoid shapes, is presented. It is shown that just the fact that the configuration is off-axis, inhibits any better performance. By introducing a transparent display, a coaxial (“on-axis”) configuration with spherical displays and mirrors is enabled, with vastly better performance and a virtually unlimited field of view (FOV).

Furthermore, an iterative synthesis method for mirror shapes is shown, and applied to the synthesis of mirrors for coaxial configurations with planar or cylindrical displays, also yielding results better than with most conventional approaches. The designs are analyzed by high precision 3D raytracing, allowing to simulate not only their optical performance but also their physical appearance.

Introduction

Current augmented reality (AR) displays usually are bulky and need special fixtures to the head, not only because of their weight but mainly because of their small eyeboxes (the range in which the eye can move without losing parts of the field of view (FOV), resolution, or even the entire optical contact). Also the FOVs are normally quite narrow anyway. Smaller, eyeglass-like models so far were data displays, with even smaller FOVs.

The most promising approach up until now have been waveguide displays. Simple ‘glass slabs’ generating images seem quite elegant at a first glance. The Bragg or holographic optics involved, however, are direction and wavelength selective, need to be up to 2 mm thick to achieve a high resolution, and usually three separate layers are needed, one for each base color. While the additional weight is not very big, said constraints are limiting the FOV (40° at most for waveguides), and even more so the eyebox, a problem also affecting other current developments with holographic optics.

A common assumption so far has been (see, e.g., [6]), that with near-eye displays, providing an image to a user by means of conventional optical elements will either result in a very small field of view or will require very complex and bulky assemblies.

The concept presented here, in turn, provides for compact, light-weight AR glasses hardly bigger and not heavier than the most advanced prescription eyeglasses, and this with a very large FOV and a large eyebox, making them just as wearable and acceptable for everybody. The approach requires transparent and - in most of the variants - flexible displays, but such are achievable and according examples have been presented since years.

Fig. 1 shows a pair of state-of-the-art prescription eyeglasses with high-tech plastics lenses and a titanium frame. The entire assembly weighs just 12 g.

Fig. 2 compares this to one of the novel AR display assemblies proposed, including an integrated prescription lens. It is almost as compact and may even be lighter than the eyeglasses, and it has a large eyebox, so the same, convenient type of frame can be used.
Even this very simple, flat display assembly offers a FOV of 40° with good resolution and a lot more beyond that. Adding even only a slight display curvature, moreover, would improve FOV and peripheral resolution figures significantly. The ideal optics assembly uses spherical display and mirror shapes, yielding virtually unlimited FOV and resolution, while maintaining the large eyebox. The best configuration may lie somewhere in between, depending on application requirements.

Fig. 3 shows a photographic rendering of the flat and spherical display variants.

Fig. 1: An example of conventional state-of-the-art prescription glasses. Total weight: 11g.



Fig. 2: Left: the flat display variant proposed, including an optical prescription lens.
Right: the state-of-the-art prescription eyeglasses assembly of Fig.1 (lens diameter is 50 mm).

Fig. 3: Left: flat display and dome mirror, exactly the size of the glasses of Fig. 1.
Right: spherical displays and mirrors, yielding the ultimate performance.

Resolution Requirements for AR Glasses

At this point, we may take a look at the requirements for display resolution. Fig. 4 shows an average person’s eye resolution as it decreases from the center of view. At the very center, persons with good eyesight resolve up to 1 arcmin. This is the target figure for perfect displays. Please note that in practice, lower resolutions may also be perceived as crisp, if detail contrast is high. Up to 2 arcmin. of resolution, an image may therefore be perceived as perfectly crisp. Moreover, the maximum resolution needs to be delivered only for the very point the eye is currently looking at. From there, it can be allowed to decline according to Fig. 4. The optical design only has to provide that the crisp area follows the eye rotation.

For the outer ranges of very wide fields of view, even lower requirements apply, as the human eye can only rotate up to a certain limit. The rotation limit is approx. ±45° horizontal and vertical. Hence, outside of an approx. ±45° range (90° FOV), we may therefore allow for a decreasing display resolution or a degrading optical quality, or both. This is far out of the FOV range of most current AR displays, but for the solutions presented here it may become an aspect to consider.

Fig. 4: Eye resolution vs. angle from center [1].

.A look at conventional designs

Near-eye displays have some functional requirements different from other optics designs. A very important fact results from the limited crisp perception area of the human eye, in conjunction with the large angular range required for a versatile near-eye display (NED). A crisp image of a scene can only be acquired cognitively in the brain, by relentless eye movements from detail to detail.

To illustrate the consequences for a near-eye display, we regard a simple, single-mirror design as shown in Fig. 5. When a user points his eye towards any particular area on the display, only a tiny fraction of the mirror is involved in the image formation for that particular point. This also differs substantially from classical optics designs in that the real exit pupil of the collimated system is in the center of the eyeball, not in front of the eye pupil.

In a collimated configuration, i.e. with the virtual image at infinity, the mirror section involved has a diameter of 2 mm for the maximum eye resolution of 1 arcmin. This is simply the pupil size corresponding to this resolution, projected onto the mirror. A smaller pupil causes lower resolution due to beam shaping constraints, a larger pupil also tends to lower resolution, because of aberration. Any of the small mirror elements effective for different angles can therefore be optimized for the particular incident beam angle on the mirror, and the focus distance to the corresponding point on the display.

The ideal shape for any such limited area on the mirror would be a properly chosen section of a properly chosen paraboloid. Then we should get an ideally crisp focus point. Alas, the curvature of the entire mirror is not simply the sum of small paraboloids, nor is it a paraboloid itself. We have to find a compromise between the local and the general shape.

The general shape of the mirror would have to be optimized to have all focus points on the display plane, while retaining the best possible approximation to the aforementioned paraboloidal shape for any single pupil sized region on the mirror. The concept may work a lot better with a display having a convex shape itself, as has already been indicated in [1] and [8].

Fig. 5: Principle of point formation for NED

An early exploration, concerning the performance of single mirror approaches as in Fig.5, has been carried out In [2]. It included designs with a naked flat display and with a flat display with a thin correction lens. The angular resolution achieved was about 1.5 arcmin for a design including a lens. For the design without a lens, an explicit number was not given but from the contrast curves it appears to be about 3 arcmin. The fields of view (FOV) provided were about ±10° (20° total).

This is about what could so far be expected with planar displays. Some modest improvement may be gained if image distortion is taken care of by pre-compensation in the display image, which has meanwhile become a common and efficient method with electronic cameras.
In this paper, we therefore ignore any kind of image distortion, in favor of maximum resolution and field of view. Also note that the displays may need a dynamic distortion compensation, taking care of the eyeball rotation and the different lens positions resulting.

In military display helmets, large stacks of about a dozen lenses are typically used to prepare the display image. Such stacks are usually placed at the side of the head, and the size and weight of the assembly render it impossible for most non-professional applications (see [3], for an example; the approach expands the principle with two very large ellipsoidal mirrors and a lot of lenses, yielding an extremely wide field of view, but at the expense of an even bigger assembly).

Introducing curved displays

The first new design we regard will be the off-axis configuration as in Fig. 6. A problem here is that a parabolic or spherical mirror would show a severe astigmatism when used at an angle. A way to avoid this is a an ellipsoidal mirror. Moreover, regarding the above statement “the real exit pupil of the collimated system is in the center of the eyeball, not in front of the eye pupil”, we take the eye center as one focal point of the mirror ellipsoid. Then, all rays from there would converge at the other focal point (Fig. 6).

This is not yet exactly what we want, however, because while the eye center is a location where all display rays approximately converge, any particular ray bundle coming from the eye pupil has to be assumed parallel, not converging (eye accommodation at infinity).
If we regard the pupil sized area of the ellipsoid mirror involved, it will approximately behave like a simple concave mirror or a lens with focus lengths f₁ and f₂ in the directions of the ellipsoid foci F₁ and F₂. If we move one of the foci (the one on the eye side) to infinity, the other one gets closer, according to the lens formula f = 1/(1/f₁ + 1/f₂).
The calculation returns that all focus points on the display side now come to lie on a smaller surface inscribed in the ellipsoid. The surface obtained this way is almost exactly an ellipsoid itself, around the same focus points as the mirror ellipsoid. With different values for the position of F₂ and the mirror radius, we have four discrete parameters for further optimization.

Fig. 6: Left: an ellipsoidal mirror yields the best point formation for off-center configurations.
Right: if the beam is parallel on one side, a focus of length f₂ results on the other side

Although this appears to be a geometrically elegant result, we cannot simply expect it to be perfect, because the simple focus distance calculation is correct for the respective center rays at any eye rotation only, hence the peripheral rays do not perfectly converge in the focus point.
The results obtained, however, are already far better than examples known from planar displays, proving the viability of the concept of curved displays.

If the shapes obtained here could be considerably improved by a further optimization, the general synthesis algorithm proposed below this paper could be a tool to explore this. On the other hand, the ellipsoids shown may also be quite ideal, as we can see them as derived from the ideal spherical design we will show below, by a simple coordinate transform. Hence at least, the shorter the ellipsoid, the closer it should be to an ideal shape.

Aberration with the off-axis configuration

It is commonly assumed that the spherical aberration of a pupil sized lens or mirror with a focus length equal to that of the eye pupil will cause about the same spherical aberration and should therefore the system resolution should not turn out lower that that of the eye.
One might expect this to hold for off-center designs with ellipsoidal mirrors as well. The ellipsoidal design however results in a considerable radius and hence focus length change (aberration) from one end to the other of the mirror surface element involved in the point formation (r₁ to r₂, Fig. 7). We will see the results of this in the optical simulation.

Fig. 7: Even though only a small area a is involved in the formation of an image point,
the mirror radius changes, from r₁ to r₂ in this example.

Optical Simulation

We will now regard the optical quality of the approach. As stated before, we do not consider image distortion, because it can be compensated electronically. What has to be evaluated, is the amount of aberration and astigmatism introduced by the mirror elements when being used for a focus length different from the genuine ellipsoid focus.

The designs were evaluated by precision 3D raytracing, using Blender and the Cycles rendering engine. Blender is a full-featured open source 3D modeling, rendering and animation environment. It also allows for an evaluation of the optical performance, at least of mirror designs. While it cannot replace a specialized optical design software in all detail, the resolution and optical appearance of the designs presented here can be explored almost perfectly. CUDA acceleration allows for very fast, almost real-time rendering with current graphics hardware.

Calculated freeform shapes were imported into Blender as vertice lines, by a csv reader plugin. The curves imported then were expanded to 3D shapes by rotating (spinning) them and deleting double vertices when done. Exporting to stl format and re-importing optimizes the surface for rendering. Another very important step is to set 'smooth' rendering for the surface faces.

With these techniques, Blender's camera model delivers accurate optical renderings of displayed image detail to resolutions of 1 arcmin or even below. A ‘camera’ attached to the eye pupil position allows to explore the virtual images in all aspects. We should however note that raytracing simulations generally cannot cover purely wave-optical effects, like blurring by very small camera apertures. But this is not a problem here as long as we keep it in mind.

Displays are simulated by luminescent surfaces, with Image or test pattern textures. The nodes surface model of the Cycles engine also allows for unidirectional emission, as well as back side transparency. A special advantage of a full 3D modeling is the capability to include detailed anatomic face shapes, allowing to evaluate the visual appearance of the designs and to detect any physical conflicts with the mirrors or displays.

Simulation results - ellipsoidal display

Fig. 8: ellipsoid design of a free-form NED, here cropped to the useful area for maximum vertical FOV. The displays are showing a full area photo. Fig. 9: resolution evaluation. Top left: center. Top right: 20° upward. Bottom left: 20° left. Bottom right: 20° down. The overlaid rectangles are 1arcmin x 1arcmin.

Fig. 8 shows a complete 3D modeling of the ellipsoidal design, including mirror simulation by ray tracing, and an anatomically correct head model. The glasses and display edges are but roughly cut (real glasses would have some nicely designed edge shapes).
The field of view of the ellipsoidal design can be very large in the horizontal, more than 140° total, with a binocular center range of 80°. The vertical range, although limited to about 45° by the display edge getting into sight, is still competitive, regarding existing designs.

Fig. 9 shows images of a crosshair grid test pattern, with grid lines originally less than 1 arcmin thick.

As expected, the display is almost free of astigmatism and the center of view is the location of best relative focus for any eye rotation angle. We see that the optical resolution partly falls short of the desired 1 arcmin. The vertical blur effect may be attributed to the already mentioned focus change along the vertical extension of the effective mirror area, due to the off-axis configuration. For the horizontal, we see a degradation from about one arcmin at the top to 2 arcmin (full contrast) at the bottom. This may indicate some space for further optimization (we should remember that the ellipse calculation was done by a simple focus law application, and the 2D result has simply been rotated to obtain the 3D shape).

The static peripheral resolution turns out to be more problematic. The simulation yields values only half as good as the peripheral resolution of the eye, or even less, especially if the eye is rotated up or down.

In conclusion, both the center and peripheral resolution of the ellipsoidal off-axis design still do not fully match the human eye resolution. While the results achieved are better than with planar displays and may still be perceived as reasonably crisp in some applications, we have to think about ways of improvement. This could be approached by an adaptation of the general mirror construction method proposed later in this paper.

Dynamic focusing according to eye position would add another degree of freedom for the display and mirror shapes. Yet even with all these approaches, the margins for improvement may be limited.

Introducing coaxial (“on-axis”) displays

We have seen that the off-center optics design is problematic in terms of optical quality, size and aesthetic appeal. Avoiding the slanted beam angles should be of advantage.

Consider a display that is transparent from its rear side and emits to the front side only. What appears strange at a first glance, can be indeed be achieved, by inserting transparent areas into a single sided display. These can either be gaps in an otherwise non-transparent display structure, or areas of fully transparent driver circuitry. Only the pixel areas have to be occlusive, to cancel out emissions to the rear. Fully transparent notebook displays have been demonstrated several years ago. A single-sided emitting variety has been demonstrated by Toshiba in 2013 already (dubbed Transmissive Display).

Fig. 10: Left: a spherical mirror yields perfect point formation at its center.
Right: if the incident beam is parallel, a focus of length r/2 results.

Consequently with such a display, we can abandon the off-axis approach. We may use eye-concentric spheres for display and mirror (equivalent to ellipsoids with zero focus point distance). A closer look reveals that there is about one ideal size for these Spheres: 4 cm diameter for the display and 8 cm for the mirror. The focus length of the sphere requires this ratio of 2:1 (Fig. 10), while the distance between left and right eye limits the mirror size, because mirrors too big would collide in the middle (Fig. 11). The display sphere sizes, on the other hand, should be sufficiently bigger than the eyeball, to achieve a comfortable distance towards it and the face.

Fig. 11: Photorealistic rendering of spherical glasses. The mirrors are half reflective. The displays in the left image are showing a very thin test grid pattern (with lines <1 arcmin wide).

A complete display design of this type would use only properly chosen sections of the spheres, according to physical limitations by the face shape.
The 3D modeling (Fig. 11) shows that with an ample clearance towards face and eyes, almost the entire visual field can still be covered. It also shows the device to look better and be more compact than the ellipsoidal one considered before. Smaller shapes for the mirrors and displays are possible, and should still deliver a sufficient FOV in many applications. A mechanical frame construction will have to be approached separately, and also the integration of further components (e.g., eye trackers).

An irritating property of this approach could be the brightness of the display: in order to produce an equivalent visual brightness, it needs to be as bright as, or brighter than the environment. This could occlude the eyes of the user and look quite awkward. In augmented reality applications, however, only a few virtual objects will usually be present, and they should normally not be in the direct view ahead, as this would obstruct the sight on the real world in a hazardous way.

A possible measure reducing this effect would be limiting the r, g and b colors emitted and reflected to narrow bands (e.g., using quantum dots and dichroic mirrors), which would increase the effectiveness and the visual transparency of the mirror and greatly reduce any light transmission to the outside.

The regular micro patterns of the display might perhaps lead to diffraction effects, mainly with shallow incident angles. If this would constitute a problem in some cases, introducing some irregularities (dithering) to the display patterns should help.

The gap structure of the transparent display would need to be very fine for our application, according to the targeted resolution of 1 arcmin. For the 2 cm display radius conceived, 1 arcmin means a pixel raster of 6 µm. Here we should perhaps start from an entirely transparent design, insert larger gaps between the pixels, and then cover the pixels up from the back side (Fig. 9). This could allow for a high transparency (e.g., 75%) as shown in Fig. 12.

Fig. 12: Illustration of a transparent display matrix on a spherical glass (left, with greatly enlarged pixels). Matrix and layer detail and maximum incident beam angle occurring (right).

Fig. 13: Assessment of the max. possible incident angles on the spherical display

Fig. 13 shows the maximum incident angle from the eye towards the display, which should be taken into account for the layer design. With typical OLED emitting layers only 100...500 nm thin (Fig. 12), this should not be a problem. Typical OLED pixels would themselves be transparent and emit to both sides, and the shields would be implemented as mirrors.

Fig. 14: Display view of a real photo image, right eye,
showing an FOV area of 55°x80°, eyeball center camera.

Fig. 14 shows the scene impression resulting for the viewer by subsequently rotating his eye towards every image part regarded. This is achieved by placing the camera in the center of the eyeball.

An image taken with a camera at pupil position (yielding the impression for the user looking straight forward) looks almost identical, but with a slightly lower resolution at the edges. Fig. 15 shows the according results. This peripheral resolution, however, is 2...6 times better than that of the eye itself (cf. Fig. 4). Hence, this display type can indeed achieve 100% resolution and a very large FOV. Indeed, we see that the 1 arcmin target for the center of view is even surpassed for usual contrast transfer figures. This holds for any eye rotation.

In conclusion, the optical quality of the spherical on-axis configuration proves to be very high, which does not surprise as the only limiting factor here is the spherical aberration of the mirror areas involved.

Fig. 15: Beam spread of the coaxial spherical display at center (left) and with the eye camera turned right 20° around the pupil position, assessing the eye peripheral resolution (right). Squares of 1arcmin x 1arcmin are overlaid for comparison. The grid lines displayed are originally <1arcmin wide.

We should note that an effect generally occuring with the on-axis display configuration is a faint reflection from the user's eye and skin appearing in the mirror. That image, however, is very blurred and enlarged, with the pupil always following the center of view. Given the dark color of the pupil, contrast values should not be greatly affected. Moreover, monochromatic light sources (quantum dot enhanced phosphors, e.g.) and narrow-banded mirrors would efficiently suppress this effect.

The Eyebox

The eyebox size tell us where the pupil can move without loosing visual contact or vignetting of the image. In most cases, eye rotation alone can cause such effects. Not so with the on-axis concept, regardless if with flat or curved displays. There is only a resolution degradation towards the edge of the FOV with flat displays, something not falling under the usual eyebox definition and discussed under FOV in this paper.

What remains to regard are effects by shifting the entire glasses frame. This also has only gradual effects, more on focus and only a little on the FOV.

As a conventional eyeglasses frame is sufficient for the concept, we will discuss what can happen just with this. The glasses are mainly held in position by the nosepads. this means left-right position won’t change, but the frame can slide a bit down the nose, as anybody wearing eyeglasses well knows. Only this kind of displacement must be considered here.

The simulation model allows to assess this easily. Results indicate that for a retention of close to optimum crispness, the ellipsoidal assembly will tolerate only about 2 mm, while the on-axis spherical assembly can easily abide 5 mm or more (in one direction, which indicates the total eyebox is rather about 10 mm).

Remarkably, there is hardly any vignetting, even in an utterly extreme case, when the display is shifted so far as to partly leave the sight area, because the virtual image originates from the mirror.

There also is no loss of sight contact (an effect commonly known from eyepieces of microscopes and binoculars). Everything that actually happens is a reduction of resolution, that looks like an astigmatism but can almost entirely be compensated for by the anyway necessary dynamic focus function, i.e. slightly moving the display vs. the mirror.

With said 5 mm displacement, e.g. 3 mm forward and 4 mm down, resolution will only drop to about 1.5 arcmin in the center and 2 arcmin in some part of the outer areas, which is hardly perceivable in practical use. The display shift required for the compensation is static and just 0.2 mm forward and up. This may be done automatically on the basis of eye tracker or other position information.

A 10 mm displacement will result in a noticeable blur, up to 10 arcmin, but that’s just all. With the flat display variant, a lateral 5 mm displacement can be compensated entirely.

We may conclude that with the on-axis concept in all variants, a straightly limited eyebox doesn’t exist, and any displacement results only in a gradual focus change.

In this context it may be interesting that in [6], a display concept was proposed that at a fist glance appears to have similarities to the spherical design shown here. There, however, an adaptive multi-cell holographic mirror was used, and was assumed to be absolutely necessary. The main problem might have been that a very small eyebox was assumed for any static mirror. Indeed, the large eyebox revealed by the simulations presented here came as a big surprise.

A synthesizing method for mirrors for arbitrary display shapes

Fig. 16: Algorithm for incremental mirror synthesis

We will now suggest a straightforward synthesis procedure for mirrors for arbitrary display shapes. It can be used to synthesize entire mirrors, but also to calculate alternative shapes for the edges of otherwise constructed mirrors (such as spherical or ellipsoidal ones).

The underlying idea starts from the fact that the effective beam width for point formation is small (pupil size). Fig. 16 shows the pupil size greatly extended for better illustration. The active mirror section could be approximated, at its edges, by two little planar mirrors (m₁, m₂). The optical effect of said section, focusing a beam of light, can completely be modeled by the two mirrors, reflecting the edge rays of the beam.

The curve between these mirrors will always be very well approximated by a circle sector (with center point f₁), provided the section is chosen to be small enough. The angles of m₁, m₂ that we choose to start with, are known. The point where they meet on the display (image point p₁), is then easily constructed.

Now we rotate the eye by one pupil or beam width. The mirror m₂, now at the other edge of the beam, gives us the location of a next image point on the display, p₂, and this, in turn, dictates the angle of a next mirror element, m₃, at the other side of the pupil beam. The spatial position of that next element (its distance from the eye) can perfectly be determined using the sphere sector shape we expect between the mirrors (with center at f2).

Iterating this method, we get a number of elements approximating a curve, which should turn out to be the ideal mirror shape for that display.
Repeating the entire procedure for eye rotation from the center upwards, we complete the vertical center line of the mirror.
Note that the iteration process may use arbitrary small pupil sizes to improve its resolution and precision.

If we try to combine iteration steps like this steps in the horizontal and vertical, we will most certainly get conflicting values and will have to figure out how to combine them for a suitable result.
For a rotation symmetric mirror, however, we may use the algorithm simply from center to edge, defining a complete mirror shape in a most simple way. This of course has the disadvantage of neglecting the lateral curvature and introducing more astigmatism. We will see this in the following example. Yet, the shape found there nevertheless turned out to be as optimal as possible.
We should note that the principle of the algorithm can also be used to calculate a display shape for a given mirror shape.

Mirror synthesis example: Dome mirror and flat display

As a proof of concept, we show a two-dimensional implementation of the mirror synthesis method. It can already be used in practical designs, involving coaxial and rotation symmetric components, such as dome mirrors in conjunction with transparent planar displays.

Fig.17: principle of the planar display - dome mirror design.

The progressive curvature necessary in the radial directions, needed to keep focus on the planar display, in this case is not accompanied by a similar curvature in the circular directions. This causes growing astigmatism towards the edge.

One may think that a three-dimensional mirror synthesis could avoid this, but it won't really work: we would need stronger z curvature to reduce astigmatism, but this can only be achieved by more progressive xy curvature, again requiring even stronger z curvature. Which proves to be a never ending iteration, rendering the problem unsolvable. Hence, the shape we got is the best possible.
It should be noted that in [5], a similar configuration was postulated, with the assumption that an ideal mirror could be calculated. As we could show, this is not the case. The only way of improving this configuration is a curved display.

Fig. 18: Synthesized freeform mirror example with flat transmissive displays.

Fig. 18 shows the assembly, with two circular flat transparent displays of 29 mm diameter and dome mirrors of 50 mm diameter. This design is not bigger nor heavier than a simple conventional pair of plastics eyeglasses. It allows for an FOV of up to 70° in all directions, but astigmatism remains decent only up to about 40°.

Fig. 19: Dome display, center resolution (left). Astigmatism at 20° eye rotation (middle), compensated by focus change (right). Overlaid rectangles are 1arcmin x 1arcmin.

Fig. 19 shows that the center resolution of the design is almost as good as with the coaxial sphere design. At 20° lateral eye rotation, the lateral resolution is still at its optimum, proving the mirror synthesis algorithm to work correctly (this is exactly what it was intended to deliver). The astigmatism, however, is strong, and as argued, this cannot be successfully addressed by changing the mirror shape. What can be done, is mediating the astigmatism by adjusting the local focus for equal y and z blur.
There are four ways:

1.      Leaving it to eye accommodation, requiring accommodations to, e.g., 1.25 m.

2.      Bending the display, (this needs only about ¼ mm for a significant improvement).

3.      Using a mirror with a bigger focus length (makes the assembly a bit more bulky).

4.      Using a dynamic focus feature (moving the display forth and back).

When only method 1 is used,, the beam spread is about 6 arcmin (x and y) at 20° eye rotation (edge of a 40° FOV). At 30% contrast, the usual definition for resolution figures, this implies an edge resolution of about 4 arcmin (Fig. 20), a value resembling the center resolution of some simpler AR and VR displays (there quite well accepted). One should also note that many current AR displays have only about ±10° total FOV.

Fig. 20: Circular beam spread of the planar display with dome mirror, over angle from center
(Blender simulation results), and estimated resolution figures.

Fig. 21: Image with the flat display (right eye), recorded with a virtual camera
in the eyeball center, simulating centered view to any image part.
FOV: the image is 70°x50°, the highlighted area is 40°.

As shown in Fig. 21, the total FOV even of the fashion cut flat display assembly is comparably huge, and the resolution within a 40° FOV looks good, even though the astigmatism has not been mediated.
Simply bending - a cylindrical display design

We may also use the same xy shape obtained, to combine the spherical coaxial principle for the horizontal with a straight shape in the vertical, resulting in a cylindrical display and a barrel shaped mirror (Fig. 22). This allows for a virtually unlimited horizontal field of view, and bending a display in only one direction is far easier than producing an entirely convex one. The astigmatism at the top and bottom is only up to 50% larger than with the planar display approach.

Fig. 22: Cylinder display and barrel shaped mirror (with only raw cut edges).

Fig. 23: A 50°x90° area from the FOV of the cylindrical display configuration,
taken with the eyeball center camera. Again, astigmatism mediation has not been applied.

Considering a Hybrid Approach

Summarizing above results, the best practical solution may likely be something in between the variants analyzed:

The spherical concept has a small eye relief and conflicts a bit with the physiognomy at its edges.. However, it also yields maximum resolution even beyond 80° FOV, where this is not necessary at all (as shown in Fig. 4).

The flat display variant has more eye relief, but resolution degrades toward the edges. However, even a small display bending improves things a lot (as we have seen, about ¼ mm can already correct astigmatism). Moreover, more eye distance and a bigger mirror also improve edge resolution. Just as with the spherical variant however, we can’t make the mirrors arbitrary large, also because the left and right mirrors would get into each other’s way. Here it may help to make the design slightly elliptical.

As horizontal FOV usually is more important than the vertical, choosing different horizontal and vertical curvatures, as in the cylindrical variant, could also be useful.

From all this follows that a slightly curved display with ample eye distance, combining the best of all variants discussed, should be the best practical solution.

Dynamic focus

It is often argued that for a true AR experience, the display must deliver a holographic experience, letting virtual objects appear at the right optical distance, for a correct match between vergence and accommodation. But a quick, adaptive focus adjustment for any object currently being looked at, may also yield an almost holographic experience, because eye resolution is only high in a small center area, and a user will hardly notice the virtual distance of objects outside of it.

Dynamic focus adjustment should be an essential part of a display glasses design. It not only allow for a semi holographic experience, but also for a user adaptation if prescription glasses are not integrated and, as stated above, the feature will also improve the usable eyebox of the system. For the on-axis designs shown, this requires just a very small variation of the display-mirror distance. With f₀ being the mirror focus length for infinity, and f₁ the object focus length wanted, the according display displacement is
d_d = 1/(1/f₀ + 1/f₁) - f₀.

For e.g. f₁ = 400 mm, f₀ = 20 mm, we get d_d = 0.95 mm.

Integrating Prescription Glasses

Prescription glasses could best be placed behind the display, as shown in Fig. 2 already, and with the designs presented here, they should best have an outer curvature similar to the display’s inner one, but this should be manageable.

Alternative Mirror Concepts

Quite obviously, the mirror in all above concepts could also be a Bragg or holographic mirror. In this case, the effective mirror shape can deviate from the shape of its carrying glass sheet. Hence, the mirror could become smaller and flatter, and also the display could have more eye clearance, which would normally require a relatively big mirror glass. But in turn, a holographic mirror would be considerably thicker and the direction and wavelength selectivity might eradicate the advantages.

Eye Tracking

The optics discussed have the interesting feature to also deliver a picture of the user’s retina right on the displays. By integrating an array of light receptors into the display, we would get a camera recording the pattern and its position, hence the eye position. A problem here is how to get enough light into the eye to obtain a bright enough picture of the retina. Even a strong infrared lighting would have difficulties in daylight situations. An alternative might be to obtain just a blurred picture of the pupil and iris. With some image processing, even this might deliver an accurate enough pupil position.

Mask Displays

The usefulness of AR displays could be greatly enhanced by a means of shielding certain sections of the FOV against the environment, in order to display virtual objects without unwanted “transparency”. The concept and practical aspects have been discussed in [1] and [4], where it has been shown that simple masks, even close to the eye, can provide for very detailed, and surprisingly crisp object coverage. Alas, a convincing technology for such a “mask display” is still to be found.

The on-axis NED presented here may require a curved mask display, while waveguides would get along with a flat, likely more simple one. The masking however works the better the more distant the mask is from the eye, and a quite promising technology could be electrochromic polymers, where considerable progress has been reported recently (see [7], just for an example). Polymers may allow for curved displays, and integrating one with the mirror would be a quite ideal solution.

Power consumption

The light power we need follows straightforwardly from the fact that the brightness of the virtual image area on the display mirror would have to match the brightness of the surrounding scene. Bright sunlight has about 1 kW/m2, most of it visible light, and the corresponding brightness is about 100.000 lux. With an all white environment and a mirror area of e.g. 10 cm2, this would mean 1 W.

Given an effectiveness of about 30% for led sources, and some losses in the driver circuit, the end result would be about 5 W. But this will by far no be necessary in practice: 100.000 lux brightness would result in snow blindness. Normal environments aren’t all white, rather about only 20% reflecting on average, like a standard gray card used in photography. Also, the display picture would cover only part of the entire area, as otherwise it would be an immersive experience rather than AR, and normally one would sunglasses or a mask display, and then the image could also be less bright than the environment. So we may easily get down to an average power requirement of about 0.1 W. An entire 10hr day would take 1Wh. A lithium battery for this weighs just 5g. Indoors we may get away with 1/100 of that.

Hence, if heavy processing is outsourced to separate mobile phone delivering just the images to the AR glasses wirelessly, battery weight should not be an issue. The entire assembly may then weigh e.g. 30 g, even with driver circuitry, focus actuators, cameras, ear pads etc.. Many glasses actually weigh more than that, and people are wearing them without complaining.

Summary

We have seen that high resolution near-eye displays with only one mirror can be constructed if the display itself is used as a functional element in the optical design, by giving it a curved surface.

The ideal shape for display and mirror in the usual off-axis configuration turned out to be ellipsoidal, with the center of the eye as one of the ellipsoid’s focal points.

Yet it turned out that the off-axis approach suffers from the curvature changes even within the small areas contributing to the formation of single image points.

The problem was solved by changing the off-axis configuration into a coaxial (or: on-axis) one. This requires a transparent display, an unusual but possible technology.

The ellipses hence became spheres, and the resolution and FOV achievable with this are virtually unlimited. But moreover, and quite surprisingly, the eyebox also turned our to be very large, in spite of the big magnification factor of the assembly. In fact, it does not even exist in the classical sense What mostly occurs with displacements is just a slight focus change.

Finally it turned out that even with a flat display, the on-axis approach yields a very large FOV, 40° of it with good resolution and a lot more on top, and also with a large eyebox.

A straightforward synthesis algorithm for the ideal mirror shape was presented and verified.

Using elements from the approaches presented, a wide variety of display and mirror shapes are conceivable, optimized for various aspects of performance and convenience, all sharing the principal advantages of the on-axis concept, i.e., high resolution and large FOV and eyebox.

Compared to the currently often favored waveguide displays, the form factor of this approach is bigger, but the functional elements can be made arbitrarily thin and therefore light, the FOV is a lot larger and the eyebox also is very large. This allows to use a simple eyeglasses frame instead of the usual helmet-like constructions. Eye clearance is smaller than with most waveguide devices, but wearing them together with prescription glasses is difficult with both approaches, anyway. Nevertheless, prescription lenses can easily be integrated into the configuration.

The approaches presented have been evaluated by 3D raytracing with freely available open source software (Blender), which proved to be fully able for the simulation of optical resolution as well as physiological and aesthetic design aspects. This allows for a more comprehensive evaluation than with classical optics design software.
The Blender model used for the simulations and a set of video tutorials can be obtained (for non-profit use only) from the website of [4] (http://www.displaysbook.info, "materials" page).

References

[1] The end of Hardware; Rolf R. Hainich, Booksurge 2006, 3rd Ed. 2009. ISBN 9781439236024
[2] O. Cakmakci, S. Vo, S. Vogl, R. Spindelbalker, A. Ferscha, J. P. Rolland: Optical Free-Form Surfaces in Off-Axis Head-Worn Display Design. 7th IEEE and ACM International Symposium on Mixed and Augmented reality, ISMAR, Cambridge, 2008.
[3] Jianming Yang, Weiqi Liu, Weizhen Lv, Daliang Zhang, Fei He, Zhonglun Wei, Yusi Kang: Method of achieving a wide field-of-view head-mounted display with small distortion. OPTICS LETTERS 203 / Vol. 38, No. 12 / 2013
[4] Displays- Fundamentals and Applications, Rolf R. Hainich and Oliver Bimber, Taylor&Francis, 2nd edition 2016. ISBN: 9780367658175
[5] EP0580261A1, Combined display and viewing system; Masakazu Kawada 1993
[6] US6407724B2, Method of and Apparatus for Viewing an Image; Waldern et al. 2002
[7] Xu, T. et al. High-contrast and fast electrochromic switching enabled by plasmonics. Nat. Commun. 7:10479 doi: 10.1038/ncomms10479 (2016).

[8] Rolf R. Hainich: Near Eye Displays - a Look into the Christmas Ball. Keynote Lecture. 7th IEEE and ACM International Symposium on Mixed and Augmented reality, ISMAR, Cambridge, 2008. https://dl.acm.org/doi/10.1109/ISMAR.2008.4637310

¹All rights preserved. Patents pending or granted, e.g.: DE102014013320B4; GB2537193B, US11867903B2.

Resources download

You are welcome to download the complete Blender model with several display/mirror designs included, additional details on the calculus used, as well as a complete set of tutorial videos.
You will also receive links to download the Displays book, first edition, as well as the code samples from the first edition's appendix, and even a complete set of all images in Displays - Fundamentals and Applications, 2nd Edition!
Please enter your name and a valid e-mail address into the form fields below and click on Submit.
We will immediately e-mail you the download links.

If you think this material is of any value to you, please consider making a donation to the Captain Paul Watson Foundation.

first name:

last name:

e-mail:

home        news

Copyright © 2012-2024 Rolf R. Hainich; all materials on this website are copyrighted.

Disclaimer: All proprietary names and product names mentioned are trademarks or registered trademarks of their respective owners. We do not imply that any of the technologies or ideas described or mentioned herein are free of patent or other rights of ourselves or others. We do also not take any responsibility or guarantee for the correctness or legal status of any information in this book or this website or any documents or links mentioned herein and do not encourage or recommend any use of it. You may use the information presented herein at your own risk and responsibility only. To the best of our knowledge and belief no trademark or copyright infringement exists in these materials. In the fiction part of the book, the sketches, and anything printed in special typefaces, names, companies, cities, and countries are used fictitiously for the purpose of illustrating examples, and any resemblance to actual persons, living or dead, organizations, business establishments, events, or locales is entirely coincidental. If you have any questions or objections, please contact us immediately. "We" in all above terms comprises the publisher as well as the author. If you intend to use any of the ideas mentioned in the book or this website, please do your own research and patent research and contact the author.



Fig. 8: ellipsoid design of a free-form NED, here cropped to the useful area for maximum vertical FOV. The displays are showing a full area photo.	Fig. 9: resolution evaluation. Top left: center. Top right: 20° upward. Bottom left: 20° left. Bottom right: 20° down. The overlaid rectangles are 1arcmin x 1arcmin.