Preprints & Reprints Abstract Keywords Introduction Unfortunately, all spectacles based 3-D systems require the wearing of dark or coloured spectacles, which dramatically reduce eye contact. Dark spectacles can even make the wearer look doubtful or sinister. Since eye contact with the person at the remote location is one of the key advantages offered by video-telephony and video-conferencing, spectacles based 3-D imaging approaches are not appropriate for these applications. We have developed a prototype spectacles free system which demonstrates the impact of 3-D in a video-telephone application. This demonstration has been assembled using commercially available components with a liquid crystal display and a lensed sheet (a lenticular sheet ) overlay. The views from two remote cameras, equivalent to a left and right eye view, are displayed. Spectacles based systems are ideal for applications where eye contact is not necessary. We have developed the concept of a remote 'surrogate head' linked back to a 3-D system, such as a spectacles based display or an immersive headset. The surrogate head has two cameras which collect views of the remote scene. The views are then transmitted over the telecommunications network to the user. This concept could be particularly useful in remote working applications where the visual image, coupled with the depth information, is important. 3-d video-telephony. Previous work. The availability of flat panel displays with a suitable pixel arrangement.
The choice of view determining screen for prototyping. For the video-telephone application, our initial design of lenticular sheet built in an element of defocus on each lens so that the dark stripes would be blurred at the viewing plane. The resulting pattern has been defocused as required but, unfortunately, a darkening is still observable as the observer moves his head [6,7].
Recent developments. The long term solution for both of these issues is to use better displays and intelligent camera systems to monitor the user's position. In the short term we have used a head tracking system that monitors the user's position and provides feedback to move the views and to ensure that the correct views are displayed. This allows the user to move more freely in front of the video-telephone and gives an enhanced visual effect. For our first demonstration we have used an infra-red headtracking system to locate the observer's head position laterally with respect to the screen. Initially we thought that the best way to overcome the 'limited viewing zone' problem was to move the lenticular sheet with respect to the LCD panel. At most, for the panel we are using, this movement would be of the order of +/- 2mm and this could be achieved relatively easily with piezo-electric transducers or miniature stepper motors. The position of the two views would then change to follow the user. After some preliminary tests we found that a better way to move the views was to rotate the complete LCD and lenticular sheet assembly using a stepper motor so that the correct views are always shown.
We have found that users seated in front of a video-telephone do not normally move more than 250mm either side of a central position, or faster than 1.35 metres per second. This corresponds to a 0.384 radian rotation of the panel, at an angular velocity of 0.002 radians per second and is well within the limits of a standard stepper motor. As it is likely that users would try to find the limits of the system, some filtering of head movements is required, so that the motor cannot be overdriven. A commercially available infrared headtracker which connects to the serial port of a computer is used to locate the head position of the user. Software interprets this signal and controls the stepper motor: this is driven from the parallel port. Initial tests of the system revealed that a greater torque was needed from the motor for it to move the panel in the required manner. Consequently, a 5:1 ratio gearbox was fitted. Unfortunately, as further experimentation showed, this meant that the motor was now too slow to track fast head movements. This is not a problem in normal operation and will be improved by fitting a better motor in the near future. Future work.
In the short term we would like to improve the performance of our system by using an image processing based, head tracking approach that offers less user intrusion and a greater flexibility. As we move to the future we expect that with such a tracking capability and with better flat panel displays we will only need to flip adjacent views to always give the correct, orthoscopic view to the user. More sophisticated image processing techniques could then be used to produce more realistic images by increasing the number of views available to the user; a number of intermediate views between those from the source cameras could be generated (eg [8]). These could be displayed as the user moves his head. As has been reported by a number of workers, image quality in both high and low resolution images is enhanced significantly with the addition of depth information. If the second and subsequent views can be efficiently coded and image quality is much enhanced compared with 2-D, then there is a real prospect for 3-D at low data rates for video-telephony. Surrogate head system. There are two scenarios for developments to this system. The first is for a field support system in which a distant operator can lend support to a technician performing a complex task. Twin cameras, perhaps mounted on the field technician's head, relay images back over the ISDN to a control room where the scene is viewed using a spectacles based stereoscopic display system. Niche market applications already exist for such systems and these have been shown to operate over the ISDN [6]. The main applications appear to be in 'remote handling', in medical imaging field or for tele-support systems such as Camnet"[9]. These are generally not enhanced presence video-telephony, or video-conferencing systems, because of the loss of eye-contact. However, they do provide depth information to a distant operator and offer valuable extra cues for difficult handling or conceptual tasks. The second scenario is for a system which requires the user to wear an immersive display linked to two remote cameras. The idea is to provide a stereoscopic pair of views to the observer which depend on the direction in which he is looking. This is determined by an independent headtracking system. In first experiments, windows on the field of view of two fixed cameras were moved in response to the observers head motion. This allows the user to look around a remote scene - even though the remote cameras remain fixed. Electronic zoom, pan and tilt are all possible under user control. The movement of a motorised camera platform following the motion of the user's head (and eyes) can increase the sense of presence further. Initial work in this area has shown that high resolution immersive displays are required before a good sense of presence can be established. With improvements in the enabling technologies and in the quality of the long distant links there are good prospects for providing real, remote immersive experiences. Planned enhancements of the system. Additionally, collaborative remote experiments with the University of Strathclyde, in Glasgow, who have developed an 'anthropomorphic head', are planned. This head has the same number of degrees of freedom as the human head and can be used in place of fixed cameras. The intention is to interact with the 'anthropomorphic head' across the network - over a distance of some 700km. This work is described more fully by Fryer et al [10]. Further applications will be described at the conference. Conclusions Since there is a great deal of similarity between adjacent views in a stereoscopic system, the prospects of coding the signals efficiently for transmission over a telecommunications network, seem good. This could significantly enhance the quality of a video-telephony interaction at only a small bandwidth overhead. Prospects for immersive or screen based stereoscopic display systems operating over long distances using the ISDN are improving year on year as the enabling technologies improve in quality and drop in price. Acknowledgements. References 2. Burner R: 'Autostereoscopic lenticular systems.' Proc IEE Colloquium on Stereoscopic Television. October 1992. London. 3. Sheat D E; Chamberlin G R; Gentry P; Leggatt J S; McCartney D J; '3-D Imaging Systems for Telecommunications Applications.' Proc.SPIE. Vol 1669. p186. Electronic Imaging Systems and Applications Meeting, San Jose.USA.1992. 4. McCartney D J; Chamberlin G R; Leggatt J S; Sheat D E; Seal C H: '3-D Electronic still imaging'. Ist Int Festival of 3-D Images. Paris. September 1991. p296-302. 5. McCartney D J. Sheat D E. Chamberlin G R. Seal C H. 'Auto-stereoscopic 3-D imaging systems for telecommunications applications.' IEE Meeting on Stereoscopic Television, London. October 1992. 6. McCartney D J. Chamberlin G R. Sheat D E. Gentry P: 'Telecommunications Opportunities For 3-D Imaging Systems.' Proceedings of the 4th European Workshop on 3-D Television. Rome. October 20-21st 1993. p65-69. 7. Chamberlin G R. Sheat D E. McCartney D J: 3D imaging for video-telephony. Proc First international sysmposium on Three Dimensional Image communication Technologies. Tokyo. December 1993. 8. Liu J: Skerjanc R: 'Construction of intermediate pictures for a multiview 3-D system.' Stereoscopic Displays and Applications. SPIE Vol 1669. (1992) .pp10-19. 9. P.Cochrane, D Heatley, K Fisher, K Cameron and R Taylor-Hendry: 'CAMNET, the First Telepresence System', Interlink 2000 Towards the Future of Communications, pp 38-41, August 1992. 10. Fryer R et al: 'An experiment in three dimensional telepresence.' Proc Conf on Three dimensional Imaging. The Royal Society of Arts, London. February 2nd & 3rd 1995. |