Future Work - VALUE SIGNIFICANCE METHOD Overall

NoodleNav GMap

TASK DESCRIPTION Task

P- VALUE SIGNIFICANCE METHOD Overall

8. Future Work

There are several areas into which this research could expand. The two main areas for expansion are in the enhancement of the head tracking tool and in the evaluation and test design. The test design has many options for enhancement. An automated way to calculate the accuracy could enhance the accuracy data and might provide some more specific insights on the different ways a user may make a mistake and correction with the different input methods. The idea for another interesting study that could be performed came from one of the users who mentioned that they felt their video game experience helped them perform better with both the head tracking and the mouse. Additionally, one user was a retired fighter pilot who had experience with missile targeting systems that used head gaze for targeting; this user mentioned that the head tracking felt natural. It would be interesting to design an experiment in which head tracking was used to navigate a video game or flight simulator space and compare it to the other methods of input common in that domain. Additionally, repeating the experiment with users who had similar levels of experience with the mouse and the head tracking would be interesting, to see if the head tracking would outperform the mouse or at least be comparable. The hard part of doing that would be finding users with so little experience with the mouse, since it is far and away the dominant method of input for computing. Finally, a simple

improvement to the test design would be to create a standard questionnaire for users to fill out after completion of the various tasks as this would be helpful in making the qualitative data easier to compare across users.

Some improvements to the tool itself would be worthwhile towards enhancing performance compared to the other input methods. Since the focus of this research was to evaluate the concept of head tracking as a method of panning spatial data, the

algorithm chosen for the actual tracking was a compromise of acceptable performance and easier implementation. Improving the performance of the head tracking, both in different lighting conditions and in response time would be helpful. Additionally, creating a more automated way of training the system, perhaps one that can handle users in different starting positions would go a long way toward enhancing the user perception of the system. Adding in other dimensions of control, such as zooming with a blink, could provide a path for the tool to become more useful and practical for everyday use. Finally, it is hard for the user to get the panning via head tracking to stop exactly where they want it to, so the tracking and response to small movements could use some refinement. Some combination of these enhancements would most likely improve the results of this research.

Extending the tool to explore how well head tracking would work in a 3D

environment, particularly in light of the demonstrated ease of use in the 2D environment, would also be a worthwhile endeavor.

9. Conclusion

This research set out to examine and evaluate the performance of head tracking as an input method for panning 2D spatial information. This is an important step toward removing the need for a user to occupy their hands with navigating through information, either to enable disabled users or to enhance the capabilities of a typical user by allowing them to use their hands for other tasks while navigation is completed with natural

motions like using head turn for control.

This evaluation was completed in two parts. The first part was to create the head tracking application. This application was created using Adobe Flex, Actionscript, and the Google Maps API. It was created such that it is operating system independent and has a low cost to use, only requiring an internet connection, a standard webcam, and the Adobe Flash player. The second part consisted of 20 users completing five tasks of increasing complexity with three input methods: mouse, laptop touchpad, and the head tracking application. Each task was timed and an accuracy rank was assigned to each input method for each task. The results proved to be statistically significant using a one- way, within-subject ANOVA and revealed that the head-tracking was slightly behind the mouse in performance, but significantly ahead of the touchpad.

This paper concludes that, as a method of input, head tracking provides intuitive and precise control for panning two-dimensional spatial data. With further application refinement and user practice, head tracking may ultimately outperform the mouse in navigating spatial information.

10. References

[1] Agrawal, Pyush, Ingmar Rauschert, Keerati Inochanon, Levent Bolelli, Sven Fuhrmann, Isaac Brewer, Guoray Cai, Alan MacEachren, and Rajeev Sharma. “Multimodal Interface Platform for Geographical Information Systems (GeoMIP) in Crisis Management.” In Proceedings of the 6th International Conference on Multimodal Interfaces. New York, NY, USA: ACM Press, 2004, 339-340, DOI= http://doi.acm.org/10.1145/1027933.1027997.

[2] ArcGIS from ESRI. Retrieved on November 16, 2009, from http://www.esri.com/software/arcgis/.

[3] Ashdown, Mark, Kenji Oka, and Yoichi Sato. “Combining Head Tracking and Mouse Input for a GUI on Multiple Monitors.” In CHI ’05 Extended Abstracts on Human Factors in Computing Systems. New York, NY, USA: ACM Press, 2005, 1188-1191, DOI=http://doi.acm.org/10.1145/1056808.1056873.

[4] Atlas Gloves. Retrieved on November 16, 2009, from http://atlasgloves.org/.

[5] Bolelli, Levent. “Multimodal Response Generation in GIS.” In Proceedings of the 6th International Conference on Multimodal Interfaces. New York, NY, USA: ACM Press, 2004, 355-355, DOI=http://doi.acm.org/10.1145/1027933.1028012.

[6] Cabral, Marcio C., Carlos H. Morimoto, and Marcelo K. Zuffo. “On the Usability of Gesture Interfaces in Virtual Reality Environments.” In Proceedings of the 2005 Latin American Conference on Human-Computer Interaction. New York, NY, USA: ACM Press, 2005, 100-108, DOI=

http://doi.acm.org/10.1145/1111360.1111370.

[7] Darrell, Trevor, Konrad Tollmar, Frank Bentley, Neal Checka, Louis-Phillipe Morency, Ali Rahimi, and Alice Oh. “Face-Responsive Interfaces: From Direct Manipulation to Perceptive Presence.” In Proceedings of the 4th international conference on Ubiquitous Computing. New York, NY, USA: ACM Press, 2002, 135-151.

[8] Andrés del Valle, Ana C. and Jean-Luc Dugelay. “Online Face Analysis: Coupling Head Pose-Tracking with Face Expression Analysis.” In Proceedings of the Tenth ACM International Conference on Multimedia. New York, NY, USA: ACM Press, 2002, 414-415, DOI=http://doi.acm.org/10.1145/641007.641093.

[9] Entropia Universe. Retrieved on November 17, 2009, from http://www.entropiauniverse.com/.

[10] ezANOVA Statistics Application. Retrieved on October 14, 2009, from http://www.sph.sc.edu/comd/rorden/ezanova/index.html.

[11] Fisher, R. B. and Oliver, P. “Multi-variate Cross-Correlation and Image Matching.” In Proceedings of British Machine Vision Conference, 1995,

623–632.

[12] Francios, Alexandre R.J., “Real-Time Multi-Resolution Blob Tracking.”

IRIS Technical Report IRIS-04-422, University of Southern California,

Los Angeles, April 2004.

[13] Freeman, William. T. and Craig D. Weissman. “Television Control by Hand Gestures.” In Proceedings of the International Workshop on Automatic Face- and Gesture-Recognition. Zurich, Switzerland: IEEE, 1995, 179-183.

[14] GeoMedia GIS Application from Intergraph. Retrieved on November 12, 2009, from http://www.intergraph.com/.

[15] Google Earth 3D Mapping Application. Retrieved on November 11, 2009, from http://earth.google.com.

[16] Google Earth Developer Blog. Retrieved on November 19, 2009, from http://google-latlong.blogspot.com/2008/02/truly-global.html.

[17] Google Maps 2D Mapping Application. Retrieved on November 19, 2009, from http://maps.google.com.

[18] Gravetter, Frederick. J. and Larry B. Wallnau, Statistics for the Behavioral Sciences. 6th Edition. Thomson-Wadsworth Publishing, Belmont, California, United States, 2004, p.124, 244, 432, 650. Print. ISBN 0-534-62203-8. [19] Gunes, Hatice, Massimo Piccardi, and Tony Jan. "Face and Body Gesture

Recognition for a Vision-Based Multimodal Analyzer." In Proceedings of the Pan-Sydney area workshop on Visual information processing. ACM, 2004, 19-28. [20] Hansen, Thomas R., Eva Eriksson, and Andreas Lykke-Olesen. "Use your head:

exploring face tracking for mobile interaction." CHI '06: CHI '06 extended abstracts on Human factors in computing systems. New York, NY, USA: ACM Press, 2006, 845-850.

[21] Hinckley, Ken, Y. Pausch, John C. Goble, and Neal F. Kassell. “A Survey of Design Issues in Spatial Input.” In Proceedings of the 7th annual ACM symposium on User interface software and technology. New York, NY, USA: ACM Press, 1994, 213-222.

[22] Hitwise Weblog. “Google Maps Surpasses Mapquest”. Retrieved on November 19, 2009 from

http://weblogs.hitwise.com/heather-dougherty/2009/04/ google_maps_surpasses_mapquest.html.

[23] Huang, X. and Y. Lin. “A vision-based hybrid method for facial expression recognition.” In Proceedings of the 1st international conference on Ambient media and systems. New York, NY, USA: ACM Press, 2008, Article 4. [24] Kaneva Online Virtual World. Retrieved on November 17, 2009 from

http://www.kaneva.com

[25] Kjeldsen, Rick. “Head Gestures for Computer Control.” In Proceedings of the IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems. New York, NY, USA: ACM Press, 2001, 61-68. [26] Kjeldsen, Rick and Jacob Hartman. “Design issues for vision-based computer

interaction systems.” In Proceedings of the 2001 workshop on Perceptive user interfaces. New York, NY, USA: ACM Press, 2001, 1-8, DOI=

http://doi.acm.org/10.1145/971478.971511.

[27] Kjeldsen, Rick, Anthony Levas, and Claudio Pinhanez. "Dynamically

reconfigurable vision-based user interfaces." In Machine Vision and Applications,

Vol.16, December 2004, 6-12, DOI=10.1007/s00138-004-0145-6.

[28] Kohler, Markus. “Special Topics of Gesture Recognition Applied in Intelligent Home Environments.” In Proceedings of the International Gesture Workshop on Gesture and Sign Language in Human-Computer Interaction. London, UK: Springer-Verlag, 1997, 285-296.

[29] Krum, David M., Olugbenga Omoteso, William Ribarsky, Thad Starner, and Larry F. Hodges. “Evaluation of a Multimodal Interface for 3D Terrain

Visualization.” In Proceedings of the conference on Visualization '02. New York, NY, USA: ACM Press, 2002, 411-418.

[30] Lin, Yuan-Pin, Yi-Ping Chao, Chung-Chih Lin, and Jyh-Horng Chen. "Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments." In Proceedings of the 27th Annual International Conference of the Engineering in Medicine and Biology Society. IEEE-EMBS. 2005, 3738-3741.

[31] Loewenich, Frank and Frederic Maire. “Hands-free mouse-pointer manipulation using motion-tracking and speech recognition.” In Proceedings of the 19th Australasian conference on Computer-Human Interaction: Entertaining User Interfaces, Vol. 251. 2007, 295-302. DOI=

http://doi.acm.org/10.1145/1324892.1324955.

[32] MapInfo from Pitney Bowes. Retrieved on November 13, 2009 from

http://www.pbinsight.com/products/location-intelligence/applications/mapping- analytical/mapinfo-professional/.

[33] Merdes, Matthias, Jochen Häu, and Matthias Jöst. "'SlidingMap': introducing and evaluating a new modality for map interaction." ICMI '04: Proceedings of the 6th international conference on Multimodal interfaces. New York, NY, USA: ACM, 2004, 325-326. DOI=http://doi.acm.org/10.1145/1027933.1027989.

[34] Microsoft Virtual Earth 2D Mapping Application, now known as Bing Maps. Retrieved on November 17, 2009 from http://www.bing/com/maps.

[35] Morency, Louis-Philippe and Trevor Darrell “Head gesture recognition in intelligent interfaces: the role of context in improving recognition.” In

Proceedings of the 11th International Conference on Intelligent User Interfaces. New York, NY, USA: ACM Press, 2006, 32-38, DOI=

http://doi.acm.org/10.1145/1111449.1111464.

[36] Morimoto, Carlos H., Yaser Yacoob, and Larry Davis. "Recognition of Head Gestures Using Hidden Markov Models." In Proceedings of ICPR. IEEE, 1996, 461-465.

[37] NASA World Wind 3D Virtual Globe Application. Retrieved on November 18, 2009 from http://worldwind.arc.nasa.gov/.

[38] Nummiaro, Katja, Esther K. Meier, and Luc J. Van Gool. "Object Tracking with an Adaptive Color-Based Particle Filter." Proceedings of the 24th DAGM Symposium on Pattern Recognition. London, UK: Springer-Verlag, 2002, 353-360.

[39] Oviatt, Sharon, Rachel Coulston, and Rebecca Lunsford. "When do we interact multimodally?: cognitive load and multimodal communication patterns." In Proceedings of the 6th International Conference on Multimodal Interfaces. New York, NY, USA: ACM Press, 2004, 129-136.

[40] Panerai, F., S. Hanneton, J. Droulez, and V. Cornilleau-Pérès. "A 6-dof device to measure head movements in active vision experiments: Geometric modeling and metric accuracy." In Journal of Neuroscience Methods 90 (1999): 97-106.

[41] Rauschert, Ingmar, Pyush Agrawal, Rajeev Sharma, Sven Fuhrmann, Isaac Brewer, and Alan Maceachren. "Designing a human-centered, multimodal GIS interface to support emergency management." GIS '02: Proceedings of the 10th ACM international symposium on Advances in geographic information systems. New York, NY, USA: ACM Press, 2002, 119-124.

[42] Rauschert, Ingmar. “Adaptive multimodal recognition of voluntary and

involuntary gestures of people with motor disabilities.” In Proceedings of the 6th International Conference on Multimodal Interfaces. New York, NY, USA: ACM Press, 2004, 356-356, DOI=http://doi.acm.org/10.1145/1027933.1028013. [43] Rigoll, Gerhard, Andreas Kosmala, and Stefan Eickeler. "High Performance Real-

Time Gesture Recognition Using Hidden Markov Models." In Proceedings of the International Gesture Workshop on Gesture and Sign Language in Human-

Computer Interaction.New York, NY, USA: ACM Press, 1998, 69-80.

[44] Roweis, S. and Z. Ghahramani. "A unifying review of linear gaussian models." Neural Computation 11 (February 1999): 305-345.

[45] Sato, Yoichi, Makiko Saito, and Hideki Koik. “Real-Time Input of 3D Pose and Gestures of a User’s Hand and Its Applications for HCI.” In Proceedings of the Virtual Reality 2001 Conference. New York, NY, USA: ACM Press, 2001, 79-79. [46] Schöning, Johannes, Brent Hecht, Martin Raubal, Antonio Krüger, Meredith

Marsh, and Michael Rohs. "Improving interaction with virtual globes through thinking: helping users ask "why?"." IUI '08: Proceedings of the 13th

International Conference on Intelligent User Interfaces. New York, NY, USA: ACM Press, 2008, 129-138.

[47] Second Life Virtual World. Retrieved on November 19, 2009 from http://secondlife.com.

[48] Sparacino, Flavia, Christopher Wren, Ali Azarbayejani, and Alex Pentland. “Browsing 3-D spaces with 3-D vision: body-driven navigation through the Internet city.” In Proceedings of the First International Symposium on 3D Data Processing, Visualization, and Transmission. IEEE, 2002, 224-231.

[49] Sturman, David J. and David Zeltzer. "A Survey of Glove-based Input." IEEE Computer Graphics and Applications 14 (1994): 30-39. DOI=10.1109/38.250916. [50] Valenti, Roberto, Alejandro Jaimes, and Nicu Sebe. “Facial Expression

Recognition as a Creative Interface.” In Proceedings of the 13th International Conference on Intelligent User Interfaces. New York, NY, USA: ACM Press, 2008, 433-434.

[51] Wei, Xiaozhou, Lijun Yin, Zhiwei Zhu, and Qiang Ji. “Avatar-mediated face tracking and lip reading for human computer interaction.” In Proceedings of the 12th Annual ACM International Conference on Multimedia. New York, NY, USA: ACM Press, 2004, 500-503, DOI=http://doi.acm.org/10.1145/1027527.1027648. [52] Wii Sports by Nintendo. Retrieved on November 18, 2009 from

http://us.wii.com/wiisports/.

[53] XMind Mind Map Software by XMind Ltd. Retrieved on November 19, 2009 from http://www.xmind.net.

[54] Yahoo Maps by Mapquest. Retrieved on November 18, 2009 from http://maps.yahoo.com.

[55] Zhang, Xiaoqin, Weiming Hu, Zixiang Zhao, Yan-guo Wnat, Xi Li, and Qingdi Wei. “SVD based Kalman particle filter for robust visual tracking.” In

Proceedings of the 19th International Conference on Pattern Recognition. IEEE, 2008, 1-4.

In document Evaluating Head Gestures for Panning 2-D Spatial Information (Page 74-82)