Summary and Future Work - Evaluating and Extending Trajectory Features for Activity Recognition

Evaluating and Extending Trajectory Features for Activity Recognition

5.6 Summary and Future Work

In this chapter, we have presented the co-recognition approach to images and videos that detects, matches, and segments multiple sets of identical objects or actions in

Fig. 5.19 The qualitative results of the proposed algorithm on our dataset containing multiple actions in a video clip. (a) Selected frames from two videos containing temporally distinguished actions. (b) Selected frames from two videos containing spatially distinguished actions

an unsupervised way. The basic idea is to grow initial matches into reliable object correspondences in a multi-layer match-growing framework and analyze their rela-tions by their matching regions or volumes. Unlike other unsupervised segmentation or object discovery methods, it effectively considers both geometry and appearance to discover the detailed dense matches and segmentation from complex images or videos. We have shown its robust performance on a variety of unsupervised vision applications, such as unsupervised object detection and segmentation [8,9], image retrieval, symmetry analysis [6], action detection [51], and 3D reconstruction.

While the proposed approach provides wide applications and impressive results on complex scenes and videos, the current method still has some limitations. As already noted in Sect.5.2, co-recognition can detect an object correspondence un-der the geometric and photometric distinctiveness of the objects appeared in given images or videos. The condition, however, is not strictly satisfied in usual data.

whole-part relations, as well as object interactions. In the future, pursuing the di-rection, we plan to improve the co-recognition approach for more complex scene understanding based on mutual relations of various objects.

References

1. Alexe B, Deselaers T, Ferrari V (2010) What is an object? In: IEEE conference on computer vision and pattern recognition

2. Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509–522

3. Blank M, Gorelick L, Shechtman E, Irani M, Basri R (2005) Actions as space-time shapes. In:

IEEE international conference on computer vision

4. Boiman O, Irani M (2006) Similarity by composition. In: Neural information processing sys-tem

5. Cho M, Lee KM (2007) Partially occluded object-specific segmentation in view-based recog-nition. In: IEEE conference on computer vision and pattern recognition

6. Cho M, Lee KM (2009) Bilateral symmetry detection and segmentation via symmetry-growing. In: British machine vision conference

7. Cho M, Lee KM (2009) Feature correspondence and deformable object matching via agglom-erative correspondence clustering. In: IEEE international conference on computer vision 8. Cho M, Shin YM, Lee KM (2008) Co-recognition of image pairs by data-driven Monte Carlo

image exploration. In: European conference on computer vision

9. Cho M, Shin YM, Lee KM (2010) Unsupervised detection and segmentation of identical ob-jects. In: IEEE conference on computer vision and pattern recognition

10. Cho M, Shin YM, Lee KM (2011) Object correspondence networks for unsupervised recog-nition of identical objects. In: Emerging topics in computer vision and its applications, vol 1, p 313

11. Cornelius H, Perd’och M, Matas J, Loy G (2007) Efficient symmetry detection using local affine frames. In: SCIA, pp 152–161

12. Efros AA, Berg AC, Mori G, Malik J (2003) Recognizing action at a distance. In: IEEE inter-national conference on computer vision

13. Faugeras O (1993) Three-dimensional computer vision: a geometric viewpoint. MIT Press, Cambridge

14. Ferrari V, Tuytelaars T, Gool L (2006) Simultaneous object recognition and segmentation from single or multiple model views. Int J Comput Vis 67(2):159–188

15. Filipovych R, Ribeiro E (2008) Learning human motion models from unsegmented videos.

In: IEEE conference on computer vision and pattern recognition

16. Furukawa Y, Ponce J (2007) Accurate, dense, and robust multi-view stereopsis. In: IEEE conference on computer vision and pattern recognition

17. Hartley R, Zisserman A (2004) Multiple view geometry in computer vision. Cambridge Uni-versity Press, Cambridge

18. Hou X, Zhang L (2007) Saliency detection: a spectral residual approach. In: IEEE conference on computer vision and pattern recognition

19. Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20:1254–1259

20. Jain AK, Dubes RC (1998) Algorithms for clustering data. Prentice Hall, New York 21. Jhuang H, Serre T, Wolf L, Poggio T (2007) A biologically inspired system for action

recog-nition. In: IEEE international conference on computer vision

22. Kannala J, Rahtu E, Brandt S, Heikkila J (2008) Object recognition and segmentation by non-rigid quasi-dense matching. In: IEEE conference on computer vision and pattern recognition 23. Karlinsky L, Dinerstein M, Levi D, Ullman S (2008) Unsupervised classification and part

localization by consistency amplification. In: European conference on computer vision 24. Ke Y, Sukthankar R, Hebert M (2007) Event detection in crowded videos. In: IEEE

interna-tional conference on computer vision

25. Keller Y, Shkolnisky Y (2004) An algebraic approach to symmetry detection. In: IEEE inter-national conference on pattern recognition

26. Kim TH, Lee KM, Lee SU (2010) Nonparametric higher-order learning for interactive seg-mentation. In: IEEE conference on computer vision and pattern recognition

27. Laptev I (2005) On space-time interest points. Int J Comput Vis 64(2/3):107–123

28. Laptev I, Lindeberg T (2003) Space-time interest points. In: IEEE international conference on computer vision

29. Laptev I, Marszałek M, Schmid C, Rozenfeld B (2008) Learning realistic human actions from movies. In: IEEE conference on computer vision and pattern recognition

30. Lempitsky V, Kohli P, Rother C, Sharp T (2009) Image segmentation with a bounding box prior. In: IEEE international conference on computer vision

31. Lhuillier M, Quan L (2002) Match propagation for image-based modeling and rendering.

IEEE Trans Pattern Anal Mach Intell 24(8):1140–1146

32. Li Z, Liu J, Chen S, Tang X (2007) Noise robust spectral clustering. In: IEEE international conference on computer vision

33. Liu Y, Hays JH, Xu YQ, Shum HY (2005) Digital papercutting. Technical sketch, SIG-GRAPH

34. Lowe DG (1999) Object recognition from local scale-invariant features. In: IEEE international conference on computer vision

35. Loy G, Eklundh JO (2006) Detecting symmetry and symmetric constellations of features. In:

European conference on computer vision, pp II-508–II-521

36. Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: DARPA image understanding workshop

37. Marola G (1989) On the detection of the axes of symmetry of symmetric and almost symmetric planar images. IEEE Trans Pattern Anal Mach Intell 11:104–108

38. Martin D, Fowlkes C, Malik J (2004) Learning to detect natural image boundaries using local brightness, color, and texture cues. IEEE Trans Pattern Anal Mach Intell 26(5):530–549 39. Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from maximally

stable extremal regions. In: British machine vision conference

40. Mikolajczyk K, Schmid C (2002) An affine invariant interest point detector. In: European conference on computer vision

41. Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Com-put Vis 60(1):63–86

42. Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630

43. Niebles JC, Fei-Fei L (2007) A hierarchical model of shape and appearance for human action classifination. In: IEEE conference on computer vision and pattern recognition

44. Niebles JC, Wang H, Fei-Fei L (2006) Unsupervised learning of human action categories using spatial-temporal words. In: British machine vision conference

45. Obdržálek S, Matas J (2002) Object recognition using local affine frames on distinguished regions. In: British machine vision conference

www.allitebooks.com

computer vision and pattern recognition

51. Shin YM, Cho M, Lee KM (2010) Co-recognition of actions in video pairs. In: International conference on pattern recognition

52. Simon I, Seitz SM (2007) A probabilistic model for object recognition, segmentation, and non-rigid correspondence. In: IEEE conference on computer vision and pattern recognition 53. Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering object

cate-gories in image collections. In: IEEE international conference on computer vision

54. Steele KL, Egbert PK (2005) Correspondence expansion for wide baseline stereo. In: IEEE conference on computer vision and pattern recognition

55. Todorovic S, Ahuja N (2007) Unsupervised category modeling, recognition, and segmentation in images. IEEE Trans Pattern Anal Mach Intell 30(12):2158–2174

56. Toshev A, Shi J, Daniilidis K (2007) Image matching via saliency region correspondences. In:

IEEE conference on computer vision and pattern recognition

57. Tuytelaars T, Mikolajczyk K (2008) Local invariant feature detectors: a survey. Found Trends Comput Graph Vis 3(3):177–280

58. Vedaldi A, Soatto S (2006) Local features, all grown up. In: IEEE conference on computer vision and pattern recognition

59. Yuan J, Wu Y (2007) Spatial random partition for common visual pattern discovery. In: IEEE international conference on computer vision, pp 1–8

Stereo Matching—State-of-the-Art

In document Advanced Topics in Computer Vision (Page 147-152)