Computer vision and artificial intelligence (original) (raw)

Abstract

One of the monolithic goals of computer vision is to automatically interpret general digital images of arbitrary scenes. This goal has produced a vast array of research over the last 35 years, yet a solution to this general problem still remains out of reach. A reason for this is that the problem of visual perception is typically under-constrained. Information like absolute scale and depth is lost when the scene is projected onto an image plane. In fact, there are an infinite number of scenes that can produce the exact same image, which makes direct computation of scene geometry from a single image impossible. The difficulty of this ``traditional goal'' of computer vision has caused the field to focus on smaller, more constrained pieces of the problem. The hope is that when the pieces are put back together, a successful scene interpreter will have been created. Digital filtering, motion analysis , image registration, segmentation, and model matching schemes are all examples of areas where progress has been made in the field. Other research has focused on the general problem through the use of knowledge and context. The use of external knowledge both about the world and about the current visual task reduces the number of plausible scene interpretations and may make the problem solvable. This approach is referred to as knowledge-based vision. Work in the area of knowledge-based vision incorporates methods from the field of AI in order to focus on the influence of context on scene understanding, the role of high level knowledge, and appropriate knowledge representations for visual tasks. The importance of computer vision to the field of AI is fairly obvious: intelligent agents need to acquire knowledge of the world through a set of sensors. What is not so obvious is the importance that AI has to the field of computer vision. Indeed, I believe that the study of perception and intelligence are necessarily intertwined. This article will look at the role that knowledge plays in computer vision and how the use of reasoning, context, and knowledge in visual tasks reduces the complexity of the general problem. The importance of context and knowledge in vision has been pointed out by psychologists many times, and these ideas have driven computer vision studies as well. Take the example in Figure 1.

Loading...

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (9)

  1. Bloch, Isabelle. 1995. Information Combination Operators for Data Fusion: A Comparative Review with Classification. IEEE Transactions on Systems, Man, and Cybernetics.
  2. Draper, Bruce. 1993. Learning Object Recognition Strategies. PhD Dissertation. CMPSCI TR93- 50. University of Massachusetts at Amherst.
  3. Jaynes, Christopher, Frank Stolle, Howard Schultz, Robert Collins, Allen Hanson, and Edward Riseman. 1996. Three-Dimensional Grouping and Information Fusion for Site Modeling from Aerial Images. In Proceedings, DARPA Image Understanding Workshop, Palm Springs, CA.
  4. Pearl, J. 1988. Probabilistic Reasoning in Intelligent Systems: networks of plausible inference, Morgan-Kaufmann Publishers.
  5. Strat, Tomas and Martin Fischler. 1995. The Role of Context in Computer Vision. In Proceedings of the International Conference on Computer Vision, Workshop on Context-Based Vision (June 19), Boston.
  6. Venkateswar V. and R. Chellappa. 1991. A Hierarchical Approach to Detection of Buildings in Aerial Images. Technical Report CS-TR-2720. Center for Automation Research, of Maryland. August.
  7. Lin, Chungan and Ramakant Nevatia. 1996. Buildings Detection and Description from Monocular Aerial Images. In Proceedings, DARPA Image Understanding Workshop, Palm Springs, CA.
  8. McKeown, David, G. Edward Bulwinkle, and Steven Cochran. 1996. Research in the Automatex Analysis of Remotely Sensed Imagery. In Proceedings, DARPA Image Understanding Workshop, Palm Springs, CA.
  9. Collins, Robert, Allen Hanson, Edward Riseman, Christopher Jaynes, Frank Stolle, Xiaoguang Wang, and Yong-Qing Cheng. 1996. UMass Progress in 3D Building Model Acquisition. In Proceedings, DARPA Image Understanding Workshop, Palm Springs, CA.