Hostname: page-component-77c89778f8-rkxrd Total loading time: 0 Render date: 2024-07-17T11:25:00.209Z Has data issue: false hasContentIssue false

Application-oriented robotic vision – a review

Published online by Cambridge University Press:  09 March 2009

R. A. Jarvis
Affiliation:
Department of Computer Science, Australian National UniversityG.P.O. Box 4 Canberra A.C.T. 2600 (Australia)

Summary

Computer Vision is essentially concerned with emulating the process of seeing, naturally manifested in various higher biological systems,1–4 on a computational apparatus, and is consequently part of the Artificial Intelligence field within the sub-category of Machine perception. Seeing has to do with making sense of image data acquired through an optical system and subsequently dealt with at increasing levels of abstraction and association with known facts about the world. The spectrum of interest in Computer Vision ranges from attempting to answer basic questions concerning the functionality of biological vision systems, particularly human, at one end, all the way to enhancing the reliability, speed and cost effectiveness of specific industrial operations, particularly component inspection and vision driven robotic manipulation. The main bulk of interest is in the middle, where the quest for generality pushes interest towards biological vision systems with their demonstrated effectiveness in a wide range of environments, some hostile, whilst the need for economic viability and timeliness in relation to particular application pushes interest towards finding workable algorithms which function reliably at high speed on affordable apparatus.

This paper is addressed, in somewhat tutorial style, at clarifying, by examples of work in the area, the issues surrounding application oriented robotic vision systems, their assumptions, strengths, weaknesses and degree of generality, and at the same time putting them in the context of the overall field of Computer Vision. In addition, the paper points to directions of development which promise to provide powerful industrial vision tools at an acceptable price.

Type
Article
Copyright
Copyright © Cambridge University Press 1984

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

1.Hubel, D.H. and Wiesel, T.N., “Receptive Fields and Functional Architecture of Monkey Striate CortexJ. Physiol. (Lond.) 195, 215243 (1968) Presents some of the pioneering studies of the behaviour of cells in the visual cortex.CrossRefGoogle ScholarPubMed
2.Gregory, R.L., The Intelligent Eye (McGraw-Hill, New York, 1970).Google Scholar
3.Gibson, J.J., The Senses Considered as Perceptual Systems (Houghton-Mifflin, Boston, 1966).Google Scholar
4.Richter, K. and Ullman, S., “A Model for the Spatiotemporal Organization of X and Y-type Ganglion Cells in the Primate Retina” MIT AI Memo 573.Google Scholar
5.Muerle, J.L. and Allen, D.C., “Experimental Evaluation of Techniques for Automatic Segmentation of Objects in a Complex Scene” In: Pictorial Pattern Recognition (Thompson Book Company) (edited by Cheng, , Ledley, , Pollack, & Rosenfeld, ) (Thompson Book Co., Washington, D.C., 1968).Google Scholar
6.Brice, C.R. and Fennema, C.L., “Scene Analysis using RegionsArtificial Intelligence 1, 205226 (1970).CrossRefGoogle Scholar
7.Jarvis, R.A., “Region Based Image Segmentation Using Shared Near Neighbor Clustering” Proc. Int'l. Conf. on Cyber. & Soc. (09 19–21, 1977, Washington, D.C.).Google Scholar
8.Yakimovsky, Y. and Feldman, J.A., “A Semantics-Based Decision Theoretic Region Analyzer” Proc. 3rd Joint Int'l. Conf. on Artificial Intell. (08 1973, Stanford Univ.) pp. 580588.Google Scholar
9.Horowitz, S.L. and Pavlidis, T., “Picture Segmentation by a Directed Split-and-Merge Procedure” Proc. 2nd Int'l. Joint Conf. on Pat. Recog. (08 13–15, 1974, Copenhagen) pp. 424433.Google Scholar
10.Tomita, F., Yachida, M. and Tsuji, S., “Detection of Homogeneous Regions by Structural Analysis” Proc. IJCAI-3 (Stanford, CA, 08 1973) pp. 564571.Google Scholar
11.Jarvis, R.A., “Image Segmentation by Interactively Combining Line, Region and Semantic Structure” Proc. Comp. Graph., Pat. Recog. & Data Struc. Conf. (05 14–16, 1975, Beverley Hills, L.A., CA) pp. 278288.Google Scholar
12.Tenenbaum, J.M. and Weyl, S., “A Region-Analysis Subsystem for Interactive Scene Analysis” Proc. 4th Int'l Joint Conf. on A.I., (1975) p. 682.Google Scholar
13.Rosenfeld, A. and Thurston, M.Edge and Curve Detection for Visual Scene AnalysisIEEE Trans. on Comp. C-20, No. 5, pp 567569 (05 1971).Google Scholar
14.Roberts, L., “Machine Perception of Three-Dimensional Solids” Lincoln Laboratory Technical Report N-15 (MIT, 05 1963).Google Scholar
15.Yoda, H., Motoike, J. and Ejiri, M., “Direction Coding Method and Its Application to Scene Analysis” Proc. 4th Int'l Joint Conf. on A.I. (1975) p. 620.Google Scholar
16.Rosenfeld, A. and Thurston, M., “Edge and Curve Detection: Further ExperimentsIEEE Trans. on Comp. C-21, No. 7, 677715 (07 1972).CrossRefGoogle Scholar
17.Hueckel, M., “A Local Visual Operator which Recognizes Edges and LinesJournal of the ACM 20, No. 4, 634647 (10 1973).CrossRefGoogle Scholar
18.Hueckel, M., “An Operator which Locates Edges in Digitized PicturesJournal of the ACM 18, No. 1, 113125 (01 1971).CrossRefGoogle Scholar
19.Shirai, Y., “Edge Finding, Segmentation of Edges and Recognition of Complex Objects” Proc. 4th Int'l Joint Conf. on A.I. (1975) p. 674.Google Scholar
20.Duda, R.O. and Hart, P.E., Pattern Classification and Scene Analysis (Wiley, New York, 1973).Google Scholar
21.Nagy, G., “State of the Art in Pattern RecognitionProc. IEEE 56, No. 5, 836863 (05 1968).CrossRefGoogle Scholar
22.Patrick, E.A., Fundamentals of Pattern Recognition (Prentice-Hall, N.J., 1972).Google Scholar
23.Watanabe, S. (Editor), Methodologies of Pattern Recognition (Academic Press, New York, London, 1969).Google Scholar
24.Watanabe, S. (Editor), Frontiers of Pattern Recognition (Academic Press, New York, London, 1972).Google Scholar
25.Kailath, T., “A General Likelihood - Ratio Formula for Random Signals in Gaussian NoiseIEEE Trans. Information Theory IT-15, 350361 (05 1980).CrossRefGoogle Scholar
26.Unger, S.H., “Pattern Detection and Recognition” Proc. IRE (10 1959) pp. 17371752.CrossRefGoogle Scholar
27.Agin, G.J. & Duda, R.O., “SRI Vision Research for Advanced Industrial Application” Proc. 2nd USA-Japan Computer Conference (1975) pp. 113117.Google Scholar
28.Cover, T.M. and Hart, P.E., “Nearest Neighbor Pattern ClassificationIEEE Trans. Info. Theory IT-13, 2127 (01 1967).CrossRefGoogle Scholar
29.Patrick, E.A. and Fischer, F.P. III, “A Generalized K- Nearest Neighbor Rule”, Information and Control 16, 128152 (1970).CrossRefGoogle Scholar
30.Fu, K-S., Syntactic Methods in Methods in Pattern Recognition (Academic Press, New York, London, 1974).Google Scholar
31.Thomson, M.G., “Finite Fuzzy Automata, Regular Fuzzy Languages and Pattern RecognitionPattern Recognition 5, 383390 (1973).CrossRefGoogle Scholar
32.Zadeh, L.A., Fu, K-S., Tanaka, K. and Shimura, M., (Editors), Fuzzy Sets and Their Applications to Cognitive and Decision Processes (Academic Press, New York, San Francisco, London, 1975).Google Scholar
33.Jarvis, R.A. and Patrick, E.A., “Clustering Using a Similarity Measure Based on Shared Near-NeighborsIEEE Trans. Computers 22, 1025, (11 1973).CrossRefGoogle Scholar
34.Zahn, C.T., “Graph-Theoretical Methods for Detecting and Describing Gestalt ClustersIEEE Trans. Computers C-20, 6886 (01 1971).CrossRefGoogle Scholar
35.Anderberg, M.R., Cluster Analysis for Applications (Academic Press, NY, 1973).Google Scholar
36.Ball, G.H. and Hall, D.J., “ISODATA, An Iterative Method of Multivariate Analysis and Pattern Classification”, presented at the Inst. Commun. Conf., Philadelphia, PA (1966).Google Scholar
37.Hartigan, J.A., Clustering Algorithms (John Wiley and Sons, Chichester 1975), Chapter 11, pp. 191214.Google Scholar
38.Diday, E. and Simon, J.C., “Clustering Analysis” In: Digital Pattern Recognition (Edited by Fu, K-S.) (Springer-Verlag, Berlin, 1976).Google Scholar
39.Sacerdoti, E.D., “The Nonlinear Nature of Plans” SRI, Artificial Intelligence Group, Technical Note 101 (01, 1975).Google Scholar
40.Nilsson, N.J., “Mechanisms for Goal Seeking, Planning and Reasoning” SRI, Artificial Intelligence Group, Technical Note 130 (05 1976).Google Scholar
41.Lozano-Perez, L. and Wesley, M.A., “An Algorithm for Planning Collision-Free Paths Among Polyhedral ObstaclesCommun. ACM 22, No. 10, 560570 (10, 1979).CrossRefGoogle Scholar
42.Udupa, S.M., “Collision Detection and Avoidance in Computer Controlled Manipulators” Proc. 5th Int'l Joint Conf. on A.I. (08 22–25, M.I.T., 1977) pp. 737748.Google Scholar
43.Gleason, G.J. and Algin, G.J., “A Modular Vision System for Sensor-Controlled Manipulation and Inspection” Proc. of 9th International Symposium on Industrial Robots, Washington, D.C. (03, 1979) pp. 5770.Google Scholar
44.Kinnucan, P., “How Smart Robots are Becoming SmarterHigh Technology, 1, No. 1, 42–40 (09/10, 1981).Google Scholar
45.Reinhold, A.G. and Vanderbrug, G., “Robot Vision for Industry: The Autovision SystemRobotics Age 2, No. 3, 2228 (Fall, 1980).Google Scholar
46.Rosenfeld, A. and Pfaltz, J.L., “Sequential Operations in Digital Picture ProcessingJ. ACM 13, No. 4, 471494 (10, 1966).Google Scholar
47.Holland, S.W., Rossol, L. & Ward, M.R., “Consight-I: A Vision-Controlled Robot System for Transferring Parts from Belt Conveyors” In: Computer Vision and Sensor-Based Robotics (edited by Dodd, G.G. and Rossol, L.), (Plenum Press, 1979) pp. 81100.CrossRefGoogle Scholar
48.Bolles, R.C. and Cain, R.A., “Recognising and Locating Partially Visible Objects: The Local-Feature-Focus Method”, Intern. J. Robot. Res. 1, No. 3, 5782 (Fall, 1982).CrossRefGoogle Scholar
49.Gonzalez, R.C. and Wintz, P., Digital Image Processing (Addison-Wesley, London, 1977).Google Scholar
50.Kaneff, S. (Editor), Picture Language Machines (Academic Press, London, New York, 1970).Google Scholar
51.Haralick, R.M. and Simon, J.C. (Editors), Issues in Digital Image Processing (Sijthoff and Noordhoff, 1980).CrossRefGoogle Scholar
52.Pratt, W.K., Digital Image Processing (John Wiley and Sons, Chichester, 1978).Google Scholar
53.Rosenfeld, A. (Editor), Image Modelling (Academic Press, New York, 1981).Google Scholar
54.Stucki, P. (Editor), Advances in Digital Image Processing: Theory, Application, Implementation (Plenum Press, New York, 1979).CrossRefGoogle Scholar
55.Nake, F. and Rosenfeld, A. (Editors), Graphic Languages (North-Holland, Amsterdam, 1972).Google Scholar
56.Grasselli, A. (Editor), Automatic Interpretation and Classification of Images (Academic Press, New York, 1969).Google Scholar
57.Rosenfeld, A., Picture Processing by Computer (Academic Press, New York, 1969).CrossRefGoogle Scholar
58.Rosenfeld, A.and Kak, A.C., Digital Picture Processing (Academic Press, New York, 1976).Google Scholar
59.Hall, E.L., Computer Image Processing and Recognition, (Academic Press, New York, 1979).Google Scholar
60.Klinger, A., Fu, K-S. and Kunii, T.L. (Editors), Data Structures, Computer Graphics and Pattern Recognition (Academic Press, New York, 1977).Google Scholar
61.Huang, T.S. (Editor), Image Sequence Analysis (Springer-Verlag, Berlin, 1981).CrossRefGoogle Scholar
62.Aggarwal, J.K., Duda, R.O. and Rosenfeld, A. (Editors), Computer Methods in Image Analysis (IEEE Press, New York, 1977).Google Scholar
63.Perkins, W.A., “A Model-Based Vision System for Industrial PartsIEEE Trans on Computers C27, No. 2, 126143 (02, 1978).CrossRefGoogle Scholar
64.Kelly, R., Birk, J., Dessimoz, J., Martins, H. and Tella, R., “Acquiring Connecting Rod Castings Using a Robot with Vision and Sensors” Proc. 1st Int'l Conf. on Robot Vision and Sensory Controls, Stratford-upon-Avon, U.K. (04 1–3, 1981) pp. 169178.Google Scholar
65.Guzman, A., “Computer Recognition of Three-Dimensional Objects in a Visual Scene” Ph.D. Thesis, MAC-TR-59, Project MAC, M.I.T. Mass (1968).Google Scholar
66.Waltz, D., “Generating Semantic Descriptions from Drawings of Scenes with Shadows” Ph.D. Thesis, M.I.T., Mass. (1972);Google Scholar
also Ch. 2 of The Psychology of Computer Vision (Winston, P.H., Ed.) (McGraw-Hill, New York, 1975).Google Scholar
67.Winston, P.H., “Learning Structural Descriptions from Examples”, Ch. 5 of The Psychology of Computer Vision (Winston, P.H., Ed.) (McGraw-Hill, New York, 1975).Google Scholar
68.Duda, R.O., “Some Current Techniques for Scene Analysis” Stanford Research Centre, A.I. Centre, Technical Note 46 (10 1970).Google Scholar
69.Nilsson, N.J., Problem-Solving Methods in Artificial Intelligence, (McGraw-Hill, New York, 1971).Google Scholar
70.Jarvis, R.A., “A Perspective on Rangefinding Techniques for Computer VisionIEEE Trans. on Pattern Analysis and Machine Intelligence PAMI-5, No. 2, 122139 (01 1983).CrossRefGoogle Scholar
71.Popplestone, R.J., Brown, C.M., Ambler, A.P. and Crawford, G.F., “Forming Models of Plane-and-Cylinder Faceted Bodies from Light Stripes” Proc. 4th Int'l. Joint Conf. A.I. (1975) pp. 664668.Google Scholar
72.Crawford, G.F., “The Stripe Finder Hardware” (Dep. Artificial Intell., Univ. Edinburgh, 1974).Google Scholar
73.Rocker, F. and Kiessling, A., “Methods for Analysing Three Dimensional Scenes” Proc. 4th Int'l Joint Conf. A.I. (1975) pp. 669673.Google Scholar
74.Will, P.M. and Pennington, N.S., “Grid Coding: A Preprocessing Technique for Robot and Machine Vision” Proc. 2nd Int'l Joint Conf. A.I. (09 1971) pp. 6668.CrossRefGoogle Scholar
75.Rosenberg, D., Levine, M.D. and Zucker, S.W., “Computing Relative Depth Relationships from Occlusion Cues” Proc. 4th Int'l Joint Conf. Pattern Recognition, Kyoto, Japan (11, 7–10, 1978) pp. 765769.Google Scholar
76.Marr, D. and Poggio, T., “Cooperative Computation of Stereo Disparity” Artificial Intelligence Memo 364, MIT A.I. Lab., Cambridge, Mass. (06 1976).CrossRefGoogle Scholar
77.Bajcsy, R. and Lieberman, L., “Texture Gradient as a Depth CueComputer Graphics and Image Processing 5, 5267 (1976).CrossRefGoogle Scholar
78.Jarvis, R.A., “Focus Optimisation Criteria for Computer Image ProcessingThe Microscope 24, 162179 (2nd., Quarter, 1976).Google Scholar
79.Nitzan, D., Brain, A.E. and Duda, R.O., “The Measurement and Use of Registered Reflectance and Range Data in Scene AnalysisProc. IEEE, 65, No. 2, 206220 (02, 1977).CrossRefGoogle Scholar
80.Lewis, R.A. and Johnston, A.R., “A Scanning Laser Rangefinder for a Robotic Vehicle”, Proc. 5th Int'l Joint Conf. A.I. (1977) pp. 762768.Google Scholar
81.Will, P.M. and Pennington, K.S., “Grid Coding: A Preprocessing Technique for Robot and Machine Vision”, Proc. 2nd Joint Int'l Conf. A.I. (09 1971) pp. 6670.CrossRefGoogle Scholar
82.Jarvis, R.A., “A Laser Time-of-Flight Range Scanner for Robotic Vision”, Dept. of Computer Science, Australian National University, Technical Report TR-CS–81–10. Also to appear in Trans IEEE on PAMI.Google Scholar
83.Shirai, Y. and Suwa, M., “Recognition of Polyhedrons with a Range Finder” Proc. 2nd Int'l Joint Conf. A.I. London (09 1971) pp. 8087.Google Scholar
84.Agin, G.J. and Binford, T.O., “Computer Description of Curved Objects” Proc. Int'l Joint Conf. A.I., Stanford Univ. (08 20–23, 1973) pp. 629640.Google Scholar
85.Yakimovsky, Y. and Cunningham, R., “A System for Extracting Three-Dimensional Measurements from a Stereo Pair of TV CamerasComput. Graphics Image Processing 7, 195210 (1978).CrossRefGoogle Scholar
86.Moravec, H.P., “Visual Mapping by a Robot Rover” Proc. 6th Int'l Joint Conf. A.I. (1979) pp. 598620.Google Scholar
87.Baker, H.H., “Edge Based Stereo Correlation” Proc. ARPA Image Understanding Workshop Univ. Maryland (04, 1980).Google Scholar
88.Julesz, B., “Binocular Depth Perception Without Familiarity CuesScience 145, pp. 356362 (1964).Google Scholar
89.Marr, D. and Hildreth, E.C., “Theory of Edge DetectionProc. R. Soc. London B 207, 187217 (1980).Google ScholarPubMed
90.Nishihara, H.K. and Larson, N.G., “Towards Real-Time Implementation of the Marr-Poggio Stereo Matcher” Proc. Image Understanding Workshop (edited by Lee, Baumann) (1981).Google Scholar
91.Jarvis, R.A., “Expedient Range Enhanced 3-D Robot Colour VisionRobotica 1, 2531 (1983).CrossRefGoogle Scholar
92.Marr, D., “Representing Visual Information - A Computational Approach” Computer Vision Systems (Hanson, A.R. and Riseman, E.M., Editors) (Academic Press, New York, 1978) pp. 6180.Google Scholar
93.Page, C.J. and Hassan, H., “Non-Contact Inspection of Complex Components Using a Rangefinder Vision System” Proc. 1st Int'l Conf. on Robot Vision and Sensory Controls, Stratford-Upon-Avon, U.K. (1981) pp. 245254.Google Scholar
94.Jarvis, R.A., “Robotic Vision Using 3D Space Cube Solid Modelling Derived from Multiple Image Projections”, submitted to 7th Australian Computer Science Conference, Adelaide (02 1984) pp. 18–1 to 18–11.Google Scholar
95.Loughlin, C. and Hudson, E., “Eye in Hand Robot Vision”, Proc. 2nd Int'l Conf. on Robot Vision and Sensory Controls, Stuttgart, Germany (11 24, 1982) pp. 263270.Google Scholar
96.Agin, G.J., “Servoing with Visual Feedback”, S.R.I., Artificial Intelligence Centre, Technical Note 149 (07 1977).Google Scholar
97.High Technology 3, No. 4, 37 (04 1983).Google Scholar