Owen Carmichael : Research : Object Recognition In Images

Home / News

Affiliations / Contact

Research

Medical Image Analysis

Object Recognition From Images

Object Recognition From Range Data

3D Modeling From Range Data

Super-Resolution

Image Retrieval

 

Papers

Teaching

Community Resources

Reading Groups

Leisure

 

 

My thesis work addressed the problem of recognizing so-called ÒwiryÓ objects. Wiry objects are distinguished by a prevalence of very thin, elongated, stick-like components; examples include tables, chairs, bicycles, and desk lamps. They are difficult to recognize because their shapes are complex and they tend to lack distinctive color or texture characteristics.  Recognizing them in images is important in a number of problem areas because they are relatively common.   Our approach takes as input a set of training images of a target object in typical background environments.  Binary edges extracted from the training images are labeled as belonging to the target object or the clutter.  Here is an example training image: edges on the ladder are marked in green, and edges in the background are marked in red.  Click on the image for a full-scale version.

Using these training images, we train a cascade of classifiers to discriminate object edges from background edges by automatically selecting informative shape cues.  Below is an example result of our algorithm applied to a test image: the test image (top left), detected edges shown in green (top right), edges classified as ÒladderÓ edges by our algorithm (bottom left), and post-processing to cluster all the ladder edges into a final box answer (bottom right).

In my thesis there are more results on the ladder in different environments (a lab, cubicle, warehouse, apartment, conference room, and classroom, respectively).  There are also results on recognizing a chair, a push cart, and a stool.  Finally, the thesis presents an extension of this technique so that edge detection and recognition are automatically optimized as a single, unified process.

Currently, Honda Corporation is evaluating this technique to see if they want to incorporate it into the vision systems on their humanoid robots, shown above.

 

Thesis: O. Carmichael .  Discriminative Techniques For The Recognition Of Complex-Shaped Objects.  PhD Thesis, The Robotics Institute, Carnegie Mellon University. September 18, 2003.  Technical Report CMU-RI-TR-03-34. [pdf, 7.4 MB]

  Papers: O. Carmichael, M. Hebert, Shape-based Recognition Of Wiry Objects, Proceedings CVPR 2003. [pdf,921 K]. O. Carmichael, M. Hebert, Object Recognition by a Cascade of Edge Probes, Proceedings of the British Machine Vision Conference, September 2002. [paper, pdf,481 K] [slides, html].

Web Site: WORD: Wiry Object Recognition Database.  All the images I used in my thesis (10000 or so), with ground truth.

 

---

 

Prior to my thesis, I worked on methods for parts-based recognition of objects with rich visual texture. I proposed the use of convolutional image filters, called Discriminant Filters, which are tuned so that their outputs discriminate between images of parts of the object, and images of parts of the background.  We tune the filters based on example images of a target object and clutter, where individual object parts have been labeled.  Here are two training images of a mug.

Here is an example result in which Discriminant Filters, together with a standard classifier, detect the presence of parts of the mug in a cluttered scene.  So that the display wouldnÕt get too cluttered, we only searched for parts 5, 6, ,7 ,8, 11, and 13.  ÒCÓ means Òclutter.Ó

 

Paper: Discriminant Filters For Object Recognition, CMU Technical Report, [pdf, 370 K].

 

Recently, a group of researchers at CMU used Discriminant Filters as part of a system to discriminate between tissue types in images of vocal chords.

 

Their paper: Allin, S., Galeotti, J., Dailey, S. and Stetten, G. Enhanced Snake-Based Segmentation of Vocal Folds. IEEE International Symposium on Biomedical Imaging, Washington D.C., 2004. [pdf]