55:148 Digital Image Processing
55:247 Image Analysis and Understanding

Chapter 6, Part I
Shape representation and description: Region identification

Shape representation and description

Defining the shape of an object can prove to be very difficult. Shape is usually represented verbally or in figures.
There is no generally accepted methodology of shape description. Further, it is not known what in shape is important.

Current approaches have both positive and negative attributes; computer graphics or mathematics use effective shape representations which are unusable in shape recognition and vice versa.
In spite of this, it is possible to find features common to most shape description approaches.

Common shape description methods can be characterized from different points of view

Input representation form: Object description can be based on boundaries or on more complex knowledge of whole regions
Object reconstruction ability: That is, whether an object's shape can or cannot be reconstructed from the description.
Incomplete shape recognition ability: That is, to what extent an object's shape can be recognized from the description if objects are occluded and only partial shape information is available.
Local/global description character: Global descriptors can only be used if complete object data are available for analysis. Local descriptors describe local object properties using partial information about the objects. Thus, local descriptors can be used for description of occluded objects.
Mathematical and heuristic techniques: A typical mathematical technique is shape description based on the Fourier transform. A representative heuristic method may be elongatedness.
Statistical or syntactic object description.
A robustness of description to translation, rotation, and scale transformations: Shape description properties in different resolutions.

Sensitivity to scale is even more serious if a shape description is derived, because shape may change substantially with image resolution.
Therefore, shape has been studied in multiple resolutions which again causes difficulties with matching corresponding shape representations from different resolutions.
Moreover, the conventional shape descriptions change discontinuously.
A scale-space approach aims to obtain continuous shape descriptions if the resolution changes continuously.

In many tasks, it is important to represent classes of shapes properly, e.g. shape classes of apples, oranges, pears, bananas, etc.
The shape classes should represent the generic shapes of the objects belonging to the same classes well. Obviously, shape classes should emphasize shape differences among classes while the influence of shape variations within classes should not be reflected in the class description.

Despite the fact that we are dealing with two-dimensional shape and its description, our world is three-dimensional and the same objects, if seen from different angles (or changing position/orientation in space), may form very different 2D projections.
The ideal case would be to have a universal shape descriptor capable of overcoming these changes -- to design projection-invariant descriptors.

Consider an object with planar faces and imagine how many very different 2D shapes may result from a given face if the position and 3D orientation of this simple object changes with respect to an observer. In some special cases, like circles which transform to ellipses, or planar polygons, projectively invariant features (invariants} can be found.

Object occlusion is another hard problem in shape recognition. However, the situation is easier here (if pure occlusion is considered, not combined with orientation variations yielding changes in 2D projections as discussed above), since visible parts of objects may be used for description.
Here, the shape descriptor choice must be based on its ability to describe local object properties -- if the descriptor only gives a global object description, such a description is useless if only a part of an object is visible. If a local descriptor is applied, this information may be used to compare the visible part of the object to all objects which may appear in the image. Clearly, if object occlusion occurs, the local or global character of the shape descriptor must be considered first.

Region identification

Region identification is necessary for region description. One of the many methods for region identification is to label each region (or each boundary) with a unique (integer) number; such identification is called labeling or coloring, also connected component labeling.
Goal of segmentation was to achieve complete segmentation, now, the regions must be labeled.

Label collision is a very common occurrence -- examples of image shapes experiencing this are U-shaped objects, mirrored E objects, etc.
The equivalence table is a list of all label pairs present in an image; all equivalent labels are replaced by a unique label in the second step.
The algorithm is basically the same in 4-connectivity and 8-connectivity, the only difference being in the neighborhood mask shape.

Last Modified: February 4, 1997