A segmentation method segments selected objects from other objects within a video image frame. The objects may be of arbitrary configuration and preferably represent distinct image features in a display image. An interior outline is formed within the interior of the object perimeter. The interior outline is expanded automatically to form an exterior outline. Pixels between the interior outline and the exterior outline are classified according to predefined attributes as to whether they are within object interior, thereby to identify automatically a conformed outline that is fitted to the object perimeter.
In an original image, an area whose brightness is smaller than a predetermined value is detected as an ink line area, and a neighboring area of the ink line area, which surrounds the ink line area, is detected as a neighboring area. Furthermore, with respect to an area other than the ink line area and the neighboring area in the original image, an outline portion of an image is detected as an outline area. A color is applied to the ink line area and the outline area, and a different color is applied to an area other than the above two types of area.
In an original image, an area whose brightness is smaller than a predetermined value is detected as an ink line area, and a neighboring area of the ink line area, which surrounds the ink line area, is detected as a neighboring area. Furthermore, with respect to an area other than the ink line area and the neighboring area in the original image, an outline portion of an image is detected as an outline area. A color is applied to the ink line area and the outline area, and a different color is applied to an area other than the above two types of area.
A method for image processing including segmenting document images into text and continuous tone regions is provided. The method uses a token shape comparison and resultant data (e.g. shape recurrence) to determine which high-contrast regions of the image correspond to text or other symbols. The identified text is then separated from the image and processing ensues.
A video compression method and apparatus uses an active decoder. The corresponding encoder can produce an encoded bitstream with a greatly reduced overhead by encoding a reference frame based on the structural information inherent to the image (e.g., image segmentation, geometry, color, and/or brightness), and then predicting other frames relative to the structural information. Typically, the description of a predicted frame would include kinetic information (e.g., segment motion data and/or associated residues representing information in previously occluded areas and/or inexact matches and appearance of new information, and portion of the segment evolution that is not captured by motion per se, etc.). Because the decoder is capable of independently determining the structural information (and relationships thereamong) underlying the predicted frame, such information need not be explicitly transmitted to the decoder. Rather, the encoder need only send information that the encoder knows the decoder cannot determine on its own.
A method and apparatus for representing motion in a sequence of digitized images derives a dense motion vector field and vector quantizes the motion vector field.