WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
Scalable digital video decompressor    
United States Patent5300949   
Link to this pagehttp://www.wikipatents.com/5300949.html
Inventor(s)Rodriquez; Arturo A. (Belmont, CA); Pietras; Mark A. (Boynton Beach, FL); Hancock; Steven M. (Boca Raton, FL); Kantner, Jr.; Robert F. (Boca Raton, FL); Rutherfoord; Charles T. (Delray Beach, FL); Wilson; Leslie R. (Boca Raton, FL)
AbstractDecompression of video segments from a sequence of differential frames is done by selected scaling of frame resolution and color depth. A frame header indicates the computational complexity of decompression of a frame, allowing selection of scales for output resolution and color depth. Decompression proceeds by retrieving a frame from the compressed video stream in elementary units. An elementary unit is characterized by types, including an unchanged type, a homogeneous type, a pattern type and a predetermined pattern type. For a retrieved elementary unit of the unchanged type, an output pointer to a display buffer is moved by an elementary unit scaled by the output resolution scale. For a retrieved elementary unit of the homogeneous type, a color retrieved from the compressed video stream is applied to an area in the display buffer corresponding to an elementary unit scaled by the output resolution scale. For a retrieved elementary unit of the predetermined pattern type, a pattern from a table of patterns is retrieved using an index from the compressed video segment as an address. Two colors for the pattern are also retrieved and applied to the pattern.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Inventor     Rodriquez; Arturo A. (Belmont, CA); Pietras; Mark A. (Boynton Beach, FL); Hancock; Steven M. (Boca Raton, FL); Kantner, Jr.; Robert F. (Boca Raton, FL); Rutherfoord; Charles T. (Delray Beach, FL); Wilson; Leslie R. (Boca Raton, FL)
Owner/Assignee     International Business Machines Corporation (Armonk, NY)
Patent assignment
All assignments
Publication Date     April 5, 1994
Application Number     07/965,129
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     October 22, 1992
US Classification     345/555 345/549 382/233
Int'l Classification     G09G 001/06
Examiner     Weldon; Ulysses
Assistant Examiner     Chow; Doon
Attorney/Law Firm     O'Malley; Paul W. Dillon; Andrew J. ,
Address
Parent Case    
Priority Data    
USPTO Field of Search     340/731 340/799 340/798 340/723 382/56 382/41 358/133 358/426 358/261.1 358/261.2 358/261.3 358/427 345/202 345/112 345/127 345/185
Patent Tags     scalable digital video decompressor
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
5119442
Brown
382/235
Jun,1992

[0 after 0 votes]
5081450
Lucas

Jan,1992

[0 after 0 votes]
5046119
Hoffert
382/166
Sep,1991

[0 after 0 votes]
4124871
Morrin, II
358/444
Nov,1978

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


What is claimed is:

1. A method of decompressing a compressed video segment including a sequence of frames with selected scaling of frame resolution and color depth for playback on a playback platform, the method comprising the steps of:

generating an output resolution scale for a frame;

retrieving the frame from the compressed video stream in elementary units;

determining a display type for each retrieved elementary unit, the display types including an unchanged type, a homogeneous type, and a pattern type;

for a retrieved elementary unit of the unchanged type, moving an output pointer to a display buffer by an elementary unit scaled by the output resolution scale;

for a retrieved elementary unit of the homogeneous type, applying a color retrieved from the compressed video stream to an area in the display buffer scaled by the output resolution scale; and

for a retrieved elementary unit of the pattern type, retrieving a pattern from the compressed video stream and applying the pattern to an area in the display buffer scaled by the output resolution scale.

2. A method as set forth in claim 1, wherein the display types further includes a predetermined pattern type, the method further comprises the step of:

for a retrieved elementary unit of the predetermined pattern type, retrieving a pattern from a table of patterns using an index retrieved from the compressed video segment and applying the pattern to an area in the display buffer scaled by the output resolution scale.

3. A method as set forth in claim 2, wherein a predetermined pattern and a pattern are each mapped by a binary bit map, and further comprising:

for a retrieved elementary unit of the pattern type or the predetermined pattern type, applying first and second colors retrieved from the compressed video segment to the off and on values, respectively, of the binary bit map; and

setting a color depth for colors upon retrieval from the compressed video segment.

4. A method as set forth in claim 3, wherein the step of determining a display type for each retrieved elementary unit comprises:

retrieving from the compressed video segment an elementary unit group block header wherein the block header is divided into a sequence of segments, each segment corresponding to an elementary unit for the frame and each segment having a value corresponding to a display type.

5. A method as set forth in claim 4, and further comprising:

determining an order for display of the elementary units of an elementary unit group from the sequence of the segments in a block header.

6. A method as set forth in claim 5, wherein the order is in the direction of rows of a frame.

7. A method as set forth in claim 5, wherein the order is for quadrants of a rectangular group of elementary units.

8. A method as set forth in claim 7, wherein the display types for an elementary unit further include a subsampled type, and further comprising:

responsive to determination that an elementary unit is of the subsampled type, recovering four colors for the elementary unit and applying one each of the colors to each of four quadrants of the elementary unit.

9. A method as set forth in claim 8, and further comprising:

upon detection of codes for both the pattern or predetermined pattern type and the subsampled type in a block header, applying subsequent video information retrieved from the compressed video segment to group of elementary units covered in a block header as a group.

10. A method as set forth in claim 3, wherein the step of determining a display type for each retrieved elementary unit includes:

retrieving a first color information block for an elementary unit from the compressed video segment;

determining from a least significant bit for a color or luminance value in the first color information block whether the elementary unit is of the homogeneous type;

in response to determination that the elementary unit is not of the homogeneous type, retrieving a second color information block for the elementary unit; and

determining from a least significant bit for a color or luminance value in the second color information block whether the elementary unit is the pattern or predetermined pattern type.

11. A method as set forth in claim 10, and further comprising:

responsive to determination that an elementary unit is of the predetermined pattern type, retrieving a pattern using an index recovered from the compressed video segment into a table of patterns;

responsive to determination that an elementary unit is of the pattern type, retrieving a pattern from the compressed video segment; and

applying colors defined by the first and second color information blocks to the pattern.

12. A method as set forth in claim 11, wherein the compressed video segment includes a bit difference map indicating the locations of changed elementary units.

13. A method as set forth in claim 4, and further comprising:

identifying an index value retrieved from the compressed video segment as one of a plurality of escape codes, wherein one set of escape code values indicate runs of unchanged elementary units and another set of escape code values indicate runs of homogeneous elementary units.

14. A data processing system for decompressing a compressed video segment including a sequence of frames with selected scaling of frame resolution and color depth for playback on a playback platform, the data processing system comprising:

a display buffer;

means for supplying a compressed video segment including video information blocks associated with non-overlapping areas of a frame;

means for generating an output resolution scale for the frame;

means for retrieving the frame by elementary units;

means for determining a display type for each retrieved elementary unit, the display types including an unchanged type, a homogeneous type and a pattern type;

means responsive to retrieval of an elementary unit of the unchanged type for moving an output pointer to a display buffer by an elementary unit scaled by the output resolution scale;

means responsive to retrieval of an elementary unit of the homogeneous type for applying a color retrieved from the compressed video stream to an area in a frame stored in the display buffer scaled by the output resolution scale; and

means responsive to retrieval of an elementary unit of the pattern type for retrieving a pattern from the compressed video stream and for applying the pattern to an area in the display buffer scaled by the output resolution scale.

15. A data processing system as set forth in claim 14, and further comprising:

a predetermined pattern type display type; and

means responsive to retrieval of an elementary unit of the predetermined pattern type for retrieving a pattern from a table of patterns using an index retrieved from the compressed video segment and for applying the pattern to an area in the display buffer scaled by the output resolution scale.

16. A data processing system as set forth in claim 15, wherein a predetermined pattern and a pattern are each mapped by a binary bit map, and further comprising:

means responsive to retrieval of an elementary unit of the pattern type, or the predetermined pattern type, for applying first and second colors retrieved from the compressed video segment to the off and on values, respectively, of the binary bit map; and

means for setting a color depth for colors upon retrieval from the compressed video segment.

17. A data processing system as set forth in claim 16, wherein the means for determining a display type for each retrieved elementary unit comprises:

means for retrieving from the compressed video segment an elementary unit group block header wherein the block header is divided into a sequence of segments, each segment corresponding to an elementary unit for the frame and each segment having a value corresponding to a display type.

18. A data processing system as set forth in claim 17, and further comprising:

means for determining an ordering for display of the elementary units of an elementary unit group from the sequence of the segments in a block header.

19. A data processing system as set forth in claim 18, wherein the ordering is in the direction of rows of a frame.

20. A data processing system as set forth in claim 18, wherein the ordering is for quadrants of a rectangular group of elementary units.

21. A data processing system as set forth in claim 20, wherein the display types for an elementary unit further include a subsampled type, and further comprising:

means responsive to determination that an elementary unit is of the subsampled type for recovering four colors for the elementary unit and applying one each of the colors to each of four quadrants of the elementary unit.

22. A data processing system as set forth in claim 21, and further comprising:

means responsive to detection of codes for both the pattern or predetermined pattern type and the subsampled type in a block header for applying subsequent video information retrieved from the compressed video segment to group of elementary units covered in a block header as a group.

23. A data processing system as set forth in claim 14, wherein the means for determining a display type comprises an optimized routine for each possible value which can be assumed by the elementary unit group block header.

24. A data processing system as set forth in claim 16, wherein the means for determining a display type for each retrieved elementary unit includes:

means for retrieving a first color information block for an elementary unit from the compressed video segment;

means for determining from a least significant bit for a color or luminance value in the first color information block whether the elementary unit is of the homogeneous type;

means responsive to determination that the elementary unit is not of the homogeneous type for retrieving a second color information block for the elementary unit; and

means for determining from a least significant bit for a color or luminance value in the second color information block whether the elementary unit is the pattern or predetermined pattern type.

25. A data processing system as set forth in claim 24, and further comprising:

means responsive to determination that an elementary unit is of the predetermined pattern type for retrieving a pattern using an index recovered from the compressed video segment into a table of patterns;

means responsive to determination that an elementary unit is of the pattern type for retrieving a pattern from the compressed video segment; and

means for applying colors defined by the first and second color information blocks to the pattern.

26. A data processing system as set forth in claim 25, wherein the compressed video segment includes a bit difference map indicating the locations of changed elementary units.

27. A data processing system as set forth in claim 17, and further comprising:

means for identifying an index value retrieved from the compressed video segment as one of a plurality of escape codes, wherein one set of escape code values indicate runs of unchanged elementary units and another set of escape code values indicate runs of homogeneous elementary units.

28. A compressed video segment frame for supporting decompression on a plurality of platforms with differing calculating capacities by providing selectable scaling of output color and spatial resolution, the compressed video segment frame comprising:

a compressed frame;

a frame header containing the number of calculations required to decompress the compressed frame at each of a plurality of output color and spatial resolutions;

a plurality of block headers identifying block of elementary regions of the compressed frame and position of the elementary regions within the compressed frame;

sections of the block headers identifying by use of selected values, the elementary regions as to one of a plurality of types of elementary region, including unchanged, pattern, and homogeneous; and

color blocks for pattern, predetermined pattern and homogeneous type elementary units.

29. A compressed video segment frame for supporting decompression with selectable scaling of output color and spatial resolution as set forth in claim 28, wherein certain combinations of the selected values for the sections of the block headers identify a group of elementary regions for as to one of a plurality of types as a whole.

30. A compressed video segment frame for supporting decompression with selectable scaling of output color and spatial resolution as set forth in claim 28, and further comprising:

a selected value for sections of the block headers identifying an elementary unit as being of a predetermined pattern, the compressed video segment; and

an index block into a table of patterns.

31. A compressed video segment frame for decompression on a plurality of platforms with differing calculating capacities by providing selectable scaling of output color and spatial resolution, the compressed video segment frame comprising:

a compressed frame;

a frame header containing the number of calculations required to decompress the compressed frame at each of a plurality of output color and spatial resolutions;

at least a first color block for each of a plurality of elementary regions of the compressed frame;

a selected bit location in the first color block for designating an elementary region as homogeneous;

a second color block for some of the plurality of elementary regions of the compressed frame; and

a selected bit location in the second color block for designating an elementary region as a pattern or, in combination with the selected bit location in the first color block, as being of another type.

32. A compressed video segment frame for supporting decompression with selectable scaling of output color and spatial resolution as set forth in claim 31, and further comprising:

a bit difference map following the frame header indicating the locations of changed elementary regions in the compressed frame.
 Description Submit all comments and votes
 


BACKGROUND OF THE INVENTION

1. Technical Field

The invention relates to the decompression for playback of video segments on a data processing system. More particularly, the invention relates to a system and a method of decompressing video data while enabling scaling of frame resolution and color depth for playback of a video segment by the playback platform.

2. Description of the Related Art

A video signal comprises a sequence of frames, which when displayed at a given minimum frame rate (e.g., 15 to 30 frames per second in a personal computer), simulate the appearance of motion to a human observer. In a personal computer system, each frame of the video image comprises a matrix of picture elements or "pixels." A common image matrix has 320 columns by 240 rows of pixels. A pixel is the minimum unit of the picture which may be assigned a luminance intensity, and in color video, a color. Depending upon the data format used, as many as three bytes of data can be used to define visual information for a pixel. A pixel by pixel color description of all pixels for an entire frame can require over two hundred thousand bytes of data. Spatial resolution of an image is increased by increases in the number of pixels.

To display a video segment, if such full frames were replaced at a frame rate of 30 frames per second, a computer could be required to recover from storage and write to video memory as many as 27 million bytes of data each second. Few contemporary mass data storage devices have both the bandwidth required to pass such quantities of data or the storage capacity to hold more than a few minutes worth of digital video information directly stored. As used here, bandwidth means the volume of data per unit time which can be recovered from an auxiliary storage device. Data compression is used to accommodate auxiliary storage devices in the storage and recovery of video segments for playback in real time and to reduce traffic on the system bus.

Data compression allows an image or video segment to be transmitted and stored in substantially fewer bytes of data than required for full frame reproduction. Data compression can be based on eliminating redundant information from frame to frame in a digitized video segment (temporal compression), or by eliminating redundant information from pixel to pixel in individual frames (spatial compression). In addition, compression may exploit superior human perception of luminance intensity detail over color detail by averaging color over a block of pixels while preserving luminance detail.

Frame differencing compression methods exploit the temporal redundancy that exists between digital video frames from the same scene recorded moments apart in time. This reduces the required data needed to encode each frame. Two successive frames from a sequence of digital motion video frames are compared region by region. The comparison process determines whether two corresponding regions are the same or different. The size and location of each region, and the nature of the comparison are outside the scope of this invention.

Before temporal redundancy can exist, one frame necessarily represents a point in time after another frame. If the field of view of the frames is unchanged, then the regions from a frame at period N do not need to be encoded and stored if the regions in a frame at period N-1 are already known. When change has occurred, the changed regions of the later frame must be encoded and stored. When each region of two frames have been compared, and changed regions of the later period encoded and stored, the process moves to the next pair of frames. During playback, the decompression process adds the stored information for each period to the current state of the display memory using a process that is the logical reverse of the encoding process. This is called conditional replenishment.

When there is very little temporal redundancy in a digital motion video the method fails. However, in a motion video sequence of a flower growing, shot at 30 frames per second, frames contain a great deal of temporal redundancy and compress well using frame differencing. Similarly a sequence recorded through a moving camera will contain little redundancy and not compress well, assuming motion compensation algorithms are not employed.

While compression makes it possible to store and reproduce video segments on personal computers, the quantities of data involved and the computational load imposed on the system central processor still tax the capacity of many contemporary personal computers, particularly low end machines based on the Intel 8086/88 family of microprocessors. Large capacity machines designed for multitasking of applications and having advanced video adaptors have an easier time handling video segments, unless two or more video segments are required to be simultaneously reproduced.

A way of providing portability of decompressed data between machines of different capacity is to introduce resolution and color depth scalability. Resolution scalability allows the playback platform to change the number of pixels in an output image. A common display resolution is a matrix of 320.times.240 pixels. Other resolutions include 640.times.480 pixels and 160.times.120 pixels. Color depth scaling is used to reduce (or increase) the number of shades of color displayed. Use of such scaling would be enhanced were portable processes available to recognize and decode compressed video segments set up to support such scaling by the playback platform itself.

Three methods have been employed to support resolution and color depth scaling. All three techniques are targeted for use in transmission channel applications, where compressed video information can be transmitted and reconstructed progressively.

One technique involves use of image hierarchies. Each frame of a video segment is compressed at a plurality of spatial resolutions. Each level represents a different level of compression obtained by subsampling of 2 by 2 pixel regions of the next higher level of resolution. The resolution of the base level is the same as the raw data frame. Frame resolution scaling is obtained by selecting a particular level of resolution at the decompression platform. Time differential compression between frames is still done, but by selecting low resolution levels, low powered computers can decompress the stream.

A second technique known in the art is called bit-plane scalability. Here each frame in the video segment is compressed by encoding the bit-planes of the color information independently. Each bit-plane has the same spatial resolution as the original image frame. Each compressed video frame is organized from the most-significant bit (MSB) planes to the least-significant bit (LSB) planes. Color scalability is obtained by only decompressing and displaying the compressed higher order bit-planes of each frame.

A third technique known in the art is called subband coding. In subband coding, an image is decomposed into different frequency bands, downsampling the spatial resolution of each produced band, and compressing each frequency subband with a suitable compression technique independently (e.g. vector quantization). Scalability is obtained by only decompressing, upsampling and displaying the compresssed low-passed frequency bands in lower end machines, and progressively decoding higher and higher-passed frequency bands in progressively more powerful machines.

SUMMARY OF THE INVENTION

The present invention provides a system and a method for decompressing video segments including a sequence of difference frames. Selected scaling of frame resolution and color depth upon playback on a playback platform is simplified by the process. A frame header may indicate the computational complexity of decompression of a frame at a plurality of spatial resolutions and color depths. This allows selection by the decompressor of a scale for output resolution and another scale for color depth. Decompression proceeds by retrieving a frame from the compressed video stream in elementary units. Elementary units relate to non-overlapping rectangular areas of the frame to be decompressed. An elementary unit is characterized by type, including an unchanged type, a homogeneous type, a pattern type and a predetermined pattern type. For a retrieved elementary unit of the unchanged type, an output pointer to a display buffer is moved by an elementary unit scaled by the output resolution scale. For a retrieved elementary unit of the homogeneous type, a color retrieved from the compressed video stream is applied to an area in the display buffer corresponding to an elementary unit scaled by the output resolution scale.

For a retrieved elementary unit of the predetermined pattern type, a pattern from a table of patterns is retrieved using an index from the compressed video segment. Two colors for the pattern are also retrieved and applied to the pattern. The result is written to an area in the display buffer corresponding to an elementary unit scaled by the output resolution scale. For a retrieved elementary unit of the pattern type, a pattern is retrieved from the compressed video stream. Two colors are retrieved and applied to the pattern and the result is written to an area in the display buffer corresponding to an elementary unit scaled by the output resolution scale.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself however, as well as a preferred mode of use, further objects and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:

FIG. 1 is a pictorial view of a personal computer;

FIG. 2 is a block diagram of a data processing system for reproduction of video segments;

FIG. 3 is a schematic illustration of a protocol for a compressed video segment;

FIGS. 4A and 4B are a logical flow chart of a process for decompressing video data in accordance with a first embodiment of the invention;

FIGS. 5A, 5B and 5C are a logical flow chart of a process for decompressing video data in accordance with a second embodiment of the invention;

FIGS. 6A and 6B are a schematic illustration of derivation of a difference bit map for a frame of video data;

FIGS. 7A and 7B are a schematic illustration of a protocol for a compressed video segment in accordance with a third embodiment of the invention;

FIG. 8 is a logical flow chart of a process for decompressing video data compressed in accordance with the protocol of FIG. 7;

FIG. 9 is a continuation of the logical flow chart of FIG. 8; and

FIG. 10 is a continuation of the logical flow chart of FIG. 8.

DETAILED DESCRIPTION OF THE INVENTION

With reference now to the figures and in particular with reference to FIG. 1, there is depicted a pictorial representation of a personal computer system 10 which may be utilized in accordance with the method of the present invention. Personal computer system 10 includes a computer 12, preferably provided by utilizing an IBM Personal System 2 or similar system. Personal computer system 10 generally includes a video display 14 and a keyboard 16 connected to the computer by cable 18. Video display device 14 and keyboard 16 are utilized to allow user input to computer 12 and to provide user perceivable messages such as video segments 20.

FIG. 2 is a block diagram of personal computer system 10. Computer system 10 is based on a system bus 22 on which data is passed between components of computer system 10. Among components connected to system bus 22 are a central processing unit (CPU) 24, which may be based on an Intel 8086/88 or more powerful microprocessor. CPU 24 executes programs stored in system memory 26 and manipulates data stored in system memory 26. A video segment may be stored in a compressed form on a compact disc-read only memory (CD-ROM) 28 which is accessed by CPU 24 through a device controller 30 connected to system bus 22. Depending upon the capability of the computer system, frames of a video segment stored on CD-ROM 28 may be transferred to system memory 26 for decompression by CPU 24 or to video buffer 36 in a display adaptor 32, where the data can be decompressed by a display adaptor controller 34. Depending on the system, video buffer 36 may or may not be available. Low capacity systems lacking such a video buffer require retrieval and decompression of a video segment through CPU 24 and system memory 26. Thus video data may be displayed on display 14 from system memory 26 or out of video buffer 36.

FIG. 3 illustrates a protocol for a compressed video data stream 38 which is decoded by the first and second embodiments of the present invention. During compression, each frame of a raw video sequence is analyzed by decomposing the frame into a plurality of non-overlapping, contiguous rectangular regions termed elementary units. Each elementary unit is a matrix of X columns by Y rows of pixels. Typically an elementary unit is a four by four matrix of pixels from a frame. The decompression platform will treat each frame as a matrix of such elementary units. According to the power of the computer, i.e. the number of instructions per second available to decompress a compressed video segment, the computer selects the appropriate scaling factor for which to reconstruct a video frame from the elementary units of the compressed video clip. The decompressor may require that elementary units be scaled to a smaller size in less powerful computers or may allow a larger size for more powerful computers. Generally, the same scaling factor is used to decompress all the frames in the video segment. However, each frame or set of consecutive frames in the video segment can contain information stating the number of operations required to decompress the frame or set of frames at different resolutions and/or color depths. This information will be available in a frame header 40. When this information is available, the decompressor may elect to adaptively change the scaling factors to provide higher video quality for some frames. The frame header may also include information relating to the size of the frame and the number of bytes in the compressed information blocks which follow a frame header.

If a video segment was compressed at 320.times.240 pixels and display at 640.times.480 pixels or any other level is desired, the appropriate parameters for scaling are computed once during decompression of a series of frames. The parameter of an elementary unit are modified once for a frame or segment.

In one embodiment, each elementary unit corresponds to one of four types: unchanged, homogeneous, pattern, or predetermined pattern. The type of each elementary unit is specified in the compressed video stream. All embodiments of the invention allow these four types of elementary units, and may add others. The playback platform is optimized for appropriate scaling to eliminate or minimize the computations required to adjust the size of each elementary unit during real-time playback.

The various embodiments of the invention specify position of elementary units in one of two ways. In the protocol of stream 38, elementary units are organized as quadruples which occur either: (1) sequentially in the X direction along a plurality of rows or (2) as quadrant sections of a two by two matrix of elementary units. Position in the final output display is then determined by position in stream 38. Following a frame header 40 is an exemplary byte header 42 which defines a quadruple elementary unit group. Byte header 42 includes four dyads 44, 46, 48 and 50. Each dyad can be set to one of four values ranging from 0 to 3. Each value corresponds to one of the four elementary unit types allowed by the protocol. As illustrated, dyad 44 is set to 0, dyad 46 is set to 1, dyad 48 is set to 2 and dyad 50 is set to 3, as indicated by binary representations of those numbers under the dyad. Position of a dyad in the sequence is linked to the position of the elementary unit in the eventual display field by a convention. An unchanged elementary unit requires no video data in stream 38 and accordingly none is supplied. Thus no blocks of data in stream 38 correspond to dyad 44. Scalability for an unchanged elementary unit is obtained by skipping the scaled elementary unit. Where stream 38 is used for an intraframe no dyads will have the value 0. An intraframe is a frame which has been spatially compressed rather than temporarily compressed. An intraframe would typically occur upon a change of scene.

Dyad 46 has the value 1, which indicates it corresponds to a homogeneous elementary unit. A homogeneous elementary unit is an elementary unit characterized by a single color value. Scalability of a homogeneous elementary unit is obtained by displaying the specified color over the scaled size of the elementary unit in the output display frame. Because a color must be specified for dyad 46, a color block 52 from stream 38 contains the color for dyad 46. Color may be coded either as an RGB8, an RGB16, an RGB24 format, or in a YUV16 or YUV24 format. Dyad 48 corresponds to the third elementary unit of a quadruple and has been set to the value 2 which corresponds to occurrence of a predetermined pattern elementary unit. The playback platform, upon occurrence of the value 2 in a dyad, will recover the next block from stream 38, which will be an index block 54 into a table of predetermined patterns 56 which resides in memory accessible to the decompressor. The table of patterns comprises a plurality of bit maps. The bit map in each pattern represents a scaled version of the elementary unit used during compression. The bit map has a pattern of ones and zeros which correspond to a first and second colors defined for the pattern. The playback platform will recover those colors from a color block 58 and a color block 60 and apply them respectively to the bits valued 1 and 0 in the pattern recovered from table 56. Scaling for predetermined pattern elementary units is accomplished by the playback platform invoking a version of the table of patterns that satisfies the condition that the size of each binary pattern equals the size of the scale of the size of the elementary unit in the reconstructed frame. Where an elementary unit is scaled down into a single pixel (i.e. a one by one region) the decompressor averages the two colors specified for the pattern.

A fourth dyad 50 is set to 3, which indicates that the elementary unit has a pattern block in the compressed stream 38. The playback platform recovers a pattern block 62 from the stream and two color blocks 64 and 66 as output values for a scaled output elementary units. A binary pattern of pattern block 62 is ordered in a raster scan of the elementary unit.

Pattern elementary units represent the greatest computational load for a decompressor to scale. To enable a wider spectrum of scalability, use of pattern elementary units in a compressed video segment is omitted or restricted. Where the number of pattern elementary units is restricted, scalability for pattern elementary units is obtained by scaling the binary pattern by a scale factor to satisfy the conditions that: (1) the average color in an elementary unit is preserved as closely as possible without modifying the specified color values; and (2) the direction of the gradient of the binary pattern prior to scaling is preserved as closely as possible after scaling. These conditions essentially dictate that the proportion of zeros and ones over the pattern be preserved as closely as possible and that the relative location of zeros and ones over the pattern be preserved as closely as possible. The gradient direction is defined as the angle specified by the arc tangent of the change, from bottom to top, of the binary values in the pattern over the change, from left to right, of the binary values in the pattern. The gradient direction is computed for vertical change as the sum of the binary values in the top half of the pattern minus the sum of the binary values in the bottom half. Similarly, the horizontal change can be calculated as the sum of the binary values in the left half subtracted from the right half. Scaling of color depth is obtained by storing color values as 16 bit color values rather than as 8 bit palettized values. If the display adaptor of the computer playback platform requires a different 16 bit color format, the conversion is performed using lookup tables. If the display adaptor of the computer playback platform requires an 8 bit index to a palette, it may be done with a set of precomputed lookup tables.

If the table of patterns 56 is limited in number, certain indices to the table may be designated as escape codes. Subsequent to a byte header 68, occurrence of an escape code 70 in the location of an index block is indicated. In some formats, the byte header could also have an 8 bit value which signifies an escape code. Such escape codes can indicate a number of things including, run lengths of elementary units which are unchanged or homogeneous. Subsequent to the escape code block a supplemental run length block 72 may be provided.

A decoding process in accordance with a first embodiment of the invention is set forth in FIG. 4A. The process is entered with execution of step 74 where it is determined if a frame requires decompression. If no frame is present, the process is exited by the NO branch. If a frame for decompression is present from the stream, step 76 is executed to retrieve header information for the frame. As noted above such header information may indicate the number of instruction steps required to decompress the frame and thereby allow a playback platform to determine what scaling factor will be used. Next, at step 78 a byte header for a group of contiguous elementary units is retrieved. Each byte header contains four dyads, each one of which corresponds to one of the elementary units of the contiguous group. In step 80 the next (or first) dyad is compared with the value reserved for homogeneous elementary units. If a homogeneous value is indicated, the YES branch is followed to step 82 where a color block is retrieved from the compressed data stream. The block may consist of 1, 2 or 3 bytes of data depending on the color format. In step 84, the color is scaled to the color depth for the decompression process. One crude way of scaling color would be to lop off the least significant bits of each color specification. In step 86, the scaled color information is written over the predetermined scaled size of the elementary unit to the output video frame. Next, at step 88 it is determined if the dyad last examined was the last dyad in a byte header. If not the process returns to step 80. If it was the last dyad, the process is advanced to step 90 where it is determined if the end of the frame has been reached. If the end of the frame has not been reached, the process is returned to step 78. If the end of the frame has been reached, the process is returned to step 74 to determine if another frame requires decompression.

If a dyad is set for a value other than that reserved for homogeneous units, the NO branch from step 80 would have been followed to step 83. At step 83 a dyad value is compared to the value reserved for predetermined patterns. If a predetermined pattern is indicated the YES branch is followed from step 83 to step 85, where an index byte is retrieved from the compressed video data. In step 87 the value of the index byte is compared with codes reserved for escape codes. If no escape code is present, step 89 is executed to use the value of the index byte as an address into a pattern table that contains the scaled patterns. Next, two subsequent color blocks are retrieved from the compressed stream and scaled to the color depth in use. In step 92 the colors are applied to the pattern retrieved from the pattern table and the result is stored to the video frame. Again processing advances to step 88, and if necessary step 90, to determine if a last dyad in a byte header has been checked and if an end of frame condition has been reached.

Returning to step 87, if an escape code was present the YES branch is followed to step 94, which is a call to an escape code processing routine, described below with reference to FIG. 4B. After processing, the process is returned to step 94 for execution of steps 88 and 90.

If a dyad value matches neither that for homogeneous elementary units or that for preset patterns, the NO branches are followed from steps 80 and 83 to step 96. At step 96 it is determined if the dyad value matches the value reserved that set for patterns. If it does, the pattern is retrieved from a pattern block in the data stream which is then scaled for the resolution of the output video frame. Processing then continues with execution of steps 91 and 92 as before. If the value in the dyad does not match that used to indicate patterns, the NO branch from step 96 is followed to step 100. This would occur if the dyad value indicated occurrence of an unchanged elementary unit. In step 100 the output pointer to the video frame is moved by the scale of elementary units in the output video frame. In other words, the scaled elementary unit is skipped so whatever values are in the video frame at the corresponding locations are left unchanged. Processing then is advanced to steps 88 and 90 to determine if the dyad was the last dyad in a byte header and if an end of frame condition has been encountered.

Also notable is that the sequence of steps 80, 83, 96 and 100 can be ordered from the most probable elementary unit type to the least probable elementary unit type to optimize the decompressor. For instance, FIG. 4A can alternatively check first if the dyad is set for unchanged or elementary unit when a high level of temporal redundancy is expected.

FIG. 4B illustrates the subprocess corresponding to the special processing of block 94 for escape codes. The process is entered at step 102 where it is determined if the code indicates a run of unchanged elementary units in the frame. If YES, the output pointer to the video buffer will be moved by the corresponding number of elementary units. Selected escape codes indicate that the following byte in the compressed stream continues the count of the number of elementary units in the run length for homogeneous or unchanged elementary units. The decompressor expects a single color value for the run length of homogeneous elementary units to occur after the number of elementary units in the run length has been determined. If the escape code value did not match those reserved for designating a run of unchanged elementary units, the NO branch from step 102 is followed to step 106 where it is determined if a run length of homogeneous elementary units has been designated. The YES branch from step 106 to step 108 results in retrieval of a color and in scaling of the color to a depth determined by the playback platform. In step 114 the color is written over the scaled size of the elementary unit to the video frame in the video buffer and processing is complet