WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
Apparatus and method for audio data compression and expansion with reduced block floating overhead    
United States Patent5375189   
Link to this pagehttp://www.wikipatents.com/5375189.html
Inventor(s)Tsutsui; Kyoya (Kanagawa, JP)
AbstractA method for compressing a digital audio input signal to provide a recording signal in which the input signal is divided into frames comprising plural samples. Each frame is transformed into a block of spectral coefficients, which are divided into plural bands. For each band, block floating is applied to the spectral coefficients therein; a block floating coefficient is generated; the spectral coefficients are quantized with an adaptive number of bits to provide quantized spectral coefficients; and a word length is generated. Finally, there is added to the recording signal a block of data derived from the block of spectral coefficients and consisting of the quantized spectral coefficients, a main word length and a main block floating coefficient for each band, and a reserve word length at least for each lower frequency band. In a variation, zero bits are allocated to quantize the spectral coefficients in bands higher in frequency than a highest usable band. The block of data consists of the quantized spectral coefficients, at least one word length, and at least one block floating coefficient, for each band up to the highest usable band; and data indicating the number of bands up to the highest usable band.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Inventor     Tsutsui; Kyoya (Kanagawa, JP)
Owner/Assignee     Sony Corporation (JP)
Patent assignment
All assignments
Publication Date     December 20, 1994
Application Number     07/953,740
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     September 29, 1992
US Classification     704/229 704/205
Int'l Classification     G10L 005/00
Examiner     MacDonald; Allen R.
Assistant Examiner     Sartori; M.
Attorney/Law Firm     Limbach & Limbach
Address
Parent Case    
Priority Data     Sep 30, 1991[JP]3-276168 Sep 30, 1991[JP]3-276169
USPTO Field of Search     395/2.38 381/29
Patent Tags     audio data compression expansion with reduced block floating overhead
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
5185800
Mahieux
704/500
Feb,1993

[0 after 0 votes]
5040217
Brandenburg
704/200.1
Aug,1991

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


I claim:

1. A method for compressing a digital audio input signal to provide a recording signal, the method comprising the steps of:

dividing the input signal into frames comprising plural samples;

transforming each frame of plural samples into a block of spectral coefficients and dividing the block of spectral coefficients into plural bands, the plural bands including lower frequency bands, and a lowest frequency band;

applying block floating to the spectral coefficients in each band and generating a block floating coefficient for each band;

quantizing the spectral coefficients in each band with an adaptive number of bits to provide quantized spectral coefficients in each band, and generating a word length for each band;

adding a block of data derived from the block of spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients consisting of:

the quantized spectral coefficients,

a main word length for each band,

a main block floating coefficient for each band, and

a reserve word length at least for each of the lower frequency bands.

2. The method for compressing a digital audio input signal of claim 1, wherein, the step of quantizing the spectral coefficients in each band with an adaptive number of bits includes quantizing the spectral coefficients in each band using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length at least for each of the lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide no reserve block floating coefficients.

3. The method for compressing a digital audio input signal of claim 1, wherein, in the step of adding a block of data derived from the block of spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients additionally consists of a reserve block floating coefficient for each of the lower frequency bands.

4. The method for compressing a digital audio input signal of claim 3, wherein

the step of quantizing the spectral coefficients in each band with an adaptive number of bits includes quantizing the spectral coefficients in each band using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide a reserve block floating coefficient for each of the lower frequency bands.

5. The method for compressing a digital audio input signal of claim 1, wherein the step of adding a block of data derived from the block of spectral coefficients to the recording signal includes the step of arranging the quantized spectral coefficients sequentially in the block of data derived from the block of spectral coefficients, beginning with the quantized spectral coefficients in the lowest frequency band.

6. The method for compressing a digital audio input signal of claim 5, wherein, the step of quantizing the spectral coefficients in each band with an adaptive number of bits includes quantizing the spectral coefficients in each band using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide no reserve block floating coefficients.

7. The method for compressing a digital audio input signal of claim 5, wherein, in the step of adding a block of data derived from the spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients consists of:

the quantized spectral coefficients,

a main word length for each band,

a main block floating coefficient for each band,

a reserve word length for each of a first number of the lower frequency bands, and

a reserve block floating coefficient for each of a second number of lower frequency bands, the second number of lower frequency bands being less than the first number of lower frequency bands.

8. The method for compressing a digital audio input signal of claim 7, wherein,

the step of quantizing the spectral coefficients in each band with an adaptive number of bits includes quantizing the spectral coefficients in each band using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the first number of lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide a reserve block floating coefficient for each of the second number of lower frequency bands.

9. The method for compressing a digital audio input signal of claim 1, wherein, in the step of adding a block of data derived from the block of spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients consists of:

the quantized spectral coefficients,

a main word length for each band,

a main block floating coefficient for each band,

a reserve word length for each band, and

a reserve block floating coefficient for each of the lower frequency bands.

10. The method for compressing a digital audio input signal of claim 9, wherein

the step of quantizing the spectral coefficients in each band with an adaptive number of bits includes quantizing the spectral coefficients in each band using an additional number of bits, the additional number of bits being a number of bits equivalent to a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide a reserve block floating coefficient for each of the lower frequency bands.

11. A method for compressing a digital audio input signal to provide a recording signal, the method comprising the steps of:

dividing the input signal into frames comprising plural samples;

transforming each frame of plural samples into a block of spectral coefficients and dividing the block of spectral coefficients into plural bands, the plural bands including a lowest frequency band, and a highest frequency band;

generating block floating parameters;

applying block floating to the spectral coefficients in each band in response to a block floating parameter;

quantizing the spectral coefficients in each band with an adaptive number of bits to provide quantized spectral coefficients in response to a block floating parameter, zero bits being allocated to the spectral coefficients in bands higher in frequency than a highest usable band;

adding a block of data derived from the block of spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients consisting of:

the quantized spectral coefficients for each band up to the highest usable band, there being a number of bands up to the highest useable band,

block floating parameters for each band up to the highest useable band, and

data indicating the number of bands up to the highest usable band.

12. The method for compressing a digital audio input signal of claim 11, wherein, the step of quantizing the spectral coefficients in each band with an adaptive number of bits includes quantizing the spectral coefficients in each band using an additional number of bits, the additional number of bits being a number of bits equivalent to a difference between a number of bits required to provide block floating parameters for each band and a number of bits required to provide block floating parameters for each band up to the highest useable band.

13. The method for compressing a digital audio input signal of claim 11, wherein

the block floating parameters include a word length and a block floating coefficient,

the step of applying block floating includes the step of applying block floating in response to the block floating coefficient,

the step of quantizing the spectral coefficients includes the step of quantizing the spectral coefficients in response to the word length, and

in the step of adding a block of data derived from the block of spectral coefficients to the recording signal, the block floating parameters in block of data derived from the block of spectral coefficients consist of:

a main word length for each band up to the highest useable band,

a main block floating coefficient for each band up to the highest useable band, and

a reserve word length for each of the lower frequency bands.

14. The method for compressing a digital audio input signal of claim 13, wherein in the step of adding a block of data derived from the block of spectral coefficients to the recording signal, the block floating parameters in block of data derived from the block of spectral coefficients additionally consist of a reserve block floating coefficient for each of the lower frequency bands.

15. The method for compressing a digital audio input signal of claim 13, wherein the step of adding a block of data derived from the block of spectral coefficients to the recording signal includes the step of arranging the quantized spectral coefficients sequentially in the block of data derived from the block of spectral coefficients, beginning with the quantized spectral coefficients in the lowest frequency band.

16. The method for compressing a digital audio input signal of claim 15, wherein, in the step of adding a block of data derived from the block of spectral coefficients to the recording signal, the block floating parameters in block of data derived from the block of spectral coefficients consist of:

a main word length for each band up to the highest useable band,

a main block floating coefficient for each band up to the highest useable band,

a reserve word length for each of a first number of the lower frequency bands, and

a reserve block floating coefficient for each of second number of lower frequency bands, the second number of lower frequency bands being less than the first number of lower frequency bands.

17. An apparatus for compressing a digital audio input signal to provide a recording signal, the apparatus comprising:

a means for dividing the input signal into frames comprising plural samples;

a means for transforming each frame of plural samples into a block of spectral coefficients and for dividing the block of spectral coefficients into plural bands, the plural bands including lower frequency bands;

a means for applying block floating to the spectral coefficients in each band and for generating a block floating coefficient for each band;

a quantizing means for quantizing the spectral coefficients in each band with an adaptive number of bits to provide quantized spectral coefficients in each band, and for generating a word length for each band;

a means for adding a block of data derived from the block of spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients consisting of:

the quantized spectral coefficients,

a main word length for each band,

a main block floating coefficient for each band, and

a reserve word length at least for each of the lower frequency bands.

18. The apparatus for compressing a digital audio input signal of claim 17, wherein, the quantizing means is for quantizing the spectral coefficients in each band with an adaptive number of bits using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide no reserve block floating coefficients.

19. The apparatus for compressing a digital audio input signal of claim 17, wherein, the block of data derived from the block of spectral coefficients additionally consists of a reserve block floating coefficient for each of the lower frequency bands.

20. The apparatus for compressing a digital audio input signal of claim 19, wherein, the quantizing means is for quantizing the spectral coefficients in each band with an adaptive number of bits using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide a reserve block floating coefficient for each of the lower frequency bands.

21. The apparatus for compressing a digital audio input signal of claim 17, wherein the adding means includes a means for arranging the quantized spectral coefficients sequentially in the block of data derived from the block of spectral coefficients, beginning with the quantized spectral coefficients in the lowest frequency band.

22. The apparatus for compressing a digital audio input signal of claim 21, wherein,

the quantizing means is for quantizing the spectral coefficients in each band with an adaptive number of bits using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide no reserve block floating coefficients.

23. The apparatus for compressing a digital audio input signal of claim 21, wherein the block of data derived from the block of spectral coefficients consists of:

the quantized spectral coefficients,

a main word length for each band,

a main block floating coefficient for each band,

a reserve word length for each of a first number of the lower frequency bands, and

a reserve block floating coefficient for each of a second number of lower frequency bands, the second number of lower frequency bands being less than the first number of lower frequency bands.

24. The apparatus for compressing a digital audio input signal of claim 23, wherein, the quantizing means is for quantizing the spectral coefficients in each band with an adaptive number of bits using an additional number of bits, the additional number of bits being a number of bits equivalent to a sum of a first difference and a second difference,

the first difference being a difference between a number of bits required to provide a reserve word length for each band and a number of bits required to provide a reserve word length for each of the first number of lower frequency bands, and

the second difference is a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide a reserve block floating coefficient for each of the second number of lower frequency bands.

25. The apparatus for compressing a digital audio input signal of claim 17, wherein the block of data derived from the block of spectral coefficients consists of:

the quantized spectral coefficients,

a main word length for each band,

a main block floating coefficient for each band,

a reserve word length for each band, and

a reserve block floating coefficient for each of the lower frequency bands.

26. The apparatus for compressing a digital audio input signal of claim 25, wherein the quantizing means quantizes the spectral coefficients in each band with an adaptive number of bits using an additional number of bits, the additional number of bits being a number of bits equivalent to a difference between a number of bits required to provide a reserve block floating coefficient for each band and a number of bits required to provide a reserve block floating coefficient for each of the lower frequency bands.

27. An apparatus for compressing a digital audio input signal to provide a recording signal, the apparatus comprising:

a means for dividing the input signal into frames comprising plural samples;

a means for transforming each frame of plural samples into a block of spectral coefficients and for dividing the block of spectral coefficients into plural bands, the plural bands including lower frequency bands, and a lowest frequency band;

a means for generating block floating parameters;

a block floating means for applying block floating to the spectral coefficients in each band in response to a block floating parameter;

a quantizing means for quantizing the spectral coefficients in each band with an adaptive number of bits to provide quantized spectral coefficients in response to a block floating parameter, the quantizing means allocating zero bits to the spectral coefficients in bands higher in frequency than a highest usable band;

a means for adding a block of data derived from the block of spectral coefficients to the recording signal, the block of data derived from the block of spectral coefficients consisting of:

the quantized spectral coefficients for each band up to the highest usable band, there being a number of bands up to the highest useable band,

block floating parameters for each band up to the highest useable band, and

data indicating the number of bands up to the highest usable band.

28. The apparatus for compressing a digital audio input signal of claim 27, wherein the quantizing means is for quantizing the spectral coefficients in each band with an adaptive number of bits using an additional number of bits, the additional number of bits being a number of bits equivalent a difference between a number of bits required to provide block floating parameters for each band and a number of bits required to provide block floating parameters for each band up to the highest useable band.

29. The apparatus for compressing a digital audio input signal of claim 27, wherein

the block floating parameters include a word length and a block floating coefficient,

the block floating means applies block floating in response to the block floating coefficient,

the quantizing means quantizes the spectral coefficients in response to the word length, and

the block floating parameters in block of data derived from the block of spectral coefficients consist of:

a main word length for each band up to the highest useable band,

a main block floating coefficient for each band up to the highest useable band, and

a reserve word length for each of the lower frequency bands.

30. The apparatus for compressing a digital audio input signal of claim 29, wherein the block floating parameters in block of data derived from the block of spectral coefficients additionally consist of a reserve block floating coefficient for each of the lower frequency bands.

31. The apparatus for compressing a digital audio input signal of claim 29, wherein the adding means includes a means for arranging the quantized spectral coefficients sequentially in the block of data derived from the spectral coefficients, beginning with the quantized spectral coefficients in the lowest frequency band.

32. The apparatus of claim 31, wherein the block floating parameters in block of data derived from the block of spectral coefficients consist of:

a main word length for each band up to the highest useable band,

a main block floating coefficient for each band up to the highest useable band,

a reserve word length for each of a first number of lower frequency bands

a reserve block floating coefficient for each of a second number of the lower frequency bands, the second number of the lower frequency bands being less than the first number of the lower frequency bands.
 Description Submit all comments and votes
 


FIELD OF THE INVENTION

This invention relates to a method for processing audio signals by block floating and compression.

BACKGROUND OF THE INVENTION

As a high efficiency coding technique for compressing a digital audio signal, it is known to divide digital audio input signal in time into plural frames of a predetermined samples, to transform each frame into spectral coefficients in the frequency domain, and to divide the block of spectral coefficients resulting from transforming a frame into plural frequency bands. The spectral coefficients in each band are processed by block floating and are quantized by adaptive bit allocation.

Block floating is a normalization process applied to a block of data comprising plural words, such as a band of spectral coefficients. Block floating is applied by multiplying each word in the data block by a common value for the data block to improve quantization efficiency. In a typical block floating process, the maximum absolute value of the words in the data block is found and is used as a block floating coefficient common to all the words in the data block. Using the maximum absolute value in the band as the block floating coefficient prevents data overflow because the absolute value of no other word in the data block can be greater than the maximum absolute value. A simplified form of block floating determines the block floating coefficient using a shift quantity, which provides block floating in 6 dB steps.

The data compressor employing block floating generates, for each band, various block floating parameters BF that are transmitted or recorded on a recording medium together with the quantized spectral coefficients, or main information. The block floating parameters include a block floating coefficient SF and a word length WL, which provides information concerning the adaptive bit allocation, indicating the difference between the value of the block floating coefficient SF and the allowable noise level which is determined for each band, taking account of masking.

In the following description, reference will be made to recording on or reproducing from a recording medium. When such references are made, they are to be understood as additionally encompassing transmitting to and receiving from a transmission medium.

Masking is a psychoacoustic phenomenon in which a sound is rendered inaudible, or "masked," by other sounds occurring simultaneously with, or slightly earlier than, or later than, the sound. Masking effects may be classed into time domain masking effects, that is, masking by sounds occurring earlier or later than the masked sound, and concurrent masking effects, which is masking is by simultaneously-occurring sounds having a frequency different from the frequency of the masked sound.

Masking enables a sound to render inaudible any noise within its time or frequency masking range. This means that in the presence of a signal that, when reproduced, produces a sound, a digital encoding system that produces quantizing noise may have quantizing noise levels that are high compared with the noise level that is allowable in the absence of the signal, provided that the quantizing noise lies within the masking range of the sound produced by the signal. Since relatively high levels of quantizing noise are allowable if masked by the sound resulting from the signal, the number of bits required to quantize the signal representing the sound, or parts of the signal, may be significantly reduced.

A critical band is a frequency band that takes advantage of the masking characteristics of the human sense of hearing. A critical band is the band of noise that can be masked by a pure sound that has the same intensity as the noise and has a frequency in the vicinity of the frequency of the noise. The width of the critical band increases with increasing frequency of the pure sound. The entire audio frequency range of 0 Hz to 20 kHz can be divided into, for example, 25 critical bands.

If, for some reason, data is destroyed or lost between the output of the data compressor and the input of a complementary data expander, it is possible to reduce the audible effects of the missing data in the data expander by reducing the signal component in the frequency band corresponding to the missing data to zero.

However, since the block floating parameters BF is related to the spectral coefficients in each band, the effect on sound quality of losing a block floating parameter BF is more noticeable than loss of the main information (i.e., the quantized spectrum signals).

To mitigate the effects of possibly losing a block floating parameter BF, it is known to include the block floating parameters BF in the compressed signal provided by the data compressor twice, so that they are recorded twice on the recording medium. This provides a redundant set of block floating parameters in case a block floating parameter is lost or erroneous.

In the following description, reference will be made to lost data, such as block floating parameters and quantized spectral coefficients on the understanding that this term also covers erroneous or corrupted data, such as block floating parameters and quantized spectral coefficients.

As shown in FIG. 15, the quantized spectral coefficients (main information) are recorded together with the block floating coefficients SF and word length WL as the above-mentioned block floating parameters BF, and are recorded as the block floating coefficients SF1 and the word lengths WL1, respectively, and are recorded a second time as the block floating coefficients SF2 and the word lengths WL2, respectively.

With the above method, since all the block floating parameters BF need to be recorded twice to deal with normal occurrences of data loss, the number of bits allocated to the main information must be reduced to accommodate the additional block floating parameters. Consequently, in systems having a high compression ratio or a low bit rate, a satisfactory sound quality cannot be attained.

In a conventional data compressor, the number of block floating parameters BF recorded per frame of the input signal is usually fixed. FIG. 16 shows how the data corresponding to each frame of the input signal is arranged in the recording signal produced by a conventional data compressor. In the example shown, the values of the block floating parameters BF for the bands to which no bits are allocated must still to be recorded, which reduces the number of bits available for coding the spectral coefficients in the main information. This makes it difficult to achieve a satisfactory sound quality when the compressed signal from the compressor is subject to complementary expansion, and is reproduced. This is especially so in systems having a high compression ratio or a low bit rate.

The system shown in FIG. 17 is also known. In this, no block floating coefficients SF are recorded for those bands to which no bits are actually allocated, i.e., for bands having a word length WL=0. Correspondingly more bits are available for allocation to coding the spectral coefficients. In the example of FIG. 17, the number of recorded block floating coefficients SF is reduced by four, which is the number of bands to which no bits are allocated. In the arrangement shown in FIG. 17, it is still necessary to record the word length WL for all bands, and to determine whether the word length WL of each band is not zero when reading the block floating coefficients SF in the expander.

It is also necessary for the data compressor to calculate the number of bits necessary for quantizing the spectral coefficients in each band by a process that determines masking. The number of bits thus calculated is compared with the total number of bits allocated to the frame, after which, the bit allocation to each band may be adjusted. However, if the change in bit allocation changes whether the block floating coefficient SF of a block is recorded or not, the total number of bits that is allocated to the main information is also changed, which complicates the process of adjusting the bit allocation.

OBJECTS AND SUMMARY OF THE INVENTION

In view of the above-described state of the art, it is an object of the present invention to provide a method for processing an audio signal in which more bits can be allocated to quantized spectral coefficients and which is resistant to data loss.

In view of the above-described state of the art, it is another object of the present invention to provide a method for processing an audio signal wherein the bit allocation may be easily adjusted and wherein the sound quality is not impaired by adjusting the bit allocation.

For accomplishing the above object, the present invention provides a method for compressing a digital audio input signal to provide a recording signal. According to the method, the input signal is divided into frames comprising plural samples. Each frame of plural samples is transformed into a block of spectral coefficients. The block of spectral coefficients is divided into plural bands that include lower frequency bands, and a lowest frequency band. Block floating is applied to the spectral coefficients in each band and a block floating coefficient is generated for each band. The spectral coefficients in each band are quantized with an adaptive number of bits to provide quantized spectral coefficients in each band, and a word length is generated for each band. Finally, a block of data derived from the block of spectral coefficients is added to the recording signal. The block of data derived from the block of spectral coefficients consists of the quantized spectral coefficients, a main word length for each band, a main block floating coefficient for each band, and a reserve word length at least for each of the lower frequency bands.

In a first variation, there is a reserve word length for each band in the block of data derived from the block of spectral coefficients.

In a second variation, the block of data derived from the spectral coefficients that is added to the output signal additionally consists of a reserve block floating coefficient for each of the lower frequency bands.

In a third variation, the method additionally comprises arranging the quantized spectral coefficients sequentially in the block of data derived from the block of spectral coefficients, beginning with the quantized spectral coefficients in the lowest frequency band.

With the arrangement just described, when the block of data derived from the spectral coefficients is added to the recording signal, the block of data derived from the block of spectral coefficients consists of the quantized spectral coefficients, a main word length for each band, a main block floating coefficient for each band, a reserve word length for each of a first number of the lower frequency bands, and a reserve block floating coefficient for each of a second number of lower frequency bands. The second number of lower frequency bands is less than the first number of lower frequency bands.

With the method for processing audio signals according to the present invention, the block floating coefficients for higher frequency bands, which are less critical to the human sense of hearing, are not included in the recording signal twice, which increases the number of bits available for quantizing the spectral coefficients.

Also, only the word lengths for lower frequency bands are recorded twice, so that bits that otherwise would be allocated to the floating coefficients may be allocated to quantizing the spectral coefficients. Further, by arranging the quantized spectral coefficients in the recording signal beginning with lower frequency spectral coefficients, the sound quality impairment resulting from the loss of higher frequency spectral coefficients is reduced because of masking by the lower frequency spectral coefficients.

For further accomplishing the above objects, the present invention additionally provides a method for compressing a digital audio input signal to provide a recording signal. According to the method, the input signal is divided into frames comprising plural samples. Each frame of plural samples is transformed into a block of spectral coefficients. The block of spectral coefficients is divided into plural bands. The plural bands include a lowest frequency band, and a highest frequency band. Block floating parameters are generated. Block floating is applied to the spectral coefficients in each band in response to a block floating parameter. The spectral coefficients in each band are quantized with an adaptive number of bits to provide quantized spectral coefficients in response to a block floating parameter. Zero bits are allocated to the spectral coefficients in bands higher in frequency than a highest usable band. A block of data derived from the block of spectral coefficients is added to the recording signal. The block of data derived from the block of spectral coefficients consists of the quantized spectral coefficients for each band up to the highest usable band, block floating parameters for each band up to the highest useable band, and data indicating the number of bands up to the highest usable band.

If the higher frequency spectral coefficients are not included in the recording signal because these signals make no perceptible contribution to the reproduced audio signal, the block floating parameters for the high frequency bands of the frame, i.e., the block floating coefficient and the word length, are not included in the recording signal. The bits thus saved are allocated to the main information at lower frequencies, which is crucial to the human sense of hearing.

When block floating parameters are omitted from the recording signal, data is included in the recording signal indicating the number of block floating parameters in the recording signal. The number of block floating parameters corresponds to the number of bands up to the highest useable band.

According to the audio data compression method of the present invention, the block parameters for each band are included in the recording signal in each frame for those bands that need such parameters, i.e., for the bands up to the maximum useable band. Block floating parameters for the bands for which the block floating parameters are unnecessary, i.e., for bands above the maximum useable band are omitted from the recording signal, and the bits thus saved are allocated for the coding the lower frequency spectral coefficients in the main information.

The invention also encompasses an apparatus to which the methods of the invention are applied.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagrammatic view for illustrating the recording signal produced by the method according to a first aspect of the present invention.

FIG. 2 is a diagrammatic view for illustrating the recording signal produced by the method according to second and third aspects of the present invention.

FIG. 3 is a block circuit diagram for illustrating an arrangement of a data compressor to which the methods according to the present invention are applied.

FIG. 4 is a block circuit diagram showing a practical arrangement of the orthogonal transform circuit of the data compressor.

FIG. 5 is a block circuit diagram showing an arrangement of a complementary data expander.

FIG. 6 is a block circuit diagram showing a practical arrangement of the orthogonal transform circuit of the expander.

FIG. 7 is a flow chart for illustrating the block floating coefficient reading sequence in the block floating coefficient reading circuit in an expander according to the first to sixth aspects of the invention.

FIG. 8 is a flow chart for illustrating the word length reading sequence in the word length reading circuit in an expander according to the first aspect of the invention.

FIG. 9 is a flow chart for illustrating the word length reading sequence in the word length reading circuit in an expander according to the second and sixth aspects of the invention.

FIG. 10 is a flow chart for illustrating the quantized spectral coefficient reading sequence in the quantized spectral coefficient reading circuit in an expander according to the first through sixth aspects of the invention.

FIG. 11 is a diagrammatic view for illustrating data recording according to a fifth aspect of the present invention, in which higher frequency spectral coefficients are not recorded.

FIG. 12 is a diagrammatic view for illustrating data recording according to a sixth aspect of the present invention, in which higher frequency spectral coefficients are recorded.

FIG. 13 is a flow chart for illustrating the processing by the data compressor.

FIG. 14 is a flow chart for illustrating the processing by the data expander.

FIG. 15 is a diagrammatic view for illustrating the recording signal produced by a conventional data compressor.

FIG. 16 is a diagrammatic view for illustrating the recording signal produced by a conventional system in which the number of block floating parameters is constant.

FIG. 17 is a diagrammatic view for illustrating the recording signal produced by a conventional system in which the number of block floating coefficients is variable.

DETAILED DESCRIPTION OF THE INVENTION

The preferred embodiments of the present invention will now be described in detail with reference to the drawings.

In a first aspect, the present invention provides a method for compressing a digital audio input signal comprising transforming a frame of the input audio signal TS in the time domain into plural spectral coefficients SP in the frequency domain. The spectral coefficients are divided into plural frequency bands, block floating is applied to each band, and the spectral coefficients in each band are quantized by adaptive bit allocation. The quantized spectral coefficients QSP, the block floating coefficient SF1, and the word length WL1, for all the bands are recorded once. In addition, the block floating coefficients SF2 for the lower frequency bands and the word lengths WL2 for all the bands are recorded a second time.

FIG. 1 shows how one block of data in the frequency domain resulting from transforming one frame of the input signal is recorded. The main information shown in FIG. 1 is all the quantized spectral coefficients in the block.

Of the block floating parameters BF, the word lengths WL1 and WL2 for all the bands in the block are recorded twice, the block floating coefficients SF1 for all the bands in the block are recorded once, and the block floating coefficients SF2 for only the lower frequency bands in the block are recorded a second time.

The advantage of recording the block floating coefficients SF2 for only the lower frequency bands a second time will now be explained. Of the block floating parameters BF, the word length WL represents the difference between the block floating coefficient SF and the allowable noise level determined for each band, taking account of masking. The word length indicates information concerning adaptive bit allocation for quantizing the spectral coefficients SP, i.e., the number of bits used to quantize the spectral coefficients in the recorded signal. As a result, if only one of the word lengths WL for the block is lost, none of the quantized spectral coefficients in the block following the quantized spectral coefficients QSP corresponding to the lost word length WL can be read. On the other hand, if one of the block floating coefficients SF for a band of spectral coefficients is lost, only the spectral coefficients SP in the band corresponding to the erroneous or lost block floating coefficient SF cannot be restored. Consequently, the impairment of sound quality resulting from a lost block floating coefficient is less than that resulting from the loss of a word length.

Further, as far as a human listener is concerned, lower frequency audio signals represented by the lower frequency spectral coefficients SP, effectively mask higher frequency signals represented by higher frequency spectral coefficients SP. Consequently, the loss of higher frequency spectral coefficients SP has a minimal effect on sound quality.

Because of this, in the audio signal processing method according to the first aspect of the present invention, impairment of the sound quality of the expanded, decoded and reproduced signal due to lost data can be minimized if, of the block floating parameters BF, the block floating coefficients of only the lower frequency bands are recorded twice. An overall improvement of sound quality can be achieved if the bits thus saved are allocated for quantizing the spectral coefficients more accurately.

In the audio signal processing method according to the second aspect of the present invention, the quantized spectral coefficients QSP, are sequentially recorded beginning with the quantized spectral coefficients in the lowest frequency band, as shown by the arrows in FIG. 2. Of the block floating parameters BF, the word length WL1 and block floating coefficient SF1 for each band are recorded once. Additionally, the word lengths WL2 for the lower frequency bands only are recorded a second time. The block floating coefficients are not recorded a second time. Only the block floating coefficient SF1 is recorded.

The third aspect of the present invention is a variation on the second aspect. In the third aspect, the block floating coefficients SF2 for the lower frequency bands are recorded a second time, as shown in FIG. 2. In addition, the number of bands for which the word length WL is recorded twice is set to be larger than the number of bands for which the block floating coefficient SF is recorded twice.

The advantages of the methods according to the second and third aspects of the invention will now be explained. As in the first aspect, as far as a human listener is concerned, lower frequency audio signals represented by the lower frequency spectral coefficients SP, effectively mask higher frequency signals represented by higher frequency spectral coefficients SP. Consequently if higher frequency spectral coefficients SP are lost, the impairment of sound quality is small.

Of the block floating parameters BF, the word length WL represents the difference between the block floating coefficient SF and the allowable noise level determined for each band taking account of masking. The word length indicates information concerning adaptive bit allocation for quantizing the spectral coefficients SP, i.e., the number of bits used to quantize the spectral coefficients in the recorded signal. As a result, if only one of the word lengths WL for the block is lost, none of the quantized spectral coefficients in the recorded signal following the spectral coefficients SP corresponding to the lost word length WL can be read. However, if the quantized spectral coefficients QSP are recorded sequentially, beginning with the lowest frequency band, as in the second aspect of the invention, the quantized spectral coefficients QSP can be correctly read in the decoder up to the frequency band corresponding to the lost word length WL. The correctly expanded lower frequency spectral coefficients mask the defects in the expanded signal resulting from the unexpanded higher frequency spectral coefficients.

If a block floating coefficient SF is destroyed, only the spectral coefficients in the band corresponding to the lost block floating coefficient can:not be restored, so that impairment of sound quality is less than if the word length WL of the band is destroyed.

It is seen from above that, in the second embodiment, in which the quantized spectral coefficients QSP are sequentially recorded, beginning with the lowest frequency band, with reg