WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
Encoder/decoder for multidimensional sound fields    
United States Patent5583962   
Link to this pagehttp://www.wikipatents.com/5583962.html
Inventor(s)Davis; Mark F. (Pacifica, CA); Todd; Craig C. (Mill Valley, CA); Dolby; Ray M. (San Francisco, CA)
AbstractTwo or more audio channels (i.e.--stereo, 4-channel surround, etc.) are each divided into frequency subbands to be coarsely quantized. An adaptive bit allocation scheme is then applied to subbands which are combined across channels such that equivalent subbands (i.e.--same frequency band) from each channel are grouped together to form a steered subband. The power from each subband is averaged across the channels to form a steered subband level. Bits are conserved by forming a vector in which each channel is represented by the difference between the steered subband level (average) and the actual subband level. Subbands which are not steered are represented by the coarse quantization and are considered unsteered channel subbands. Subbands which are steered are represented by vectors and are considered composite channel subbands. Vectors may be represented using a lookup table and the steered subband levels may be calculated using alternatives such as a peak level rather than an average.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Drawing from US Patent 5583962
Encoder/decoder for multidimensional sound fields - US Patent 5583962 Drawing
Encoder/decoder for multidimensional sound fields
Inventor     Davis; Mark F. (Pacifica, CA); Todd; Craig C. (Mill Valley, CA); Dolby; Ray M. (San Francisco, CA)
Owner/Assignee     Dolby Laboratories Licensing Corporation (San Francisco, CA)
Patent assignment
All assignments
Publication Date     December 10, 1996
Application Number     07/927,429
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     September 4, 1992
US Classification     704/229 704/200.1 704/205 704/230
Int'l Classification     G10L 009/18 G10L 007/00
Examiner     Knepper; David D.
Assistant Examiner    
Attorney/Law Firm     Gallagher; Thomas A. Lathrop; David N. ,
Address
Parent Case    
Priority Data    
USPTO Field of Search     395/2 395/2..19 395/2.38 395/2.39 381/29 381/30 381/31 381/32 381/33 381/34 381/35 381/36 381/37 381/38 381/39 381/40
Patent Tags     encoder/decoder multidimensional sound fields
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
1124580



[0 after 0 votes]
1793772



[0 after 0 votes]
1850130



[0 after 0 votes]
2361490



[0 after 0 votes]
2714633



[0 after 0 votes]
3067287



[0 after 0 votes]
3067292



[0 after 0 votes]
3200207



[0 after 0 votes]
3272906



[0 after 0 votes]
3845572



[0 after 0 votes]
3907412



[0 after 0 votes]
5323396
Lokhoff
370/468
Jun,1994

[0 after 0 votes]
5109417
Fielder
704/205
Apr,1992

[0 after 0 votes]
4713776
Araseki
704/229
Dec,1987

[0 after 0 votes]
4455649
Esteban
370/522
Jun,1984

[0 after 0 votes]
4256389
Engebretson
352/11
Mar,1981

[0 after 0 votes]
4251688
Furner
381/18
Feb,1981

[0 after 0 votes]
3973839
Stumpf
352/5
Aug,1976

[0 after 0 votes]
3952157
Takahashi
381/22
Apr,1976

[0 after 0 votes]
3934086
Takahashi
381/22
Jan,1976

[0 after 0 votes]
3848092
Shamma
369/47.25
Nov,1974

[0 after 0 votes]
3836715
Ito
381/22
Sep,1974

[0 after 0 votes]
4236039
Cooper
381/23
Dec,1969

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


What is claimed is:

1. In an encoder for encoding two or more audio channels, the combination comprising:

subband means for generating subband signals, each subband signal representing spectral energy in a respective subband of a respective one of said audio channels,

composite means for forming one or more composite signals, each composite signal formed by combining subband signals in a respective subband of two or more of said audio channels, and

formatting means for assembling an output signal including information representing said one or more composite signals in a form comprising a coarse measure of composite signal contents and a corresponding finer measure of composite signal contents, and including information conveying spectral levels of each subband signal combined into a respective composite signal.

2. The combination of claim 1 further comprising means for encoding said one or more composite signals and subband signals not combined into a respective composite signal, wherein said composite means forms said one or more composite signals only when the amount of information required to encode subband signals generated by said subband means exceeds a limit, and wherein said composite signals are formed only to the extent that the amount of information saved by encoding said composite signals rather than the subband signals combined into said composite signals is sufficient to allow encoding using amount of information which does not exceed said limit.

3. The combination of claim 1 wherein said more finely quantized values are quantized using bits allocated from a common pool of bits.

4. The combination of claim 1 further comprising means for adding, prior to quantization, a noise-like signal to said subband signals and said one or more composite signals.

5. The combination of claim 4 wherein the mean amplitude of said noise-like signal substantially matches the expected quantizing error of said subband signals and said one or more composite signals.

6. The combination of claim 1 wherein said information conveying spectral levels for a respective composite signal includes an indication of amplitude or power level of each constituent subband signal.

7. The combination of claim 6 wherein said indication of amplitude or power level is either

a plurality of elements, each element representing the difference in amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the ratio of amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the absolute value of the amplitude or power level of each respective constituent subband signal.

8. The combination of claim 1 wherein said plurality of audio channels represent a sound field and said information conveying spectral levels for a respective composite signal includes sound field localization information for constituent subband signals combined into the respective composite signal.

9. The combination of claim 1 wherein said composite means includes means for compensating out-of-phase signal components between subband signals from which said one or more composite signals are formed.

10. The combination of claim 1 wherein said one or more composite signals and the subband signals not represented by a respective composite signal are represented in a form comprising one or more scale factors each associated with one or more scaled values, the combination further comprising means for adjusting either or both dynamic range and gain by manipulating the values of said one or more scale factors.

11. The combination of claim 1 wherein said composite means includes selection means for selecting each subband from which a respective one of said one or more composite signals is formed.

12. The combination of claim 11 wherein said selection means selects one or more predetermined subbands.

13. The combination of claim 11 wherein said selection means selects one or more of the highest frequency subbands.

14. The combination of claim 11 wherein said selection means selects one or more subbands such that the resulting one or more composite signals are least likely to be subject to errors caused by out-of-phase signal cancellation.

15. The combination of claim 11 further comprising means for allocating a limited number of bits to said one or more composite signals and to subband signals not represented by said one or more composite signals, wherein said selection means selects subbands whose subband signals, if not represented by a respective composite signal, would not be allocated a respective minimum number of bits.

16. The combination of claim 15 wherein said respective minimum number of bits is the number of bits required to render quantizing noise in a respective subband substantially inaudible.

17. The combination of claim 11 wherein said selection means selects a subband starting with subbands in which coding inaccuracies are least objectionable.

18. The combination of claim 11 wherein said selection means selects the highest frequency subband, reiteratively selecting the highest frequency subband not already selected until sufficient bits are made available to allocate at least said respective minimum number of bits to subband signals not represented by a respective composite signal.

19. The combination of claim 11 wherein said means for selecting further selects subband signals according to in which audio channels the subband signals are located.

20. In a decoder for decoding an encoded signal generated by an encoder, said encoded signal including subband information representing respective subbands of a plurality of audio channels and including spectral level information, each subband constituting a portion of the spectrum of said audio channels, said subband information representing one or more composite signals and a plurality of subband signals, each of said composite signals formed in said encoder by combining subband signals of two or more of said plurality of audio channels in a respective subband, the combination in said decoder comprising:

deformatting means for obtaining said subband information and said spectral level information from said encoded signal, wherein said subband information is represented in a form comprising a coarse measure of composite signal contents and a corresponding finer measure of composite signal contents, and said spectral level information conveys spectral levels of each subband signal combined in a respective composite signal,

reconstruction means for obtaining said one or more composite signals and said plurality of subband signals in response to said subband information, and for deriving subband signals in response to said one or more composite signals and said spectral level information, and

synthesis means for generating a plurality of output signals in response to said derived subband signals and said plurality of subband signals obtained from said subband information.

21. The combination of claim 20 further comprising means for substituting a noise-like signal for the least significant bits of said more finely quantized values.

22. The combination of claim 20 wherein said more finely quantized values include a noise-like signal added prior to their quantization in said encoder, wherein said combination further comprises means for generating a noise-like signal substantially the same as that added prior to quantization, and means for subtracting said noise-like signal from said more finely quantized values after dequantization.

23. The combination of claim 20 wherein said more finely quantized values are dequantized using bits allocated from a common pool of bits.

24. The combination of claim 20 wherein said spectral level information includes an indication of amplitude or power level of each constituent subband signal combined into a respective composite signal.

25. The combination of claim 20 wherein said spectral level information includes an indication of sound field localization for constituent subband signals combined into a respective composite signal.

26. The combination of claim 20 further comprising means for inverse out-of-phase compensation of signal components between subband signals from which composite signals are formed.

27. The combination of claim 20 wherein said encoded signal comprises subband information represented in a form comprising one or more scale factors each associated with one or more scaled values, said combination further comprising means for adjusting either or both dynamic range and gain of said subband information by manipulating the values of said one or more scale factors.

28. A method for use in the encoding of two or more audio channels, comprising:

generating subband signals of said audio channels, each subband signal representing spectral energy in a respective subband of a respective one of said channels,

forming one or more composite signals, each composite signal formed by combining subband signals in a respective subband of two or more of said audio channels, and

assembling an output signal including information representing said one or more composite signals in a form comprising a coarse measure of composite signal contents and a corresponding finer measure of composite signal contents, and including information conveying spectral levels of said subband signal combined into a respective composite signal.

29. The method of claim 28 wherein said plurality of audio channels represent a sound field and said information conveying spectral levels for a respective composite signal includes sound field localization information for constituent subband signal combined into the respective composite signal.

30. The method of claim 28 wherein forming one or more composite signals includes compensating out-of-phase signal components between subband signal from which said one or more composite signals are formed.

31. The method of claim 28 wherein said one or more composite signals and the subband signals not represented by a respective composite signal are represented in a form comprising one or more scale factors each associated with one or more scaled values, the method further comprising adjusting either or both dynamic range and gain by manipulating the values of said one or more scale factors.

32. The method of claim 28 further comprising encoding said one or more composite signals and subband signals not combined into a respective composite signal, wherein said one or more composite signals are formed only when the amount of information required to encode subband signals exceeds a limit, and wherein said composite signals are formed only to the extent that the amount of information saved by encoding said composite signals rather than the subband signals combined into said composite signals is sufficient to allow encoding using amount of information which does not exceed said limit.

33. The method of claim 28 wherein said more finely quantized values are quantized using bits allocated from a common pool of bits.

34. The method of claim 28 wherein a noise-like signal is added to said subband signal and said one or more composite signals prior to quantization.

35. The method of claim 34 wherein the mean amplitude of said noise-like signal substantially matches the expected quantizing error of said subband signals and said one or more composite signals.

36. The method of claim 28 wherein said forming one or more composite signals includes selecting each subband from which a respective one of said one or more composite signals is formed.

37. The method of claim 36 wherein one or more predetermined subbands are selected.

38. The method of claim 36 wherein one or more of the highest frequency subbands are selected.

39. The method of claim 36 wherein one or more subbands are selected such that the resulting one or more composite signals are least likely to be subject to errors caused by out-of-phase signal cancellation.

40. The method of claim 36 further comprising allocating a limited number of bits to said one or more composite signals and to subband signals not represented by said one or more composite signals, wherein subbands are selected whose subband signals, when not represented by a respective composite signal, would not be allocated a respective minimum number of bits.

41. The method of claim 40 wherein said respective minimum number of bits is the number of bits required to render quantizing noise in a respective subband substantially inaudible.

42. The method of claim 36 wherein a subband is selected starting with subbands in which coding inaccuracies are least objectionable.

43. The method of claim 36 wherein the highest frequency subband is selected, reiteratively selecting the highest frequency subband not already selected until sufficient bits are made available to allocate at least said respective minimum number of bits to subband signals not represented by a respective composite signal.

44. The method of claim 36 wherein subband signals are selected according to in which audio channels the subband signals are located.

45. The method of claim 28 wherein said information conveying spectra levels for a respective composite signal includes an indication of the amplitude or power level of said constituent subband signals.

46. The method of claim 45 wherein said indication of amplitude or power level is either

a plurality of elements, each element representing the difference in amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the ratio of amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the absolute value of the amplitude or power level of each respective constituent subband signal.

47. A method for use in decoding an encoded signal generated by an encoder, said encoded signal including subband information representing respective subbands of a plurality of audio channels and including spectral level information, each subband constituting a portion of the spectrum of said audio channels, said subband information representing one or more composite signals and a plurality of subband signals, each of said composite signals formed in said encoder by combining subband signals of two or more of said plurality of audio channels in a respective subband, the method comprising:

obtaining said subband information and said spectral level information from said encoded signal, wherein said subband information is represented in a form comprising a coarse measure of composite signal contents and a corresponding finer measure of composite signal contents, and said spectral level information conveys spectral levels of each subband signal combined in a respective composite signal,

obtaining said one or more composite signals and said plurality of subband signals in response to said subband information, and for deriving subband signals in response to said one or more composite signals and said spectral level information, and

generating a plurality of output signals in response to said derived subband signals and said plurality of subband signals obtained from said subband information.

48. The method of claim 47 further comprises substituting a noise-like signal for the least significant bits of said more finely quantized values.

49. The method of claim 47 wherein said more finely quantized values include a noise-like signal added prior to quantization in said encoder, wherein said method further comprises generating a noise-like signal substantially the same as that added prior to quantization, and subtracting said noise-like signal from said more finely quantized values after dequantization.

50. The method of claim 47 wherein said more finely quantized values are dequantized using bits allocated from a common pool of bits.

51. The method of claim 47 wherein said spectral level information includes an indication of amplitude or power level of each constituent subband signal combined into a respective composite signal.

52. The method of claim 47 wherein said spectral level information includes an indication of sound field localization for constituent subband signals combined into a respective composite signal.

53. The method of claim 47 further comprising inverse out-of-phase compensating of signal components between subband signals from which composite signals are formed.

54. The method of claim 47 wherein said encoded signal comprises subband information represented in a form comprising one or more scale factors each associated with one or more scaled values, said method further comprising adjusting either or both dynamic range and gain of said subband information by manipulating the values of said one or more scale factors.

55. In an encoder for encoding two or more audio channels, the combination comprising:

subband means for generating subband signals, each subband signal representing spectral energy in a respective subband of a respective one of said audio channels,

composite means for forming one or more composite signals, each composite signal formed by combining subband signals in a respective subband of two or more of said audio channels, wherein said composite means includes means for compensating out-of-phase signal components between subband signals from which said one or more composite signals are formed, and

formatting means for assembling an output signal including information representing said one or more composite signals and subband signals not combined into a respective composite signal.

56. The combination of claim 55 further comprising means for encoding said one or more composite signals and subband signals not combined into a respective composite signal, wherein said composite means forms said one or more composite signals only when the amount of information required to encode subband signals generated by said subband means exceeds a limit, and wherein said composite signals are formed only to the extent that the amount of information saved by encoding said composite signals rather than the subband signals combined into said composite signals is sufficient to allow encoding using amount of information which does not exceed said limit.

57. The combination of claim 55 or 56 wherein said composite means includes selection means for selecting each subband from which a respective one of said one or more composite signals is formed.

58. The combination of claim 57 wherein said selection means selects a subband starting with subbands in which coding inaccuracies are least objectionable.

59. The combination of claim 57 wherein said means for selecting further selects subband signals according to which audio channels the subband signals are located.

60. The combination of claim 55 wherein said information conveying spectral levels for a respective composite signal includes an indication of amplitude or power level of each constituent subband signal.

61. The combination of claim 60 wherein said indication of amplitude or power level is either

a plurality of elements, each element representing the difference in amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the ratio of amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the absolute value of the amplitude or power level of each respective constituent subband signal.

62. The combination of claim 55, 56 or 60 wherein said one or more composite signals and the subband signals not represented by a respective composite signal are represented in a form comprising one or more scale factors each associated with one or more scaled values, the combination further comprising means for adjusting either or both dynamic range and gain by manipulating the values of said one or more scale factors.

63. A method for encoding two or more audio channels comprising:

generating subband signals, each subband signal representing spectral energy in a respective subband of a respective one of said audio channels,

forming one or more composite signals, each composite signal formed by combining subband signals in a respective subband of two or more of said audio channels, wherein said combining includes compensating out-of-phase signal components between subband signals from which said one or more composite signals are formed, and

assembling an output signal including information representing said one or more composite signals and subband signals not combined into a respective composite signal.

64. The method of claim 63 further comprising encoding said one or more composite signals and subband signals not combined into a respective composite signal, wherein said one or more composite signals are formed only when the amount of information required to encode said subband signals exceeds a limit, and wherein said composite signals are formed only to the extent that the amount of information saved by encoding said composite signals rather than the subband signals combined into said composite signals is sufficient to allow encoding using amount of information which does not exceed said limit.

65. The method of claim 63 wherein said information conveying spectral levels for a respective composite signal includes an indication of amplitude or power level of each constituent subband signal.

66. The method of claim 65 wherein said indication of amplitude or power level is either

a plurality of elements, each element representing the difference in amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the ratio of amplitude or power level between a respective constituent subband signal and a level of said composite signal, or

a plurality of elements, each element representing the absolute value of the amplitude or power level of each respective constituent subband signal.

67. The method of claim 63 or 64 wherein said combining includes selecting each subband from which a respective one of said one or more composite signals is formed.

68. The method of claim 67 wherein said selecting selects a subband starting with subbands in which coding inaccuracies are least objectionable.

69. The method of claim 67 wherein said selecting selects subband signals according to in which audio channels the subband signals are located.

70. The method of claim 63, 64 or 65 wherein said one or more composite signals and the subband signals not represented by a respective composite signal are represented in a form comprising one or more scale factors each associated with one or more scaled values, the method further comprising adjusting either or both dynamic range and gain by manipulating the values of said one or more scale factors.
 Description Submit all comments and votes
 


TECHNICAL FIELD

The invention relates in general to the recording, transmitting, and reproducing of multi-dimensional sound fields intended for human hearing. More particularly, the invention relates to the high-fidelity encoding and decoding of signals representing such sound fields, wherein the encoded signals may be carried by a composite audio-information signal and a steering control signal.

BACKGROUND ART

A. Goal of High-Fidelity Reproduction

A goal for high-fidelity reproduction of recorded or transmitted sounds is the presentation at another time or location a faithful representation of an "original" sound field. A sound field is defined as a collection of sound pressures which are a function of time and space. Thus, high-fidelity reproduction attempts to recreate the acoustic pressures which existed in the original sound field in a region about a listener.

Ideally, differences between the original sound field and the reproduced sound field are inaudible, or if not inaudible at least relatively unnoticeable to most listeners. Two general measures of fidelity are "sound quality" and "sound field localization."

Sound quality includes characteristics of reproduction such as frequency range (bandwidth), accuracy of relative amplitude levels throughout the frequency range (timbre), range of sound amplitude level (dynamic range), accuracy of harmonic amplitude and phase (distortion level), and amplitude level and frequency of spurious sounds and artifacts not present in the original sound (noise). Although most aspects of sound quality are susceptible to measurement by instruments, in practical systems characteristics of the human hearing system (psychoacoustic effects)