WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
Machine readable information recording medium having audio gap information stored therein for indicating a start time and duration of an audio presentation discontinuous period    
United States Patent6173113   
Link to this pagehttp://www.wikipatents.com/6173113.html
Inventor(s)Okada; Tomoyuki (Katano, JP); Mori; Yoshihiro (Hirakata, JP); Tsuga; Kazuhiro (Takarazuka, JP); Hamasaka; Hiroshi (Hirakata, JP); Ishihara; Hideshi (Katano, JP); Nakamura; Kazuhiko (Hirakata, JP); Hasebe; Takumi (Yawata, JP)
AbstractAn optical disk for recording more than one system stream containing audio data and video data. The audio data and video data of the plural system streams are recorded to the optical disk, and are interleaved such that the difference between the input start times of the video data and audio data to the video buffer in the video decoder and the audio buffer in the audio decoder is less than the reproduction time of the number of audio frames that can be stored in the audio buffer plus one audio frame.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Drawing from US Patent 6173113
Machine readable information recording medium having audio gap information

     stored therein for indicating a start time and duration of an audio

     presentation discontinuous period - US Patent 6173113 Drawing
Machine readable information recording medium having audio gap information stored therein for indicating a start time and duration of an audio presentation discontinuous period
Inventor     Okada; Tomoyuki (Katano, JP); Mori; Yoshihiro (Hirakata, JP); Tsuga; Kazuhiro (Takarazuka, JP); Hamasaka; Hiroshi (Hirakata, JP); Ishihara; Hideshi (Katano, JP); Nakamura; Kazuhiko (Hirakata, JP); Hasebe; Takumi (Yawata, JP)
Owner/Assignee     Matsushita Electric Industrial Co., Ltd. (Osaka, JP)
Patent assignment
All assignments
Publication Date     January 9, 2001
Application Number     09/263,208
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     March 5, 1999
US Classification    
Int'l Classification    
Examiner     Nguyen; Huy
Assistant Examiner    
Attorney/Law Firm     Wenderoth, Lind, & Ponack, L.L.P.
Address
Parent Case     This is a Rule 1.53(b) Divisional of application Ser. No. 08/724,232, filed Sep. 27, 1996 U.S. Pat. No. 5,905,845.
Priority Data     Sep 29, 1995 [JP] 7-252735 Feb 28, 1996 [JP] 8-41581
USPTO Field of Search    
Patent Tags     readable information recording medium audio gap information stored indicating start time duration audio presentation discontinuous period
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


What is claimed is:

1. A machine readable information recording medium comprising:

a first region having stored therein audio data of a plurality of audio frames in a form of at least one audio stream that is multiplexed with at least one video stream, wherein each of the audio frames have a predetermined duration and wherein the audio data is capable of being decoded and presented;

a second region having stored therein management information used for managing a decoding and presentation of the audio data;

wherein the presentation of the audio data is discontinuous between at least two of the audio frames for a period less than a duration of one audio frame; and

wherein at least one of said first and second regions having stored therein audio gap information which indicates a start time of the audio presentation discontinuous period and a duration of the audio presentation discontinuous period.

2. A machine readable information recording medium as claimed in claim 1, wherein said audio gap information stored in said at least one of said first and second regions is used to suspend the decoding of the audio data during the audio presentation discontinuous period.

3. A machine readable information recording medium as claimed in claim 1, wherein the at least one audio stream contains a plurality of data blocks with time stamps each indicating a presentation time of a corresponding data block, each of said plurality of data blocks containing at least one of the at least two audio frames between which the presentation of the audio data is discontinuous; and

wherein said audio gap information indicates a time stamp of a data block including the audio presentation discontinuous period and the duration of the audio presentation discontinuous period.

4. A recording method for recording audio data of a plurality of audio frames each having a predetermined duration onto a machine readable information recording medium including a first region and a second region, said method comprising:

recording the audio data capable of being decoded and presented in the first region in form of at least one audio stream that is multiplexed with at least one video stream in such a manner that presentation of the audio data is discontinuous between at least two of the audio frames for a period less than the duration of one audio frame;

recording management information used for managing a decoding and the presentation of the audio data in the second region; and

recording audio gap information which indicates a start time of the audio presentation discontinuous period and a duration of the audio presentation discontinuous period in at least one of the first region and the second region.

5. A recording method as claimed in claim 4, wherein said audio gap information stored in said at least one of said first and second regions is used to suspend the decoding of the audio data during the audio presentation discontinuous period.

6. A recording method as claimed in claim 4, wherein the audio data is recorded in such a manner that the at least one audio stream contains a plurality of data blocks with time stamps each indicating a presentation time of a corresponding data block, each of said plurality of data blocks containing at least one of the at least two audio frames between which the presentation of the audio data is discontinuous; and

wherein said audio gap information is recorded to indicate a time stamp of a data block including the audio presentation discontinuous period and the duration of the audio presentation discontinuous period.
 Description Submit all comments and votes
 


BACKGROUND OF INVENTION

1. Field of the Invention

The present invention relates to a method, disk and apparatus for system encoding bitstreams to connect seamlessly thereof and, more particularly, bitstreams for use in an authoring system for variously processing a data bitstream comprising video data, audio data, and sub-picture data constituting each of plural program titles containing related video data, audio data, and sub-picture data content to generate a bitstream from which a new title containing the content desired by the user can be reproduced, and efficiently recording and reproducing the generated bitstream using a particular recording medium.

2. Description of the Prior Art

Authoring systems used to produce program titles comprising related video data, audio data, and sub-picture data by digitally processing, for example, multimedia data comprising video, audio, and sub-picture data recorded to laser disk or video CD formats are currently available.

Systems using Video-CDs in particular are able to record video data to a CD format disk, which was originally designed with an approximately 600 MB recording capacity for storing digital audio data only, by using such high efficiency video compression techniques as MPEG. As a result of the increased effective recording capacity achieved using data compression techniques, karaoke titles and other conventional laser disk applications are gradually being transferred to the video CD format.

Users today expect both sophisticated title content and high reproduction quality. To meet these expectations, each title must be composed from bitstreams with an increasingly deep hierarchical structure. The data size of multimedia titles written with bitstreams having such deep hierarchical structures, however, is ten or more times greater than the data size of less complex titles. The need to edit small image (title) details also makes it necessary to process and control the bitstream using low order hierarchical data units.

It is therefore necessary to develop and prove a bitstream structure and an advanced digital processing method including both recording and reproduction capabilities whereby a large volume, multiple level hierarchical digital bitstream can be efficiently controlled at each level of the hierarchy. Also needed are an apparatus for executing this digital processing method, and a recording media to which the bitstream digitally processed by said apparatus can be efficiently recorded for storage and from which said recorded information can be quickly reproduced.

Means of increasing the storage capacity of conventional optical disks have been widely researched to address the recording medium aspect of this problem. One way to increase the storage capacity of the optical disk is to reduce the spot diameter D of the optical (laser) beam. If the wavelength of the laser beam is l and the aperture of the objective lens is NA, then the spot diameter D is proportional to l/NA, and the storage capacity can be efficiently improved by decreasing l and increasing NA.

As described, for example, in U.S. Pat. No. 5,235,581, however, coma caused by a relative tilt between the disk surface and the optical axis of the laser beam (hereafter "tilt") increases when a large aperture (high NA) lens is used. To prevent tilt-induced coma, the transparent substrate must be made very thin. The problem is that the mechanical strength of the disk is low when the transparent substrate is very thin.

MPEG1, the conventional method of recording and reproducing video, audio, and graphic signal data, has also been replaced by the more robust MPEG2 method, which can transfer large data volumes at a higher rate. It should be noted that the compression method and data format of the MPEG2 standard differ somewhat from those of MPEG1. The specific content of and differences between MPEG1 and MPEG2 are described in detail in the ISO-11172 and ISO-13818 MPEG standards, and further description thereof is omitted below.

Note, however, that while the structure of the encoded video stream is defined in the MPEG2 specification, the hierarchical structure of the system stream and the method of processing lower hierarchical levels are not defined.

As described above, it is therefore not possible in a conventional authoring system to process a large data stream containing sufficient information to satisfy many different user requirements. Moreover, even if such a processing method were available, the processed data recorded thereto cannot be repeatedly used to reduce data redundancy because there is no large capacity recording medium currently available that can efficiently record and reproduce high volume bitstreams such as described above.

More specifically, particular significant hardware and software requirements must be satisfied in order to process a bitstream using a data unit smaller than the title. These specific hardware requirements include significantly increasing the storage capacity of the recording medium and increasing the speed of digital processing; software requirements include inventing an advanced digital processing method including a sophisticated data structure.

Therefore, the object of the present invention is to provide an effective authoring system for controlling a multimedia data bitstream with advanced hardware and software requirements using a data unit smaller than the title to better address advanced user requirements.

To share data between plural titles and thereby efficiently utilize optical disk capacity, multi-scene control whereby scene data common to plural titles and the desired scenes on the same time-base from within multi-scene periods containing plural scenes unique to particular reproduction paths can be freely selected and reproduced is desirable.

However, when plural scenes unique to a reproduction path within the multi-scene period are arranged on the same time-base, the scene data must be contiguous. Unselected multi-scene data is therefore unavoidably inserted between the selected common scene data and the selected multi-scene data. The problem this creates when reproducing multi-scene data is that reproduction is interrupted by this unselected scene data.

When one of the multiple scenes is connected to common scene data, the difference between the video reproduction time and the audio reproduction time differs on each of the reproduction paths because of the offset between the audio and video frame reproduction times. As a result, the audio or video buffer underflows at the scene connection, causing video reproduction to stop ("freeze") or audio reproduction to stop ("mute"), and thus preventing seamless reproduction. It will also be obvious that the difference between the audio and video reproduction times can cause a buffer underflow state even when common scene data is connected 1:1.

Therefore, the object of the present invention is to provide a data structure whereby multi-scene data can be naturally reproduced as a single title without the video presentation stopping ("freezing") at one-to-one, one-to-many, or many-to-many scene connections in the system stream; a method for generating a system stream having said data structure; a recording apparatus and a reproduction apparatus for recording and reproducing said system stream; and a medium to which said system stream can be recorded and from which said system stream can be reproduced by said recording apparatus and reproduction apparatus.

The present application is based upon Japanese Patent Application No. 7-252735 and 8-041581, which were filed on Sep. 29, 1995 and Feb. 28, 1996, respectively, the entire contents of which are expressly incorporated by reference herein.

SUMMARY OF THE INVENTION

The present invention has been developed with a view to substantially solving the above described disadvantages and has for its essential object to provide an optical disk for recording more than one system stream containing audio data and video data, wherein the audio data and video data of the plural system streams recorded to the optical disk are interleaved such that the difference between the input start times of the video data and audio data to the video buffer in the video decoder and the audio buffer in the audio decoder is less than the reproduction time of the number of audio frames that can be stored in the audio buffer plus one audio frame.

BRIEF DESCRIPTION OF THE DRAWINGS

These and other objects and features of the present invention will become clear from the following description taken in conjunction with the preferred embodiments thereof with reference to the accompanying drawings throughout which like parts are designated by like reference numerals, and in which:

FIG. 1 is a graph schematically showing a structure of multi media bit stream according to the present invention,

FIG. 2 is a block diagram showing an authoring encoder according to the present invention,

FIG. 3 is a block diagram showing an authoring decoder according to the present invention,

FIG. 4 is a side view of an optical disk storing the multi media bit stream of FIG. 1,

FIG. 5 is an enlarged view showing a portion confined by a circle of FIG. 4,

FIG. 6 is an enlarged view showing a portion confined by a circle of FIG. 5,

FIG. 7 is a side view showing a variation of the optical disk of FIG. 4,

FIG. 8 is a side view showing another variation of the optical disk of FIG. 4,

FIG. 9 is a plan view showing one example of track path formed on the recording surface of the optical disk of FIG. 4,

FIG. 10 is a plan view showing another example of track path formed on the recording surface of the optical disk of FIG. 4,

FIG. 11 is a diagonal view schematically showing one example of a track path pattern formed on the optical disk of FIG. 7,

FIG. 12 is a plan view showing another example of track path formed on the recording surface of the optical disk of FIG. 7,

FIG. 13 is a diagonal view schematically showing one example of a track path pattern formed on the optical disk of FIG. 8,

FIG. 14 is a plan view showing another example of track path formed on the recording surface of the optical disk of FIG. 8,

FIG. 15 is a flow chart showing details of the decoder synchronization process of FIG. 66,

FIG. 16 is a graph schematically showing the structure of multimedia bit stream for use in Digital Video Disk system according to the present invention,

FIG. 17 is a graph schematically showing the encoded video stream according to the present invention,

FIG. 18 is a graph schematically showing an internal structure of a video zone of FIG. 16.

FIG. 19 is a graph schematically showing the stream management information according to the present invention,

FIG. 20 is a graph schematically showing the structure the navigation pack NV of FIG. 17,

FIG. 21 is a graph in assistance of explaining a concept of parental lock playback control according to the present invention,

FIG. 22 is a graph schematically showing the data structure used in a digital video disk system according to the present invention,

FIG. 23 is a graph in assistance of explaining a concept of Multi-angle scene control according to the present invention,

FIG. 24 is a graph in assistance of explaining a concept of multi scene data connection,

FIG. 25 is a block diagram showing a DVD encoder according to the present invention,

FIG. 26 is a block diagram showing a DVD decoder according to the present invention,

FIG. 27 is a graph schematically showing an encoding information table generated by the encoding system controller of FIG. 25,

FIG. 28 is a graph schematically showing an encoding information tables,

FIG. 29 is a graph schematically showing an encoding parameters used by the video encoder of FIG. 25,

FIG. 30 is a graph schematically showing an example of the contents of the program chain information according to the present invention,

FIG. 31 is a graph schematically showing another example of the contents of the program chain information according to the present invention,

FIG. 32 is a flow chart showing the encode parameters generating operation for a system stream containing a single scene,

FIG. 33 is a graph in assistance of explaining a concept of multi-angle scene control according to the present in invention,

FIG. 34 is a flow chart, formed by FIGS. 34A and 34B, showing an operation of the DVD encoder of FIG. 25,

FIG. 35 is a flow chart showing detailed of the encode parameter production sub-routine of FIG. 34,

FIG. 36 is a flow chart showing the detailed of the VOB data setting routine of FIG. 35,

FIG. 37 is a flow chart showing the encode parameters generating operation for a seamless switching,

FIG. 38 is a flow chart showing the encode parameters generating operation for a system stream,

FIG. 39 is a graph showing simulated results of data input/output to the video buffer and audio buffer of the DVD decoder of FIG. 26,

FIG. 40 is a graph in assistance of explaining a concept of parental control according to the present invention,

FIG. 41 is a graph in assistance of explaining the data input/output to the video buffer of the DVD decoder DCD shown in FIG. 26 during contiguous reproduction,

FIG. 42 is a graph in assistance of explaining a possible problem under a parental lock control example shown in FIG. 40,

FIG. 43 is a graph in assistance of explaining a reproduction gap generated under parental lock control,

FIG. 44 is a graph showing a system streams produced according to the present invention,

FIG. 45 is a graph in assistance of explaining an operation whereby these system streams are connected,

FIG. 46 is a graph in assistance of explaining a method of generating a system streams,

FIG. 47 is a graph in assistance of explaining another method of producing a system stream,

FIG. 48 is a graph showing a structure of the end of the second common system stream and the beginnings of the two parental lock control system streams,

FIG. 49 is a graph in assistance of explaining the difference in the video reproduction time and audio reproduction time of different reproduction paths,

FIG. 50 is a block diagram showing an internal structure of the system encoder in the DVD encoder of FIG. 25,

FIG. 51 is a graph showing a structure of the end of the two parental lock control system streams and the beginning of the following common system stream Sse,

FIG. 52 is a graph in assistance of explaining the difference in the video reproduction time and audio reproduction time of different reproduction paths,

FIG. 53 is a flow chart showing details of system stream producing routine of FIG. 34,

FIG. 54 is a graph in assistance of explaining an operation to calculate an audio data movement MFAp1,

FIG. 55 is a graph in assistance of explaining an operation to calculate an audio data movement MFAp2,

FIG. 56 is a block diagram showing an internal structure of the synchronizer of FIG. 26,

FIG. 57 is a flow chart showing an operation executed by the audio decoder controller of FIG. 26,

FIGS. 58 and 59 are graphs showing decoding information table produced by the decoding system controller of FIG. 26,

FIG. 60 is a flow chart showing the operation of the DVD decoder DCD of FIG. 26,

FIG. 61 is a flow chart showing details of reproduction extracted PGC routing of FIG. 60,

FIG. 62 is a flow chat showing details of the stream buffer data transfer process according to the present invention,

FIG. 63 is a flow chart showing details of the non multi-angle decoding process of FIG. 62,

FIG. 64 is a flow chart showing details of the non-multi-angled interleave process of FIG. 63,

FIG. 65 is a flow chart showing details of the non-multi-angled contiguous block process,

FIG. 66 is a flow chart showing details of decoding data process of FIG. 64, performed by the stream buffer, is shown,

FIG. 67 is a graph schematically showing an actual arrangement of data blocks recorded to a data recording track on a recording medium according to the present invention,

FIG. 68 is a graph schematically showing contiguous block regions and interleaved block regions array,

FIG. 69 is a graph schematically showing a content of a VTS title VOBS according to the present invention, and

FIG. 70 is a graph schematically showing an internal data structure of the interleaved block regions according to the present invention.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

Data structure of the authoring system

The logic structure of the multimedia data bitstream processed using the recording apparatus, recording medium, reproduction apparatus, and authoring system according to the present invention is described first below with reference to FIG. 1.

In this structure, one title refers to the combination of video and audio data expressing program content recognized by a user for education, entertainment, or other purpose. Referenced to a motion picture (movie), one title may correspond to the content of an entire movie, or to just one scene within the movie.

A video title set (VTS) comprises the bitstream data containing the information for a specific number of titles. More specifically, each VTS comprises the video, audio, and 20 other reproduction data representing the content of each title in the set, and control data for controlling the content data.

The video zone VZ is the video data unit processed by the authoring system, and comprises a specific number of video title sets. More specifically, each video zone is a linear sequence of K+1 video title sets numbered VTS #0-VTS #K where K is an integer value of zero or greater. One video title set, preferably the first video title set VTS #0, is used as the video manager describing the content information of the titles contained in each video title set.

The multimedia bitstream MBS is the largest control unit of the multimedia data bitstream handled by the authoring system of the present invention, and comprises plural video zones VZ.

Authoring encoder EC

A preferred embodiment of the authoring encoder EC according to the present invention for generating a new multimedia bitstream MBS by re-encoding the original multimedia bitstream MBS according to the scenario desired by the user is shown in FIG. 2. Note that the original multimedia bitstream MBS comprises a video stream St1 containing the video information, a sub-picture stream St3 containing caption text and other auxiliary video information, and the audio stream St5 containing the audio information.

The video and audio streams are the bitstreams containing the video and audio information obtained from the source within a particular period of time. The sub-picture stream is a bitstream containing momentary video information relevant to a particular scene. The sub-picture data encoded to a single scene may be captured to video memory and displayed continuously from the video memory for plural scenes as may be necessary.

When this multimedia source data St1, St3, and St5 is obtained from a live broadcast, the video and audio signals are supplied in real-time from a video camera or other imaging source; when the multimedia source data is reproduced from a video tape or other recording medium, the audio and video signals are not real-time signals.

While the multimedia source stream is shown in FIG. 2 as comprising these three source signals, this is for convenience only, and it should be noted that the multimedia source stream may contain more than three types of source signals, and may contain source data for different titles. Multimedia source data with audio, video, and sub-picture data for plural titles are referred to below as multi-title streams.

As shown in FIG. 2, the authoring encoder EC comprises a scenario editor 100, encoding system controller 200, video encoder 300, video stream buffer 400, sub-picture encoder 500, sub-picture stream buffer 600, audio encoder 700, audio stream buffer 800, system encoder 900, video zone formatter 1300, recorder 1200, and recording medium M.

The video zone formatter 1300 comprises video object (VOB) buffer 1000, formatter 1100, and volume and file structure formatter 1400.

The bitstream encoded by the authoring encoder EC of the present embodiment is recorded by way of example only to an optical disk.

The scenario editor 100 of the authoring encoder EC outputs the scenario data, i.e., the user-defined editing instructions. The scenario data controls editing the corresponding parts of the multimedia bitstream MBS according to the user's manipulation of the video, sub-picture, and audio components of the original multimedia title. This scenario editor 100 preferably comprises a display, speaker(s), keyboard, CPU, and source stream buffer. The scenario editor 100 is connected to an external multimedia bitstream source from which the multimedia source data St1, St3, and St5 are supplied.

The user is thus able to reproduce the video and audio components of the multimedia source data using the display and speaker to confirm the content of the generated title. The user is then able to edit the title content according to the desired scenario using the keyboard, mouse, and other command input devices while confirming the content of the title on the display and speakers. The result of this multimedia data manipulation is the scenario data St7.

The scenario data St7 is basically a set of instructions describing what source data is selected from all or a subset of the source data containing plural titles within a defined time period, and how the selected source data is reassembled to reproduce the scenario (sequence) intended by the user. Based on the instructions received through the keyboard or other control device, the CPU codes the position, length, and the relative time-based positions of the edited parts of the respective multimedia source data streams St1, St3, and St5 to generate the scenario data St7.

The source stream buffer has a specific capacity, and is used to delay the multimedia source data streams St1, St3, and St5 a known time Td and then output streams St1, St3, and St5.

This delay is required for synchronization with the editor encoding process. More specifically, when data encoding and user generation of scenario data st7 are executed simultaneously, i.e., when encoding immediately follows editing, time Td is required to determine the content of the multimedia source data editing process based on the scenario data St7 as will be described further below. As a result, the multimedia source data must be delayed by time Td to synchronize the editing process during the actual encoding operation. Because this delay time Td is limited to the time required to synchronize the operation of the various system components in the case of sequential editing as described above, the source stream buffer is normally achieved by means of a high speed storage medium such as semiconductor memory.

During batch editing in which all multimedia source data is encoded at once ("batch encoded") after scenario data St7 is generated for the complete title, delay time Td must be long enough to process the complete title or longer. In this case, the source stream buffer may be a low speed, high capacity storage medium such as video tape, magnetic disk, or optical disk.

The structure (type) of media used for the source stream buffer may therefore be determined according to the delay time Td required and the allowable manufacturing cost.

The encoding system controller 200 is connected to the scenario editor 100 and receives the scenario data St7 therefrom. Based on the time-base position and length information of the edit segment contained in the scenario data St7, the encoding system controller 200 generates the encoding parameter signals St9, St11, and St13 for encoding the edit segment of the multimedia source data. The encoding signals St9, St11, and St13 supply the parameters used for video, sub-picture, and audio encoding, including the encoding start and end timing. Note that multimedia source data St1, St3, and St5 are output after delay time Td by the source stream buffer, and are therefore synchronized to encoding parameter signals St9, St11, and St13.

More specifically, encoding parameter signal St9 is the video encoding signal specifying the encoding timing of video stream St1 to extract the encoding segment from the video stream St1 and generate the video encoding unit. Encoding parameter signal St11 is likewise the sub-picture stream encoding signal used to generate the sub-picture encoding unit by specifying the encoding timing for sub-picture stream St3. Encoding parameter signal St13 is the audio encoding signal used to generate the audio encoding unit by specifying the encoding timing for audio stream St5.

Based on the time-base relationship between the encoding segments of streams St1, St3, and St5 in the multimedia source data contained in scenario data St7, the encoding system controller 200 generates the timing signals St21, St23, and St25 arranging the encoded multimedia-encoded stream in the specified time-base relationship.

The encoding system controller 200 also generates the reproduction time information IT defining the reproduction time of the title editing unit (video object, VOB), and the stream encoding data St33 defining the system encode parameters for multiplexing the encoded multimedia stream containing video, audio, and sub-picture data. Note that the reproduction time information IT and stream encoding data St33 are generated for the video object VOB of each title in one video zone VZ.

The encoding system controller 200 also generates the title sequence control signal St39, which declares the formatting parameters for formatting the title editing units VOB of each of the streams in a particular time-base relationship as a multimedia bitstream. More specifically, the title sequence control signal St39 is used to control the connections between the title editing units (VOB) of each title in the multimedia bitstream MBS, or to control the sequence of the interleaved title editing unit (VOBs) interleaving the title editing units VOB of plural reproduction paths.

The video encoder 300 is connected to the source stream buffer of the scenario editor 100 and to the encoding system controller 200, and receives therefrom the video stream St1 and video encoding parameter signal St9, respectively. Encoding parameters supplied by the video encoding signal St9 include the encoding start and end timing, bit rate, the encoding conditions for the encoding start and end, and the material type. Possible material types include NTSC or PAL video signal, and telecine converted material. Based on the video encoding parameter signal St9, the video encoder 300 encodes a specific part of the video stream St1 to generate the encoded video stream St15.

The sub-picture encoder 500 is similarly connected to the source stream buffer of the scenario editor 100 and to the encoding system controller 200, and receives therefrom the sub-picture stream St3 and sub-picture encoding parameter signal St11, respectively. Based on the sub-picture encoding parameter signal St11, the sub-picture encoder 500 encodes a specific part of the sub-picture stream St3 to generate the encoded sub-picture stream St17.

The audio encoder 700 is also connected to the source stream buffer of the scenario editor 100 and to the encoding system controller 200, and receives therefrom the audio stream St5 and audio encoding parameter signal St13, which supplies the encoding start and end timing. Based on the audio encoding parameter signal St13, the audio encoder 700 encodes a specific part of the audio stream St5 to generate the encoded audio stream St19.

The video stream buffer 400 is connected to the video encoder 300 and to the encoding system controller 200. The video stream buffer 400 stores the encoded video stream St15 input from the video encoder 300, and outputs the stored encoded video stream St15 as the time-delayed encoded video stream St27 based on the timing signal St21 supplied from the encoding system controller 200.

The sub-picture stream buffer 600 is similarly connected to the sub-picture encoder 500 and to the encoding system controller 200. The sub-picture stream buffer 600 stores the encoded sub-picture stream St17 output from the sub-picture encoder 500, and then outputs the stored encoded sub-picture stream St17 as time-delayed encoded sub-picture stream St29 based on the timing signal St23 supplied from the encoding system controller 200.

The audio stream buffer 800 is similarly connected to the audio encoder 700 and to the encoding system controller 200. The audio stream buffer 800 stores the encoded audio stream St19 input from the audio encoder 700, and then outputs the encoded audio stream St19 as the time-delayed encoded audio stream St31 based on the timing signal St25 supplied from the encoding system controller 200.

The system encoder 900 is connected to the video stream buffer 400, sub-picture stream buffer 600, audio stream buffer 800, and the encoding system controller 200, and is respectively supplied thereby with the time-delayed encoded video stream St27, time-delayed encoded sub-picture stream St29, time-delayed encoded audio stream St31, and the stream encoding data St33. Note that the system encoder 900 is a multiplexer that multiplexes the time-delayed streams St27, St29, and St31 based on the stream encoding data St33 (timing signal) to generate title editing unit (VOB) St35. The stream encoding data St33 contains the system encoding parameters, including the encoding start and end timing.

The video zone formatter 1300 is connected to the system encoder 900 and the encoding system controller 200 from which the title editing unit (VOB) St35 and title sequence control signal St39 (timing signal) are respectively supplied. The title sequence control signal St39 contains the formatting start and end timing, and the formatting parameters used to generate (format) a multimedia bitstream MBS. The video zone formatter 1300 rearranges the title editing units (VOB) St35 in one video zone VZ in the scenario sequence defined by the user based on the title sequence control signal St39 to generate the edited multimedia stream data St43.

The multimedia bitstream MBS St43 edited according to the user-defined scenario is then sent to the recorder 1200. The recorder 1200 processes the edited multimedia stream data St43 to the data stream St45 format of the recording medium M, and thus records the formatted data stream St45 to the recording medium M. Note that the multimedia bitstream MBS recorded to the recording medium M contains the volume file structure VFS, which includes the physical address of the data on the recording medium generated by the video zone formatter 1300.

Note that the encoded multimedia bitstream MBS St35 may be output directly to the decoder to immediately reproduce the edited title content. It will be obvious that the output multimedia bitstream MBS will not in this case contain the volume file structure VFS.

Authoring decoder

A preferred embodiment of the authoring decoder DC used to decode the multimedia bitstream MBS edited by the authoring encoder EC of the present invention, and thereby reproduce the content of each title unit according to the user-defined scenario, is described next below with reference to FIG. 3. Note that in the preferred embodiment described below the multimedia bitstream St45 encoded by the authoring encoder EC is recorded to the recording medium M.

As shown in FIG. 3, the authoring decoder DC comprises a multimedia bitstream producer 2000, scenario selector 2100, decoding system controller 2300, stream buffer 2400, system decoder 2500, video buffer 2600, sub-picture buffer 2700, audio buffer 2800, synchronizer 2900, video decoder 3800, sub-picture decoder 3100, audio decoder 3200, synthesizer 3500, video data output terminal 3600, and audio data output terminal 3700.

The bitstream producer 2000 comprises a recording media drive unit 2004 for driving the recording medium M; a reading head 2006 for reading the information recorded to the recording medium M and producing the binary read signal St57; a signal processor 2008 for variously processing the read signal St57 to generate the reproduced bitstream St61; and a reproduction controller 2002.

The reproduction controller 2002 is connected to the decoding system controller 2300 from which the multimedia bitstream reproduction control signal St53 is supplied, and in turn generates the reproduction control signals St55 and St59 respectively controlling the recording media drive unit (motor) 2004 and signal processor 2008.

So that the user-defined video, sub-picture, and audio portions of the multimedia title edited by the authoring encoder EC are reproduced, the authoring decoder DC comprises a scenario selector 2100 for selecting and reproducing the corresponding scenes (titles). The scenario selector 2100 then outputs the selected titles as scenario data to the authoring decoder DC.

The scenario selector 2100 preferably comprises a keyboard, CPU, and monitor. Using the keyboard, the user then inputs the desired scenario based on the content of the scenario input by the authoring encoder EC. Based on the keyboard input, the CPU generates the scenario selection data St51 specifying the selected scenario. The scenario selector 2100 is connected by an infrared communications device, for example, to the decoding system controller 2300, to which it inputs the scenario selection data St51.

Based on the scenario selection data St51, the decoding system controller 2300 then generates the bitstream reproduction control signal St53 controlling the operation of the bitstream producer 2000.

The stream buffer 2400 has a specific buffer capacity used to temporarily store the reproduced bitstream St61 input from the bitstream producer 2000, extract the address information and initial synchronization data SCR (system clock reference) for each stream, and generate bitstream control data St63. The stream buffer 2400 is also connected to the decoding system controller 2300, to which it supplies the generated bitstream control data St63.

The synchronizer 2900 is connected to the decoding system controller 2300 from which it receives the system clock reference SCR contained in the synchronization control data St81 to set the internal system clock STC and supply the reset system clock St79 to the decoding system controller 2300.

Based on this system clock St79, the decoding system controller 2300 also generates the stream read signal St65 at a specific interval and outputs the read signal St65 to the stream buffer 2400.

Based on the supplied read signal St65, the stream buffer 2400 outputs the reproduced bitstream St61 at a specific interval to the system decoder 2500 as bitstream St67.

Based on the scenario selection data St51, the decoding system controller 2300 generates the decoding signal St69 defining the stream Ids for the video, sub-picture, and audio bitstreams corresponding to the selected scenario, and outputs to the system decoder 2500.

Based on the instructions contained in the decoding signal St69, the system decoder 2500 respectively outputs the video, sub-picture, and audio bitstreams input from the stream buffer 2400 to the video buffer 2600, sub-picture buffer 2700, and audio buffer 2800 as the encoded video stream St71, encoded sub-picture stream St73, and encoded audio stream St75.

The system decoder 2500 detects the presentation time stamp PTS and decoding time stamp DTS of the smallest control unit in each bitstream St67 to generate the time information signal St77. This time information signal St77 is supplied to the synchronizer 2900 through the decoding system controller 2300 as the synchronization control data St81.

Based on this synchronization control data St81, the synchronizer 2900 determines the decoding start timing whereby each of the bitstreams will be arranged in the correct sequence after decoding, and then generates and inputs the video stream decoding start signal St89 to the video decoder 3800 based on this decoding timing. The synchronizer 2900 also generates and supplies the sub-picture decoding start signal St91 and audio stream decoding start signal St93 to the sub-picture decoder 3100 and audio decoder 3200, respectively.

The video decoder 3800 generates the video output request signal St84 based on the video stream decoding start signal St89, and outputs to the video buffer 2600. In response to the video output request signal St84, the video buffer 2600 outputs the video stream St83 to the video decoder 3800. The video decoder 3800 thus detects the presentation time information contained in the video stream St83, and disables the video output request signal St84 when the length of the received video stream St83 is equivalent to the specified presentation time. A video stream equal in length to the specified presentation time is thus decoded by the video decoder 3800, which outputs the reproduced video signal St104 to the synthesizer 3500.

The sub-picture decoder 3100 similarly generates the sub-picture output request signal St86 based on the sub-picture decoding start signal St91, and outputs to the sub-picture buffer 2700. In response to the sub-picture output request signal St86, the sub-picture buffer 2700 outputs the sub-picture stream St85 to the sub-picture decoder 3100. Based on the presentation time information contained in the sub-picture stream St85, the sub-picture decoder 3100 decodes a length of the sub-picture stream St85 corresponding to the specified presentation time to reproduce and supply to the synthesizer 3500 the sub-picture signal St99.

The synthesizer 3500 superimposes the video signal St104 and sub-picture signal St99 to generate and output the multi-picture video signal St105 to the video data output terminal 3600.

The audio decoder 3200 generates and supplies to the audio buffer 2800 the audio output request signal St88 based on the audio stream decoding start signal St93. The audio buffer 2800 thus outputs the audio stream St87 to the audio decoder 3200. The audio decoder 3200 decodes a length of the audio stream St87 corresponding to the specified presentation time based on the presentation time information contained in the audio stream St87, and outputs the decoded audio stream St101 to the audio data output terminal 3700.

It is thus possible to reproduce a user-defined multimedia bitstream MBS in real-time according to a user-defined scenario. More specifically, each time the user selects a different scenario, the authoring decoder DC is able to reproduce the title content desired by the user in the desired sequence by reproducing the multimedia bitstream MBS corresponding to the selected scenario.

It is therefore possible by means of the authoring system of the present invention to generate a multimedia bitstream according to plural user-defined scenarios by real-time or batch encoding multimedia source data in a manner whereby the substreams of the smallest editing units (scenes), which can be divided into plural substreams, expressing the basic title content are arranged in a specific time-base relationship.

The multimedia bitstream thus encoded can then be reproduced according to the one scenario selected from among plural possible scenarios. It is also possible to change scenarios while playback is in progress, i.e., to select a different scenario and dynamically generate a new multimedia bitstream according to the most recently selected scenario. It is also possible to dynamically select and reproduce any of plural scenes while reproducing the title content according to a desired scenario.

It is therefore possible by means of the authoring system of the present invention to encode and not only reproduce but to repeatedly reproduce a multimedia bitstream MBS in real-time.

A detail of the authoring system is disclosed Japanese Patent Application filed September 27, 1996, and entitled and assigned to the same assignee as the present application.

Digital video disk (DVD)

An example of a digital video disk (DVD) with only one recording surface (a single-sided DVD) is shown in FIG. 4.

The DVD recording medium RC1 in the preferred embodiment of the invention comprises a data recording surface RS1 to and from which data is written and read by emitting laser beam LS, and a protective layer PL1 covering the data recording surface RS1. A backing layer BL1 is also provided on the back of data recording surface RS1. The side of the disk on which protective layer PL1 is provided is therefore referred to below as side SA (commonly "side A"), and the opposite side (on which the backing layer BL1 is provided) is referred to as side SB ("side B"). Note that digital video disk recording media having a single data recording surface RS1 on only one side such as this DVD recording medium RC1 is commonly called a single-sided single layer disk.

A detailed illustration of area C1 in FIG. 4 is shown in FIG. 5. Note that the data recording surface RS1 is formed by applying a metallic thin film or other reflective coating as a data layer 4109 on a first transparent layer 4108 having a particular thickness T1. This first transparent layer 4108 also functions as the protective layer PL1. A second transparent substrate 4111 of a thickness T2 functions as the backing layer BL1, and is bonded to the first transparent layer 4108 by means of an adhesive layer 4110 disposed therebetween.

A printing layer 4112 for printing a disk label may also be disposed on the second transparent substrate 4111 as necessary. The printing layer 4112 does not usually cover the entire surface area of the second transparent substrate 4111 (backing layer BL1), but only the area needed to print the text and graphics of the disk label. The area of second transparent substrate 4111 to which the printing layer 4112 is not formed may be left exposed. Light reflected from the data layer 4109 (metallic thin film) forming the data recording surface RS1 can therefore be directly observed where the label is not printed when the digital video disk is viewed from side SB. As a result, the background looks like a silver-white over which the printed text and graphics float when the metallic thin film is an aluminum thin film, for example.

Note that it is only necessary to provide the printing layer 4112 where needed for printing, and it is not necessary to provide the printing layer 4112 over the entire surface of the backing layer BL1.

A detailed illustration of area C2 in FIG. 5 is shown in FIG. 6. Pits and lands are molded to the common contact surface between the first transparent layer 4108 and the data layer 4109 on side SA from which data is read by emitting a laser beam LS, and data is recorded by varying the lengths of the pits and lands (i.e., the length of the intervals between the pits). More specifically, the pit and land configuration formed on the first transparent layer 4108 is transferred to the data layer 4109. The lengths of the pits and lands is shorter, and the pitch of the data tracks formed by the pit sequences is narrower, than with a conventional Compact Disc (CD). The surface recording density is therefore greatly improved.

Side SA of the first transparent layer 4108 on which data pits are not formed is a flat surface. The second transparent substrate 4111 is for reinforcement, and is a transparent panel made from the same material as the first transparent layer 4108 with both sides flat. Thicknesses T1 and T2 are preferably equal and commonly approximately 0.6 mm, but the invention shall not be so limited.

As with a CD, information is read by irradiating the surface with a laser beam LS and detecting the change in the reflectivity of the light spot. Because the objective lens aperture NA can be large and the wavelength l of the light beam small in a digital video disk system, the diameter of the light spot Ls used can be reduced to approximately 1/1.6 the light spot needed to read a CD. Note that this means the resolution of the laser beam LS in the DVD system is approximately 1.6 times the resolution of a conventional CD system.

The optical system used to read data from the digital video disk uses a short 650 nm wavelength red semiconductor la