|
Description  |
|
|
FIELD OF THE INVENTION
This invention relates generally to video production, photographic image processing, and computer graphics design, and, more particularly, to a multi-format video production system capable of professional quality editing and manipulation of
images intended for television and other applications, including HDTV programs.
BACKGROUND OF THE INVENTION
As the number of television channels available through various program delivery methods (cable TV, home video, broadcast, etc.) continues to proliferate, the demand for programming, particularly high-quality ItDTV-format programming, presents
special challenges, both technical and financial, to program producers. While the price of professional editing and image manipulation equipment continues to increase, due to the high cost of research and development and other factors, general-purpose
hardware, including personal computers, can produce remarkable effects at a cost well within the reach of non-professionals, even novices. As a result, the distinction between these two classifications of equipment has become less well defined.
The parent to this application, for example, describes a video production system which integrates equipment supplied by various manufacturers, enabling a consumer to produce and edit video material using an enhanced personal computer. An adapter
unit interfaced to each camera in use with the system connects to a camera interface module, and each camera interface module, in turn, feeds a computer interface unit. These computer interface units communicate with a personal computer over a standard
interconnect, allowing an operator to control the various cameras while viewing individual video programs which appear in separate "windows" on the computer monitor.
This related invention solves many of the problems associated with combining commercially available hardware to create an economical personal-computer-based system capable of very high quality audio/video production. However, the variety of
available and planned program standards and delivery methods places further demands on video production equipment, including the editing and manipulation of images not only from a variety of sources, but in differing pixel formats, frame rates, and so
forth. Although general-purpose PC-based equipment may never allow professional-style rendering of images at full resolution in real-time, each new generation of microprocessors enables progressively faster, higher-resolution applications. In addition,
as the price of memory circuits and other data storage hardware continues to fall, the capacity of such devices has risen dramatically, thereby improving the prospects for enhancing PC-based image manipulation systems for such applications.
In terms of dedicated equipment, attention has traditionally focused on the development of two kinds of professional image-manipulation systems: those intended for the highest quality levels to support film effects, and those intended for
television broadcast to provide "full 35 mm theatrical film quality," within the realities and economics of present broadcasting systems. Conventional thinking holds that 35 mm theatrical film quality is equivalent to 1200 or more lines of resolution,
whereas camera negatives present 2500 or more lines. As a result, image formats under consideration have been directed towards video systems having 2500 or more scan lines for high-level production (such as the Kodak "Electronic Intermediate" system
described by Hunt et al.), with hierarchies of production, HDTV broadcast, and NTSC and PAL compatible standards which are derived by down-converting these formats. Several techniques have been described, including those of Bretyl ("3.times.NTSC
`Leapfrog` Production Standard for HDTV", SMPTE Journal, March 1989), Demos ("An Example Hierarchy of Formats for HDTV", SMPTE Journal, September 1992), and Lim ("A Proposal for an HDTV/ATV Standard with Multiple Transmission Formats", SMPTE Journal,
August 1993). Most proposals employ progressive scanning, although interlace is considered an acceptable alternative as part of an evolutionary process. In particular, Demos addresses the important issue of compatibility to computer-graphics-compatible
formats, although he begins with an 1152-line format, and only considers progressive scanning. And, as pointed out by Thorpe et al., progressive scanning also has drawbacks, and as shown by Kaiser et al. ("Resolution Requirements for HDTV Based Upon the
Performance of 35 mm Motion-Picture Films for Theatrical Viewing", SMPTE Journal, June 1985), even 35 mm theatrical film quality is a misnomer since the realities of mechanical projection systems restrict the typical screen display to less than 700 TV
lines/picture height.
Current technology directions in computers and image processing should allow production equipment based upon fewer than 1200 scan lines, with picture expansions to create a hierarchy of upward-converted formats for theatrical projection, film
effects, and film recording. In addition, general-purpose hardware enhancements should be capable of addressing the economic aspects of production, a subject not considered in detail by any of the available references.
SUMMARY OF THE INVENTION
The present invention takes advantage of general-purpose hardware where possible to provide an economical multi-format video production system. In the preferred embodiment, specialized graphics processing capabilities are included in a
high-performance personal computer or workstation, enabling the user to edit and manipulate an input video program and produce an output version of the program in a final format which may have a different frame rate, pixel dimensions, or both. An
internal production format is chosen which provides the greatest compatibility with existing and planned formats associated with standard and widescreen television, high-definition television, and film. For compatibility with film, the frame rate of the
internal production format is preferably 24 fps. Images are re-sized by the system to larger or smaller dimensions so as to fill the particular needs of individual applications, and frame rates are adapted by inter-frame interpolation or by traditional
schemes, including "3:2 pull-down" for 24-to-30 fps conversions, or by manipulating the frame rate itself for 24 to 25 fps for a PAL-compatible display. The enhancement to a general-purpose platform preferably takes the form of a graphics processor
connected to receive a video signal in an input format. The processor comprises a plurality of interface units, including a standard/widescreen interface unit operative to convert the video program in the input format into an output signal
representative of a standard/widescreen formatted image, and output the signal to an attached display device. A high-definition television interface unit is operative to convert the video program in the input format into an output signal representative
of an HDTV-formatted image, and output the signal to the display device. A centralized controller in operative communication with the video program input, the graphics processor, and an operator interface, enables commands entered by an operator to
cause the graphics processor to perform one or more of the conversions using the television interfaces. The present invention thus encourages production at relatively low pixel dimensions to make use of lower-cost general-purpose hardware and to
maintain high signal-to-noise, then subsequently expands the result into a higher-format final program. This is in contrast to competing approaches, which recommend operating at higher resolution, then down-sizing, if necessary, to less expensive
formats which has led to the high-cost, dedicated hardware, the need for which the present invention seeks to eliminate.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGS. 1A-1D show the preferred and alternative image aspect ratios in pixels;
FIG. 2A shows the mechanical design for a digital camera configured to execute the preferred embodiment;
FIG. 2B shows a digital camera configured to execute the preferred embodiment for several different formats;
FIG. 2C shows a low-cost digital camera configured to execute the preferred embodiment for several different formats;
FIG. 3 shows a functional diagram for disk-based video recording;
FIG. 4 shows the components comprising the multi-format audio/video production system;
FIG. 5 depicts an approach for reducing the chrominance bandwidth of wide-band analog RGB output signals without decreasing the luminance resolution;
FIG. 6 shows the inter-relationship of the multi-format audio/video production system to many of the various existing and planned video formats; and
FIG. 7 shows the implementation of a complete television production system, based on one possible choice for image sizes and aspect ratios.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
The present invention builds upon and extends certain of the concepts introduced in the parent to this application, "Personal-Computer-Based Video Production System," Ser. No. 08/050,861 filed Apr. 21, 1993. The system described in that
application allows an operator to control equipment supplied by various manufacturers at a centralized personal computer to produce, edit and record a video program. Each camera to be used with the system described in this previously filed application
feeds a signal to the personal computer through a custom adapter unit, cable and camera interface module the latter containing cable compensation and gain circuitry. The interface modules feed a common video switcher, audio mixer and display means, all
of which may be provided by a variety of sources, including different manufacturers. In the preferred embodiment, the display is the monitor of a programmed personal computer, and computer interface modules connected between each camera interface module
and the computer allow video images generated by the cameras to appear in different windows on the computer monitor. Control signals entered at the computer are routed to the cameras in order to control their functioning.
The present invention is primarily concerned with a different but related aspect of facilitating professional quality audio/video production; namely, the conversion of disparate graphics or television formats, including requisite frame-rate
conversions, to establish an interrelated family of aspect ratios, resolutions, and frame rates, while remaining compatible with available and future graphics/TV formats. These formats include images of pixel dimensions capable of being displayed on
currently available multi-scan computer monitors, and custom hardware will be described whereby frames of higher pixel-count beyond the capabilities of these monitors may be viewed. Images are re-sized by the system to larger or smaller dimensions so as
to fill the particular needs of individual applications, and frame rates are adapted by inter-frame interpolation or by traditional schemes such as using "3:2 pull-down" (for 24 to 30 frame-per-second film-to-NTSC conversions) or by speeding up the frame
rate itself (as for 24 to 25 fps for PAL television display). The resizing operations may involve preservation of the image aspect ratio, or may change the aspect ratio by "cropping" certain areas, by performing non-linear transformations, such as
"squeezing" the picture, or by changing the vision center for "panning," "scanning" and so forth. Inasmuch as film is often referred to as "the universal format," primarily because 35-mm film equipment is standardized and used throughout the world, the
preferred internal or "production" frame rate is preferably 24 fps. This selection also has an additional benefit, in that the 24 fps rate allows the implementation of cameras having greater sensitivity than at 30 fps, which is even more critical in
systems using progressive scanning, for which the rate will be 48 fields per second vs. 60 fields per second in some other proposed systems.
The image dimensions chosen allow the use of conventional CCD-type cameras, but the use of digital processing directly through the entire signal chain is preferred, and this is implemented by replacing the typical analog RGB processing circuitry
with fully digital circuitry. Production effects may be conducted in whatever image size is appropriate, and then re-sized for recording. Images are recorded by writing the digital data to storage devices employing removable hard-disk drives, disk
drives with removable media, optical or magneto-optical based drives, or tape-based drives, preferably in compressed-data form. As data rates for image processing and reading-from or writing-to disk drives increase, many processes that currently require
several seconds will soon become attainable in real-time, which will eliminate the need to record film frames at slower rates. Other production effects, such as slow-motion or fast-motion may be incorporated, and it is only the frame rates of these
effects that are limited in any way by the technology of the day. In particular, techniques such as non-linear-editing, animation, and special-effects will benefit from the implementation of this system. In terms of audio, the data rate requirements
are largely a function of sound quality. The audio signals may be handled separately, as in an "interlocked" or synchronized system for production, or the audio data may be interleaved within the video data stream. The method selected will depend on
the type of production manipulations desired, and by the limitations of the current technology.
Although a wide variety of video formats and apparatus configurations are applicable to the present invention, the system will be described in terms of the alternatives most compatible with currently available equipment and methods. FIG. 1A
illustrates one example of a compatible system of image sizes and pixel dimensions. The selected frame rate is preferably 24 per second (2:1 interlaced), for compatibility with film elements; the selected picture dimension in pixels is preferably
1024.times.576 (0.5625 Mpxl), for compatibility with the 16:9 "widescreen" aspect ratio anticipated for all HDTV systems, and the conventional 4:3 aspect ratio used for PAL systems [768.times.576 (0.421875 Mpxl)]. All implementations preferably rely on
square pixels, though other pixel shapes may be used. Re-sizing (using the well known, sophisticated sampling techniques available in many image-manipulation software packages or, alternatively, using hardware circuitry described herein below) to
2048.times.1152 (2.25 Mpxl) provides an image suitable for HDTV displays or even theatrical projection systems, and a further re-sizing to 4096.times.2304 (9.0 Mpxl) is appropriate for even the most demanding production effects. Images may be data
compressed 5:1 for 16:9 "wide-screen" TV frames, or 10:1 for HDTV; the data files may then be stored on conventional disk drives, requiring only approximately 8.1 MB/sec for wide-screen frames in RGB, and only 16.1 MB/sec for HDTV frames in RGB.
An alternative embodiment of the invention is shown in FIG. 1B. In this case, the user would follow a technique commonly used in film production, in which the film is exposed as a 4:3 aspect ratio image. When projected as a wide-screen format
image, the upper and lower areas of the frame may be blocked by an aperture plate, so that the image shows the desired aspect ratio (typically 1.85:1 or 1.66:1). If the original image format were recorded at 24 frames per second, with a 4:3 ratio and
with a dimension in pixels of 1024.times.768, all image manipulations would preserve these dimensions. Complete compatibility with the existing formats would result, with NTSC and PAL images produced directly from these images by re-scaling, and the
aforementioned wide-screen images would be provided by excluding 96 rows of pixels from the top of the image and 96 rows of pixels from the bottom of the image, resulting in the 1024.times.576 image size as disclosed above. The data content of each of
these frames would be 0.75 Mpxls, and the data storage requirements disclosed above would be affected accordingly.
Another embodiment of the invention is depicted in FIG. 1C. In this alternative, the system would follow the image dimensions suggested in several proposed digital HDTV formats under consideration by the Advanced Television Study Committee of
the Federal Communications Commission. The format to be adopted is expected to assume a wide-screen image having dimensions of 1280.times.720 pixels. Using these image dimensions (but at 24 fps with 2:1 interlace), compatibility with the existing
formats would be available, with NTSC and PAL images derived from this frame size by excluding 160 columns of pixels from each side of the image, thereby resulting in an image having a dimension in pixels of 960.times.720. This new image would then be
re-scaled to produce images having pixel dimensions of 640.times.480 for NTSC, or 768.times.576 for PAL; the corresponding wide-screen formats would be 854.times.480 and 1024.times.576, respectively. In this case, an image having a dimension in pixels
of 1280.times.720 would contain 0.87890625 Mpxl, with 1,000 TV lines of resolution; furthermore, the systems under evaluation by the ATSC of the FCC also assume a decimation of the two chrominance signals, with detail of only 640.times.360 pixels
retained. The data storage requirements disclosed above would be affected accordingly. The development path to 24 fps with progressive scanning is both well-defined and practical, as is the use of the previously described methods to produce images
having a dimension in pixels of 2048.times.1152.
A further alternative embodiment of the invention is shown in FIG. 1D. As with the system described with reference to FIG. 1B, the user follows the technique commonly used in film production, wherein the film is exposed as a 4:3 aspect ratio
image. When projected as a wide-screen format image, the upper and lower areas of the frame area again blocked by an aperture plate, so that the image shows the desired aspect ratio (typically 1.85:1 or 1.66:1). For an original image format recorded at
24 frames per second, with 4:3 ratio and with pixel dimensions of 1280.times.960, all image manipulations preserve these dimensions. Complete compatibility with the existing formats results, with NTSC and PAL images produced directly from these images
by rescaling, and the aforementioned wide-screen images are provided by excluding 120 rows of pixels from the top of the image and 120 rows of pixels from the bottom of the image, thereby resulting in the 1280.times.720 image size as described above.
The data content of each of these frames is 0.87890625 Mpxls, and the data storage requirements disclosed above are affected accordingly.
Currently available CCD elements for PAL/HDTV dual-use cameras provide 600,000 pixels, typically as arrays of 1024.times.592 or similar dimensions. By modifying the camera circuitry, the optical and CCD-driver circuitry may be adapted for use by
the present invention, thereby allowing for economical implementation of the preferred configuration. FIG. 2A shows a camera as modified for this application. A lens 2 and viewfinder 4 are mounted upon the body of the camera frame. The usual
optical-splitter, CCD-sensors and driver circuitry, and the inventive all-digital signal processing circuitry are located at 6, with optional battery-pack capability at 10. The various analog and digital output signals and any input audio, video or
control signals, all shown generally at 16, are interfaced through appropriate connectors disposed on the rear-panel 12 and sub-panel 14. Provisions are included a shown for the input of analog audio signals, and for the output of both analog and
digital audio signals. Preferably fiber-optic cabling is employed to carry the necessary signals. Internal video recording facilities 8 are described herein below.
Conventional CCD-element cameras of the type described above produce images of over 800 TV Lines horizontal Luminance (Y) resolution, with a sensitivity of 2,000 lux at f8, and with a signal-to-noise ratio of 62 dB. However, typical HDTV
cameras, at 1,000 TV Lines resolution and with similar sensitivity, produce an image with only a 54 dB signal-to-noise ratio, due to the constraints of the wideband analog amplifiers and the smaller physical size of the CCD-pixel-elements. By employing
the more conventional CCD-elements in the camera systems of this invention, and by relying upon the computer to create the HDTV-type image by image re-sizing, the improved signal-to-noise ratio is retained. In the practical implementation of cameras
conforming to this new design approach, there will be less of a need for extensive lighting provisions, which in turn, means less demand upon the power generators in remote productions, and for AC-power in studio applications.
FIG. 2B shows the configuration of a digital video camera implementing the preferred embodiment of the invention. A lens assembly 20 is coupled to an optical beam-splitter 22, which focuses red, green and blue images onto CCD-elements 24a, 24b,
and 24c, respectively. The output signals from each of these CCD-elements is directed to its respective analog-to-digital converter 26a, 26b, and 26c. The output of these three analog-to-digital converters is carried to digital signal processor 28,
which provides digital signal outputs 34, configured as RGB, Y/R-Y/B-Y, YUV, YIQ, or any other format, as desired. In addition, these digital output signals are also provided to digital-to-analog converters 30a, 30b, and 30c, and from these converters
to the analog signal processor 32. This processor provides the analog output signals 36 in the format desired, including the RGB, Y/R-Y/B-Y, YUV, YIQ, or other formats as described above, or additionally, in the composite video or Y/C formats commonly
employed in conventional video production equipment and VTRs. A fiber-optic interface 38 accepts digital video signals from the digital signal processor 28 and provides these signals through the fiber-optic cable 40. Control signals are received from
the fiber-optic cable 40 and carried through to the digital signal processor 28; other camera operational and status signals, such as tally signals, remote lens controls, return video signals, and so forth, are carried in the reverse direction along this
same path from the digital signal processor 28, through the fiber-optic interface 38, to the fiber-optic cable 40.
In practice, the implementation of this design using three 600,000-element CCDs and the commonly employed technique of the spatial-shift for the green CCD-element (as described below) will produce Y/R-Y/B-Y signals with 800 TV lines of
resolution, and will provide a luminance bandwidth of 15 MHz and a Chrominance bandwidth of 7.5 MHz. The RGB video signal outputs will provide a full 15 MHz bandwidth for each channel, and the camera will be suitable for the conventional/widescreen
application described herein. However, for HDTV production, a higher performance level is desired. Accordingly, the system of FIG. 2B, as described above, is implemented with three of the latest 2.4 Mpxl CCD-elements, providing images of pixel
dimension 2048.times.1152. In the digital realm, the resultant image is 6.75 MB per frame, and the data rate of 162 MB/sec is subjected to a 10:1 data-compression to 16.2 MB/sec for recording and production. The resulting image exhibits over 1,000 TV
lines of resolution, again relying upon the spatial shift of the green CCD-element as described herein below. For Y/R-Y/B-Y signals, the Luminance bandwidth will be 60 MHz, and the Chrominance bandwidth will be 30 MHz. The RGB video signal outputs will
provide a full 60 MHz bandwidth for each channel. In this case, it will be possible to re-size the picture image to be as large as 8192.times.4608, which would even enable the system to be used for special optical effects, or with other specialized film
formats, such as IMAX and those employing 65 mm camera negatives.
A more economical alternative implementation of the camera system is shown in FIG. 2C. In this case, the camera employs a single 1.2 Mpxl CCD-element, using color filters to produce the color signals. As shown, the camera lens assembly 42 is
coupled to the color-filter assembly 44. The Luminance signal 46, and the Chrominance signals 48 are provided to the inputs of their respective analog-to-digital converters 50 and 52. The outputs of these converters are provided to the digital signal
processor 54, which produces the digital video output signals 62. These signals may be in any of a number of alternative formats, including, for example, RGB, Y/R-Y/B-Y, YUV, or YIQ. These signals are additionally provided to digital-to-analog
converters 56a, 56b, and 56c, respectively, and then to the analog signal processor 60, which provides analog output signals 64 in the format desired, including the RGB, Y/R-Y/B-Y, YUV, YIQ, or other formats as described above, or additionally in the
composite video or Y/C formats commonly employed in conventional video production equipment and VTRs. In this case, the image size will be 1024.times.576 for the luminance channel (producing approximately 600 TV Lines of resolution), and 512.times.576
for each of the chrominance channels. In this case, it is not possible to introduce the green spatial-shift approach, because only a single CCD-element is employed. However, the luminance channel bandwidth achieved will be 15 MHz, and the chrominance
channel bandwidth will be 7.5 MHz.
In CCD-based cameras, it is a common technique to increase the apparent resolution by mounting the red and blue CCD-elements in registration, but offsetting the green CCD-element by one-half pixel width horizontally. In this case, picture
information is in-phase, but spurious information due to aliasing is out-of-phase. When the three color signals are mixed, the picture information is intact, but most of the alias information will be canceled out. This technique will evidently be less
effective when objects are of solid colors, so it is still the usual practice to include low-pass optical filters mounted on each CCD-element to suppress the alias information. In addition, this technique cannot be applied to computer-based graphics, in
which the pixel images for each color are always in registration. However, in general-use video, the result of the application of this spatial-shift offset is to raise the apparent luminance (Y) horizontal resolution to approximately 800 television
lines.
The availability of hard-disk drives of progressively higher capacity and data transmission rates is allowing successively longer and higher resolution image displays in real-time. At the previously cited data rates, wide-screen frames would
require 486 MB/min, so that currently available 10 GB disk drives will store more than 21 minutes of video. When the anticipated 100 GB disk drives (2.5-inch or 3.5-inch disks using Co-Cr, barium ferrite, or other high-density recording magnetic
materials) become available, these units will store 210 minutes, or 31/2 hours of video. For this application, a data storage unit 8 is provided to facilitate editing and production activities, and it is anticipated that these units would be employed in
much the same way as video cassettes are currently used in Betacam and other electronic news gathering (ENG) cameras and in video productions. This data storage unit may be implemented by use of a magnetic, optical, or magneto-optical disk drive with
removable storage media, or by a removable disk-drive unit, such as those based on the PCMCIA standards. Although PCMCIA media are 1.8-inches in dimension, alternative removable media storage units are not restricted to this limit, and could employ
larger media, such as 2.5-inch or 3.5-inch disks; this, in turn, will lead to longer duration program data storage, or could be applied to lower ratios of data compression or higher-pixel-count images within the limits of the same size media.
FIG. 3 shows the functional diagram for the storage-device-based digital recorder employed in the video camera, or separately in editing and production facilities. As shown, a removable hard disk drive 70 is interfaced through a bus controller
72; in practice, alternative methods of storage such as optical or magneto-optical drives could be used, based on various interface bus standards such as SCSI-2 or PCMCIA. This disk drive system currently achieves data transfer rates of 20 MB/sec, and
higher rates on these or other data storage devices, such as high-capacity removable memory modules, is anticipated. The microprocessor 74 controls the 64-bit or wider data bus 80, which integrates the various components. Currently available
microprocessors include the Alpha 21064 by Digital Equipment Corporation, or the MIPS R4400 by MIPS Technologies, Inc.; future implementations would rely on the already announced P6 by Intel Corp. or the PowerPC 620, which is capable of sustained data
transfer rates of 100 MB/sec. Up to 256 MB of ROM, shown at 76, is anticipated for operation, as is 256 MB or more of RAM, shown at 78. Current PC-based video production systems are equipped with at least 64 MB of RAM, to allow sophisticated editing
effects. The graphics processor 82 represents dedicated hardware that performs the various manipulations required to process the input video signals 84 and the output video signals 86; although shown using an RGB format, either the inputs or outputs
could be configured in alternative formats, such as Y/R-Y/B-Y, YIQ, YUV or other commonly used alternatives. In particular, while a software-based implementation of the processor 82 is possible, a hardware-based implementation preferred, with the system
employing a compression ratio of 5:1 for the conventional/widescreen signals ("NTSC/PAL/Widescreen"), and a 10:1 compression ratio for HDTV signals (2048.times.1152, as described herein above). An example of one of the many available options for this
data compression is the currently available Motion-JPEG system. Image re-sizing may alternatively be performed by dedicated microprocessors, such as the gm865.times.1 or gm833.times.3 by Genesis Microchip, Inc. Audio signals may be included within the
data stream, as proposed in the several systems for digital television transmission already under evaluation by the Federal Communications Commission, or by one of the methods available for integrating audio and video signals used in multi-media
recording schemes, such as the Microsoft ".AVI" (Audio/Video Interleave) file format. As an alternative, an independent system for recording audio signals may be implemented, either by employing separate digital recording provisions controlled by the
same system and electronics, or by implementing completely separate equipment external to the camera system described herein above.
FIG. 4 shows the components that comprise a multi-format audio/video production system. As in the case of the computer disk-based recording system of FIG. 3, an interface bus controller 106 provides access to a variety of storage devices,
preferably including an internal hard-disk drive 100, a tape-back-up drive 102, and a hard-disk drive with removable media or a removable hard-disk drive 104. The interface bus standards implemented could include, among others, SCSI-2 or PCMCIA. Data
is transmitted to and from these devices under control of microprocessor 110. Currently, data bus 108 would operate as shown as 64-bits wide, employing microprocessors such as those suggested for the computer-disk-based video recorder of FIG. 3; as
higher-powered microprocessors become available, such as the PowerPC 620, the data bus may be widened to accommodate 128 bits, and the use of multiple parallel processors may be employed, with the anticipated goal of 1,000 MIPS per processor. Up to 256
MB of ROM 112 is anticipated to support the requisite software, and at least 1,024 MB of RAM 114 will allow for the sophisticated image manipulations, inter-frame interpolation, and intra-frame interpolation necessary for sophisticated production
effects, and for conversions between the various image formats.
A key aspect of the system is the versatility of the graphics processor shown generally as 116. Eventually, dedicated hardware will allow the best performance for such operations as image manipulations and re-scaling, but it is not a requirement
of the system that it assume these functions. Three separate sections are employed to process the three classifications of signals. Although the video input and output signals described herein below are shown, by example, as RGB, any alternative format
for video signals, such as Y/R-Y/B-Y, YIQ, YUV, or other alternatives may be employed as part of the preferred embodiment. One possible physical implementation would be to create a separate circuit board for each of the sections as described below, and
manufacture these boards so as to be compatible with existing or future PC-based electrical and physical interconnect standards.
A standard/widescreen video interface 120, intended to operate within the 1024.times.576 or 1024.times.768 image sizes, accepts digital RGB signals for processing and produces digital RGB outputs in these formats, as shown generally at 122.
Conventional internal circuitry comprising D/A converters and associated analog amplifiers are employed to convert the internal images to a second set of outputs, including analog RGB signals and composite video signals. These outputs may optionally be
supplied to either a conventional multi-scan computer video monitor or a conventional video monitor having input provisions for RGB signals (not shown). A third set of outputs supplies analog Y/C video signals. The graphics processor may be configured
to accept or output these signals in the standard NTSC, PAL, or SECAM formats, and may additionally be utilized in other formats as employed in medical imaging or other specialized applications, or for any desired format for computer graphics
applications. Conversion of these 24 frame-per-second images to the 30 fps (actually, 29.97 fps) NTSC and 25 fps PAL formats may be performed in a similar manner to that used for scanned film materials, that is to NTSC by using the conventional 3:2
"pull-down" field-sequence, or to PAL by running the images at the higher 25 fps rate. For other HDTV frame rates, aspect ratios, and line rates, intra-frame and inter-frame interpolation and image conversions may be performed by employing comparable
techniques well known in the art of computer graphics and television.
The management of 25 fps (PAL-type) output signals in a system configured for 24 fps production applications presents technical issues which must be addressed, however. Simple playback of signals to produce PAL output is not a serious problem,
since any stored video images may be replayed at any frame rate desired, and filmed material displayed at 25 fps is not objectionable. Indeed, this is the standard method for performing film-to-tape transfers used PAL- and SECAM-television countries.
However, it is not practical to produce both PAL and NTSC signals concurrently from a single source running at 24 fps. Simultaneous output of both NTSC and film-rate images is performed by exploiting the 3:2 field-interleaving approach:
5.times.24=2.times.60; that is, two film frames are spread over five video fields. This makes it possible to concurrently produce film images at 24 fps and video images at 30 fps. The difference between 30 fps and the exact 29.97 fps rate of NTSC may
be palliated by slightly modifying the system frame rate to 23.976 fps. This is not noticeable in normal film projection, and is an acceptable deviation from the normal film rate. However, if the systemframe rate is adjusted to 25 fps to produce PAL or
SECAM output, there is no convenient technique to produce 30 fps NTSC concurrently, unless multiple-frame storage with motion-interpolation is employed, which tends to create udesirable artifacts in the image produced. Commercial standards-converters
are available to perform this function, however, from companies such as Snell & Wilcox. This system is primarily directed towards production of video-based film and high-definition TV images, for which 24 fps and 30 fps, respectively, are the
established frame rate for film and the proposed frame rate for HDTV (in NTSC-countrie | | |