WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
System for constructing a partitioned queue of DMA data transfer requests for movements of data between a host processor and a digital signal processor    
United States Patent5404522   
Link to this pagehttp://www.wikipatents.com/5404522.html
Inventor(s)Carmon; Donald E. (Durham, NC); Crouse; William G. (Raleigh, NC); Ware; Malcolm S. (Raleigh, NC)
AbstractA multi-media user task (host) computer is interfaced to a high speed DSP which provides support functions to the host computer via an interprocessor DMA bus master and controller. Support of multiple dynamic hard real-time signal processing task requirements are met by posting signal processor support task requests from the host processor through the interprocessor DMA controller to the signal processor and its operating system. The signal processor builds data transfer packet request execution lists in a partitioned queue in its own memory and executes internal signal processor tasks invoked by users at the host system by extracting signal sample data from incoming data packets presented by the interprocessor DMA controller in response to its execution of the DMA packet transfer request queues built by the signal processor in the partitioned queue. Processed signal values etc. are extracted from signal processor memory by the DMA interprocessor controller executing the partitioned packet request lists and delivered to the host processor. A very large number of packet transfers in support of numerous user tasks and implementing a very large number of DMA channels is thus made possible while avoiding the need for arbitration between the channels on the part of the signal processor or the host processor.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Drawing from US Patent 5404522
System for constructing a partitioned queue of DMA data transfer

     requests for movements of data between a host processor and a digital

     signal processor - US Patent 5404522 Drawing
System for constructing a partitioned queue of DMA data transfer requests for movements of data between a host processor and a digital signal processor
Inventor     Carmon; Donald E. (Durham, NC); Crouse; William G. (Raleigh, NC); Ware; Malcolm S. (Raleigh, NC)
Owner/Assignee     International Business Machines Corporation (Armonk, NY)
Patent assignment
All assignments
Publication Date     April 4, 1995
Application Number     08/143,406
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     October 26, 1993
US Classification     718/107 710/22
Int'l Classification     G06F 013/00 G06F 015/16
Examiner     Chan; Eddie P.
Assistant Examiner    
Attorney/Law Firm     Duffield; Edward H. Phillips; Steven B. ,
Address
Parent Case     CROSS REFERENCE TO RELATED APPLICATION This application is a division of application Ser. No. 07/761,534, filed Sep. 18, 1991, now abandoned.
Priority Data    
USPTO Field of Search     395/200 395/250 395/425 395/375 395/800
Patent Tags     constructing partitioned queue dma data transfer requests movements data between host processor digital signal processor
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
5247626
Firoozmand
709/212
Sep,1993

[0 after 0 votes]
5226041
Waclawsky
370/236
Jul,1993

[0 after 0 votes]
5195092
Wilson
725/146
Mar,1993

[0 after 0 votes]
5165021
Wu
709/234
Nov,1992

[0 after 0 votes]
5003465
Chisholm
710/25
Mar,1991

[0 after 0 votes]
4930069
Batra
709/212
May,1990

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


What is claimed is:

1. A multi-media digital signal processor subsystem, comprising: a digital signal processor, DSP, for executing digital signal processor programs;

means in said DSP subsystem for analyzing incoming requests for tasks from a host processor by constructing a partitioned queue of direct memory access (DMA) data transfer requests for movement of data between a system memory and said DSP;

means for inserting pacing markers in said queue at regularly occurring time intervals, said markers for generating a start signal to be sent by an operating system in said DSP to a DMA/IO interprocessor controller to initiate DMA data transfers; and

said DMA/IQ interprocessor controller for controlling the movement of data between said DSP and said system memory by reading from said system memory, in response to said start signal, a partition of data from an address where it last encountered a wait packet from the transfer request built in the partition queue.

2. A multi-media digital signal processor subsystem as described in claim 21 wherein:

said DMA/IO interprocessor controller executes said DMA data transfers in accordance with said data transfer requests in a partition of said partitioned queue during the time interval in which said DSP is constructing another partition in said queue.

3. In a multi-media digital signal processor subsystem having a digital signal processor, DSP, for executing signal processing programs and a DMA/IO interprocessor controller for direct memory access input and output, a method of operation including steps at said DSP of:

analyzing incoming requests for tasks from a host processor by constructing a partitioned queue of direct memory access(DMA) data transfer requests for movement of data between a system memory and said DSP;

inserting pacing markers in said queue at regularly occurring time intervals, said markers for generating a start signal to be sent by an operating system in said DSP to said DMA/IO interprocessor controller to initiate DMA data transfers; and reading from said system memory, in order to control the movement of data between said DSP and said system memory, in response to said start signal, a partition of data from an address where a wait packet from the transfer request built in the partition queue was last encountered.

4. A method as described in claim 3, further comprising the step of:

constructing further partitions in said queue during a time interval while DMA data transfers are being executed in accordance with data transfer requests in another partition of said queue.

5. A multi-media task computer system for executing user task programs which place signal processing demands on said system in supporting of requested user tasks, the system comprising:

a first digital processor, DP, for executing user task programs;

a second digital processor, DSP, for executing digital signal processing programs to support said user task programs;

a host system bus for moving data between said DP and said DSP, said host system bus connected to said DP and to a subsystem including said DSP, said subsystem further including:

means in said DSP subsystem for analyzing incoming requests for tasks by constructing a partitioned queue of direct memory access (DMA) data transfer requests for movement of data between a system memory and said DSP;

means for inserting pacing markers in said queue at regularly occurring time intervals, said markers for generating a start signal to be sent by an operating system in said DSP to a DMA/IO interprocessor controller to initiate DMA data transfers; and

said DMA/IO interprocessor controller for controlling the movement of data between said DSP and said system memory by reading from said system memory, in response to said start signal, a partition of data from an address where it last encountered a wait packet from the transfer request built into the partition queue.

6. A multi-media task computer system as described in claim 5 wherein:

said DMA/IO interprocessor controller executes said DMA data transfers in accordance with said data transfer requests in a partition of said partitioned queue during the time interval in which said DSP is constructing another partition in said queue.
 Description Submit all comments and votes
 


FIELD OF THE INVENTION

This invention relates to signal processor computer systems in general and to direct memory access control; more specifically, it relates to a dynamic, hard real-time, multi-task signal processing demands commonly encountered in multi-media computer systems.

PRIOR ART

Signal processors are well known components in numerous computer systems presently available. Specialized Digital Signal Processors (DSPs) are commercially available from a variety of manufacturers and are utilized for the high speed, iterative execution of algorithms employed to provide digital signal filtering, speech recognition or speech synthesis, servomechanism control, encoded speech generation, compact disk hi fi sampled sound and music generation, modem data modulating and demodulating functions, facsimile data transmission encoding and decoding functions, color and monochrome image data compression and display functions, motion video processing functions and numerous data protocol conversion or encoding, error correcting or similar functions. In fact, the suggested lists of potential digital signal processor applications for high speed, repetitive execution of such algorithms as Fourier transforms, etc. on a high speed stream of digital analog signal samples are widespread in the industry.

A particularly advantageous signal processor architecture is shown in U.S. Pat. No. 4,794,517 assigned to the assignee of this present application; reference may be had thereto for an understanding of how digital signal processors may be built and utilized. However, it is evident to those of skill in the art that the aforementioned signal processor architecture is but one of many competing ones available in the marketplace. Additionally, the uses for digital signal processors are expanding at almost geometric rates, particularly in the growing field of the so called multi-media computer systems. In such systems a user may simultaneously wish to execute numerous functions such as speech encoding for transmission, motion video, modem transmission and reception and perhaps background CD music reproduction to name but a few audiovisual or multi-media applications. These applications may be run on a typical host system such as an IBM Personal System/2 computer or any of a variety of similar available multi-tasking computer systems commonly sold today.

In such multi-media systems, signal processing tasks are usually offloaded (via DMA) to the specialized, high speed digital signal processor (DSP). However, as speed and memory capacity of processors increase, the DSP may take on execution of the user tasks themselves as will become evident later. Indeed, in such a system, DMA function itself could be handled by the DSP. If only a few channels of DMA access are required, a typical DMA controller may allow the signal processor and the host processor to service one or several hardware devices. However, where the host processor is a multi-tasking one and a large number of I/O devices exist, the provision of only a few DMA channels between the host multi-tasking processor and the supporting digital signal processor may become a bottleneck which is insurmountable when the DMA facility is simultaneously shared by all operating I/O devices and various threads of task execution which are running.

Consider a typical multi-media environment that may have numerous high fidelity audio signal samples being processed utilizing independent host memory tables of audio samples and multiple memory control tables and energy and pitch envelopes stored in memory, and one encounters a system with a need for providing more than one hundred separate channels of DMA access in a time period less than a millisecond. With 16 hi fi stereophonic audio signal channels, each audio channel requires transfer of 88,200 sample bytes per second. In such a short time, one may consider that 100 channels of DMA are supporting over 100,000 block transactions per second, with each block having its own unique source and destination address and block transfer size. Using a conventional DMA access device, a system processor such as the digital signal processor would have to be interrupted for a new data transfer to support the required operations on the average of about every 10 microseconds. Several machine cycles would be required to support each transfer and the system would soon become inoperative since no processor resource would remain capable of executing the actual signal processing tasks in addition to those required to control the DMA accesses.

In addition to typical audio applications, there may be other host tasks under execution that require communication between the digital signal processor and the host processor while the audio signals are being "played". An example might be a facsimile modem moving image data to or from the host computer's system memory in conjunction with the audio music, speech or background being "played". In addition a speech recognition task may be running which requires the moving of speech templates to and from the host system memory to the signal processor's memory for matching purposes and then writing back speech recognition tokens as they are recognized. A speech synthesis function may also be operating and is retrieving phoneme data from the system memory bank while a computer-aided display application may be running in which the signal processor is required to rotate a three dimensional object found in an image screen buffer which requires numerous complex iterative calculations. The point being made is that a huge number of effective DMA channels may be required in a complex multiprocessor and multi-tasking environment supported by a signal processor.

Multiple DMA controller chip devices are known such as that shown in U.S. Pat. No. 4,831,523. Such devices operate as peripheral device controllers and are designed to connect a fixed number, four in the case of the presently cited reference, of physical peripheral devices to a system bus. Physical devices are not the equivalent to multi-tasking processes that require hard, real-time, processed signal samples in order to carry out the task processes that a user desires. Four physical peripheral devices show a limit of approximately eight logical DMA channels and, in the cited reference, devices are not serviced within any fixed amount of time since the devices are serviced in a round robin fashion and one device may take an arbitrarily long time to complete its work, thus removing the ability for any other real-time device to complete its work within a fixed period of time.

A direct memory access channel sharing mechanism is also shown in the IBM Technical Disclosure Bulletin, Vol. 30, No. 7 published December, 1987, pages 369 and 370. However, the mechanism shown requires I/O device hardware that can be dynamically reassigned. It appears that this mechanism is a software one which dynamically assigns a small number of hardware DMA channels to requesting external hardware devices which permits the sharing of the DMA channels. There is no deterministic information on the size, number or time period in which grant of service for any requests may be made and no details are given as to how the hardware device may make such a request. The system shown is not real-time in nature, i.e. requesting tasks whose signal samples are to be processed and delivered do not have to be serviced within precise and repetitive time increments such as a CD music reproduction system with 88,200 bytes of information per second to be transferred, processed and the processed signals retransferred back for usage by the requesting task.

U.S. Pat. No. 4,807,121 shows a peripheral interface system having an input/output processor .connected to up to four multiplexing units with each such unit providing an interface for up to four controllers. The I/O processor has a DMA channel that receives multiplexed serial data from the multiplexers. Data is transferred between the I/O processor and any controller unit by filling the storage area in a buffer from local memory of the I/O processor in a serial fashion over a DMA channel. Only a single channel is provided and the multiplexing scheme allows it to be utilized. Data parcels are transferred from the controller to the multiplexer on a time slot basis and from storage of the multiplexer to the memory in a serial fashion. However, there is no indication that this system has any means of supplying the dynamic, hard real-time requirement that would be presented by application tasks of the sort as alluded to above running simultaneously.

OBJECTS OF THE INVENTION

in light of the foregoing difficulties in the known prior art, it is an object of this invention to provide an efficient multi-media computer system and data transfer mechanism to support hard real-time multi-tasking operations in a host processor.

BRIEF SUMMARY OF THE INVENTION

A solution to the foregoing problems is provided in the preferred embodiment of the invention by presenting task requests to the signal processor from the host processor, analyzing incoming requests for tasks, and building a list of packet transfer requests in a partitioned queue in memory, accessing the partitioned queue with an interprocessor DMA controller and moving the necessary data signal samples in or out of the signal processor (DSP) via the DMA mechanism within a fixed minimum prescribed time period. Data transfer packet request lists are made up by the digital signal processor (DSP) in the form of DMA control packets for this embodiment.

Each packet list contains several words of control information and the source and destination address for the movement of the data samples. The operating system of the DSP constructs the packet request lists for transferring data by appending packet requests one by one to the current packet lists contents. The pacing of the execution of the packet list by the DMA control machine is provided for by placing a wait packet or "end of list" marker at the end of each partition in the packet list. Thus, a partitioned queue of DMA data transfer requests with markers inserted at regularly occurring time intervals is constructed within memory.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and still other objects of the invention are met in a preferred embodiment which is further described and illustrated in the drawings in which:

FIG. 1 illustrates a preferred embodiment in schematic form with the operational flow of data to and from the host processor and the digital signal processor via the interprocessor DMA controller which interfaces to the requisite system address and data buses in both the host system and the digital signal processor system.

FIG. 2 illustrates schematically the interconnection between a digital signal processor system and a host computer system via the interprocessor DMA bus master and controller.

FIG. 3 illustrates schematically the flow of data in a programmable form of the interprocessor DMA I/O bus master, controller and arbiter according to the invention.

FIG. 4 illustrates in some detail the format and content of the DMA packet requests built by the digital signal processor and what the encoding of these requests may signify.

FIG. 5 shows the arrangement of FIGS. 5A and 5B which illustrate the flow of data in and out of the packet buffer and registers for data flow and control within the interprocessor DMA controller and arbiter.

FIG. 6 illustrates the schematic flow at the host processor which builds digital signal processing task work lists or requests and manages them not to exceed the available signal processing resource of the digital signal processor.

FIG. 7 shows the arrangement of FIGS. 7A and 7B which illustrate the schematic flow of operation in the interprocessor DMA controller and arbiter as it processes DMA packet request lists built in the digital signal processor.

FIG. 8 illustrates a portion of the DMA handier hardware for a preferred embodiment of the invention.

FIG. 9 shows the arrangement of FIGS. 9A and 9B which illustrate the DMA transfer process executed by the DMA interprocessor arbiter and controller.

FIG. 10 illustrates a high-level flow chart of the packet list construction process in the DSP.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT OF THE INVENTION

A full description of the preferred embodiment of the invention will be given with respect to the illustrations thereof shortly. At the outset, however, it is deemed desirable to illustrate the magnitude of the problem encountered and to further illustrate the efficiency and improvement provided by utilizing the system and DMA packet controller mechanism of the present invention.

The management of data flow to and from a Digital Signal Processor Subsystem (DSP) could, as noted above, pose potentially tremendous loads on the DSP. As an example, one may consider a commercially current computer such as the Motorola 68030 which is a commonly available processor forming the core of many PC's. A typical DSP might be the Motorola 56000 PC processor which may be implemented on the planar board of the processor. There is a DMA path between the 56000 and the 68030; however, the DMA hardware only relieves the host processor 68030 and does nothing to improve the functioning of the DSP 56000. For each word which must be moved between the 68030 and the 56000, there is a brief interruption to the 68030. During the interruption, the bus of the host processor is granted to the DMA device which then moves a word of data or signal sample. However, once the word is acquired by the DSP, the DSP must actually be interrupted. The DSP must stop what it is then doing and move the data word into its memory via an interrupt handler.

An estimate of the number of DSP processor cycles to accomplish this is about 10. There is approximately a one cycle loss due to the three-phase pipeline in the 56000; two to three more processor cycles are required to save the contents of a few registers, several more cycles of processor are required to establish an index pointer to the DSP memory and to the modulo counter index control. Finally, several additional processor cycles are required to restore the DSP machine state and return to the task that was interrupted. This all amounts to approximately 10 processor cycles required to move only a single word of data or sample to or from the DSP. This DSP is of the type noted and referred to earlier as illustrated in U.S. Pat. No. 4,794,517 which is also a three-phase pipelined DSP architecture.

The present invention provides an improved multi-media system using a DMA packet machine, i.e. a programmed mechanism for providing the data transfer in a more efficient way. DMA transfers are initiated by software requests made by the DSP, not by the DMA mechanism. The DMA mechanism, which is the controller and arbiter, gets guaranteed service from the DSP within time windows that repeat and have a length according to the most demanding task to be supported in the host processor's menu of multiple tasks. For example, a time window of 726 microseconds is capable of meeting 32 bit (full word) samples necessary to support the typical 88,200 digital stereo audio samples for playback on the average of every 726 microseconds. At the typical speed of operation of the DSP, over 100 such 32 bit packets could be processed within the 726 millisecond time interval. This would provide capacity for more than 100 unique transactions or communication channels for data flow between tasks in the multiprocessing host system and operations performed in the DSP subsystem.

When the DMA controller reads a packet transfer request placed in a partitioned queue by the service-requesting DSP task as it executes, the DMA arbiter and controller will arbitrate for the host system bus, e.g. the IBM PS/2's microchannel or equivalent host bus and, when access to the bus is granted by the host system to the DMA controller and arbiter for the request placed by the DSP, up to 16 bytes (dependent on buffer size) of data may be moved to a first in first out (FIFO) register buffer in the DMA controller. Once the bytes are in FIFO, the DMA arbiter and controller will arbitrate for the local DSP databus and once granted, will move one 16-bit word and then drop the bus request. This word will then be moved to the DSP's memory and the DSP processor will be halted for one bus cycle. Thus, for each word moved, the process will consume one cycle in the DSP. The DMA controller and arbiter will continue to arbitrate for the local DSP bus until the entire contents of the FIFO within the DMA controller has been emptied. The DMA controller will then make a new request to the host system buses for more data to be moved.

It will be noted that the process for arbitrating for either the system bus or the DSP bus is most efficiently conducted by dedicated programmed hardware and processes such as those implemented in a DMA arbiter and controller. The packet list processing capability of the DMA controller allows efficient utilization of both a host system bus and the DSP bus. Recalling the prior art design briefly described above, it may be seen that a savings of nine out of ten average DSP cycles will be realized with this design.

For example, returning to the aforementioned stereo hi fi signal processing task required to support stereophonic CD music, one must play out the standard 88,200 16-bit digital samples every second. This requires the transfer of 88,200 words of data every second between the host system processor which will be reading the CD disk, transferring the read samples to the DSP, receiving processed audio samples from the DSP and reconstructing them in analog integrated sound segments at the rate of 88,200 per second. In the prior art machine and system noted above, this operation alone would require 88,200 words per second times 10 cycles per word or 882,000 cycles per second of DSP cycle capability to be exercised. In the preferred embodiment of the present invention, only 88,200 DSP cycles will be required, or about one tenth of those utilized in the prior art.

Turning to an example of a DMA arbiter controller and list processor system in use, let us suppose that a speech recognition task and a stereo hi fi CD audio playback task are running simultaneously, as selected at the host processor by a user, and which require the facilities of a DSP to support them. The speech recognition task will need approximately two DMA channels for operation, one incoming and one outgoing, each with a peak rate of approximately 256 words per channel. The CD music task will require one DMA channel with a peak rate of about 64 words for its channel. Each time the CD audio task runs and the signal processing tasks on the samples are run in the DSP, the DSP will need to move 64 more words of data from the host system processor memory to the DSP memory. To do this, the DSP will call its operating service routine for a DMA request each time it runs. On each call, it will provide the next system processor address to be accessed and the next internal memory address and beginning count where the 64 words to be fetched may be stored. Each time the speech recognition task runs in the DSP, it will call the DSP's DMA request service routine twice. Each request will have a unique system address at the host system from which information is to be withdrawn or to which it is to be delivered and a unique DSP memory address where the samples are to be stored or from which they are to be fetched. The DSP will download two recognition templates each time its iterative task runs in the DSP. Each template will be approximately 256 words in length.

The three DMA requests represented by the single request for the CD task and the two requests for the speech recognition task will be placed in a segmented queue by the DSP as they occur. As an example, the speech recognition task my have begun and placed its first request and then have been interrupted while the CD task in the DSP took control and placed its DMA request. After the CD task request was done, the speech recognition task may be restarted and it will place its second DMA request. If only these three requests were placed during one 726 microsecond interval during which a partitioned queue is built by the DSP, they will not be serviced by the DMA controller during the time that they are being placed in the queue. At the next interrupt of a 1,378 hz clock (1/726 microseconds) the next group of DMA request packets that were placed in the previous 726 microseconds will be accessed and processed by the DMA controller and arbiter.

In the present example, the DMA controller arbiter will first find the packet request that asks for 256 words to be moved for the speech recognition task. It will load up the indicated control words from the packet request to see how many words to move and will load the modulo addressing control boundaries. It will then read the host system memory address and the DSP memory address and arbitrate for the host memory bus. Once a grant to the host memory bus is received, the DMA controller and processor will burst a number of bytes (16 bytes in this embodiment) of data from the host system processor memory to its own internal buffer. It will then arbitrate for the DSP bus and each time it receives a grant, it will write another word into the DSP memory according to the DSP beginning memory address which it read from the DMA packet control request. After all 16 bytes have been written, it will go back to arbitrate for the host system memory bus again. The process will continue until all 256 words have been moved. Then the DMA machine will read the next packet in the DMA packet transfer request list. According to this example, this will be the CD task packet. For this packet, 64 words are to be moved utilizing the same operation as previously described. When all 64 words have been moved, the-DMA arbiter and controller will access the next packet transfer request which will be the second packet for the speech recognition task and, after processing it in the same fashion, will see no more packets and will enter a "wait state" until the 726 microsecond tame window has elapsed. If more DMA packet requests have been arriving by being placed in the partitioned queue built by the DSP during the current 726 microsecond window, they will not be serviced until the next time window.

Turning to FIG. 1, the overall operational flow of this type of process may be briefly envisioned beginning in Box 1 of FIG. 1, the user at the host PC starts the operations by invoking or selecting application program requests for execution at the PC which will necessarily involve digital signal processing tasks for their support. Examples might be high fidelity audio playback, speech recognition, modem data transfer and facsimile data functions, motion video, speech synthesis or any of a variety of applications from the multi-media environment that are well understood to those of skill in this art.

The host PC will request DSP tasks to be executed by transmitting identification of the tasks requested to the operating system of the DSP. This occurs in Box 2 of FIG. 1.

In Box 3, the DSP operating system builds partitioned packet lists from any active task requests, placing an end to the partition of requests at repetitive time intervals. FIG. 10 shows the high-level process flow for the DSP's packet list building operations. In one preferred embodiment as discussed above, these repetitive "End of List" (E.O.L.) time intervals occur at every 726 microseconds.

The DSP will pace or clock an interprocessor DMA controller as shown in Box 4 at the clock rate of every 726 microseconds and will continue building partitioned packet request lists in order to carry out any active tasks which may be running in the DSP or which are requested by new user requests coming from the PC as shown in Box 11.

In Box 5, the interprocessor DMA controller will receive the pacing clock signal beginning a 726 microsecond time interval from the DSP as shown. In Box 6, the interprocessor DMA controller begins reading the packet list from a partition of the DMA request packet list built by the DSP. The DMA controller will arbitrate for the system or DSP data or memory buses as appropriate for the requested packet transfer as shown in Box 7, will receive the bus grant for access to the appropriate bus as shown in Box 8 and will then transfer a number of packets to or from the DMA buffer in Box 9 and rearbitrate for access to the system or DSP buses as shown by the linkage between Box 9 and Box 7 until all transfers are complete as shown in Box 10.

As is apparent from the foregoing brief description of flow with respect to FIG. 1, some management by the host system (or by the DSP if it has sufficient capacity, or by an auxiliary processor if desired) must be exercised so that DSP task requests can all be processed within the minimum time interval of, for example, 726 microseconds, or suffer the consequence that any further requesting user task may not have its needs fulfilled in hard real-time. To accommodate this requirement, a DSP resource management and allocation scheme is implemented in the preferred embodiment here, in the host processor. The management and allocation function is illustrated schematically in FIG. 6 and could be practiced by the DSP or auxiliary processor if desired, and is described as follows.

The resource management and allocation function keeps track of the total load that will be presented to the DSP by any user invoked tasks. The load is measured in terms of the total available DMA byte transfer bandwidth, the length of the packet list and the available DSP resource power or speed as measured in DSP execution cycles in millions of instructions per second (MIPS). The management and allocation function assures that sufficient signal processor resource will be available each 726 microsecond interval for all of the requested DSP tasks in order to guarantee that each DSP task's real-time DMA requirements may be met.

The total available resource is a function of the particular system implementation, i.e. the speed of the DSP in MIPS, the bandwidth transfer capability of the DMA mechanism and the length of a partition in a packet transfer request list to be built by the DSP. While these may all be variable according to implementation, once implemented they will be constant for the given system. The DMA bandwidth is bounded, i.e. constrained by the lesser of either the host processor's bus bandwidth, the DSP's instruction cycle time or the DMA mechanism's hardware bandwidth. The packet list size is bounded, as mentioned earlier, by the amount of the available DSP data storage and the DSP instruction clock speed.

In order to implement the resource management and allocation function, each user task at the host system will be required to contain an indication or declaration of the total DSP task resource that will be required in terms of maximum DMA bandwidth, packet list length and DSP MIPS that will be consumed at a maximum by the invoked task. As DSP tasks are requested by the end users at the host system, the resource management and allocation function in FIG. 6 allocates the declared resource requirements to the requested DSP tasks in the DSP. As long as all of the DSP task resource requirements can be met within the constraints of the system, the DSP task will be loaded into the DSP. This is done by the host system placing a DSP task request to the operating system of the DSP. If sufficient resources are not available, the DSP task request placed by the user will be rejected and an appropriate user error message will be given.

In FIG. 6, operation is begun by the user requesting a task at his PC: for example, speech recognition. The speech recognition program in the host PC will be called up and it will contain appropriate parameters for the demands which it will place on bandwidth in terms of maximum words per DMA window time, the maximum DSP MIPS that it will require and the maximum DSP memory storage that it may invoke as shown in Box 1 of FIG. 6. The available maximum DSP resources are also known to the system as having been entered by the user or by the system interrogating hardware encoded registers (not shown ) present in the DMA controller and arbiter and in the DSP. This is shown in Box 2.

In Box 3 the sum of all user task requirements, i.e. the total DMA requirement, is formed by adding together the requirements for all presently active and any newly requested tasks. This is compared with the maximum total DMA word transfer capability and the total DSP resources in Box 3. If the word transfer total demand exceeds the available resource, the task is not loaded as shown in Box 4 and a return to the user selection of tasks in Box 5 is indicated. If the user task total DMA requirement is not exceeded, the system proceeds to Box 6 where the sum of all active user task total DSP instruction execution resources is compared with the maximum available DSP MIPS. If the maximum is exceeded, the new task will not be loaded as shown by the return to Box 4 and 5. If, however, the total DSP MIPS are not exceeded, the-system proceeds to Box 7 where the sum of all active user task total DSP storage requirements is made and compared with the maximum available DSP data storage size.

If the maximum is not exceeded, the DSP workload manager process (which could reside in the DSP or an auxiliary processor if desired) which performs the resource management and allocation in the PC host will proceed to Box 8 where it will load the task by signalling the operating system of the DSP to invoke the beginning of a new user selected task and then the routine ends in Box 9.

As mentioned previously, DMA data packet transfer request lists are built in partitioned form in memory of the DSP by the operating system of the DSP. As DSP task requests are brought to the DSP's operating system, it will form a list of DMA data packet transfer requests necessary to support the requested task execution. The format of the packet requests is shown in FIG. 4.

In FIG. 4, a five-word DMA packet request containing two control words which are stored in control registers 1 and 2, two words of host system memory address (being the lower address and the upper address) and one word of DSP memory address which represents the location where a number of words to be moved to or from begins. The encoding of the specific control words for the control registers is shown in FIG. 4. These control words are utilized by the interprocessor DMA arbiter and controller as will be described in greater detail later.

The operating system of the DSP builds the DMA packet transfer request lists in memory. The addressing is such that the memory operates as a circular buffer within the DSP. The packet request list is a partitioned list in that it contains a group of one or more individual DMA packet transfer requests and an ending "wait state" or "end of list" marker. The end of the list in DSP memory contains a pointer back to the beginning so that the "buffer" will be endlessly traversed. The available DSP memory for constructing the buffer must be large enough to contain at least two complete packet request lists at any instant of time. This is because the DMA controller hardware will be processing the contents of one request list while the operating system of the DSP is busy constructing the contents of the next partition of the list. The operating system of the DSP constructs the DMA packet request lists (as shown in FIG. 10) for transferring data by appending packet requests one by one to the current packet list contents as active tasks in the DSP place DMA requests either to fetch in new signal samples for processing or to transfer processed samples back to the requesting user task as appropriate.

A given task operating in the DSP requests a DMA packet transfer by first loading internally specified DSP registers with the desired source address, destination address and any control information that is necessary, and secondly by calling the operating system which appends this request to the list it is currently building in its partitioned lists. DSP tasks may make packet transfer requests at any time. Such requests may be asynchronous to the DMA arbiter and controller list execution which is conducted in the DMA control machine. The packet list execution by the DMA control machine is paced at precise intervals of time by the DSP's operating system which places a "wait packet" or "end of list marker" as a marker at the end of a partition in the packet list which it is currently building. In the preferred embodiment, these markers are pre-written in the queue in memory so that they occur at regular intervals as the queue is read by the DMA device. At precisely recurring times, the DSP signals the DMA controller to proceed with execution of the packet transfer request list which is next to be processed. The wait packet (E.O.L.) will serve as a means or a marker for stopping the DMA hardware when it has completed processing the current list. As tasks in the DSP continue making DMA packet data transfer requests, the operating system will begin filling in the next partition in the packet list. DSPs such as the Motorola 56000 referred to earlier are well known in the industry and the capability of their operating system to build such lists in memory is well understood by those of skill in the art and needs no further description hero.

Turning to FIG. 2, the overall physical layout and data flow of packets of information from a typical host system, such as the IBM PS/2 or the Motorola 68030 mentioned previously, over their respective host system data and address buses to an interprocessor DMA controller and arbiter, such as the Intel 82325 programmable micro-channel/DMA controller are shown. Interfaces from the interprocessor DMA arbiter and controller to a typical DSP such as that shown in IBM's U.S. Pat. No. 4,794,517 which is a three-phased pipelined DSP or the Motorola 56000 as previously described, are also shown. Packets of data are move to and from the host system memory to a buffer within the DMA arbiter and controller and to and from the buffer in the DMA controller over the DSP memory buses to the DSP memory for data or instructions.

A programmable interprocessor DMA/IO bus master controller and arbiter such as is commercially available in the form of the Intel 82325 chip set may be employed for these purposes. This may be referred to as the "bus master" hardware which performs the actual transfer of requested data packets between the host PC data store or memory and the DSP instruction or data store. The bus master controller may be divided into two major functional components: the packet list processor and the DMA transfer handier. The packet list processor, receives a "start" pacing si