|
|
|
| United States Patent | 4931950 |
| Link to this page | http://www.wikipatents.com/4931950.html |
| Inventor(s) | Isle; Brian A. (Isanti, MN);
Bloom; Charles P. (Eagan, MN);
Butler; Arch W. (Minneapolis, MN);
Spoor; David (Eden Prarie, MN);
Wunderlin; David J. (New Hope, MN);
Bedros; Renee (West St. Paul, MN) |
| Abstract | A multimedia interface presents information and receives user commands for
a computer system. The multimedia interface operates in parallel with
another application software module, such as an expert system. To add
multimedia features to the application software module, the module is
modified so as to generate multimedia commands at the same time as it
displays text on a text monitor. The multimedia commands, which are held
in a queue, provide additional information in the form of video images and
generated speech corresponding to the displayed text. In addition, the
multimedia commands are split into at least two sets: one set which is
dispatched to the user substantially immediately after displaying the
corresponding text, and one set which is dispatched only upon request by
the user. In the preferred embodiment, the multimedia interface presents
information to the user through text, graphics, video speech production,
and printed output. User inputs are made through a special-function keypad
and voice recognition. The preferred embodiment is a portable expert
system which fits in a single portable suitcase sized package. |
|
|
|
Title Information  |
|
|
|
|
|
Drawing from US Patent 4931950 |
|
|
Multimedia interface and method for computer system |
|
|
|
|
|
| Publication Date |
June 5, 1990 |
|
|
|
|
|
| Filing Date |
July 25, 1988 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Title Information  |
|
|
Claims  |
|
|
What is claimed is:
1. In a computer system, a multimedia interface comprising:
input means for entering user commands;
multimedia output means for presenting information to the user of said
system using a plurality of output media; said multimedia output means
including text displaying means for displaying text strings, and at least
one additional output media;
an application program coupled to said input means and multimedia output
means, said application program including means for generating output
strings, said output strings including text strings for immediate delivery
to a user of said computer system via said text display means, a
multiplicity of said output strings further including means for denoting
multimedia commands denoting information that can be presented by said
multimedia output means;
queue means for receiving and storing said multimedia commands; and
dispatch means coupled to said queue means for executing said multimedia
commands stored in said queue means by delivering said multimedia commands
to said multimedia output means and thereby presenting said information
denoted by said multimedia commands via said multimedia output means.
2. A computer system as set forth in claim 1,
said system including output processing means for processing output strings
generated by said application program by storing the multimedia commands,
if any, in said output strings in said queue means, delivering the text
strings, if any, in said output strings to said text displaying means, and
then activating said dispatch means to deliver said multimedia commands
stored in said queue means to said at least one additional output media.
3. A computer system as set forth in claim 1, said multimedia output means
including voice generating means for generating spoken messages;
said multimedia commands including a first set of voice output commands for
denoting a first set of voice messages, and a second set of voice output
command for denoting a second distinct set of voice messages;
said queue means including buffer means for storing specified voice
messages;
said dispatch means including means for delivering said first set of voice
commands to said voice generating means so as to speak said first set of
voice messages, and for storing said second set of voice commands in said
buffer means;
said system further including means, responsive to a predefined user
command entered through said input means, for delivering said voice
commands stored in said buffer means to said voice generating means so as
to speak said stored voice messages.
4. A computer system as set forth in claim 1,
said multimedia output means including a second output means distinct from
said text displaying means for communicating specified sets of
information;
said multimedia commands including first and second sets of output
commands, denoting first and second sets of information to be
communicating by said second output means;
said queue means including buffer means for storing specified sets of
information;
said dispatch means including means for delivering said first set of output
commands to said second output means so as to communicate said first set
of information, and for storing said sets of information denoted by second
set of output commands in said buffer means;
said system further including means, responsive to a predefined user
command entered through said input means, for delivering said sets of
information stored in said buffer means to said second output means so as
to communicate said stored sets of information.
5. An expert system as set forth in claim 1, said multimedia output means
including means, coupled to said input means, for suspending the
presentation of information to the user of said system when said input
means receives a user command.
6. An expert system as set forth in claim 5, said input means including
keyboard input means for receiving user commands and voice recognition
means for receiving spoken user commands.
7. An expert computer system, comprising:
input means for entering user commands;
text display means for displaying text messages;
multimedia output means for presenting information to the user of said
system using a plurality of output media in addition to said text display
means;
a knowledge base storing a specified set of information, including means
for denoting text strings that can be displayed on the text display means
and corresponding multimedia commands denoting additional information that
can be presented by said multimedia output means;
queue means for receiving and storing selected ones of said multimedia
commands;
inference software means, coupled to said input means, text display means,
knowledge base and queue means, for accessing selected portions of the
information in said knowledge base, responding to user commands from said
input means, sending text strings corresponding to selected portions of
said information to said text display means for display, and storing in
said queue means selected multimedia commands corresponding to said
selected portions of said information in said knowledge base; and
dispatch means coupled to said queue means for delivering said multimedia
commands stored in said queue means to said multimedia output means in
response to a predefined dispatch command;
wherein said inference software includes means for generating said
predefined dispatch command at predefined times after accessing selected
portions of the information in said knowledge base.
8. An expert system as set forth in claim 7, said inference software
including means for automatically generating said predefined dispatch
command after sending said text strings corresponding to said accessed
information to said text display means for display.
9. An expert system as set forth in claim 7,
said multimedia output means including voice generating means for
generating spoken messages;
said multimedia commands including a first set of voice output commands for
denoting a first set of voice messages, and a second set of voice output
command for denoting a distinct second set of voice messages;
said queue means including buffer means for storing specified voice
messages;
said dispatch means including means for delivering said first set of voice
commands to said voice generating means so as to speak said first set of
voice messages, and for storing said second set of voice commands in said
buffer means;
said system further including means, responsive to a predefined user
command entered through said input means, for delivering said voice
commands stored in said buffer means to said voice generating means so as
to speak said stored voice messages.
10. An expert system as set forth in claim 7,
said multimedia commands including first and second sets of output
commands, each denoting a distinct set of messages to be generated by said
multimedia output means;
said system including means for automatically delivering said first set of
output commands to said multimedia output means to present the information
denoted by said first set of output commands after said inference software
means sends text strings corresponding to selected portions of said
information in said knowledge base to said text display means for display;
and means responsive to a predefined user command entered through said
input means, for delivering said second set of output commands to said
multimedia output means to present the information denoted by said second
set of output commands
11. An expert system as set forth in claim 7, said multimedia output means
including means, coupled to said input means, for suspending the
presentation of information to the user of said system when said input
means receives a user command.
12. An expert system as set forth in claim 11, said input means including
keyboard input means for receiving user commands and voice recognition
means for receiving spoken user commands.
13. In an expert system having a knowledge base and inference software
which accesses selected portions of the knowledge base, input means for
user entry of commands, text display means for displaying text messages,
the knowledge base including means for storing a multiplicity of text
strings that can be displayed on the text display means, the inference
software including means for responding to user commands from the input
means and for displaying selected ones of said text strings stored in the
knowledge base; the combination comprising:
multimedia output means for presenting information to the user of said
system using a plurality of output media in addition to said text display
means;
means for storing multimedia commands in the knowledge base along with
corresponding ones of said text strings; said multimedia commands being
embedded in said corresponding text strings;
queue means for storing multimedia commands;
output parsing means for removing and storing in said queue means said
multimedia commands embedded in said text strings when said inference
software selects corresponding portions of said knowledge base; and
dispatch means coupled to said queue means for executing said multimedia
commands stored in said queue means by delivering said multimedia commands
to said multimedia output means and thereby presenting said information
denoted by said multimedia commands via said multimedia output means;
wherein said inference software includes means for generating a predefined
dispatch command at predefined times after accessing a selected portion of
the knowledge base.
14. An expert system as set forth in claim 13, said multimedia output means
including voice output means for converting a specified text string into
synthesized speech;
said means for storing multimedia commands in the knowledge base including
means for storing speech production commands in the knowledge base, said
speech production commands including text strings representing words to be
spoken when corresponding portions of the knowledge base are accessed by
said inference software.
15. An expert system as set forth in claim 13, said multimedia output means
including voice output means for converting a specified text string into
synthesized speech;
said multimedia commands including a first set of voice output commands for
denoting a first set of voice messages, and a second set of voice output
command for denoting a distinct second set of voice messages; said voice
messages comprising text strings representing words to be spoken by said
voice output means;
said queue means including buffer means for storing specified voice
messages;
said dispatch means including means for delivering said first set of voice
commands to said voice output means so as to speak said first set of voice
messages, and for storing said second set of voice commands in said buffer
means;
said system further including means, responsive to a predefined user
command entered through said input means, for delivering said voice
commands stored in said buffer means to said voice output means so as to
speak said stored voice messages.
16. A method of processing and presenting information in a computer system,
said computer system including text display means for displaying text
messages; the steps of the method comprising:
providing multimedia output means for presenting information to the user of
said system using a plurality of output media in addition to said text
display means;
running an application program on said computer system said application
program generating text strings to be displayed on said text display means
and corresponding multimedia commands denoting additional information that
can be presented by said multimedia output means;
displaying said text strings generated by said application program, and
storing said multimedia commands generated by said application program in
a queue; and
delivering said stored multimedia commands to said multimedia output means
after said text string displaying step.
17. The method set forth in claim 16,
said multimedia output means including voice generating means for
generating spoken messages;
said multimedia commands including a first set of voice output commands for
denoting a first set of voice messages, and a second set of voice output
command for denoting a second distinct set of voice messages;
said method including the steps of:
providing a voice message buffer for storing voice messages;
said method including the step of delivering said first set of voice
commands to said voice generating means so as to speak said first set of
voice messages, and for storing said second set of voice commands in said
voice message buffer;
receiving user commands;
responding to a predefined user command by delivering said voice commands
stored in said voice message buffer to said voice generating means so as
to speak said stored voice messages.
18. The method set forth in claim 17, said step of receiving user commands
including the step of suspending the operation of said voice generating
means when a user command is received, thereby interrupting the speaking
of said voice messages by said voice generating means.
19. A method of processing and presenting information in an expert computer
system, said expert computer system including text display means for
displaying text messages; the steps of the method comprising:
providing multimedia output means for presenting information to the user of
said system using a plurality of output media in addition to said text
display means;
providing a knowledge base storing a multiplicity of information records,
each information record including means for denoting text strings that can
be displayed on the text display means and corresponding multimedia
commands denoting additional information that can be presented by said
multimedia output means;
receiving user commands;
accessing selected information records in said knowledge base in response
to said user commands;
displaying said text strings denoted by said accessed information records
on said text display means, and storing said multimedia commands denoted
by said accessed information records in a queue; and
delivering said stored multimedia commands to said multimedia output means
after said text string displaying step.
20. A method of processing and presenting information in an expert computer
system, said expert computer system including text display means for
displaying text messages; the steps of the method comprising:
providing multimedia output means, including video display means for
displaying video images and voice generating means for generating spoken
messages;
providing a knowledge base storing a specified set of information including
text strings that can be displayed on the text display means, and
corresponding multimedia commands denoting video images and voice messages
that can be presented by said video display means and said voice
generating means;
receiving user commands;
accessing selected portions of the information in said knowledge base in
response to said user commands;
displaying on said display means text strings corresponding to said
selected portions of the information in said knowledge base, and queuing
corresponding ones of said multimedia commands denoting video images and
voice messages stored in said knowledge base; said multimedia commands
including distinct first and second sets of commands;
delivering said first set of queued multimedia commands to said multimedia
output means after displaying corresponding text strings; and
delivering said second set of queued multimedia commands to said multimedia
output means when a corresponding user command is received
21. A method of processing and presenting information in an expert computer
system, said expert computer system including text display means for
displaying test messages; the steps of the method comprising:
providing multimedia output means, including video display means for
displaying video images and voice generating means for generating spoken
messages;
providing a knowledge base storing a specified set of information related
to a predefined set of tasks to be performed on a predefined set of
equipment; said stored information including text strings, corresponding
video images showing said set of equipment, and corresponding voice
messages;
receiving user commands;
accessing selected portions of the information in said knowledge base in
response to said user commands, including selected text strings, and
corresponding video images and voice messages; and
then displaying said selected text strings on said text display means,
displaying corresponding video images on said video image means, and
generating at least a portion of said corresponding selected voice
messages with said voice generating means.
22. The method set forth in claim 21,
said corresponding selected voice messages including first and second sets
of said voice messages;
said generating voice messages step including the step of generating said
first set of corresponding selected voice messages with said voice
generating means;
said method further including the step of generating said second set of
corresponding selected voice messages with said voice generating means
when a predefined user command is received.
23. A method of processing and presenting information in an expert computer
system, said expert computer system including text display means for
displaying text messages; the steps of the method comprising:
providing multimedia output means, including video display means for
displaying video images and voice generating means for generating spoken
messages;
providing a knowledge base storing a specified set of information including
text strings that can be displayed on the text display means, and
corresponding multimedia commands denoting video images and voice messages
that can be presented by said video display means and said voice
generating means;
said knowledge base storing information related to a predefined set of
tasks to be performed on a predefined set of pieces of equipment; said
video images denoted by multimedia commands including a multiplicity of
images showing said set of pieces of equipment, including images showing
said pieces of equipment in the context of said predefined set of tasks;
said information stored by said knowledge base including text strings and
voice messages related to said predefined set of tasks;
receiving user commands;
accessing selected portions of the information in said knowledge base in
response to said user commands;
displaying on said display means text strings corresponding to said
selected portions of the information in said knowledge base, and queuing
corresponding multimedia commands denoting video images and voice messages
stored in said knowledge base; said multimedia commands including distinct
first and second sets of commands;
dispatching said first set of queued multimedia commands to said video
display means and voice generating means at predefined times after
displaying corresponding text strings; said dispatching step causing said
expert system to simultaneously display text strings and video images
related to selected ones of said predefined set of tasks;
and dispatching said second set of queued multimedia commands to said video
display means and voice generating means when a corresponding user command
is received; said second dispatching step causing said expert system to
provide additional explanatory material related to selected ones of said
predefined set of tasks.
24. A method of processing and presenting information in an computer
system, said computer system including text display means for displaying
text messages; the steps of the method comprising:
providing multimedia output means, including video display means for
displaying video images and voice generating means for generating spoken
messages;
providing an application program which generates output strings, said
output strings including text strings for display on said text display
means, a multiplicity of said output strings including multimedia commands
denoting video images and voice messages that can be presented by said
video display means and said voice generating means; said multimedia
commands including distinct first and second sets of commands;
receiving user commands;
displaying said text strings on said text display means;
queuing said multimedia commands generated by said application program; and
dispatching said first set of queued multimedia commands to said multimedia
output means after displaying corresponding text strings; and
dispatching said second set of queued multimedia commands to said
multimedia output means when a corresponding user command is received.
25. In a computer system, the combination comprising:
selecting means for selecting a piece of equipment from a predefined set of
pieces of equipment; said selecting means including voice recognition
means for receiving spoken user commands;
video means for storing and displaying digitized video images corresponding
to said selected piece of equipment, including video random access memory
means for storing a multiplicity of digitized video images, each said
stored image corresponding to one or more of said pieces of equipment and
display means coupled to said video random access memory means for
accessing and displaying at least one digitized video image stored in said
video random access memory corresponding to said selected piece of
equipment; and
output means including text displaying means for displaying text
corresponding to said displayed video images, and voice generating means
for generating spoken messages corresponding to said display video images;
whereby said computer system can be used to provide video and textual
displays as well as spoken messages when working with a selected piece of
equipment.
26. The computer system set forth in claim 25, wherein said video means
includes means for displaying a sequence of video images corresponding to
said selected piece of equipment.
27. The computer system set forth in claim 26, wherein said video means
includes means for displaying a sequence of video images showing the
location of said selected piece of equipment at a particular predefined
site.
28. The computer system set forth in claim 27, including task specifying
means for specifying an equipment maintenance task to be performed;
said video means including means for displaying a sequence of video images
related to said specified equipment maintenance task; and
said text means includes means for displaying text related to said
specified equipment maintenance task.
29. The computer system set forth in claim 28, including
portable housing which houses said computer system; said portable housing
enabling said computer system to be taken to and used at the place where
said selected piece of equipment is located.
30. The computer system set forth in claim 25, wherein said video means
includes
digitized plant layout images showing the locations of at least a
multiplicity of said predefined set of pieces of equipment at a particular
predefined site stored in said video random access memory means; and
video selecting means coupled to said video random access memory means and
said video display means for displaying selected ones of said plant layout
images corresponding to said selected piece of equipment.
31. The computer system set forth in claim 25, including
portable housing which houses said computer system; said portable housing
enabling said computer system to be taken to and used at the place where
said selected piece of equipment is located.
32. In a computer system, the combination comprising:
means for defining a predefined set of pieces of equipment and a predefined
set of maintenance tasks in conjunction with corresponding ones of said
pieces of equipment;
task specifying means for specifying one of said equipment maintenance
tasks to be performed in conjunction with corresponding ones of said
pieces of equipment; said task specifying means including voice
recognition means for receiving spoken user commands;
video means for storing and displaying one or more digitized video images
corresponding to said selected piece of equipment, including video random
access memory means for storing a multiplicity of digitized video images,
each said stored image corresponding to one or more of said pieces of
equipment, and display means for displaying a sequence of said stored
digitized video images related to said specified equipment maintenance
task; and
text means for simultaneously displaying text corresponding to said video
images, including means for displaying text related to said specified
equipment maintenance task;
whereby said computer system can be used to provide assistance in the form
of video sequences and textual displays to a person performing a specified
equipment maintenance task.
33. The computer system set forth in claim 32, including
portable housing which houses said computer system; said portable housing
enabling said computer system to be taken to and used at the place where
said selected piece of equipment is located.
34. In a computer system, a method of obtaining information regarding
various pieces of equipment, the steps of the method comprising:
storing a multiplicity of digitized video images, each said stored image
corresponding to one or more of said pieces of equipment from a predefined
set of pieces of equipment;
selecting a piece of equipment from said predefined set of pieces of
equipment; said selecting step including the step of receiving spoken user
commands;
displaying at least one digitized video image stored in said video random
access memory corresponding to said selected piece of equipment;
displaying text corresponding to said displayed video images, and
generating spoken messages corresponding to said displayed video images;
whereby a person using said computer system receives information in the
form of digitized video images, textual displays and spoken messages while
working with a selected piece of equipment.
35. The method obtaining information regarding various pieces of equipment
set forth in claim 34, wherein said displaying step includes displaying a
sequence of said digitized video image corresponding to said selected
piece of equipment. |
|
|
|
|
Claims  |
|
|
Description  |
|
|
The present invention relates to the human interface aspects of computer
systems, and particularly to systems and methods for adding video and
voice capabilities to expert computer systems.
BACKGROUND OF THE INVENTION
The effectiveness of many computer systems is a function not only of the
application software program which interacts with the user, but also a
function of how well the computer system communicates with its human
users. In terms of expert systems, effectiveness is a function not only of
the knowledge applied to its task domain, but also the delivery of that
knowledge to the end user. In spite of the recent strides that have been
made in expert system theory and development, no one has produced a truly
effective delivery vehicle for communicating with the users of these
systems.
Most expert systems are either implemented on a main frame computer and
accessed through a terminal, or they are ported to smaller personal
computers. In either case the human-computer dialogue passes through a
standard text screen or monitor in one direction, and through a standard
"QWERTY" keyboard in the other direction. This keyboard/monitor interface
is not an interface that optimizes system performance.
In a large number of situations, the standard keyboard/monitor mode of
communication is inadequate or not suitable. For instance, in situations
where the user's hands are busy, keyboard input is clearly difficult. When
the user must visually concentrate on a task, it may be dangerous for the
user to avert his eyes so as to look at a computer screen. In other
situations, computer monitors are inadequate visual media for conveying
information needed by the user. For instance, computer monitors generally
cannot be used to display photographic images or moving video pictures,
such as a movie sequence showing what a particular piece of equipment
looks like, or how to disassemble that piece of equipment to effect a
particular repair.
All of the communication modules required for a multimedia interface, such
as voice recognition modules, text-to-speech voice production modules,
random access video memories and video displays are commercially
available. However, these multimedia communication modules have generally
not been incorporated into expert systems because of the difficulty of
adding multimedia features to an existing expert system, or to a system
for building expert systems. That is, there was an implicit assumption
that one would have to totally redesign and rebuild an expert system's
knowledge base and control software in order to incorporate multimedia
features.
In other word words, the primary problem has been the difficulty of
integrating the multimedia modules with traditional expert systems. It is
therefore an object of the present invention to provide a multimedia
software interface that can be added to an expert system with minimal
changes to the expert system.
Another potential problem with a multimedia expert system is that
increasing the number of communication modules increases the number of
ways that the system can fail. It is therefore another object of the
present invention to provide a modular multimedia software interface where
the failure of any input or output channel does not impact the
functionality of the remaining parts of the expert system.
Still another potential problem with building a multimedia expert system is
that the multimedia features would be needed or useful mostly in
situations requiring that the expert system be portable. While powerful
portable computers are now available, these computers generally do not
include voice input, voice output, video output and a printer because it
has been assumed that the inclusion of all of these features would make
the resulting system nonportable.
It is therefore an object of the present invention to provide a portable
multimedia expert system delivery vehicle which weighs no more than 30
pounds, and is truly portable.
SUMMARY OF THE INVENTION
In summary, the present invention provides a multimedia interface, which
presents information and receives user commands, for a computer system.
The multimedia interface operates in parallel with another application
software module, such as an expert system. To add multimedia features to
the application software module, the module is modified so as to generate
multimedia commands at the same time as it displays text on a text
monitor. The multimedia commands, which are held in a queue, provide
additional information in the form of video images and generated speech
corresponding to the displayed text. In addition, the multimedia commands
are split into at least two sets: one set which is dispatched to the user
substantially immediately after displaying the corresponding text, and one
set which is dispatched only upon request by the user.
In the preferred embodiment, information is presented through text,
graphics, video, speech production, and printed output; control inputs are
made through a special-function keypad and voice recognition. The
preferred embodiment is a portable expert system which fits in a single
portable suitcase sized package.
BRIEF DESCRIPTION OF THE DRAWINGS
Additional objects and features of the invention will be more readily
apparent from the following detailed description and appended claims when
taken in conjunction with the drawings, in which:
FIG. 1 is a perspective, partially cut away view of a prototype of a
portable expert system incorporating the present invention.
FIG. 2 depicts a conceptual flow chart of the flow of information in the
present information.
FIG. 3 is a diagram of the queue data structure used in a multimedia
interface.
FIG. 4 is a flow chart of the system cycle of an expert system showing the
multimedia interface tasks added to an expert system which previously did
not have such tasks.
FIGS. 5A and 5B depict the keyboard layout for the portable expert system
in FIG. 1.
FIG. 6 is a block diagram of the software routines used in the preferred
embodiment.
FIG. 7 is a block diagram of the hardware for an expert system
incorporating the present invention.
FIG. 8 shows a side by side presentation of a video image and corresponding
text generated by a preferred embodiment of the present invention.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
The preferred embodiment of the present invention is a multimedia interface
for an expert system. It should be understood, however, that the present
invention can be used with many types of computer systems and many types
of application software programs.
Overview of the Preferred Embodiment
The preferred embodiment of the present invention shown in FIG. 1 is a
portable computer system incorporating a multimedia implementation of an
expert system known as The Expert Adviser for Combustion Turbines,
developed under a contract with the Electric Power Research Institute.
This is a knowledge-based system designed to aid maintenanc | | |