|
Claims  |
|
|
We claim:
1. A telecommunications system having a plurality of attendant terminals
comprising
at least one text-to-speech synthesizer unit operative for translating text
received via an associated second port into speech for delivery to an
associated first port,
means, responsive to receipt at one of said attendant terminals a request
identifying a called telephone station, for establishing a first telephone
connection to said called telephone station in accordance with an
instruction entered into said one attendant terminal, said request being
entered via a second telephone connection extending to a data terminal
operated by a sound impaired person,
means, responsive to said instruction, for extending said first connection
to said first port of said one text-to-speech synthesizer and for
extending said second telephone connection associated with said data
terminal to said second port of said text-to-speech synthesizer, and
means, contained in said synthesizer and responsive to receipt of a
predetermined key word via said second connection, for causing said first
connection to be connected to an available one of said attendant terminals
so that an attendant associated with said one terminal may convert speech
signals received via said first connection into text for delivery to said
second connection via said second port of said one text-to-speech
synthesizer.
2. The system set forth in claim 1 further comprising
a local data network, each of said attendant terminals and said at least
one text-to-speech synthesizer unit being interconnected to one another
via said local data network.
3. The system set forth in claim 1 further comprising means responsive to
receipt of said predetermined key word via said first connection for then
causing said first connection to be disconnected from said available
terminal and reconnected to said first port of said text-to-speech
synthesizer.
4. The system set forth in claim 1 wherein if said sound impaired person is
a person having a speech impairment but not a hearing impairment, then
said means for causing, responsive to said predetermined key word, invokes
means for establishing a communications path between said first and second
ports of said one text-to-speech synthesizer unit so that said speech
signals received via said first connection and said first port may be
supplied directly to said impaired person via said second port and said
second connection.
5. The system set forth in claim 4 further comprising means responsive to
receipt of said predetermined keyword via said first port of said
text-to-speech synthesizer unit for then disconnecting said communications
path between said first and second ports.
6. The system set forth in claim 1 further comprising
at least one automatic speech recognition unit having an input terminal and
an output terminal, said automatic speech recognition unit being operative
for translating received speech into respective text based on a
predetermined template of words associated with a speaker, said speaker
being associated with said called telephone station, and
wherein, in the event that said sound impaired person is a person that is
hearing impaired but not speech impaired, then, in that event, said means
for establishing and extending extends said first connection to said input
terminal of said one automatic speech recognition unit, rather than to
said first port of said one text-to-speech unit, and extends said second
telephone connection associated with said data terminal to said output
terminal of said automatic speech recognition unit so that said automatic
speech recognition unit may convert speech signals received via said input
terminal into text for delivery to said output terminal and thence to said
data terminal.
7. The system set forth in claim 2 further comprising
at least one automatic speech recognition unit having a connection to said
local data network and being operative for translating speech signals
received via an associated input into respective text for delivery to said
local data network, said translation being based on a predetermined
template of words associated with a speaker, said speaker being associated
with said telephone station, wherein said means for causing and extending
includes means, responsive to receipt of said predetermined key word, for
transferring said first connection to said associated input of said one
automatic speech recognition unit so that speech received via said first
connection may be translated into respective text and supplied to said
second port of said one text-to-speech unit via said local data network
for delivery to said data terminal.
8. The system set forth in claim 7 wherein said means for transferring
including means, responsive to receipt of said predetermined key word via
said first connection, for then causing said first connection to be
disconnected from said associated input of said one automatic speech
recognition unit and reconnected to said first port of said one
text-to-speech unit.
9. The system set forth in claim 7 wherein said means for transferring
including means, responsive to an absence of speech being received via
said first connection for a predetermined period of time, for then causing
said first connection to be disconnected from said associated input of
said one automatic speech recognition unit and reconnected to said first
port of said one text-to-speech unit.
10. A telecommunications system having a plurality of attendant terminals
comprising
at least one text-to-speech synthesizer,
means, responsive to receipt of a request and associated text at one of
said attendant terminals, in which said request identifies at least a
telephone station and an enhanced communications service, for storing in a
memory via a path including a local data network said request and
associated text, said request and said text being received from a data
terminal connected to a communications path extending to said one
attendant terminal, said data terminal being operated by a sound impaired
person, and
means thereafter operative for establishing another communications path to
said enhanced service and supplying the identification of said telephone
station to said enhanced service via said other communications path, and
wherein said means for establishing includes means for passing said stored
text through said text-to-speech synthesizer and for supplying the
resulting speech signals to said enhanced service via said other
communications path.
11. The system set forth in claim 10 wherein said enhanced service is a
voice mail service.
12. The system set forth in claim 10 wherein said enhanced service is a
language translation service.
13. The system set forth in claim 10 wherein said means for passing
includes means operative in the event that said enhanced service is a
facsimile service for then causing said stored text to be supplied
directly to said enhanced service.
14. A dual party relay communications system comprising
a plurality of attendant terminals,
at least one text-to-speech synthesizer unit operative for translating text
received via an associated second port into speech for delivery to an
associated first port,
means, responsive to receipt of a telephone call associated with a first
communications path established by a data terminal operated by a sound
impaired person, for establishing a second communications path to a
telephone station and for respectively extending to said first and second
ports said second and first communications paths so that text received
from said data terminal may be translated into speech for delivery to said
second communications path, said second communications path being
established in accordance with a request received from said data terminal,
and means, responsive to receiving text indicative of a predetermined key
word from said data terminal, for causing said second communications path
to be transferred to an available one of said attendant terminals so that
an attendant associated with said available terminal may translate speech
received via said second communications path into text for delivery to
said data terminal via said first communications path.
15. The system set forth in claim 14 further comprising means responsive to
receiving said predetermined key word via said second communications for
then causing said second communications path to be disconnected from said
available attendant terminal and reconnected to said first port of said
one text-to-speech synthesizer.
16. The system set forth in claim 14 wherein if said sound impaired person
is a person having a speech impairment but not a hearing impairment, then
said means for causing, responsive to said predetermined key word, invokes
means for establishing a connection between said first and second ports of
said text-to-speech synthesizer unit so that said speech received via said
first port may be supplied directly to said impaired person via said
second port and said first communications path.
17. The system set forth in claim 16 further comprising means responsive to
receipt of said predetermined key word via said first port of said
text-to-speech synthesizer unit or text received via said second port, for
then terminating said direct connection between said first and second
ports.
18. The system set forth in claim 14 wherein said system further comprises
means, operative in the event that text is not received from said data
terminal for a predetermined period of time, for then invoking said means
for causing said transfer.
19. The system set forth in claim 14 further comprising
at least one automatic speech recognition unit operative for translating
received speech into respective text based on a predetermined template of
words associated with a speaker, said speaker being associated with said
telephone station,
wherein, in the event that said sound impaired person is a person that is
hearing impaired but not speech impaired, then, in that event, said means
for establishing and extending extends said second communications path to
one input of said one automatic speech recognition unit, rather than to
said first port of said one text-to-speech unit, and extends said first
communications path associated with said data terminal to another input of
said automatic speech recognition unit so that text indicative of speech
received via said one input may be supplied to said other input for
delivery to said data terminal.
20. The system set forth in claim 14 further comprising
a local data network, and
at least one automatic speech recognition unit operative for translating
received speech into respective text based on a predetermined template of
words associated with a speaker, said speaker being associated with said
telephone station, wherein said means for transferring includes means,
responsive to receipt of said predetermined key word, for causing said
second communications path to be transferred to an input of said one
automatic speech recognition unit so that speech received via said second
communications path may be translated into respective text and supplied to
said second port of said one text-to-speech unit via said local data
network for delivery to said data terminal via said first communications
path.
21. The system set forth in claim 20 wherein said means for transferring to
said input of said automatic speech recognition unit including means,
responsive to receipt of said predetermined keyword via said second
communications path, for then causing said second communications path to
be disconnected from said input of said one automatic speech recognition
unit and reconnected to said first port of said one text-to-speech unit.
22. The system set forth in claim 20 wherein said means for transferring to
said one automatic speech recognition unit including means, responsive to
an absence of speech being received via said second communications path
for a predetermined period of time, for then causing said second
communications path to be disconnected from said input of said one
automatic speech recognition unit and reconnected to said first port of
said one text-to-speech unit. |
|
|
|
|
Claims  |
|
|
Description  |
|
|
FIELD OF THE INVENTION
The invention relates to a dual party relay service that is used to
establish a telephone call between a sound impaired person and a person
who is not so impaired, and more particularly relates to automating
certain aspects of such a service.
BACKGROUND OF THE INVENTION
Dual Party Relay (DPR) service is a service that allows a sound-impaired
(i.e., hearing or speech impaired) individual to communicate, via a
telephone connection, with an individual who is not so impaired. An
impaired individual may do this by dialing a particular telephone
number(s) to establish a connection to a so-called DPR attendant who is
trained to "translate" and relay the ensuing conversation.
In particular, a hearing-impaired person may place a telephone call to a
DPR center via a device commonly referred to as a Telecommunications
Device for the Deaf (TDD). A TDD has an alphanumeric keyboard and either a
full screen or a single line character display, and employs either BAUDOT
or ASCII signaling or both. The hearing-impaired person (hereinafter also
referred to as TDD user) operating the TDD keyboard enters a text message,
which is transmitted via a telephone connection and displayed at an
attendant console. An attendant positioned at the console transmits via a
separate telephone connection to the unimpaired person an oral version of
the displayed text message. The attendant then listens to and enters via a
console keyboard for transmission via the telephone connection to the TDD
for display thereat a text message corresponding to the unimpaired
person's oral response. The attendant continues such translation services
for the duration of the DPR telephone call.
More specifically, if the TDD user can speak but cannot hear, then a
switching state referred to as "Voice CarryOver" (VCO) allows that TDD
user to speak directly to the unimpaired person. If, on the other hand,
the TDD user can hear but cannot speak then a switching state referred to
as "Hearing CarryOver" (HCO) allows that TDD user to receive speech
directly from the unimpaired person. In the first instance, the attendant
translates the unimpaired person's verbal response into text for the
benefit of the hearing impaired TDD user, and in the second instance, the
attendant translates the TDD user's text message into speech for benefit
of the unimpaired person.
It can be appreciated from the foregoing that the cost of a DPR telephone
call is not insignificant, since the aforementioned translation service is
labor intensive.
It has been suggested that a Text-To-Speech (TTS) synthesizer and possibly
automatic speech recognition could be employed to perform such translation
services, thereby decreasing the amount of time an attendant is involved
in serving DPR calls, and thereby decreasing the cost of such calls. The
way in which a TTS synthesizer and/or automatic speech recognition could
be employed in a dual party relay system is not obvious, and the
implementation of either one of those features is not a trival task.
Currently, large interexchange carriers offer a number of so-called
enhanced services, such as, for example, electronic mail services, which
store a voice and/or textual message and then, at a later time, forward
the message to an identified recipient. However, sound impaired persons
cannot take advantage of such enhanced services. The reason for this is
that such services typically require a caller to respond to prerecorded
verbal instructions to invoke the respective service.
SUMMARY OF THE INVENTION
An advance in provisioning dual party relay service is obtained by, in
accordance with the invention, employing a network architecture, which in
combination with automatic call distribution, text-to-speech processing,
data networking and, on a limited basis, automatic speech recognition,
decreases appreciably the amount of time that an attendant is involved in
a dual party relay call. In addition, the inventive architecture may be
adapted to extend to sound impaired persons the advantages offered by
enhanced services, such as voice mail service. In particular, text
provided by a sound impaired person is first stored in memory, and then
translated into speech for processing by an enhanced service.
Further, the inventive architecture may be adapted to accommodate a
partially impaired person who can hear but cannot speak, and who desires
to personally listen to the response of the called speaker, in which case
the impaired person is multiplexed between a text-to-speech synthesizer
and the telephone connection extending to the called party. The inventive
architecture may be similarly adapted to handle the opposite case in which
the impaired person can speak but cannot hear.
BRIEF DESCRIPTION OF THE DRAWING
In the drawing:
FIG. 1 shows a broad block diagram of a telecommunications system in which
the principles of the invention may be practiced;
FIG. 2 shows a block diagram of the automatic speech recognition unit of
FIG. 1; and
FIG. 3 shows a block diagram of the text-to-speech unit of FIG. 1.
DETAILED DESCRIPTION
Turning now to FIG. 1, there is shown a broad block diagram of Dual Party
Relay System (DPRS) 200 coupled to a telecommunications network 100, which
may be, for example, the well-known AT&T network. As is well-known, the
AT&T network comprises a plurality of so-called toll switches (not shown)
interconnected with one another to provide long distance telephone and
data services for its subscribers, such as the subscribers associated with
stations S1 and S2.
AT&T also provides for its subscribers a number of special services, such
as, for example, (1) a Voice Messaging Service (VMS) 115, which stores and
forwards to an intended destination a voice message and which is commonly
referred to as the VoiceMark (service mark of AT&T) messaging service; (2)
a language translation service 120 for subscribers who do not speak the
English language, which service is referred to as Language Line (LL), and
(3) an enhanced facsimile service (EFAX) 125, which stores and forwards to
an intended destination a digital message (i.e., facsimile message). The
manner in which a subscriber, e.g., the subscriber associated with station
S1, establishes a telephone connection to any one of the foregoing
services or to another subscriber served by network 100 is well-known and
will not be discussed herein. However, it suffices to say that a
subscriber positioned at station S1 may establish such a connection by
causing station S1 to go "off-hook" and then dialing a respective
telephone number. The local Central Office (CO) 5 associated with station
S1 collects the telephone digits as they are dialed and establishes a
connection to an associated toll switch (not shown) contained in network
100. The toll switch, in turn, and based on the dialed telephone number
that it receives from CO 5, establishes a network 100 connection to a
so-called destination toll switch. The destination toll switch then
extends the connection to the desired location. If the dialed telephone
number is associated with one of the aforementioned services, then the
destination toll swith extends the connection to the desired service. If,
on the other hand, the dialed telephone number is associated with another
subscriber, then the destination switch extends the connection to a CO
associated with that subscriber. The latter CO then extends the connection
to the telephone station (not shown) associated with the other subscriber.
Each of the services 115, 120 or 125 interacts with a subscriber by
transmitting to the latter via a respective network 100 connection
prerecorded verbal instructions. For example, in the case of services 115
and 125, the subscriber is verbally instructed to enter, inter alia, the
telephone number of the destination that is to receive a message that the
subscriber intends to send. The service then instructs the subscriber to
enter the message, voice or facsimile as the case may be. In the case of
the language translation service 120, the subscriber is verbally
instructed to enter the identity of the language that is to be translated.
It is apparent that as a result of such interaction a sound-impaired person
cannot take advantage of the enhanced services provided by network 100.
Accordingly, a hearing-impaired person cannot arrange to have either a
voice or facsimile message stored and then forwarded to an intended
recipient. In addition, a sound-impaired person cannot communicate via a
telephone connection with a person unfamiliar with the English language.
We have recognized these and other limitations and have designed a Dual
Party Relay System (DPRS) having an architecture that is geared toward
allowing a TDD user, in accord with an aspect of the invention, to
advantageously use such services. Moreover, the architecture of our DPRS
partially automates a dual party relay call by employing text-to-speech
synthesis to significantly decrease attendant involvement, thereby
significantly reducing the cost of the dual party relay service. In
addition, the architecture of our DPRS, in accord with an aspect of the
invention, employs what we call independent and dependent Speech-To-Text
Synthesis (STTS) using Automatic Speech Recognition (ASR) means that is
based on predetermined speech recognition templates.
In particular, DPRS 200 includes Automatic Call Distributor (ACD) 205, a
plurality of Automatic Speech Recognition (ASR) units 210-1 through 210-N,
a plurality of Text-to-Speech Synthesis (TTS) Units 215-1 through 215-N, a
plurality of attendant terminals 220-1 through 220-N, an Applications
Processor (AP) 225 and associated memory 226. As will be discussed below,
the ASR units, TTS units, attendant terminals and applications processor
may communicate with one another via Local Area Network (LAN) 230, which
may be, for example, the Datakit (registered trademark of AT&T) virtual
circuit network available from AT&T.
ACD 205, which may be, for example, a so-called private branch exchange,
such as the System 85 communications system available from AT&T,
interfaces DPRS 200 with network 100. That is, ACD 205, inter alia,
operates in a conventional manner to distribute incoming calls received
via network 100 among the attendants positioned at terminals 220-1 through
220-N. ACD 205 also operates in a conventional manner to establish via
network 100 a telephone connection associated with a call originated at
one of the terminals 220-1 through 220-N, as will be discussed below.
ASR units 210-1 through 210-N are identical to one another and provide a
switching capability between a TDD user and an unimpaired speaker. That
is, each ASR unit connects to two ACD 205 port circuits via a respective
communication path 211. As shown in FIG. 2, one of the two ports connects
the ASR to the TDD user via communication path 302 (also referred to
herein as ASR port 2), ACD 205 and network 200. The other ACD 205 port
connects the ASR to the unimpaired speaker via communication path 301
(also referred to herein as ASR port 1), ACD 205 and network 100. Each ASR
unit may be, for example, a Conversant (registered trademark of AT&T)
voice system that is available from AT&T. The Conversant voice system is
disclosed in the AT&T Technical Journal, September/October 1986, Volume
65, Issue 5, Robert J. Perdue and Eugene L. Rissanen, entitled Conversant
1 Voice System: Architecture and Applications, AT&T Technical Journal,
September/October 1986, Volume 65, Issue 5, which is hereby incorporated
by reference.
Each ASR is also arranged to recognize particular words characterized
herein as speaker independent and speaker dependent speech recognition.
Speaker dependent recognition is used in the instance in which a
particular speaker has developed (i.e., "pretrained") a so-called voice
template that DPRS 200 may use to recognize a limited vocabulary of words
that may be uttered by the speaker. Such pretraining involves an off-line
training session in which the particular speaker in communication with a
voice processor enunciates each word or appropriate "pieceparts" thereof,
of a limited vocabulary of words and does so until the voice processor
develops a template which enables the voice processor to recognize each
such word when it is uttered by the speaker during an actual call. The
template constitutes a digital version and ASCII coded version of each of
the limited vocabulary of words and is stored in memory 226 at a location
associated with the speaker. Thereafter, and at the time that the speaker
is connected to ASR port 1, AP 225 loads the unimpaired speaker's
associated voice template into voice template memory 310.
Speaker independent recognition is based on a common template or set of
templates which enables an ASR unit to recognize particular phrases,
commonly referred to as key words, received from any speaker. The
combination of key words should be chosen so that they are not likely to
be spoken in an ordinary sentence. For example, the words "star over"
("*over") form a unique combination that is unlikely to be contained in a
speaker's response. Such key words prompts the associated ASR to switch
from its present ASR configuration to a particular operating configuration
referred to herein as "carryover", as will be discussed below.
As seen from FIG. 2, ASR unit 210 also includes a pattern matching circuit
315 which is connected to ASR port 1. Analog signals representing a word
uttered by the unimpaired speaker and presented to the input of circuit
315 are converted into a digital pattern (signals) using a conventional
analog-to-digital converter. Appropriate digital features are then
extracted from the pattern using a digital signal processor (not shown).
Circuit 315 then matches (compares) such features with digital patterns of
respective words forming the limited vocabulary template stored in voice
template memory 310. If a match is found, then circuit 315 passes an ASCII
coded version of the digitized word to modem 305 for transmission to the
TDD user via ASR port 302.
If a match is not found, then, circuit 315 notifies processor 320 of that
fact via lead 316. Processor 320, as directed by its associated DPRS
program stored in ROM 325, increments a respective software counter stored
in RAM memory 330, as a means of tracking the number of mis-matches that
occur over the course of the current conversation. Processor 320 then
compares the value contained in the counter with a predetermined
threshold. If that value exceeds the predetermined threshold, then
processor 320 causes circuit 315 to pass to modem 305 an appropriate
message--illustratively--"excessive translation errors"--for transmission
to the TDD user via port 302. In addition, Processor 320, in accordance
with an aspect of the invention, terminates automatic speech recognition
by sending to AP 225 via LAN 230 a message requesting the intervention of
an attendant. AP 225, in turn, sends to ACD 205 via communication path 227
a number of so-called third-party call control messages requesting that
ACD 205 transfer the voice caller to an available attendant's voice
station. ACD 205 responsive thereto and in a conventional manner
reconfigures the call as requested. In doing so, ACD 205 transfers the
call to an available attendant terminal and brings up on the associated
display the particulars associated with the call, e.g., identity of the
parties, their respective telephone numbers, etc.
At that point, the attendant may request that the speaker repeat the
substance of his or her response. As the speaker does so, the attendant
types in via the terminal keyboard each word that is uttered by the
speaker for transmission to the TDD terminal connected to the call. At the
end of the speaker's response, the attendant sends to AP 225 via LAN 230 a
message requesting that control of the call be transferred to the TDD
terminal. As discussed below, AP 225 responsive thereto notifies ACD 205
that the connection between the TDD call and attendant terminal is
terminated so that terminal may be made available to serve other TDD
calls.
Continuing with FIG. 2, pattern matching circuit 315 also compares the
digital pattern of adjacent received words with the aforementioned common
template to determine if it compares with the above-mentioned key word of
"star over". If the comparison is found to be true, then circuit 315
notifies processor 320 via lead 316. If the TDD user had selected the
option to use voice carryover (i.e., the TDD user can speak but cannot
hear, and opted to speak directly to the unimpaired speaker and had so
notified the attendant at the time of setting up the TDD call), then
processor 320 responsive to such notification causes terminals 1 and 2 of
switch 303 to be connected to one another to establish a bridged
connection between ASR ports 1 and 2. As will be discussed below, this
switching configuration is similarly characterized as a "carryover" state,
which allows the TDD user to speak directly to the unimpaired speaker.
As a feature of the invention, pattern matching circuit 315 implements a
timer which is started at the end of a received word and which is stopped
at the beginning of the next received word. This is done to cover the case
where the unimpaired speaker fails to utter the aforementioned keyword, or
the uttered keyword is not detected or recognized, so that control of the
connection may be passed to the TDD user. Thus, if the value of the timer
reaches a predetermined threshold--illustratively 10 seconds--, then, in
that event circuit 315 signals (notifies) processor 320 via signaling path
317. Similarly, responsive to receipt of that signal, processor 320
outputs to path 302 via lead 318 and modem 305 for delivery to the TDD and
display thereat a message characterizing the aforementioned key words,
e.g., "star over", or "*over.
Upon seeing the aforementioned key words displayed at the TDD, the TDD user
may then enter via the TDD keyboard a response. As the TDD user enters a
character it is transmitted via the telephone connection to ACD 205 for
delivery to the respective TTS unit that may be involved in the call.
Returning to FIG. 1, TTS units 215-1 through 215-N are also identical to
one another and each such unit connects to two ACD 205 port circuits via
respective communication paths 216. One of the two ports connects its
associated TTS unit to the TDD terminal via a path including ACD 205 and
network 200. The other port connects the associated TTS unit to the
unimpaired speaker via a similar path. A TTS unit connected to a TDD user
via the aforementioned path receives digitally encoded text which an
impaired user enters ("types in") via the keyboard of his or her TDD unit.
Such text is then passed to a conventional text-to-speech synthesizer,
which may be, for example, the Olive-Liberman Synthesizer, J. P. Olive and
M. Y. Liberman, 1985, Text-To-Speech-An Overview, Journal of the Acoustic
Society of America, Supplement 1 78, S6, which is hereby incorporated by
reference.
Turning now to FIG. 3, there is shown a block diagram of a TTS unit 215.
TTS unit 215 includes Text-to-Speech Synthesizer (TSS) 410, processor 415,
modem 430 and multiplexed switches 405 and 435. ROM 420 and RAM 425 are
associated with processor 415. It is seen that TSS unit 410 has one
connection to TTS port 1 (designated 401 in the FIG.) and another
connection to terminal 2 of switch 435. The former connection extends TSS
410 to the unimpaired speaker, as mentioned above, and the latter
connection extends TSS 410 to one port of modem 430 via terminal 1 of
switch 435. The other port of modem 430 connects to terminal 1 of switch
405.
TSS unit 410 may be bypassed by interconnecting terminals 1 and 2 of switch
405 to establish a direct connection between ports 1 and 2 of the TTS
unit. In addition, an input 417 to processor 415 may be extended to port 2
via terminals 1 and 3 of switch 435, and modem 430. Processor 415 controls
the operation of switches 405 and 435 and may cause either switch to
change an electrical connection extending from one associated terminal to
another associated terminal by "toggling" either lead 416 or 418,
respectively.
In a TTS configuration, as characterized by a connection between terminals
1 and 2 of switch 435, TDD text received via port 2 is presented to the
input of TTS 410 via modem 430. TTS 410 converts into synthesized analog
speech each textual word that it receives via that connection and then
presents the result to TTS port 1 (path 401) for delivery to the
unimpaired speaker via ACD 205 and network 100.
In addition, each TTS unit 215-1 through 215-N is arranged to compare each
pair of adjacent textual words with a predetermined digital template to
determine if the words are equivalent to the aforementioned predetermined
key words (e.g., "star over", or "*over") indicative of the end of a
response originated by the TDD user. If the comparison is true, then TTS
410 notifies processor 415 via lead 412. Processor 415, in turn, and under
the direction of an associated program stored in ROM 420, sends to AP 225
via LAN 230 an appropriate message requesting the intervention of an
attendant. AP 225, in turn, sends to ACD 205 a number of third party call
control messages requesting that ACD 205 transfer the voice call to an
available attendant's voice station. ACD 205 selects an available
attendant terminal and associates the terminal voice station with port 1
of the pertinent TTS unit. In addition, ACD 205 sends to AP 225 via bus
227 the identity of the selected attendant terminal. AP 225, in turn,
sends to processor 415 via LAN 230 a message containing the identity of
associated attendant terminal.
Responsive to receipt of that message, processor 415 "toggles" lead 418 to
establish a connection between terminals 1 and 3 of switch 435 to extend
processor connection 417 to modem 430. In this way, processor 415 extends
the TDD call, via LAN 230, to the required attendant data terminal
previously identified by AP 225. As a result of this call configuration,
the attendant listens to the voice user and enters into the associated
terminal the corresponding text. The attendant terminal forms strings of
such text into respective messages containing the identity of the
pertinent TTS and transmits each message to LAN 230. Processor 415 of the
associated TTS unit removes each such message from LAN 230 and sends the
text portion thereof to modem 430 via the aforementioned connection. Modem
430, in turn, modulates such text in a well-known fashion for transmission
to the TDD user terminal via switch 405, ACD 205 and network 200.
(It is noted that the reason for transporting text messages via a backbone
path including LAN 230 is to maintain the modem carrier signal transmitted
between the TDD terminal and modem 230. As is well-known, if the TDD
terminal is a so-called ASCII terminal, then the loss of such a carrier
signal would cause both modem 430 and the ASCII terminal to terminate the
telephone connection. However, the maintenance of the carrier signal is
not applicable in the case where the TDD is a so-called BAUDOT terminal.)
Continuing, when the unimpaired speaker utters the keyword "star over",
indicating the end of his/her latest response, then the attendant handling
the call sends to ACD 205 a request to reconnect the voice user to port 1
of the pertinent TTS unit. Responsive thereto, ACD 205 reconnects the
voice, or unimpaired, user as requested and then notifies AP 225 via path
227 of that fact. AP 225, in turn, sends to processor 415 via LAN 230 a
message requesting that the TTS unit reconfigure itself for text-to-speech
processing. Processor 415 implements the request by respectively toggling
leads 416 and 418 to interconnect terminals 1 and 2 of switch 435, thereby
returning the call to the TDD configuration, as discussed above.
If the call involves an ASR unit, then the TDD-to-voice portion of the call
uses a TTS unit while the associated voice-to-TDD portion of the call uses
an ASR unit. Specifically, and assuming that the state of the call is at a
point where processor 415 has notified AP 225 that the aforementioned
keyword has been received from the TDD user, then AP 225 in accordance
with the associated call record stored in memory 226 (FIG. 1) sends to ACD
205 a message requesting that ACD 205 transfer the voice portion of an the
call to port 1 (301) of available ASR unit 210 (FIG. 2). ACD 205
responsive to receipt of the message transfers the unimpaired speaker's
ACD 205 connection from port 1 of the pertinent TTS unit 215 to port 1 of
the available ASR unit 210. In addition, ACD 205 notifies AP 225 of the
identity of the selected ASR unit 210. Referring now to FIGS. 2 and 3,
armed with that information, AP 225 sends to processor 415 via LAN 230 a
message identifying processor 320 and sends to processor 320 via LAN 230 a
message identifying processor 415.
Processor 415 responsive to receipt of the message causes terminals 1 and 2
of switch 435 to be connected to one another. As a result of this
configuration, speech received via port 1 of the ASR unit is presented to
pattern matching circuit 315 which then outputs to bus 318 an equivalent
digitized version of the received speech (text). Processor 320, in turn,
collects the digitized speech, and forms it into a data packet containing
the address of the pertinent TTS unit 215 and presents the packet to LAN
230 for delivery to the latter unit.
The TTS unit 215 upon receipt of the packet strips off the header
containing, inter alia, its address (identity), and forwards the remainder
of the packet to ACD 205 via modem 430 and the aforementioned switch 405
and 435 connections. In this call configuration, referred to as "ASR
Configuration", the voice user speaks and the ASR unit converts this
speech to text for transmission to the TDD user via the LAN and TTS unit.
As a result of this reconfiguration, the voice call is connected directly
from ACD 205 to ASR unit 210 and the TDD call is connected from ACD 205 to
ASR unit 210 via a backbone connection including TTS unit 215, LAN 230,
and ASR processor 320. As mentioned above, this "indirect" TDD
configuration is necessary to avoid loss of the modem carrier signal, and
hence the TDD call, if the TDD terminal is an ASCII terminal rather than a
Baudot terminal.
The call flow reverts to the aforementioned TSS configuration when pattern
matching circuit 315 receives the aforementioned keyword, e.g., "star
over" and notifies processor 320 accordingly. In that event then,
processor 320 sends to AP 225 via LAN 230 a message indicative of that
fact. AP 225 then sends to ACD 205 via bus 227 a message requesting that
ACD 205 transfer the unimpaired speaker's voice call to port 1 of TTS | | |