An interactive audio system employs a sound signal processor coupled with a microprocessor as an interactive audio control system. A pair of tranceivers, operated as stereophonic loudspeakers and also as receiving microphones, are coupled with the signal processor for receiving voice commands from a principal user. The voice commands are processed to operate a variety of different devices, such as television, tape, radio or CD player for supplying signals to the processor, from which signals then are supplied to the loudspeakers of the transceivers to produce the desired sound. Additional infrared sensors may be utilized to constantly triangulate the position of the principal listener to supply signals back through the transceiver system to the processor for constantly adjusting the balance of the sound to maintain the "sweet spot" of the sound focused on the principal listener. Additional devices also may be controlled by the signal processor in response to voice commands which are matched with stored commands to produce an output from the signal processor to operate these other devices in accordance with the spoken voice commands. The system is capable of responding to voice commands simultaneously with the reproduction of stereophonic sound from any one of the sources of sound which are operated by the system.
A range finding audio system automatically modifies the audio output of an audio source based on the distance of a listener from the speakers. A speaker in an audio system may include a range device coupled with a controller. The range device may utilize infrared, laser, or acoustic technology to determine the distance between the speaker and the listener. The controller may transfer distance information to an audio interface of a processor unit. The audio interface may include a positioning routine to modify the audio output according to the distance from the speaker to the listener. Alternatively, the controller may perform the functions ascribed to the positioning routine making the necessary modifications to the audio output based on the distance information.
A method and system for location dependent verbal command execution in a computer based control system within an installation having multiple physical locations. A specified function within each physical location, such as a lighting fixture or alarm setting, may be controlled by a selected verbal command. A microphone within each room or physical location within the installation is utilized to detect each utterance of a verbal command and the volume of each verbal command is determined for each physical location at which that command is detected. Thereafter, the physical location having the highest volume for a detected verbal command is identified and the specified function is controlled at only that location. In the event multiple speakers simultaneously utter a verbal command at different physical locations, the location of maximum volume is determined for each speaker and the specified function is controlled at only the maximum volume location associated with each speaker. In this manner, a single verbal command may be utilized to control an identical function at multiple locations within an installation in a location dependent manner.
In a sound producing apparatus and a sound producing method of this invention, at least underlying sound representing situations of the place within virtually actualized space and ornamental sound corresponding to object within the virtually actualized space which is not independent of presence or absence of display on a display unit are produced to set virtual sound space corresponding to the virtually actualized space displayed on the display unit. Thus, speech (sound) can be utilized as means for grasping spatial information as in the actual world within the virtually actualized space, and offer of information utilizing speech (sound) can be made. In addition, the advertisement effect can be also increased.
A speech recognition apparatus including an audio cancellation module is disclosed. The module includes an audio input for receiving an audio signal from a microphone. The module also includes at least two audio inputs for receiving audio signals from respective independent audio sources. The audio cancellation module produces a speech signal by canceling two of the independent audio source signals from the microphone signal. A speech recognizer is used to recognize at least part of the speech signal.
Method, product and system alters audio data for a synthesized voice so that when it is produced on a speaker system, it appears to emanate from a spatial position. First, the voice is synthesized into a speech waveform from a set of stored data representative of a text string using standard techniques. The speech waveform is converted into analog signals for a right and left channel. According to the invention, the analog signals to the right and left channels are altered according to position data stored with the text string so that the synthesized voice appears to originate at the apparent spatial position when the analog signals are sent to a speaker system.