or
Bookmark and Share
Prosody based audio/visual co-analysis for co-verbal gesture recognition
 
   
Document Number
US Patent 7321854
Issued Date
January 22, 2008
Link
Inventors
Sharma; Rajeev (State College, PA)
Map
Abstract
The present method incorporates audio and visual cues from human gesticulation for automatic recognition. The methodology articulates a framework for co-analyzing gestures and prosodic elements of a person's speech. The methodology can be applied to a wide range of algorithms involving analysis of gesticulating individuals. The examples of interactive technology applications can range from information kiosks to personal computers. The video analysis of human activity provides a basis for the development of automated surveillance technologies in public places such as airports, shopping malls, and sporting events.
Tags:
Description:
Amusing 0%
Clever 0%
Complex 0%
Efficient 0%
Historic 0%
Important 0%
Innovative 0%
Interesting 0%
Practical 0%
Simple 0%
Number of Claims:
19
Comments:
no comments yet
Owner
Published
January 22, 2008
Application Number
10/666,460
Filed
September 19, 2003
US Classification
704/243   704/276 704/E15.041
Int'l Classification
G10L   15/00   (20060101)  
Examiner
Parent Case
CROSS-REFERENCE TO RELATED APPLICATIONS This application is based on and claims priority to U.S. Provisional Application No. 60/413,998, filed Sep. 19, 2002, which is fully incorporated herein by reference.
USPTO Field of Search
704/243   704/276   704/246  
Related Patents
7486815 - Method and apparatus for scene learning and three-dimensional tracking using stereo video cameras - Owned by Microsoft Corporation (Redmond, WA)

A method and apparatus are provided for learning a model for the appearance of an object while tracking the position of the object in three dimensions. Under embodiments of the present invention, this is achieved by combining a particle filtering technique for tracking the object's position with an expectation-maximization technique for learning the appearance of the object. Two stereo cameras are used to generate data for the learning and tracking.

7493293 - System and method for extracting entities of interest from text using n-gram models - Owned by International Business Machines Corporation (Armonk, NY)

A document (or multiple documents) is analyzed to identify entities of interest within that document. This is accomplished by constructing n-gram or bi-gram models that correspond to different kinds of text entities, such as chemistry-related words and generic English words. The models can be constructed from training text selected to reflect a particular kind of text entity. The document is tokenized, and the tokens are run against the models to determine, for each token, which kind of text entity is most likely to be associated with that token. The entities of interest in the document can then be annotated accordingly.

Claims
Description
About| FAQs| Terms & Disclaimer| Link to Us| Contact Us