or
Bookmark and Share
Method, apparatus and system for building a compact language model for large vocabulary continuous speech recognition (LVCSR) system
   
Document Number
US Patent 7418386
Issued Date
August 26, 2008
Link
Inventors
Map
Abstract
According to one aspect of the invention, a method is provided in which a set of probabilistic attributes in an N-gram language model is classified into a plurality of classes. Each resultant class is clustered into a plurality of segments to build a code-book for the respective class using a modified K-means clustering process which dynamically adjusts the size and centroid of each segment during each iteration in the modified K-means clustering process. A probabilistic attribute in each class is then represented by the centroid of the corresponding segment to which the respective probabilistic attribute belongs.
Tags:
Description:
Amusing 0%
Clever 0%
Complex 0%
Efficient 0%
Historic 0%
Important 0%
Innovative 0%
Interesting 0%
Practical 0%
Simple 0%
Number of Claims:
30
Comments:
no comments yet
Owner
Intel Corporation (Santa Clara, CA)
Published
August 26, 2008
Application Number
10/297,354
Filed
April 3, 2001
US Classification
704/257  
Int'l Classification
G10L   15/18   (20060101)   G06F   17/20   (20060101)  
Examiner
USPTO Field of Search
704/257  
Related Patents
Claims
Description
About| FAQs| Terms & Disclaimer| Link to Us| Contact Us