or
Bookmark and Share
System and method for inferring user information need in hypermedia linked document collection
   
Document Number
US Patent 7017110
Issued Date
March 21, 2006
Link
Inventors
Chi; Ed H. (Palo Alto, CA)
Pirolli; Peter L. (San Francisco, CA)
Map
Abstract
The present invention provides a system and method for inferring information need in a collection of hypermedia documents that is based on the observation that a user's hypertext link traversal decisions are typically based on the nature of that user's information need. The system identifies the hypermedia linkage structure among the plurality of documents in the collection. The documents include content items that may be relevant to a user information need. The system then accepts a user path item that represents a user's hypermedia link traversal history and applies a network flow model to the user path item in the hypermedia link information in order to create a document vector. The system also determines the distribution of the content items in the document collection, and then compares the document vector to the content item distribution in order to determine an inferred information need.
Tags:
Description:
Amusing 0%
Clever 0%
Complex 0%
Efficient 0%
Historic 0%
Important 0%
Innovative 0%
Interesting 0%
Practical 0%
Simple 0%
Number of Claims:
9
Comments:
no comments yet
Owner
Xerox Corporation (Stamford, CT)
Published
March 21, 2006
Application Number
09/540,063
Filed
March 31, 2000
US Classification
715/501.1  
Int'l Classification
G06F   17/00   (20060101)  
Examiner
Assistant Examiner
USPTO Field of Search
715/501.1   345/854  
Related Patents
7260643 - Systems and methods for identifying user types using multi-modal clustering and information scent - Owned by Xerox Corporation (Stamford, CT)

Techniques for determining user types based on multi-modal clustering are provided. The topology, content and usage of a document collection or web site is determined. The user paths are identified using longest repeating subsequence techniques and a multi-modal information need vector is determined for each significant user path. Multi-modal vectors for each document in the significant path, content, uniform resource locators, inlink and outlink multi-modal vectors are determined and combined based on path position and access frequency. Multi-modal clustering is performed based on a multi-modal similarity function and a specified measure of similarity using a type of multi-modal clustering such as K-means or wavefront clustering. The identified clusters may be further analyzed based on changes to the weighting of the corresponding content, url, inlinks and outlinks multi-modal feature vectors.

Claims
Description
About| FAQs| Terms & Disclaimer| Link to Us| Contact Us