WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
Eye tracking apparatus and method employing grayscale threshold values    
United States Patent5481622   
Link to this pagehttp://www.wikipatents.com/5481622.html
Inventor(s)Gerhardt; Lester A. (Clifton Park, NY); Sabolcik; Ross M. (Austin, TX)
AbstractAn eye-tracking system determines the position of a user's pupil and maps this position into a point of regard of the user on an interface device, such as a display screen, or other real-world object by a system comprising a camera for acquiring a video image of the pupil; a frame grabber coupled to the camera for accepting and converting analog video data from the camera to digital pixel data; a computer coupled to the frame grabber for processing the digital pixel data to substantially determine the position of the pupil; a display screen coupled to the computer; and a support connected to the camera and display screen for fixing the relative physical positions thereof relative to the user's pupil. The processing performed by the computer may include the selection of a first pixel intensity threshold for the segmentation of the digital pixel data into first and second groups, where the total pixel area of the first group is selected to be substantially equal to a pre-determined value expected to correspond to the area of a user's pupil. The system may be calibrated by the user's following a cursor on the display screen while the system measures the pupil position for known locations of the cursor.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Drawing from US Patent 5481622
Eye tracking apparatus and method employing grayscale threshold values - US Patent 5481622 Drawing
Eye tracking apparatus and method employing grayscale threshold values
Inventor     Gerhardt; Lester A. (Clifton Park, NY); Sabolcik; Ross M. (Austin, TX)
Owner/Assignee     Rensselaer Polytechnic Institute (Troy, NY)
Patent assignment
All assignments
Publication Date     January 2, 1996
Application Number     08/204,008
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     March 1, 1994
US Classification     382/103 345/158 382/171 382/291 708/141
Int'l Classification     G06K 009/00
Examiner     Boudreau; Leo H.
Assistant Examiner     Johns; Andrew W.
Attorney/Law Firm     Heslin & Rothenberg
Address
Parent Case    
Priority Data    
USPTO Field of Search     382/1 382/9 382/48 382/100 382/103 382/117 382/171 382/173 382/291 348/78 345/8 345/157 345/158 364/709.1 364/709.11 351/206 351/209 351/210 351/245
Patent Tags     eye tracking employing grayscale threshold values
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
3236578



[0 after 0 votes]
3542457



[0 after 0 votes]
5325133
Adachi
351/209
Jun,1994

[0 after 0 votes]
5189512
Cameron
348/838
Feb,1993

[0 after 0 votes]
5109425
Lawton
382/107
Apr,1992

[0 after 0 votes]
5094521
Jolson
351/210
Mar,1992

[0 after 0 votes]
5093567
Staveley
250/221
Mar,1992

[0 after 0 votes]
5070883
Kasahara

Dec,1991

[0 after 0 votes]
5049147
Danon
606/10
Sep,1991

[0 after 0 votes]
5034618
Akeel
250/559.33
Jul,1991

[0 after 0 votes]
5016282
Tomono
382/117
May,1991

[0 after 0 votes]
5002385
Kasahara
351/210
Mar,1991

[0 after 0 votes]
4988183
Kasahara
351/210
Jan,1991

[0 after 0 votes]
4852988
Velez
351/210
Aug,1989

[0 after 0 votes]
4836670
Hutchinson
351/210
Jun,1989

[0 after 0 votes]
4815839
Waldorf
351/210
Mar,1989

[0 after 0 votes]
4748502
Friedman
348/78
May,1988

[0 after 0 votes]
4648052
Friedman
348/78
Mar,1987

[0 after 0 votes]
4625329
Ishikawa
382/104
Nov,1986

[0 after 0 votes]
4595990
Garwin
708/141
Jun,1986

[0 after 0 votes]
4102564
Michael
351/210
Jul,1978

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


We claim:

1. A system for determining a point of regard, said system comprising:

camera means for acquiring a video image of a first vision means, said first vision means comprising a pupil, a position of said pupil corresponding to said point of regard, and said video image comprising a pupil image;

frame grabber means, coupled to said camera means, for accepting video data corresponding to said video image from said camera means and converting said video data to digital pixel data of a given dynamic range so as to provide said digital pixel data as a grayscale mapping of the video image;

computer means, coupled to said frame grabber means, for processing said digital pixel data to substantially determine the position of said pupil wherein said computer means includes:

memory means for providing an expected area value for the area of said pupil within said video image;

means for determining a grayscale threshold value in accordance with said expected area value and grayscale distribution of said digital pixel data;

means for segregating said digital pixel data into two separate binary groups as first and second groups using said grayscale threshold value as a discriminator; and

means for substantially determining the position of said pupil in accordance with the segregated data;

feedback means, coupled to said computer means, for accepting data corresponding to the determined position of said pupil from said computer means and feeding back to said first vision means feedback information representative of the determined position; and

support means, connected to said camera means and said feedback means, for fixing the relative physical positions of said camera means and said feedback means.

2. The system of claim 1 wherein said determining means of said computer means further comprises:

means for grouping contiguously related individual pixels from one of said first group or said second group into a first set of representative pixel blobs; and

means for selecting from said first set of pixel blobs a pixel blob corresponding to said pupil image.

3. The system of claim 2 wherein said feedback means is a display screen; and

said computer means further comprises:

means for determining the position of said pupil image in an image coordinate space; and

means for mapping the position of said pupil image in the image coordinate space into a position in a display screen coordinate space.

4. The system of claim 3 further comprising a calibration means for determining a mapping function between the image coordinate space and the display screen coordinate space, said calibration means comprising:

means for placing a cursor at a pre-determined position on said display screen;

means for orienting said point of regard to substantially coincide with said pre-determined position;

means for determining the position of the pupil image of said vision means in the image coordinate space;

means for repeating said steps of placing a cursor, orienting said point of regard, and determining the pupil image position a plurality of times to generate a plurality of cursor position-pupil image position data pairs; and

means for fitting a curve to said data pairs to determine said mapping function.

5. The system of claim 3 wherein said position of said pupil image in image coordinate space is determined based on the centroid of said pixel blob selected as corresponding to said pupil image.

6. The system of claim 3 wherein said support means substantially fixes the relative physical position of said vision means relative to said camera and display screen.

7. The system of claim 2 wherein said computer means in selecting said pixel blob corresponding to said pupil image further comprises:

means for calculating a statistic for each pixel blob of said first set;

means for comparing said statistic of each pixel blob with an expected value corresponding to said pupil image to calculate a difference error; and

means for selecting said pixel blob corresponding to said pupil image where said difference error is less than a pre-determined tolerance.

8. The system of claim 7 wherein the computer means further comprises:

a counter that is incremented after each failure to select a pixel blob corresponding to said pupil image for the current video image being processed by said computer means; and

adjustment means for relaxing said tolerance by a pre-determined quantity after said counter reaches a pre-determined upper limit.

9. The system of claim 8 wherein:

said counter is decremented after each successful selection of a pixel blob corresponding to said pupil image for the current video image being processed by said computer means; and

said adjustment means tightens said tolerance to a pre-determined baseline value after said counter reaches a pre-determined lower limit.

10. The system of claim 7 wherein said statistic for each blob is an area, length-to-width ratio, or a centroid.

11. The system of claim 7 wherein said expected value corresponding to said pupil image is a running average of a plurality of statistics calculated for previously-selected pixel blobs corresponding to pupil images for previous video images.

12. The system of claim 2 further comprising at least one light source mounted on said support means, each light source illuminating said vision means and creating an associated radiation intensity highlight on said vision means.

13. The system of claim 12 wherein said computer means in processing said digital pixel data further comprises:

means for selecting a second pixel intensity threshold, greater in intensity than said grayscale threshold value, for the segmentation of said pixel data into third and fourth groups, said second pixel intensity threshold selected so that the total pixel area of said fourth group is substantially equal to a pre-determined expected area for all highlights of said light sources illuminating said vision means;

means for grouping contiguously related individual pixels from said fourth group into a second set of respective pixel blobs;

means for selecting from said second set of pixel blobs a pixel blob corresponding to a first highlight; and

means for comparing the relative positions of said pixel blob corresponding to said pupil image and said pixel blob corresponding to said first highlight to determine said point of regard.

14. The system of claim 13 wherein said feedback means is a display screen and said computer means in processing said pixel data further comprises:

means for determining the relative position of said pupil image relative to said first highlight in an image coordinate space; and

means for mapping said relative position of said pupil image in the image coordinate space into a point of regard in a display screen coordinate space.

15. The system of claim 14 further comprising a second light source mounted on said support means, said second light source illuminating said vision means and creating a second radiation intensity highlight on said vision means.

16. The system of claim 15 wherein the position of said pupil image relative to said second highlight is determined by said computer means.

17. The system of claim 1 wherein said computer means processes data corresponding to a plurality of video images to substantially determine the position of said pupil.

18. An eye-tracking system for determining a point of regard on a display, said eye-tracking system comprising:

a display screen providing said display and displaying information corresponding to said point of regard;

a camera for acquiring a video image of an eye having a pupil, the position of said pupil corresponding to said point of regard, and said video image comprising a pupil image;

a frame grabber, coupled to said camera, for accepting video data corresponding to said video image from said camera and converting said video data to digital pixel data of a given dynamic range so as to provide said digital pixel data as a grayscale mapping of said video image;

a computer, coupled between said frame grabber and said display screen, for processing said digital pixel data to substantially determine said point of regard on said display screen wherein said computer includes:

memory means for providing an expected area value for the area of said pupil within said video image;

means for determining a grayscale threshold value in accordance with said expected area value and grayscale distribution of said digital pixel data;

means for segregating said digital pixel data into two separate binary groups as first and second groups using said grayscale threshold value as a discriminator; and

means for substantially determining the position of said pupil in accordance with the segregated data; and

a support, connected to said camera and said display screen, for substantially fixing the physical positions of said camera means and said display screen relative to said eye.

19. The eye-tracking system of claim 18 wherein said computer in processing said digital pixel data further comprises:

means for grouping contiguously related pixels from said first group into a first set of respective pixel blobs;

means for selecting from said first set a pixel blob corresponding to said pupil image;

means for determining the position of said pupil image by a calculated value based on a property of said selected pixel blob; and

means for mapping the position of said pupil image in an image coordinate space into a position in a display screen coordinate space.

20. The eye-tracking system of claim 19 wherein said computer means in selecting the pixel blob from said first set further comprises:

means for calculating a statistic for each pixel blob in said first set;

means for comparing said statistic for each pixel blob in said first set with an expected value corresponding to said pupil image to calculate a difference error; and

means for selecting from said first set the pixel blob corresponding to said pupil image where said difference error is less than a pre-determined tolerance.

21. The eye-tracking system of claim 20 further comprising at least a first light source mounted on said support, said first light source illuminating said eye and creating a radiation intensity highlight on the corneal surface of said eye.

22. The eye-tracking system of claim 21 wherein said computer in processing said digital pixel data further comprises:

means for selecting a second pixel intensity threshold, greater in intensity than said grayscale threshold valve, for the segmentation of said pixel data into third and fourth groups, said second pixel intensity threshold selected so that the total pixel area of said fourth group is substantially equal to a pre-determined expected total image area corresponding to the highlight of said first light source illuminating said eye;

means for grouping contiguously related individual pixels from said fourth group into a second set of respective pixel blobs;

means for selecting from said second set a pixel blob corresponding to a first highlight image of said first light source; and

means for comparing the relative positions of said pixel blob corresponding to said pupil image and said pixel blob corresponding to said first highlight image to determine said point of regard.

23. The system of claim 22 wherein said computer in processing said digital pixel data further comprises:

means for determining the relative position of said pupil image relative to said first highlight image in the image coordinate space; and

means for mapping said relative position of said pupil image in the image coordinate space into a point of regard in the display screen coordinate space.

24. The system of claim 23 further comprising a second light source mounted on said support means, said second light source illuminating said eye and creating a second radiation intensity highlight on said eye, wherein the position of said pupil image relative to a second highlight image corresponding to said second light source is determined by said computer.

25. The system of claim 22 wherein said first light source is a light-emitting diode.

26. The system of claim 22 further comprising a speech synthesizer coupled to said computer.

27. The system of claim 20 wherein all components of said system including a power supply can be carried on one's person.

28. The system of claim 20 wherein said camera and said display screen are both directed to the same eye.

29. In a vision-tracking system, a tracking method for determining a point of regard, said method comprising the steps of:

acquiring video data from a camera corresponding to a video image of a vision means, said vision means comprising a pupil, the position of said pupil corresponding to said point of regard, and said video image comprising a pupil image;

converting said video data to digital pixel data corresponding to said video image using an analog-to-digital interface coupled to said camera so as to provide said digital pixel data of a given dynamic range as a grayscale mapping of said video image;

providing an expected area value for said pupil;

determining a grayscale threshold value in accordance with said expected area value and grayscale distribution of said digital pixel data;

segregating said digital pixel data into first and second groups using said grayscale threshold value as a discriminator;

determining the position of said pupil in accordance with the segregated data of one of said first group or said second group; and

providing feedback data corresponding to said pupil position from said computer.

30. The tracking method of claim 29 wherein said step of processing said pixel data further comprises the step of grouping contiguously related individual pixels from one of said first group or said second group into a set of respective pixel blobs.

31. The tracking method of claim 30 wherein said step of processing said pixel data further comprises the step of selecting one of said pixel blobs corresponding to said pupil image.

32. The tracking method of claim 29 wherein said step of providing feedback uses a display.

33. The tracking method of claim 32 further comprising the step of mounting said camera and said display, prior to said step of acquiring video data, to fix the relative physical positions of said camera and said display.

34. The tracking method of claim 29 wherein said feedback data is provided by a display screen, said grayscale threshold value is selected so that the total pixel area of said first group is substantially equal to a pre-determined expected pupil area, and said processing by said computer further comprises the steps of:

grouping contiguously related pixels from said first group into a first set of respective pixel blobs;

selecting from said first set a pixel blob corresponding to said pupil image;

determining the position of said pupil image by a calculated value based on a property of said pixel blob as selected; and

mapping the position of said pupil image in an image coordinate space into a position in a display screen coordinate space.

35. The tracking method of claim 34 wherein said step of selecting the pixel blob from said first set comprises the steps of:

calculating a statistic for each pixel blob in said first set;

comparing said statistic for each pixel blob in said first set with an expected value corresponding to said pupil image to calculate a difference error; and

selecting from said first set the pixel blob corresponding to said pupil image where said difference error is less than a pre-determined tolerance.
 Description Submit all comments and votes
 


FIELD OF THE INVENTION

The present invention relates generally to a vision-tracking system and more particularly to an eye-tracking system that uses digital processing of an image of an eye to locate the eye's viewing direction.

DESCRIPTION OF THE PRIOR ART

Often there is a need to interact with a computer without the use of one's hands. This need may arise because one's hands are occupied while executing some task or as the result of a physical disability which prevents one from having sufficient physical control over one's hands to manipulate a traditional input device such as a keyboard. Interaction with a computer through the use of one's eyes is one way to satisfy this need. For example, where a person is afflicted with severe physical disabilities, eye movements can represent one of the few remaining motions that can be readily controlled. A physically-disabled person could interact with the computer through a system able to track and respond to the motion of one or both of his eyes.

Several prior eye-tracking systems have been built to track the motion of the eye in its viewing of objects. Earlier systems determined the eye's motion directly by physical contact with the eye, while current systems determine its motion indirectly by sensing light reflected from the eye's surface. Applications of prior eye-tracking systems have included the determination of the parts of advertising that catch a viewer's attention, and the evaluation of the layout of automotive dashboards and cockpit displays to determine their effectiveness for drivers and pilots.

Some recent eye-tracking systems have permitted a user to use the eye as a control input to a computer. In one example of such an application, a user selects words from a menu on a video screen to produce synthesized speech. This system operates by determining the intersection of the eye's line of sight with the plane of the screen to determine a so-called "point of regard", which is the point which the user is looking at on the screen and corresponds in this case to a menu selection. In other applications, however, the eye's point of regard generally corresponds to the physical point at which the eye is looking, whether on a display screen or elsewhere in three-dimensional space. The location of the point of regard is determined by the eye-tracking system and used as a control input for interactive control by the user.

Although certain prior systems permit a user to have some interactive control of a computer, these systems exhibit several disadvantages. In determining the eye's point of regard it is necessary to know the relative positions of the sensing camera, the display screen, and the user's eye. One of the more recent interactive systems fixes the position of the display screen and the sensing camera relative to one another, but not relative to the user. Thus, the user's physical position must be restrained for proper functioning. This is a disadvantage because the user's head must remain stationary for long periods of time leading to increased fatigue.

Another interactive system places the sensing camera on a helmet worn by the user. Although the camera's position relative to the user is fixed, the display screen's relative position is not. Instead, the display screen is mounted in a fixed position apart from the helmet. Therefore, an additional helmet sensor is required to track the position and orientation of the head. As a result, in this system the positions of the head and the eye must both be calculated to determine a point of regard. This second calculation for the head position increases the computational requirements of the system. In addition, both this and the aforementioned systems suffer from large physical size, lack of portability, and excessive cost.

Thus, there is a need for an eye-tracking system that will not restrict the mobility of the user, is portable, is more affordable, and avoids the additional computational overhead associated with tracking the relative positions of system components (or of the user), other than that of the eye itself.

SUMMARY OF THE INVENTION

This need is satisfied, the limitations of the prior art overcome, and other benefits realized in accordance with the principles of the present invention by a vision-tracking system for determining a point of regard. In one approach, the vision-tracking system determines the point of regard by determining the position of a pupil of a first vision means by digital image processing and then relating this position to a point of regard in the real-world of the first or a second vision means (e.g. the point of regard on a display screen or on a selected object in three-dimensional space). Although the pupil position being determined is that of a first vision means, the point of regard being determined can be that of either the first or the second vision means. The point of regard may be that of the second vision means in situations where the pupil position of the first vision means substantially corresponds to the point of regard of the second vision means. This situation occurs, for example, in a typical pair of human eyes in which the left eye's pupil position closely tracks the right eye's pupil position.

In one aspect of the present invention, the vision-tracking system comprises:

a camera means for acquiring a video image of a vision means, wherein the video image comprises a pupil image;

a frame grabber means, coupled to the camera means, for accepting video data corresponding to the video image from the camera means and converting the video data to digital pixel data;

a computer means, coupled to the frame grabber means, for processing the digital pixel data to substantially determine the position of the pupil;

a feedback means, coupled to the computer means, for accepting feedback data corresponding to the pupil position from the computer means; and

a support means, connected to the camera and feedback means, for fixing the relative physical positions of the camera and feedback means.

The processing performed by the computer means of the vision tracking system may further comprise the selection of a first pixel intensity threshold for the segmentation of the pixel data into first and second groups. This processing may also comprise the following steps:

grouping individual pixels from one of the first or second groups into a first set having at least one pixel blob (note: a blob is a region of connected pixels belonging to the same group); and

selecting from the first set one of the pixel blobs corresponding to the pupil image.

The feedback means may be a display screen, and the processing by the computer means may further comprise determining the position of the pupil image in image coordinate space, and mapping the position of the pupil image in image coordinate space into a position in display screen coordinate space.

In another aspect of the present invention, the first pixel intensity threshold is selected so that the total pixel area of the first group is substantially equal to a pre-determined expected pupil area, and the step of selecting one of the pixel blobs corresponding to the pupil image comprises the steps of:

calculating one or more statistics for each of the pixel blobs;

comparing the statistic for each pixel blob with an expected value corresponding to the pupil image to calculate a difference error; and

selecting the pixel blob corresponding to the pupil image where the difference error is less than a pre-determined tolerance.

In a further aspect of the present invention, the vision tracking system comprises at least one light source mounted on the support means that illuminates the vision means and creates a radiation intensity highlight on the vision means. In this aspect, the processing by the computer means may also comprise the steps of:

selecting a second pixel intensity threshold, greater in intensity than the first pixel intensity threshold, for the segmentation of the pixel data into third and fourth groups, the second pixel intensity threshold being selected so that the total pixel area of the fourth group is substantially equal to a predetermined expected area for all highlights of the light sources illuminating the vision means;

grouping individual pixels from the fourth group into a second set having at least one pixel blob;

selecting from the second set one of the pixel blobs corresponding to a first highlight; and comparing the relative positions of the pixel blob corresponding to the pupil image and the pixel blob corresponding to the first highlight to determine the point of regard of the vision means.

Yet another aspect of the present invention is realized in a digital vision-tracking system by a tracking method for determining a point of regard. This method comprises the steps of:

acquiring video data from a camera corresponding to a video image of a vision means having a pupil, wherein the position of the pupil corresponds to the point of regard and the video image comprises a pupil image;

converting the video data to digital pixel data corresponding to the video image using an analog-to-digital interface coupled to the camera;

processing the pixel data in a computer coupled to the analog-to-digital interface to substantially determine the position of the pupil by a processing method comprising the step of selecting a pixel intensity threshold for the segmentation of the pixel data into first and second groups; and

providing feedback data corresponding to the pupil position.

The feedback data may be provided by a display screen, and the pixel intensity threshold may be selected so that the total pixel area of the first group is substantially equal to a pre-determined expected pupil area. The processing by the computer may also further comprise the steps of:

grouping pixels from the first group into a first set having at least one pixel blob;

selecting from the first set one of the pixel blobs as corresponding to the pupil image;

determining the position of the pupil image by a calculated value based on a property of the selected pixel blob; and

mapping the position of the pupil image in image coordinate space into a position in display screen coordinate space.

An advantage of the present invention is that all system components may be carried on one's person, inclu