|
|
|
| United States Patent | 5481622 |
| Link to this page | http://www.wikipatents.com/5481622.html |
| Inventor(s) | Gerhardt; Lester A. (Clifton Park, NY);
Sabolcik; Ross M. (Austin, TX) |
| Abstract | An eye-tracking system determines the position of a user's pupil and maps
this position into a point of regard of the user on an interface device,
such as a display screen, or other real-world object by a system
comprising a camera for acquiring a video image of the pupil; a frame
grabber coupled to the camera for accepting and converting analog video
data from the camera to digital pixel data; a computer coupled to the
frame grabber for processing the digital pixel data to substantially
determine the position of the pupil; a display screen coupled to the
computer; and a support connected to the camera and display screen for
fixing the relative physical positions thereof relative to the user's
pupil. The processing performed by the computer may include the selection
of a first pixel intensity threshold for the segmentation of the digital
pixel data into first and second groups, where the total pixel area of the
first group is selected to be substantially equal to a pre-determined
value expected to correspond to the area of a user's pupil. The system may
be calibrated by the user's following a cursor on the display screen while
the system measures the pupil position for known locations of the cursor. |
|
|
|
Title Information  |
|
|
|
|
|
Drawing from US Patent 5481622 |
|
|
Eye tracking apparatus and method employing grayscale threshold values |
|
|
|
|
|
| Publication Date |
January 2, 1996 |
|
|
|
|
|
| Filing Date |
March 1, 1994 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Title Information  |
|
|
References  |
|
|
| *references marked with an asterisk below are user-added references |
|
U.S. References |
|
|
| Add a new US reference: |
| | Reference | Relevancy | Comments | Reference | Relevancy | Comments | 3236578
|      Your vote accepted [0 after 0 votes] | | 3542457
|      Your vote accepted [0 after 0 votes] | | 5325133 Adachi 351/209 Jun,1994 |      Your vote accepted [0 after 0 votes] | | 5189512 Cameron 348/838 Feb,1993 |      Your vote accepted [0 after 0 votes] | | 5109425 Lawton 382/107 Apr,1992 |      Your vote accepted [0 after 0 votes] | | 5094521 Jolson 351/210 Mar,1992 |      Your vote accepted [0 after 0 votes] | | 5093567 Staveley 250/221 Mar,1992 |      Your vote accepted [0 after 0 votes] | | 5070883 Kasahara
Dec,1991 |      Your vote accepted [0 after 0 votes] | | 5049147 Danon 606/10 Sep,1991 |      Your vote accepted [0 after 0 votes] | | 5034618 Akeel 250/559.33 Jul,1991 |      Your vote accepted [0 after 0 votes] | | 5016282 Tomono 382/117 May,1991 |      Your vote accepted [0 after 0 votes] | | 5002385 Kasahara 351/210 Mar,1991 |      Your vote accepted [0 after 0 votes] | | 4988183 Kasahara 351/210 Jan,1991 |      Your vote accepted [0 after 0 votes] | | 4852988 Velez 351/210 Aug,1989 |      Your vote accepted [0 after 0 votes] | | 4836670 Hutchinson 351/210 Jun,1989 |      Your vote accepted [0 after 0 votes] | | 4815839 Waldorf 351/210 Mar,1989 |      Your vote accepted [0 after 0 votes] | | 4748502 Friedman 348/78 May,1988 |      Your vote accepted [0 after 0 votes] | | 4648052 Friedman 348/78 Mar,1987 |      Your vote accepted [0 after 0 votes] | | 4625329 Ishikawa 382/104 Nov,1986 |      Your vote accepted [0 after 0 votes] | | 4595990 Garwin 708/141 Jun,1986 |      Your vote accepted [0 after 0 votes] | | 4102564 Michael 351/210 Jul,1978 |      Your vote accepted [0 after 0 votes] | | | | | |
|
|
|
|
U.S. References |
|
|
Foreign References |
|
|
|
|
|
|
Foreign References |
|
|
Other References |
|
|
|
|
|
|
Other References |
|
|
|
|
|
References  |
|
|
|
|
|
| Market Size |
|
Estimate the gross annual revenues of the relevant market
sector:
|
| | |
| |
|
|
| Market Share |
|
Estimate the percentage of the relevant market sector this invention will capture:
|
| | |
| |
|
|
| Reasonable Royalty |
|
What percentage of gross sales should the inventor or assignee be paid?
|
| | |
| |
|
|
|
Public's "Guesstimation" of Royalty Value
|
| Market Size | N/A | [No votes] | | x | Market Share | N/A | [No votes] | | x | Reasonable Royalty | N/A | [No votes] |
| | N/A | |
| |
|
|
|
|
|
|
|
|
|
|
|
|
Market Review  |
|
|
Technical Review  |
|
|
Claims  |
|
|
We claim:
1. A system for determining a point of regard, said system comprising:
camera means for acquiring a video image of a first vision means, said
first vision means comprising a pupil, a position of said pupil
corresponding to said point of regard, and said video image comprising a
pupil image;
frame grabber means, coupled to said camera means, for accepting video data
corresponding to said video image from said camera means and converting
said video data to digital pixel data of a given dynamic range so as to
provide said digital pixel data as a grayscale mapping of the video image;
computer means, coupled to said frame grabber means, for processing said
digital pixel data to substantially determine the position of said pupil
wherein said computer means includes:
memory means for providing an expected area value for the area of said
pupil within said video image;
means for determining a grayscale threshold value in accordance with said
expected area value and grayscale distribution of said digital pixel data;
means for segregating said digital pixel data into two separate binary
groups as first and second groups using said grayscale threshold value as
a discriminator; and
means for substantially determining the position of said pupil in
accordance with the segregated data;
feedback means, coupled to said computer means, for accepting data
corresponding to the determined position of said pupil from said computer
means and feeding back to said first vision means feedback information
representative of the determined position; and
support means, connected to said camera means and said feedback means, for
fixing the relative physical positions of said camera means and said
feedback means.
2. The system of claim 1 wherein said determining means of said computer
means further comprises:
means for grouping contiguously related individual pixels from one of said
first group or said second group into a first set of representative pixel
blobs; and
means for selecting from said first set of pixel blobs a pixel blob
corresponding to said pupil image.
3. The system of claim 2 wherein said feedback means is a display screen;
and
said computer means further comprises:
means for determining the position of said pupil image in an image
coordinate space; and
means for mapping the position of said pupil image in the image coordinate
space into a position in a display screen coordinate space.
4. The system of claim 3 further comprising a calibration means for
determining a mapping function between the image coordinate space and the
display screen coordinate space, said calibration means comprising:
means for placing a cursor at a pre-determined position on said display
screen;
means for orienting said point of regard to substantially coincide with
said pre-determined position;
means for determining the position of the pupil image of said vision means
in the image coordinate space;
means for repeating said steps of placing a cursor, orienting said point of
regard, and determining the pupil image position a plurality of times to
generate a plurality of cursor position-pupil image position data pairs;
and
means for fitting a curve to said data pairs to determine said mapping
function.
5. The system of claim 3 wherein said position of said pupil image in image
coordinate space is determined based on the centroid of said pixel blob
selected as corresponding to said pupil image.
6. The system of claim 3 wherein said support means substantially fixes the
relative physical position of said vision means relative to said camera
and display screen.
7. The system of claim 2 wherein said computer means in selecting said
pixel blob corresponding to said pupil image further comprises:
means for calculating a statistic for each pixel blob of said first set;
means for comparing said statistic of each pixel blob with an expected
value corresponding to said pupil image to calculate a difference error;
and
means for selecting said pixel blob corresponding to said pupil image where
said difference error is less than a pre-determined tolerance.
8. The system of claim 7 wherein the computer means further comprises:
a counter that is incremented after each failure to select a pixel blob
corresponding to said pupil image for the current video image being
processed by said computer means; and
adjustment means for relaxing said tolerance by a pre-determined quantity
after said counter reaches a pre-determined upper limit.
9. The system of claim 8 wherein:
said counter is decremented after each successful selection of a pixel blob
corresponding to said pupil image for the current video image being
processed by said computer means; and
said adjustment means tightens said tolerance to a pre-determined baseline
value after said counter reaches a pre-determined lower limit.
10. The system of claim 7 wherein said statistic for each blob is an area,
length-to-width ratio, or a centroid.
11. The system of claim 7 wherein said expected value corresponding to said
pupil image is a running average of a plurality of statistics calculated
for previously-selected pixel blobs corresponding to pupil images for
previous video images.
12. The system of claim 2 further comprising at least one light source
mounted on said support means, each light source illuminating said vision
means and creating an associated radiation intensity highlight on said
vision means.
13. The system of claim 12 wherein said computer means in processing said
digital pixel data further comprises:
means for selecting a second pixel intensity threshold, greater in
intensity than said grayscale threshold value, for the segmentation of
said pixel data into third and fourth groups, said second pixel intensity
threshold selected so that the total pixel area of said fourth group is
substantially equal to a pre-determined expected area for all highlights
of said light sources illuminating said vision means;
means for grouping contiguously related individual pixels from said fourth
group into a second set of respective pixel blobs;
means for selecting from said second set of pixel blobs a pixel blob
corresponding to a first highlight; and
means for comparing the relative positions of said pixel blob corresponding
to said pupil image and said pixel blob corresponding to said first
highlight to determine said point of regard.
14. The system of claim 13 wherein said feedback means is a display screen
and said computer means in processing said pixel data further comprises:
means for determining the relative position of said pupil image relative to
said first highlight in an image coordinate space; and
means for mapping said relative position of said pupil image in the image
coordinate space into a point of regard in a display screen coordinate
space.
15. The system of claim 14 further comprising a second light source mounted
on said support means, said second light source illuminating said vision
means and creating a second radiation intensity highlight on said vision
means.
16. The system of claim 15 wherein the position of said pupil image
relative to said second highlight is determined by said computer means.
17. The system of claim 1 wherein said computer means processes data
corresponding to a plurality of video images to substantially determine
the position of said pupil.
18. An eye-tracking system for determining a point of regard on a display,
said eye-tracking system comprising:
a display screen providing said display and displaying information
corresponding to said point of regard;
a camera for acquiring a video image of an eye having a pupil, the position
of said pupil corresponding to said point of regard, and said video image
comprising a pupil image;
a frame grabber, coupled to said camera, for accepting video data
corresponding to said video image from said camera and converting said
video data to digital pixel data of a given dynamic range so as to provide
said digital pixel data as a grayscale mapping of said video image;
a computer, coupled between said frame grabber and said display screen, for
processing said digital pixel data to substantially determine said point
of regard on said display screen wherein said computer includes:
memory means for providing an expected area value for the area of said
pupil within said video image;
means for determining a grayscale threshold value in accordance with said
expected area value and grayscale distribution of said digital pixel data;
means for segregating said digital pixel data into two separate binary
groups as first and second groups using said grayscale threshold value as
a discriminator; and
means for substantially determining the position of said pupil in
accordance with the segregated data; and
a support, connected to said camera and said display screen, for
substantially fixing the physical positions of said camera means and said
display screen relative to said eye.
19. The eye-tracking system of claim 18 wherein said computer in processing
said digital pixel data further comprises:
means for grouping contiguously related pixels from said first group into a
first set of respective pixel blobs;
means for selecting from said first set a pixel blob corresponding to said
pupil image;
means for determining the position of said pupil image by a calculated
value based on a property of said selected pixel blob; and
means for mapping the position of said pupil image in an image coordinate
space into a position in a display screen coordinate space.
20. The eye-tracking system of claim 19 wherein said computer means in
selecting the pixel blob from said first set further comprises:
means for calculating a statistic for each pixel blob in said first set;
means for comparing said statistic for each pixel blob in said first set
with an expected value corresponding to said pupil image to calculate a
difference error; and
means for selecting from said first set the pixel blob corresponding to
said pupil image where said difference error is less than a pre-determined
tolerance.
21. The eye-tracking system of claim 20 further comprising at least a first
light source mounted on said support, said first light source illuminating
said eye and creating a radiation intensity highlight on the corneal
surface of said eye.
22. The eye-tracking system of claim 21 wherein said computer in processing
said digital pixel data further comprises:
means for selecting a second pixel intensity threshold, greater in
intensity than said grayscale threshold valve, for the segmentation of
said pixel data into third and fourth groups, said second pixel intensity
threshold selected so that the total pixel area of said fourth group is
substantially equal to a pre-determined expected total image area
corresponding to the highlight of said first light source illuminating
said eye;
means for grouping contiguously related individual pixels from said fourth
group into a second set of respective pixel blobs;
means for selecting from said second set a pixel blob corresponding to a
first highlight image of said first light source; and
means for comparing the relative positions of said pixel blob corresponding
to said pupil image and said pixel blob corresponding to said first
highlight image to determine said point of regard.
23. The system of claim 22 wherein said computer in processing said digital
pixel data further comprises:
means for determining the relative position of said pupil image relative to
said first highlight image in the image coordinate space; and
means for mapping said relative position of said pupil image in the image
coordinate space into a point of regard in the display screen coordinate
space.
24. The system of claim 23 further comprising a second light source mounted
on said support means, said second light source illuminating said eye and
creating a second radiation intensity highlight on said eye, wherein the
position of said pupil image relative to a second highlight image
corresponding to said second light source is determined by said computer.
25. The system of claim 22 wherein said first light source is a
light-emitting diode.
26. The system of claim 22 further comprising a speech synthesizer coupled
to said computer.
27. The system of claim 20 wherein all components of said system including
a power supply can be carried on one's person.
28. The system of claim 20 wherein said camera and said display screen are
both directed to the same eye.
29. In a vision-tracking system, a tracking method for determining a point
of regard, said method comprising the steps of:
acquiring video data from a camera corresponding to a video image of a
vision means, said vision means comprising a pupil, the position of said
pupil corresponding to said point of regard, and said video image
comprising a pupil image;
converting said video data to digital pixel data corresponding to said
video image using an analog-to-digital interface coupled to said camera so
as to provide said digital pixel data of a given dynamic range as a
grayscale mapping of said video image;
providing an expected area value for said pupil;
determining a grayscale threshold value in accordance with said expected
area value and grayscale distribution of said digital pixel data;
segregating said digital pixel data into first and second groups using said
grayscale threshold value as a discriminator;
determining the position of said pupil in accordance with the segregated
data of one of said first group or said second group; and
providing feedback data corresponding to said pupil position from said
computer.
30. The tracking method of claim 29 wherein said step of processing said
pixel data further comprises the step of grouping contiguously related
individual pixels from one of said first group or said second group into a
set of respective pixel blobs.
31. The tracking method of claim 30 wherein said step of processing said
pixel data further comprises the step of selecting one of said pixel blobs
corresponding to said pupil image.
32. The tracking method of claim 29 wherein said step of providing feedback
uses a display.
33. The tracking method of claim 32 further comprising the step of mounting
said camera and said display, prior to said step of acquiring video data,
to fix the relative physical positions of said camera and said display.
34. The tracking method of claim 29 wherein said feedback data is provided
by a display screen, said grayscale threshold value is selected so that
the total pixel area of said first group is substantially equal to a
pre-determined expected pupil area, and said processing by said computer
further comprises the steps of:
grouping contiguously related pixels from said first group into a first set
of respective pixel blobs;
selecting from said first set a pixel blob corresponding to said pupil
image;
determining the position of said pupil image by a calculated value based on
a property of said pixel blob as selected; and
mapping the position of said pupil image in an image coordinate space into
a position in a display screen coordinate space.
35. The tracking method of claim 34 wherein said step of selecting the
pixel blob from said first set comprises the steps of:
calculating a statistic for each pixel blob in said first set;
comparing said statistic for each pixel blob in said first set with an
expected value corresponding to said pupil image to calculate a difference
error; and
selecting from said first set the pixel blob corresponding to said pupil
image where said difference error is less than a pre-determined tolerance. |
|
|
|
|
Claims  |
|
|
Description  |
|
|
FIELD OF THE INVENTION
The present invention relates generally to a vision-tracking system and
more particularly to an eye-tracking system that uses digital processing
of an image of an eye to locate the eye's viewing direction.
DESCRIPTION OF THE PRIOR ART
Often there is a need to interact with a computer without the use of one's
hands. This need may arise because one's hands are occupied while
executing some task or as the result of a physical disability which
prevents one from having sufficient physical control over one's hands to
manipulate a traditional input device such as a keyboard. Interaction with
a computer through the use of one's eyes is one way to satisfy this need.
For example, where a person is afflicted with severe physical
disabilities, eye movements can represent one of the few remaining motions
that can be readily controlled. A physically-disabled person could
interact with the computer through a system able to track and respond to
the motion of one or both of his eyes.
Several prior eye-tracking systems have been built to track the motion of
the eye in its viewing of objects. Earlier systems determined the eye's
motion directly by physical contact with the eye, while current systems
determine its motion indirectly by sensing light reflected from the eye's
surface. Applications of prior eye-tracking systems have included the
determination of the parts of advertising that catch a viewer's attention,
and the evaluation of the layout of automotive dashboards and cockpit
displays to determine their effectiveness for drivers and pilots.
Some recent eye-tracking systems have permitted a user to use the eye as a
control input to a computer. In one example of such an application, a user
selects words from a menu on a video screen to produce synthesized speech.
This system operates by determining the intersection of the eye's line of
sight with the plane of the screen to determine a so-called "point of
regard", which is the point which the user is looking at on the screen and
corresponds in this case to a menu selection. In other applications,
however, the eye's point of regard generally corresponds to the physical
point at which the eye is looking, whether on a display screen or
elsewhere in three-dimensional space. The location of the point of regard
is determined by the eye-tracking system and used as a control input for
interactive control by the user.
Although certain prior systems permit a user to have some interactive
control of a computer, these systems exhibit several disadvantages. In
determining the eye's point of regard it is necessary to know the relative
positions of the sensing camera, the display screen, and the user's eye.
One of the more recent interactive systems fixes the position of the
display screen and the sensing camera relative to one another, but not
relative to the user. Thus, the user's physical position must be
restrained for proper functioning. This is a disadvantage because the
user's head must remain stationary for long periods of time leading to
increased fatigue.
Another interactive system places the sensing camera on a helmet worn by
the user. Although the camera's position relative to the user is fixed,
the display screen's relative position is not. Instead, the display screen
is mounted in a fixed position apart from the helmet. Therefore, an
additional helmet sensor is required to track the position and orientation
of the head. As a result, in this system the positions of the head and the
eye must both be calculated to determine a point of regard. This second
calculation for the head position increases the computational requirements
of the system. In addition, both this and the aforementioned systems
suffer from large physical size, lack of portability, and excessive cost.
Thus, there is a need for an eye-tracking system that will not restrict the
mobility of the user, is portable, is more affordable, and avoids the
additional computational overhead associated with tracking the relative
positions of system components (or of the user), other than that of the
eye itself.
SUMMARY OF THE INVENTION
This need is satisfied, the limitations of the prior art overcome, and
other benefits realized in accordance with the principles of the present
invention by a vision-tracking system for determining a point of regard.
In one approach, the vision-tracking system determines the point of regard
by determining the position of a pupil of a first vision means by digital
image processing and then relating this position to a point of regard in
the real-world of the first or a second vision means (e.g. the point of
regard on a display screen or on a selected object in three-dimensional
space). Although the pupil position being determined is that of a first
vision means, the point of regard being determined can be that of either
the first or the second vision means. The point of regard may be that of
the second vision means in situations where the pupil position of the
first vision means substantially corresponds to the point of regard of the
second vision means. This situation occurs, for example, in a typical pair
of human eyes in which the left eye's pupil position closely tracks the
right eye's pupil position.
In one aspect of the present invention, the vision-tracking system
comprises:
a camera means for acquiring a video image of a vision means, wherein the
video image comprises a pupil image;
a frame grabber means, coupled to the camera means, for accepting video
data corresponding to the video image from the camera means and converting
the video data to digital pixel data;
a computer means, coupled to the frame grabber means, for processing the
digital pixel data to substantially determine the position of the pupil;
a feedback means, coupled to the computer means, for accepting feedback
data corresponding to the pupil position from the computer means; and
a support means, connected to the camera and feedback means, for fixing the
relative physical positions of the camera and feedback means.
The processing performed by the computer means of the vision tracking
system may further comprise the selection of a first pixel intensity
threshold for the segmentation of the pixel data into first and second
groups. This processing may also comprise the following steps:
grouping individual pixels from one of the first or second groups into a
first set having at least one pixel blob (note: a blob is a region of
connected pixels belonging to the same group); and
selecting from the first set one of the pixel blobs corresponding to the
pupil image.
The feedback means may be a display screen, and the processing by the
computer means may further comprise determining the position of the pupil
image in image coordinate space, and mapping the position of the pupil
image in image coordinate space into a position in display screen
coordinate space.
In another aspect of the present invention, the first pixel intensity
threshold is selected so that the total pixel area of the first group is
substantially equal to a pre-determined expected pupil area, and the step
of selecting one of the pixel blobs corresponding to the pupil image
comprises the steps of:
calculating one or more statistics for each of the pixel blobs;
comparing the statistic for each pixel blob with an expected value
corresponding to the pupil image to calculate a difference error; and
selecting the pixel blob corresponding to the pupil image where the
difference error is less than a pre-determined tolerance.
In a further aspect of the present invention, the vision tracking system
comprises at least one light source mounted on the support means that
illuminates the vision means and creates a radiation intensity highlight
on the vision means. In this aspect, the processing by the computer means
may also comprise the steps of:
selecting a second pixel intensity threshold, greater in intensity than the
first pixel intensity threshold, for the segmentation of the pixel data
into third and fourth groups, the second pixel intensity threshold being
selected so that the total pixel area of the fourth group is substantially
equal to a predetermined expected area for all highlights of the light
sources illuminating the vision means;
grouping individual pixels from the fourth group into a second set having
at least one pixel blob;
selecting from the second set one of the pixel blobs corresponding to a
first highlight; and comparing the relative positions of the pixel blob
corresponding to the pupil image and the pixel blob corresponding to the
first highlight to determine the point of regard of the vision means.
Yet another aspect of the present invention is realized in a digital
vision-tracking system by a tracking method for determining a point of
regard. This method comprises the steps of:
acquiring video data from a camera corresponding to a video image of a
vision means having a pupil, wherein the position of the pupil corresponds
to the point of regard and the video image comprises a pupil image;
converting the video data to digital pixel data corresponding to the video
image using an analog-to-digital interface coupled to the camera;
processing the pixel data in a computer coupled to the analog-to-digital
interface to substantially determine the position of the pupil by a
processing method comprising the step of selecting a pixel intensity
threshold for the segmentation of the pixel data into first and second
groups; and
providing feedback data corresponding to the pupil position.
The feedback data may be provided by a display screen, and the pixel
intensity threshold may be selected so that the total pixel area of the
first group is substantially equal to a pre-determined expected pupil
area. The processing by the computer may also further comprise the steps
of:
grouping pixels from the first group into a first set having at least one
pixel blob;
selecting from the first set one of the pixel blobs as corresponding to the
pupil image;
determining the position of the pupil image by a calculated value based on
a property of the selected pixel blob; and
mapping the position of the pupil image in image coordinate space into a
position in display screen coordinate space.
An advantage of the present invention is that all system components may be
carried on one's person, inclu | | |