WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
Image processing and analysis of individual nucleic acid molecules    

Custom CD of patents similar to US5720928 : Image processing and analysis of individual nucleic acid molecules - $19.95
United States Patent5720928   
Link to this pagehttp://www.wikipatents.com/5720928.html
Inventor(s)Schwartz; David C. (New York, NY)
AbstractA method for observing and determining the size of individual molecules and for determining the weight distribution of a sample containing molecules of varying size, which involves placing a deformable or nondeformable molecule in a medium, subjecting the molecule to an external force, thereby causing conformational and/or positional changes, and then measuring these changes. Preferred ways to measure conformational and positional changes include: (1) determining the rate at which a deformable molecule returns to a relaxed state after termination of the external force, (2) determining the rate at which a molecule becomes oriented in a new direction when the direction of the perturbing force is changed, (3) determining the rate at which a molecule rotates, (4) measuring the length of a molecule, particularly when it is at least partially stretched, or (5) measuring at least one diameter of a spherical or ellipsoidal molecule. Measurements of relaxation, reorientation, and rotation rates, as well as length and diameter can be made using a light microscope connected to an image processor. Molecule relaxation, reorientation and rotation also can be determined using a microscope combined with a spectroscopic device. The invention is particularly useful for measuring polymer molecules, such as nucleic acids, and can be used to determine the size and map location of restriction digests. Breakage of large polymer molecules mounted on a microscope slide is prevented by condensing the molecules before mounting and unfolding the molecules after they have been placed in a matrix.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Inventor     Schwartz; David C. (New York, NY)
Owner/Assignee     New York University (New York, NY)
Patent assignment
All assignments
Company News
Publication Date     February 24, 1998
Application Number     08/415,710
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     April 3, 1995
US Classification     422/186 422/55 422/58 422/99 422/129 435/6
Int'l Classification     B01J 019/08 G01N 021/00 G01N 031/22 B01L 003/00
Examiner     Jones; W. Gary
Assistant Examiner     Atzel; Amy
Attorney/Law Firm     Pennie & Edmonds LLP
Address
Parent Case     This application is a continuation-in-part of application Ser. No. 08/162,379, filed Dec. 7, 1993, now U.S. Pat. No. 5,599,664, which in turn is a continuation of application Ser. No. 07/333,531, filed Apr. 5, 1989, (abandoned). This application is also a continuation-in-part of application Ser. No. 08/128,996, filed Sep. 30, 1993, which is a continuation-in-part of: (a) application Ser. No. 07/879,551, filed May 4, 1992, now U.S. Pat. No. 5,405,519, which in turn, is a continuation of application Ser. No. 07/244,897, filed Sep. 15, 1988, (abandoned); and (b) a continuation-in-part of application Ser. No. 07/333,531, filed Apr. 5, 1989, (abandoned), and application Ser. No. 07/244,897, filed Sep. 15, 1988, (abandoned). The entire contents of each of the foregoing applications is incorporated by reference herein in its entirety.
Priority Data    
USPTO Field of Search     435/6 536/23.1 422/55 422/58 422/99 422/129 422/186
Patent Tags     image processing analysis individual nucleic acid molecules
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
5380833
Urdea
536/22.1
Jan,1995

[0 after 0 votes]
5314829
Coles
436/165
May,1994

[0 after 0 votes]
5079169
Chu

Jan,1992

[0 after 0 votes]
5059294
Lizardi
204/458
Oct,1991

[0 after 0 votes]
4870004
Conroy
204/461
Sep,1989

[0 after 0 votes]
4767700
Wallace
435/6
Aug,1988

[0 after 0 votes]
4737251
Carle
204/458
Apr,1988

[0 after 0 votes]
4695548
Cantor
435/179
Sep,1987

[0 after 0 votes]
4473452
Cantor
204/458
Sep,1984

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B

[0 market size comments]
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%

[0 market share comments]
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%

[0 reasonable royalty comments]
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

[0 Guesstimation of Royalty Value Comments]
License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
[0 license availability comments]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
[0 owner/assignee comments]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

[0 competitive advantage comments]
Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

[0 commercial alternatives comments]
 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


What is claimed is:

1. A system for characterizing a nucleic acid molecule, comprising:

(a) the nucleic acid molecule elongated and fixed onto a planar surface so that the nucleic acid molecule remains accessible for enzymatic reactions and/or hybridization reactions;

(b) an enzyme fixed onto the planar surface;

(c) a divalent cation cofactor included on the planar surface, wherein the cofactor is required for activity of the fixed enzyme and is reversibly chelated with a divalent cation chelator to protect the fixed enzyme from being activated; and

(b) a device for imaging the elongated and fixed nucleic acid molecule to obtain its physical characteristics.

2. The system of claim 1 in which the enzyme is a restriction endonuclease, an exonuclease, a polymerase, a ligase or a helicase.

3. The system of claim 1 in which the cofactor is released upon exposure to a specific wavelength of light, and the fixed enzyme is activated in the location of the exposure.
 Description Submit all comments and votes
 


1. FIELD OF THE INVENTION

This invention relates to methods and compositions for manipulating and characterizing individual polymer molecules, especially nucleic acid molecules, according to, for example, size and/or nucleotide sequence.

2. BACKGROUND OF THE INVENTION

The analysis of nucleic acid molecules at the genome level is an extremely complex endeavor which requires accurate, rapid characterization of large numbers of often very large nucleic acid molecules via high throughput DNA mapping and sequencing. The construction of physical maps, and ultimately of nucleotide sequences, for eukaryotic chromosomes currently remains laborious and difficult. This is due, in part, to the fact that current procedures for mapping and sequencing DNA were originally designed to analyze nucleic acid at the gene, rather than at the genome, level (Chumakov, I. et al., 1992, Nature 359:380; Maier, E. et al., 1992, Nat. Genet. 1:273).

Traditionally, the separation and molecular weight distribution of nucleic acid molecules has been accomplished, most commonly, via gel electrophoresis (see, for example, Freifelder, 1976, Physical Biochemistry, W. H. Freeman), which involves moving a population of molecules through an appropriate medium, such that the molecules are separated according to size. Such electrophoretic methods offer an acceptable level of size resolution, but, especially for purposes of high throughput mapping, suffer from a number of setbacks.

For example, such techniques require the preparation of DNA in bulk amounts. First, with respect to genome mapping, such preparative procedures may require sources such as genomic DNA or DNA from yeast artificial chromosomes (YACs; Burke, D. T. et al., 1987, Science 236:806; Barlow, et al., 1987, Trends in Genetics 3:167-177; Campbell et al., 1991, Proc. Natl. Acad. Sci. U.S.A. 88:5744). Obtaining quantities of DNA from these sources which is sufficient for detailed analyses, such as restriction mapping, is time consuming and often impractical. Second, because populations of molecules of like size migrate through the medium at the same rate, it is impossible to separate individual molecules from within a sample of particles by utilizing such a technique. Additionally, while it is possible to resolve a wide size range of DNA molecule populations gel electrophoresis techniques, optimal techniques can often require the use of several different gel matrix compositions and/or alternative electrophoresis procedures, depending upon the sizes of the molecules of interest. For example, the separation of large molecules of DNA may require such techniques as pulse field electrophoresis (see, e.g., U.S. Pat. No. 4,473,452). Further, standard gel electrophoresis techniques involve the separation of populations of molecules according to size, making it impossible to separate individual molecules within a polydisperse mixture. In summary, therefore, the accurate, rapid, practical, high throughput separation of individual DNA molecules, especially those of highly disparate sizes, which would often be required for genomic mapping purposes, is impossible via gel electrophoresis.

Techniques have been reported for the visualization of be single nucleic acid molecules and complexes. Such techniques include such fluorescence microscopy-based techniques as fluorescence in situ hybridization (FISH; Manuelidis, L. et al., 1982, J. Cell. Biol. 95:619; Lawrence, C. A. et al., 1988, Cell 52:51; Lichter, P. et al., 1990, Science 247:64; Heng, H. H. Q. et al., 1992, Proc. Natl. Acad. Sci. U.S.A. 89:9509; van den Engh, G. et al., 1992, Science 257:1410) and those reported by, for example, Yanagida (Yanagida, M. et al., 1983, Cold Spring Harbor Symp. Quantit. Biol. 47:177; Matsumoto, S. et al., 1981, J. Mol. Biol. 132:501-516); tethering techniques, whereby one or both ends of a nucleic acid molecule are anchored to a surface (U.S. Pat. Nos. 5,079,169; 5,380,833; Perkins, T. T. et al., 1994, Science 264:819; Bensimon, A. et al., 1994, Science 265:2096); and scanning probe microscopy-based visualization techniques, including scanning tunneling microscopy and atomic force microscopy techniques (see, e.g., Karrasch, S. et al., 1993, Biophysical J. 65:2437-2446; Hansma, H. G. et al., 1993, Nucleic Acids Research 21:505-512; Bustamante, C. et al., 1992, Biochemistry 31:22-26; Lyubchenko, Y. L. et al., 1992, J. Biomol. Struct. and Dyn. 10:589-606; Allison, D. P. et al., 1992, Proc. Natl. Acad. Sci. U.S.A. 89:10129-10133; Zenhausern, F. et al., 1992, J. Struct. Biol. 108:69-73).

While single molecule techniques offer the potential advantage of an ordering capability which gel electrophoresis lacks, none of the current single molecule techniques can be used, on a practical level, as, for example, high resolution genomic mapping tools. The molecules described by Yanagida (Yanagida, M. et al., 1983, Cold Spring Harbor Symp. Quantit. Biol. 47:177; Matsumoto, S. et al., 1981, J. Mol. Biol. 132:501-516), for example, were visualized, primarily free in solution, in a manner which would make any practical mapping impossible. Further, while the FISH technique offers the advantage of using only a limited number of immobilized fragments, usually chromosomes, it is not possible to achieve the sizing resolution available with gel electrophoresis.

Single molecule tethering techniques, as listed above, generally involve individual nucleic acid molecules which have, first, been immobilized onto a surface via one or both of their ends, and, second, have been manipulated such that the molecules are stretched out. These techniques, however, are not suited to genome analysis. First, the steps involved are time consuming and can only be accomplished with a small number of molecules per procedure. Further, in general, the tethered molecules cannot be stored and used again.

A combination of the sizing capability of gel electrophoresis and the ordering capability of certain single molecule techniques such as, for example, FISH, would, therefore, be extremely useful for genomic analyses such as genomic mapping. Such analyses would be further aided by the ability to manipulate the single molecules being analyzed. Additionally, an ability to reuse the nucleic acid samples of interest would increase the efficiency and throughput capability of the analysis. Currently, however, there exists no single technology which embodies, in a practical manner, each of these elements.

Citation of documents herein is not intended as an admission that any of the documents cited herein is pertinent prior art, or an admission that the cited documents are considered material to the patentability of the claims of the present application. All statements as to the date or representations as to the contents of these documents are based on the information available to the applicant and does not constitute any admission as to the correctness of the dates or contents of these documents.

3. SUMMARY OF THE INVENTION

The present invention relates to methods and compositions for characterizing and manipulating individual nucleic acid molecules, including mammalian chromosome-sized individual nucleic acid molecules. The methods and compositions described herein can be utilized for the accurate, rapid, high throughput analysis of nucleic acid molecules at the genome level, and may, for example, include the construction of high resolution physical maps, referred to herein as "optical mapping", and the detection of specific nucleotide sequences within a genome, referred to herein as "optical sequencing."

Specifically, methods are described whereby single nucleic acid molecules, including mammalian chromosome-sized DNA molecules, are elongated and fixed in a rapid, controlled and reproducible manner which allows for the nucleic acid molecules to retain their biological function and, further, makes rapid analysis of the molecules possible. In one embodiment of such a procedure, the molecules are elongated in a flow of a molten or unpolymerized gel composition. The elongated molecules become fixed as the gel composition becomes hardened or polymerized. In such an embodiment, the gel composition is preferably an agarose gel composition. The elongated molecules became fixed as the agarose.

In a second embodiment, the single nucleic acid molecules are elongated and fixed in a controllable manner directly onto a solid, planar surface. This solid, planar surface contains a positive charge density which has been controllably modified such that the single nucleic acid molecules will exhibit an optimal balance between the critical parameters of nucleic acid elongation state, degree of relaxation stability and biological activity. Further, methods, compositions and assays are described by which such an optimal balance can precisely and reproducibly be achieved.

In a third embodiment, the single nucleic acid molecules are elongated via flow-based techniques. In such an embodiment, a single nucleic acid molecule is elongated, manipulated (via, for example, a regio-specific restriction digestion), and/or analyzed in a laminar flow elongation device. The present invention further relates to and describes such a laminar flow elongation device.

The elongated, individual nucleic acid molecules can then be utilized in a variety of ways which have applications for the analysis of nucleic acid at the genome level. For example, such nucleic acid molecules may be used to generate ordered, high resolution single nucleic acid molecule restriction maps. This method is referred to herein as "optical mapping" or "optical restriction mapping". Additionally, methods are presented whereby specific nucleotide sequences present within the elongated nucleic acid molecules can be identified. Such methods are referred to herein as "optical sequencing". The optical mapping and optical sequencing techniques can be used independently or in combination on the same individual nucleic acid molecules.

Still further, the elongated nucleic acid molecules of the invention can be manipulated using any standard procedure. For example, the single nucleic acid molecules may be manipulated by any enzymes which act upon nucleic acid molecules, and which may include, but are not limited to, restriction endonucleases, exonucleases, polymerases, ligases or helicases.

Additionally, methods are also presented for the imaging and sizing of the elongated single nucleic acid molecules. These imaging techniques may, for example, include the use of fluorochromes, microscopy and/or image processing computer software and hardware. Such sizing methods include both static and dynamic measuring techniques.

Still further, high throughput methods for utilizing such single nucleic acid molecules in genome analysis are presented. In one embodiment of such high throughput methods, rapid optical mapping approaches are described for the creation of high-resolution restriction maps. In such an embodiment, single nucleic acid molecules are elongated, fixed and gridded to high density onto a solid surface. These molecules can then be digested with appropriate restriction enzymes for the map construction. In an alternative embodiment, the single nucleic acid molecules can be elongated, fixed and gridded at high density onto a solid surface and utilized in a variety of optical sequencing-based diagnostic methods. In addition to speed, such diagnostic grids can be reused. Further, the high throughput and methods can be utilized to rapidly generate information derived from procedures which combine optical mapping and optical sequencing methods.

The present invention is based on the development of techniques, including high throughput techniques, which reproducibly and rapidly generate populations of individual, elongated nucleic acid molecules that not only retain biological function but are accessible to manipulation and make possible rapid genome analysis.

4. BRIEF DESCRIPTION OF THE FIGURES

FIG. 1. Schematic drawing of an electrophoretic microscopy chamber which is specifically adapted to fluorescence microscopy studies.

FIG. 2. Partly schematic and partly block diagram showing an interconnection of exemplary chamber electrodes in an electrophoresis chamber which may be used in the present invention.

FIGS. 3A-3B. Schematic illustration of the instrumentation used in the microscopic study of DNA molecules in a medium according to this invention, and a more detailed diagram showing the instrumentation for measuring birefringence.

FIGS. 4A-4I. Depicted herein are the DNA molecular conformational and positional changes when G bacteriophage molecules are subject to two sequential electric fields in different directions.

FIGS. 5A-5J. Depicted herein are the DNA molecular conformational and positional changes during relaxation of G bacteriophage DNA molecules after electrophoresis for 600 seconds, as revealed by the fluorescence microscopy experiments described in Example 4.

FIG. 6. Optical mapping. DNA molecules and restriction enzyme are dissolved in molten agarose without magnesium ions. The DNA molecules are elongated by the flow generated when the mixture is sandwiched between a slide and coverslip. Stretched molecules are fixed in place by agarose gelation. Magnesium ion diffusion into the gel triggers digestion and cleavage sites appear as growing gaps as the molecular fragments relax.

FIGS. 7A-7D. Histograms of optical mapping. Not 1 cut frequencies, showing variation with molecule size and number of cut sites, are indicated. Cutting frequencies were scored by counting the number of Not 1 cuts in nucleic acid molecules present in microscope fields. Such fields typically contain approximately 3-5 molecules. Because approximately half the fields showed no Not 1 cutting and were, therefore, not scored, this underestimates the number of uncut molecules. The expected number of cut sites and chromosome sizes: 7A: Ch. 1(240 kb) 1; 7B.: Ch. V and VIII(595 kb) 3 and 2; 7C: Ch. XI(675 kb) 2; and 7D: Ch. XIII and XVI(950 and 975 kb) 1. Chromosome pairs V and VIII, and XIII and XVI were present on the same mount.

FIGS. 8A-8H. Depicted are some restriction fragment relaxation modes for a singly cleaved, gel-fixed, elongated molecule. Horizontal arrows indicate direction of relaxation. Relaxation modes illustrated: 8A depicts fixed molecule before cleavage, 8B-8E depict possible relaxation modes producing detectably cleaved molecules, and 8F-8H depict relaxation modes producing undetectably cleaved molecules.

FIG. 9. Schematic representation depicting possible relaxation events to form pools of segments or "balls" at coil ends. Agarose gel is illustrated as a series of pegs with free spaces available for molecules. Gel pegs might intersect the embedded DNA molecule during gelation and possibly entrap it. The coil segments positioned in the pool region comprise a relaxed sub-coil region and have higher entropy than the coil stretched out between them. These pools may act as molecular rivets in some circumstances, particularly if the segment pool mass approaches that of the intervening coil.

FIGS. 10A-10D. Optical mapping sizing results for Not I endonuclease restriction fragments from S. cerevisiae chromosomes I, V, VIII, XI, XIII, and XVI calculated as described, plotted against published results. The diagonal line is for reference. Typical fragment images are shown in this figure. (See example 13). The inset shows the estimate of population standard deviation (kb). Error bars represent 90% confidence (7) on means (main graph) or standard deviation (inset). 10A and 10B: the relative intensity determination of fragment sizes. 10C and 10D: the relative apparent length determination of fragment sizes.

FIGS. 11A-11C. Scatter plot of normalized absolute intensity vs. apparent length. Absolute intensities from six individual images were calculated and plotted against apparent length over a time interval typically used in optical mapping (10-15 minutes). For each sample, the initial intensity was found by averaging absolute intensity values from groups of 5 adjacent images and taking the largest value. The values from several samples were normalized by dividing values from each image by the initial intensity for the sample. 11A: chromosome I 120 kb Not I fragment, 7 samples. 11B: chromosome XI 285 kb Not I fragment, 4 samples. 11C: chromosome XI 360 kb Not I fragment, 4 samples.

FIG. 12. Comparison of Not I endonuclease restriction maps of optical mapping results of S. cerevisiae chromosomal DNA molecules with published restriction maps (L&O). Maps were constructed from length (Len), intensity (Int) or a combination of both (Com). Bar lengths for the optical mapping data are proportional to the means plotted in FIGS. 10A-10D, and typical images are shown in FIGS. 13A-13F.

FIGS. 13A-13F. Typical fluorescence microscopy images of S. cerevisiae chromosomal DNA molecules stained with DAPI and embedded in agarose gel during Not I restriction endonuclease cleavage. Chromosomal DNA molecules were prepared and fixed as described in Example 13 and cited references. Images were background corrected using a smoothed and attenuated background image, smoothed, and stretched, using 16-bit precision. Images show Not I restriction digestion evolution, with arrows highlighting cut sites. Intervals are timed after addition of Mg.sup.2+. 13A: Ch. I (240 kb), 20 and 60 sec; 13B: Ch. XI (675 kb), 500, 880 and 1160 sec; 13C: Ch. V (595 kb), 200, 240, 520 sec; 13D: Ch. VIII (595 kb), 440, 1220 and 1360 sec; 13E: Ch. XIII (950 kb), 100 and 560 sec; 13F: Ch. XVI (975 kb), 460 and 560 sec. Bars, 5 .mu.m. A 100.times. objective was used to image results in FIGS. 13A-13D and a 63.times. objective was used for FIGS. 13E and 13F.

FIG. 14. Optical mapping results from Rsr II and Asc I endonuclease restriction digest of S. cerevisiae chromosomes III and XI. Maps were constructed from fully cut length (Len) or intensity (Int) data, and refined using partial cut length. Bar lengths are proportional to the calculated means, and typical images are shown in FIGS. 15A-15C. Number of cuts was determined as in FIGS. 7A-7D.

FIGS. 15A-15C. Fluorescence microscopy images of S. cerevisiae chromosomal DNA molecules stained with DAPI and embedded in agarose gel during Rsr II or Asc I restriction endonuclease cleavage. Chromosomal DNA molecules were digested and analyzed as in FIGS. 13A-13F. Images show restriction digestion evolution, with arrows highlighting cut sites. 15A: Ch. III, Rsr II, 1100 and 1820 sec; 15B: Ch. XI, Rsr II, 20, 600, 920, 1060 sec; 15C: Ch. XI, Asc I, 1160, 1500, 1780, 1940 sec. An isoschizomer to Rsr II, Csp I, was also used and gave identical results. Bar, 5 .mu.m.

FIG. 16. Glass surface properties as a function of polylysine treatment. Glass surfaces were incubated for 16 hours in different concentrations of poly-D-lysine, MW=350,500. Lambda bacteriophage DNA molecules in EcoRI restriction buffer and ethidium homodimer, minus magnesium ions, were mounted onto the treated glass surfaces. Square and circle show ratio of absorbed DNA and average length of absorbed DNA, respectively. Each point represents roughly 50 molecules measured and bars show the standard deviation about a mean. Sample preparation, imaging techniques and analysis are given in Methodology.

FIGS. 17A-17W. Gallery of fluorescence microscopy images of lambda clones from Optical Mapping results. Clones from a mouse yeast artificial chromosome (YAC) (Burke et al., Science 236:806-812, 1987; Murray and Szostak, Nature 305:189-193, 1983) spanning the Pygmy locus were subcloned into Lambda FIX II and digested with EcoRI and BamHI. Maps for these and other molecules (not shown) were constructed by Optical Mapping techniques (Methodology) and shown in FIG. 19. Images show typical molecules used for map construction. Bars: 5 microns. FIG. 17V is an enlargement of FIG. 17T and FIG. 17W is at the same scale as FIG. 17V. The enzymes used for map construction are indicated as (E) for EcoRI and (B) for BamH I. FIG. 17A uncut lambda DNA; FIG. 17B, B3 (E); FIG. 17C, F (B); FIG. 17D, B (s); FIG. 17E, D (S); FIG. 17F, E (S); FIG. 17G, 914 (S); FIG. 17H, S(E); FIG. 17I, G (S); FIG. 17J, C (E); FIG. 17K, B4 (E); FIG. 17L, Y11 (E); FIG. 17M, 618 (E); FIG. 17N, 617 (E); FIG. 17O, 305 (E); FIG. 17P, A (B); FIG. 17Q, 1004 (B); FIG. 17R, E (E); FIG. 17S, B6 (E); FIG. 17T, A2 (E); FIG. 17U, C3 (E); FIG. 17V, A2 (E); FIG. 17W, F (E).

FIGS. 18A-18D. EcoRI and BamH I endonuclease restriction fragment sizing results for Lambda FIX II clones, calculated as described and plotted against gel electrophoresis data. FIG. 18A, Relative fluorescence intensity results. The diagonal line is for reference. Typical fragment images are shown in FIGS. 17A-17W. Inset: estimate of population standard deviation (kb). Error bars represent 90% confidence on means (main graph) or standard deviation (inset). The size of the whole molecule was determined by gel electrophoresis. b, results for small fragments. The best fit line through the origin (slope 0.665) was used to calibrate fragment originally estimated at less than 6.5 kb prior to incorporation into maps. c, results after correction. d, Relative apparent length sizing results from the same images. The diagonal line is for reference.

FIG. 19. EcoRI and BamH I restriction maps constructed by Optical Mapping. Clones are labeled on the left side. The upper ticks are EcoRI restriction sites and lower ticks are BamH I sites. Table 1 shows the fragment sizes.

FIGS. 20A-20H. Optically sizing insert DNA of lambda FIX II clones. Lambda clones mounted on the surface were digested by an enzyme which cut at the polylinker sites, as described in Methodology. The 20 kb and 9 kb vector arms of FIX II cloning system were used as internal size standards to convert relative sizes to absolute sizes. The results of fluorescence intensity and length were shown in Table 2, together with sizes from PFGE. Cases where the enzyme also cut the insert were easily interpreted. Scale bar is 5 microns. FIGS. 20A-20B, Clone F (Sal I): 20 kb, 7.5 kb, 9.5 kb, 9 kb. FIGS. 20B-20D, Clone G (Sal I): 20 kb, 10.1 kb, 4.1 kb, 9 kb. FIGS. 20E-20F, clone B (NotI): 20 kb, 17.6 kb, 9 kb. FIGS. 20G-20H, B3 (SstI): 20 kb, 13.8 kb, 9 kb.

FIG. 21 DNA binding properties of glass surfaces as a function of APTES deposition. Yeast (AB972) chromosome I molecules (240 kb, 72 mm contour length, assuming B-DNA) in (10 mM Tris pH 7.6, 1 mM EDTA, 50 mM NaCl) were applied in molten agarose to glass surfaces previously treated with APTES for the indicated time. The number and length of molecules was measured by fluorescence microscopy after staining with ethidium homodimer. The plot shows the average number of molecules deposited per 100 m.sup.2 field viewed (square) and the average molecule length (circle), plotted against the time of prior APTES derivatization. Each point represents .sup..about.b 60 molecules imaged. Bars indicate the standard deviation about the means. Sample preparation, imaging techniques and analysis are given in Materials and Methods.

FIGS. 22A-22D. Optical mapping sizing results for NotI endonuclease restriction fragments of S. cerevisiae chromosomes I, V, VIII, and XI calculated as described (Example 13) plotted against published results (Link and Olson, Genetics 127:681, 1991). The diagonal line is for reference. Each point represents 20 to 40 imaged fragments. FIGS. 22B and 22D: estimate of population standard deviation (kb). Error bars represent the 90% confidence intervals. FIGS. 22A and 22B Relative apparent length determination of restriction fragment sizes. FIGS. 22C and 22D Relative fluorescence intensity determination of restriction fragment sizes.

FIGS. 23A-23D. 23A-23C: Typical fluorescence micrographs of S. cerevisiae chromosomal DNA molecules digested with NotI restriction endonuclease. Molecules were stained with ethidium homodimer after digestion. Arrows indicate cleavage sites, bars 10 microns. FIG. 23A, chromosome XI, two cuts; FIG. 23B, Chromosome V, three cuts; and FIG, 23C, chromosome VIII, two cuts. FIG. 23D, graphical comparison of optical mapping results and published PFGE restriction maps of yeast chromosomes digested with NotI. Bar lengths for the optical mapping data are proportional to the means based on the fluorescence intensity measurements plotted in FIGS. 22C-22D.

FIGS. 24A-24I. FIGS. 24A-G: Typical fluorescence micrographs of yeast artificial chromosomes digested with NotI, MluI, EagI and NruI restriction endonucleases and stained with ethidium homodimer. Arrows indicate Cleavage sites, bars 10 microns. YAC 7H6 was digested with: FIG. 24A, NruI; FIG. 24B EagI. YAC 3I4 was digested with: FIG. 24C, NotI; FIG. 24D, MluI; FIG. 24E, EagI; FIG. 24F, NotI and MluI; FIG. 24G, MluI and EagI. Graphical comparison of optical mapping results with PFGE mapping results for YACs: FIG. 24H, 7H6; FIG. 24I, 3I4. Double digestion results are included. Bar lengths for the optical mapping data are proportional to the means based on fluorescence intensity measurements.

FIG. 25 is a diagram depicting a laminar flow elongation device.

FIGS. 26 A, B, and C illustrate the characteristic "sunburst" pattern of fixation of elongated molecules using the spotting technique of the present invention.

FIGS. 27 A and B show relaxation measurements as a function of molecular size.

FIGS. 28 A and B are logarithmic plots of relaxation versus size.

FIG. 29 shows a enlarged view of a DNA spot and one method of spreading molecules onto a derivatized surface.

FIG. 30 is a block diagram of a method for high throughput optical mapping of lambda or cosmid clones.

FIG. 31 is a block diagram of the system used for high throughput optical mapping of gridded YAC DNA.

FIG. 32 is a block diagram of one embodiment of the automated system for high throughput optical mapping.

FIG. 33 illustrates a method of optimizing the image collection process and maximizing the signal-to-noise ratio.

FIG. 34 is a block diagram of the image processing method in accordance with a preferred embodiment of the present invention.

5. DETAILED DESCRIPTION OF THE INVENTION

Described herein are methods and compositions for characterizing and manipulating individual nucleic acid molecules, including mammalian chromosome-sized individual nucleic acid molecules. The methods and compositions described herein can be utilized for optical mapping and optical sequencing purpose to generate accurate, rapid, high throughput analyses of nucleic acid molecules at the genome level.

Specifically, Section 5.1 describes methods for the elongation and fixation of single nucleic acid molecules. Such methods include both agarose-based (Section 5.1.1) and solid surface-based (Section 5.1.2) techniques. Section 5.1 also describes assays for the optimization of parameters important to the production of the solid, planar surfaces used-herein. Further, Section 5.1 also describes flow-based elongation techniques (Section 5.1.3) in which a single nucleic acid molecule is elongated, manipulated and/or analyzed in a laminar flow elongation device.

Section 5.2 describes methods for the imaging and sizing of single nucleic acid molecules. The Section includes, for example, nucleic acid staining, microscopy and photography techniques useful for imaging single nucleic acid molecules. Further, the Section describes methods for the sizing of single nucleic acid molecules including both static and dynamic measurement techniques. Section 5.3 describes genome analysis applications to which the single nucleic acid molecule techniques of the invention may be put. Such applications include, for example, optical mapping and optical sequencing techniques. Finally, Section 5.4 discusses methods for rapid, high throughput utilization of the single nucleic acid techniques of the invention.

5.1. SINGLE NUCLEIC ACID MOLECULE ELONGATION TECHNIQUES

A variety of methods can be utilized for the rapid, controllable and reproducible elongation of single nucleic acid molecules in such a manner that allows rapid, efficient analysis and/or manipulation of the molecules. These techniques can include, for example, gel-based (Section 5.1.1), solid surface-based (Section 5.1.2) and flow-based techniques (Section 5.1.3), each of which will be separately described below.

5.1.1. GEL-BASED TECHNIQUES

Gel-based techniques can be utilized for the elongation of single nucleic acid molecules. The gel-based techniques described herein maintain the biological function of the nucleic acid molecules and, further, allow for the manipulation and/or accurate analysis of the elongated single nucleic acid molecules. Nucleic acid molecules which can be rapidly, efficiently analyzed via such gel-based techniary include nucleic acid molecules which range in length from about 20 kb up to mammalian chromosome-sized lengths (i.e. greater than 1000 kb). Further, such gel-based techniques make possible the utilization of dynamic measurement procedures, may generate a lower level of nucleic acid shearing and make possible the utilization of a wide range of biochemical activities with which the manipulate the elongated nucleic acid molecules.

Briefly, gel-based techniques involve elongating single nucleic acid molecules within a molten or nonpolymerized gel composition such that upon cooling or polymerization, the elongated nucleic acid molecules are maintained in a relatively stationary position, while remaining accessible to, for example, enzymatic manipulation and/or hybridization to complementary nucleic acid molecules or binding to sequence-specific proteins or peptides. Further, the gelation process restrains elongated nucleic acid molecules from appreciably relaxing to a random coil conformation after, for example, their enzymatic cleavage.

For optimal imaging and manipulation potential, the amount which the single nucleic acid molecules are elongated within the gel composition is critical. Excessive elongation or stretching causes the molecule to become difficult to visualize. For example, too much stretching presents too little fluorochrome per imaging pixel, lending the intensities generated by the measured molecular intensities to approach background values. Insufficient stretching, however, generates too low a level of tension, which can interfere with an analysis of single nucleic acid molecule manipulations. For example, when restriction mapping, enough elongation must occur such that, upon digestion, the newly formed nucleic acid fragments pull away from each other, thus revealing restriction sites. An additional requirement for optimal gel-based elongation requires that care be taken to preserve the moisture within the gel, such that the maximum biological function of the nucleic acid can be retained.

For optimal imaging/manipulation potential, the extent to which a nucleic acid molecule is elongated within a gel must be great enough to generate a sufficient level of intramolecular tension while not being so great that the elongated molecule becomes difficult to image. In general, elongation methods which produce single nucleic acid molecules that span approximately 20% to 60% of their curvilnear contour lengths are preferred.

Further, the elongated nucleic acid molecules within the gel must lie within a shallow plane of focus for successful imaging. With respect to larger nucleic acid molecules, for example, it is additionally important for the molecules to lie within a plane approximately 0.2 .mu.m in thickness for focused visualization.

Because gelation or polymerization fixes embedded molecules, systematically varying parameters which affect the rate at which the gelation or polymerization can modulate the degree of fixation and, ultimately, the rate of molecule relaxation. Smaller nucleic acid molecules (i.e., molecules less than about 350 kb) relax quickly. Thus, it is preferred that elongation take place under conditions which hasten gelation/polymerization so that the nucleic acid molecules become trapped in an extended conformation before substantial relaxation takes place. Larger nucleic acid molecules relax at a slower rate, and, therefore, can be elongated under conditions which allow for a slower rate of gelation/polymerization.

With respect to agarose gels, parameters which affect the rate of gelation include, for example, the gel concentration and/or temperature at which the gel is formed. A higher gel concentration or gelation at a low temperature hastens gel formation. With respect to polyacrylamide gels, parameters which affect the rate of polymerization include, for example, the acrylamide/bisacrylamide concentration and ratio, the temperature at which polymerization takes place, and the ammonium sulfate and TEMED concentrations used.

While any gel composition may be used for such elongation techniques, an agarose gel composition is preferred, with an agarose composition exhibiting a low gelling temperature being especially preferred. Such low gelling temperature agarose compositions are the most optically clear agarose compositions available and, further, because such compositions can remain molten at 37.degree. C., the biological activity of enzymes, such as restriction enzymes, within the molten agarose can easily be maintained. Additionally, such agarose compositions are useful in that rapid gelation is often desired for fixation of the elongated nucleic acid molecules. For agarose gel compositions, a gel composition comprising from about 0.1% to about 3.0%, with 0.1-1.5% being preferred.

Any number of techniques can be used to apply an external force which will cause the nucleic acid molecules within the gel composition to become elongated. For example, an elongating external force may include an electrical or mechanical force. While the exact amount of external force required for optimal elongation may vary according to, for example, the specific gel composition and nucleic acid molecules being elongated, the optimization of gel parameters can easily and without undue experimentation be assayed by, for example, utilizing the visualization and measurement techniques described in Section 5.2, below.

Elongation may, for example, be accomplished by generating a flow force within a molten agarose gel containing single nucleic acid molecules. Such a flow force may be set up by placing the nucleic acid/molten gel composition between two solid surfaces, such as, for example, between a slide and a coverslip. In such an embodiment, a hole preferably exists in the slide through which reagents for the manipulation of the elongated nucleic