WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
System and method of automatically generating chemical compounds with desired properties    
United States Patent5463564   
Link to this pagehttp://www.wikipatents.com/5463564.html
Inventor(s)Agrafiotis; Dimitris K. (Exton, PA); Bone; Roger F. (Bridgewater, NJ); Salemme; Francis R. (Kennett Square, PA); Soll; Richard M. (Lawrenceville, NJ)
AbstractA computer based, iterative process for generating chemical entities with defined physical, chemical and/or bioactive properties. During each iteration of the process, (1) a directed diversity chemical library is robotically generated in accordance with robotic synthesis instructions; (2) the compounds in the directed diversity chemical library are analyzed to identify compounds with the desired properties; (3) structure-property data are used to select compounds to be synthesized in the next iteration; and (4) new robotic synthesis instructions are automatically generated to control the synthesis of the directed diversity chemical library for the next iteration.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Drawing from US Patent 5463564
System and method of automatically generating chemical compounds with

     desired properties - US Patent 5463564 Drawing
System and method of automatically generating chemical compounds with desired properties
Inventor     Agrafiotis; Dimitris K. (Exton, PA); Bone; Roger F. (Bridgewater, NJ); Salemme; Francis R. (Kennett Square, PA); Soll; Richard M. (Lawrenceville, NJ)
Owner/Assignee     3-Dimensional Pharmaceuticals, Inc. (Philadelphia, PA)
Patent assignment
All assignments
Publication Date     October 31, 1995
Application Number     08/306,915
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     September 16, 1994
US Classification     700/268 260/1 422/187 423/659 436/43 702/31
Int'l Classification     G06F 017/50
Examiner     Voeltz; Emanuel T.
Assistant Examiner     Choi; Kyle
Attorney/Law Firm     Sterne, Kessler, Goldstein & Fox
Address
Parent Case    
Priority Data    
USPTO Field of Search     364/496 364/497 364/499 364/500 436/43 436/50 436/55 423/659 424/2 935/85 935/86 935/87 935/88
Patent Tags     automatically generating chemical compounds with desired properties
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
5331573
Balaji
703/11
Jul,1994

[0 after 0 votes]
5288514
Ellman
435/4
Feb,1994

[0 after 0 votes]
5270170
Schatz
435/7.37
Dec,1993

[0 after 0 votes]
5240680
Zuckermann
422/67
Aug,1993

[0 after 0 votes]
4939666
Hardman
700/293
Jul,1990

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


What is claimed is:

1. A computer based method of automatically generating compounds having a prescribed set of properties, comprising the steps of:

(1) robotically synthesizing, in accordance with robotic synthesis instructions, a directed diversity chemical library comprising a plurality of chemical compounds;

(2) robotically analyzing said chemical compounds to obtain structure-activity data pertaining thereto;

(3) comparing, under computer control, said structure-activity data of said chemical compounds against said prescribed set of properties to identify any of said chemical compounds substantially conforming to said prescribed set of properties;

(4) classifying, under computer control, said identified chemical compounds as lead compounds;

(5) analyzing, under computer control, said structure-activity data of said compounds and historical structure-activity data pertaining to compounds synthesized and analyzed in the past to derive structure-activity models having enhanced predictive and discriminating capabilities;

(6) identifying, under computer control and in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a set of compounds predicted to exhibit activity/properties more closely matching said prescribed set of properties;

(7) generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said set of compounds; and

(8) repeating steps (1)-(7), wherein step (1) is repeated using said generated robotic synthesis instructions.

2. The method of claim 1, wherein step (6) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein step (7) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

3. The method of claim 1, wherein step (6) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein step (7) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

4. The method of claim 1, wherein step (6) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, and a third set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first, second, and third sets of compounds are not mutually exclusive;

wherein step (7) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second and third set of compounds.

5. The method of claim 1, wherein step (6) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a second set of compounds predicted to have superior three-dimensional receptor fit, wherein said first and second sets of compounds are not mutually exclusive;

wherein step (7) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

6. The method of claim 1, wherein step (6) comprises the step of:

identifying under computer control reagents from said reagent database that, when combined, will produce a second set of compounds with structural, physical, or chemical characteristics similar to characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties.

7. The method of claim 1, wherein step (6) comprises the steps of:

(a) generating a list of potential chemical compounds to possibly synthesize, each of said potential chemical compounds synthesized using N of said identified reagents; and

(b) selecting from said list of potential chemical compounds a plurality of candidate compounds that are suitable for synthesis.

8. The method of claim 7, wherein N is greater or equal to 1 and less than 500.

9. The method of claim 7, wherein N is of a magnitude such that each compound synthesized contains three variable subunits.

10. The method of claim 7, wherein step (2) comprises the steps of:

analyzing said chemical compounds to obtain structure and composition data pertaining thereto;

analyzing said structure and composition data to generate chemical synthesis indicia indicating which of said chemical compounds were adequately synthesized, and which of said chemical compounds were not adequately synthesized; and

storing said structure and composition data and said chemical synthesis indicia in a structure-activity database, said structure-activity database also having stored therein structure and composition data and chemical synthesis indicia pertaining to previously synthesized chemical compounds.

11. The method of claim 10, wherein step (b) comprises the steps of:

retrieving from said structure-activity database any chemical synthesis indicia pertaining to said potential chemical compounds; and

selecting as candidate compounds any of said potential chemical compounds that were previously adequately synthesized as indicated by said retrieved chemical synthesis indicia, if any, respectively associated with said potential chemical compounds.

12. The method of claim 10, wherein step (b) comprises the steps of:

retrieving from said structure-activity database any chemical synthesis indicia pertaining to said potential chemical compounds; and

eliminating from consideration as candidate compounds any of said potential chemical compounds that were previously inadequately synthesized as indicated by said retrieved chemical synthesis indicia, if any, respectively associated with said potential chemical compounds.

13. The method of claim 7, wherein step (7) comprises the step of:

generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said candidate compounds.

14. The method of claim 7, further comprising the step of:

(c) selecting an optimal set of said candidate compounds to synthesize based on at least one of the following factors:

(i) their respective predicted abilities to exhibit activity/properties more closely matching said prescribed set of activity/properties as indicated by said structure-activity models;

(ii) their respective predicted abilities to validate said structure-activity models;

(iii) their respective predicted abilities to discriminate between said structure-activity models;

(iv) their respective predicted abilities to have superior three-dimensional receptor fit; and

(v) similarity between their respective structural, physical, or chemical characteristics and characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties.

15. The method of claim 14, wherein step (c) comprises the step of:

selecting said optimal set by individually ranking said candidate compounds based on at least one of factors (i)-(v).

16. The method of claim 14, wherein step (c) comprises the step of:

selecting said optimal set by ranking combinations of said candidate compounds based on at least one of factors (i)-(v).

17. The method of claim 14, wherein step (7) comprises the step of:

generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said optimal set of candidate compounds.

18. The method of claim 1, wherein step (6) is performed according to operator input.

19. The method of claim 1, wherein step (7) comprises the steps of:

receiving operator input pertaining to the generation of said robotic synthesis instructions; and

generating said robotic synthesis instructions based, at least in part, on said operator input.

20. The method of claim 1, wherein said reagents include amines suitable for synthesizing thrombin inhibitors.

21. A computer-based system for automatically generating compounds having a prescribed set of activity/properties, comprising:

one or more chemical synthesis robots to robotically synthesize, in accordance with robotic synthesis instructions, a directed diversity chemical library comprising a plurality of chemical compounds;

one or more analysis robots to robotically analyze said chemical compounds to obtain structure-activity data pertaining thereto;

a synthesis protocol generator, comprising:

comparing means for comparing said structure-activity data of said chemical compounds against said prescribed set of activity/properties to identify any of said chemical compounds substantially conforming to said prescribed set of activity/properties;

classifying means for classifying said identified chemical compounds as lead compounds;

structure-activity model derivation means for analyzing said structure-activity data of said compounds and historical structure-activity data pertaining to compounds synthesized and analyzed in the past to derive structure-activity models having enhanced predictive and discriminating capabilities;

reagent identifying means for identifying, in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a set of compounds predicted to exhibit activity/properties more closely matching said prescribed set of activity/properties; and

robotic synthesis instruction generating means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said set of compounds.

22. The system of claim 21, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said second set of compounds.

23. The system of claim 21, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said second set of compounds.

24. The system of claim 21, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, and a third set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first, second, and third sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said second and third set of compounds.

25. The system of claim 21, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have superior three-dimensional receptor fit, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said second set of compounds.

26. The system of claim 21, wherein said reagent identifying means comprises:

means for identifying reagents from said reagent database that, when combined, will produce a second set of compounds with structural, physical, or chemical characteristics similar to characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said second set of compounds.

27. The system of claim 21, wherein said synthesis protocol generator further comprises:

means for generating a list of potential chemical compounds to possibly synthesize, each of said potential chemical compounds synthesized using N of said reagents identified by said reagent identifying means; and

candidate compound identifying means for selecting from said list of potential chemical compounds a plurality of candidate compounds that are suitable for synthesis.

28. The system of claim 27, wherein N is of a magnitude such that each compound synthesized contains three variable subunits.

29. The method of claim 27, wherein N is greater or equal to 1 and less than 500.

30. The system of claim 27, wherein said analysis robots comprise:

means for analyzing said chemical compounds to obtain structure and composition data pertaining thereto;

means for analyzing said structure and composition data to generate chemical synthesis indicia indicating which of said chemical compounds were adequately synthesized, and which of said chemical compounds were not adequately synthesized; and

means for storing said structure and composition data and said chemical synthesis indicia in a structure-activity database, said structure-activity database also having stored therein structure and composition data and chemical synthesis indicia pertaining to previously synthesized chemical compounds.

31. The system of claim 30, wherein said candidate compound identifying means comprises:

means for retrieving from said structure-activity database any chemical synthesis indicia pertaining to said potential chemical compounds; and

means for selecting as candidate compounds any of said potential chemical compounds that were previously adequately synthesized as indicated by said retrieved chemical synthesis indicia, if any, respectively associated with said potential chemical compounds.

32. The system of claim 27, wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said candidate compounds.

33. The system of claim 27, wherein said synthesis protocol generator further comprises:

optimal set selecting means for selecting an optimal set of said candidate compounds to synthesize based on at least one of the following factors:

(i) their respective predicted abilities to exhibit activity/properties more closely matching said prescribed set of activity/properties as indicated by said structure-activity models;

(ii) their respective predicted abilities to validate said structure-activity models;

(iii) their respective predicted abilities to discriminate between said structure-activity models;

(iv) their respective predicted abilities to have superior three-dimensional receptor fit; and

(v) similarity between their respective structural, physical, or chemical characteristics and characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties.

34. The system of claim 33, wherein said optimal set selecting means comprises:

means for selecting said optimal set by individually ranking said candidate compounds based on at least one of factors (i)-(v).

35. The system of claim 33, wherein said optimal set selecting means comprises:

means for selecting said optimal set by ranking combinations of said candidate compounds based on at least one of factors (i)-(v).

36. The system of claim 33, wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robots to robotically synthesize said optimal set of candidate compounds.

37. The system of claim 21, wherein said reagent identifying means operates according to operator input.

38. The system of claim 21, wherein said robotic synthesis instruction generating means comprises:

means for receiving operator input pertaining to the generation of said robotic synthesis instructions; and

means for generating said robotic synthesis instructions based, at least in part, on said operator input.

39. The system of claim 21, wherein said reagents include amines suitable for synthesizing thrombin inhibitors.

40. A computer based method of iteratively generating a plurality of directed diversity chemical libraries each comprising multiple chemical compounds, wherein chemical compounds in said directed diversity chemical libraries more closely conform to a prescribed set of activity/properties during each successive iteration, the method comprising the steps of:

(1) robotically synthesizing, in accordance with robotic synthesis instructions, a directed diversity chemical library comprising a plurality of chemical compounds;

(2) robotically analyzing said chemical compounds to obtain structure-activity data pertaining thereto;

(3) analyzing, under computer control, said structure-activity data of said compounds and historical structure-activity data pertaining to compounds synthesized and analyzed during prior iterations to derive structure-activity models having enhanced predictive and discriminating capabilities;

(4) identifying, under computer control and in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a set of compounds predicted to exhibit activity/properties more closely matching said prescribed set of activity/properties;

(5) generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said set of compounds; and

(6) repeating steps (1)-(5), wherein step (1) is repeated using said generated robotic synthesis instructions.

41. The method of claim 40, wherein step (4) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive; wherein step (5) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

42. The method of claim 40, wherein step (4) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein step (5) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

43. The method of claim 40, wherein step (4) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, and a third set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first, second, and third sets of compounds are not mutually exclusive;

wherein step (5) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second and third set of compounds.

44. The method of claim 40, wherein step (4) comprises the step of:

identifying, under computer control and in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have superior three-dimensional receptor fit, wherein said first and second sets of compounds are not mutually exclusive;

wherein step (5) comprises the step of generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

45. The method of claim 40, wherein step (4) comprises the step of:

identifying under computer control reagents from said reagent database that, when combined, will produce a second set of compounds with structural, physical, or chemical characteristics similar to characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties, wherein said first and second sets of compounds are not mutually exclusive;

wherein step (5) comprises the step of:

generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

46. The method of claim 40, further comprising the following steps which are performed between steps (4) and (5):

(a) generating a list of potential chemical compounds to possibly synthesize, each of said potential chemical compounds synthesized using N of said identified reagents; and

(b) selecting from said list of potential chemical compounds a plurality of candidate compounds that are suitable for synthesis.

47. The method of claim 46, wherein step (5) comprises the step of:

generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said candidate compounds.

48. The method of claim 46, further comprising the step of:

(c) selecting an optimal set of said candidate compounds to synthesize based on at least one of the following factors:

(i) their respective predicted abilities to exhibit activity/properties more closely matching said prescribed set of activity/properties as indicated by said structure-activity models;

(ii) their respective predicted abilities to validate said structure-activity models;

(iii) their respective predicted abilities to discriminate between said structure-activity models;

(iv) their respective predicted abilities to have superior three-dimensional receptor fit; and

(v) similarity between their respective structural, physical, or chemical characteristics and characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties.

49. The method of claim 48, wherein step (c) comprises the step of:

selecting said optimal set by individually ranking said candidate compounds based on at least one of factors (i)-(v).

50. The method of claim 48, wherein step (c) comprises the step of:

selecting said optimal set by ranking combinations of said candidate compounds based on at least one of factors (i)-(v).

51. The method of claim 48, wherein step (5) comprises the step of:

generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said optimal set of candidate compounds.

52. A synthesis protocol generator for use in a system that automatically generates compounds having a prescribed set of activity/properties, said system comprising at least one chemical synthesis robot to robotically synthesize, in accordance with robotic synthesis instructions, a directed diversity chemical library comprising a plurality of chemical compounds, and at least one analysis robot to robotically analyze said chemical compounds to obtain structure-activity data pertaining thereto, said synthesis protocol generator comprising:

structure-activity model derivation means for analyzing said structure-activity data of said compounds and historical structure-activity data pertaining to compounds synthesized and analyzed in the past to derive structure-activity models having enhanced predictive and discriminating capabilities;

reagent identifying means for identifying, in accordance with said structure-activity models, reagents from a reagent database that, when combined, will produce a set of compounds predicted to exhibit activity/properties more closely matching said prescribed set of activity/properties; and

robotic synthesis instruction generating means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said set of compounds.

53. The synthesis protocol generator of claim 52, further comprising:

comparing means for comparing said structure-activity data of said chemical compounds against said prescribed set of activity/properties to identify any of said chemical compounds substantially conforming to said prescribed set of activity/properties; and

classifying means for classifying said identified chemical compounds as lead compounds.

54. The synthesis protocol generator of claim 52, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said second set of compounds.

55. The synthesis protocol generator of claim 52, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said second set of compounds.

56. The synthesis protocol generator of claim 52, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagent from said reagent database that, when combined, will produce a second set of compounds predicted to have a superior ability to validate said structure-activity models, and a third set of compounds predicted to have a superior ability to discriminate between said structure-activity models, wherein said first, second, and third sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said second and third set of compounds.

57. The synthesis protocol generator of claim 52, wherein said reagent identifying means comprises:

means for identifying, in accordance with said structure-activity models, reagents from said reagent database that, when combined, will produce a second set of compounds predicted to have superior three-dimensional receptor fit, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said second set of compounds.

58. The synthesis protocol generator of claim 52, wherein said reagent identifying means comprises:

means for identifying under computer control reagents from said reagent database that, when combined, will produce a second set of compounds with structural, physical, or chemical characteristics similar to characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties, wherein said first and second sets of compounds are not mutually exclusive;

wherein said robotic synthesis instruction generating means comprises:

means for generating, under computer control, robotic synthesis instructions that, when executed, enable robotic synthesis of said second set of compounds.

59. The synthesis protocol generator of claim 52, further comprising:

means for generating a list of potential chemical compounds to possibly synthesize, each of said potential chemical compounds comprising N of said reagents identified by said reagent identifying means; and

candidate compound identifying means for selecting from said list of potential chemical compounds a plurality of candidate compounds that are suitable for synthesis.

60. The synthesis protocol generator of claim 59, wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said candidate compounds.

61. The synthesis protocol generator of claim 59, wherein said synthesis protocol generator further comprises:

optimal set selecting means for selecting an optimal set of said candidate compounds to synthesize based on at least one of the following factors:

(i) their respective predicted abilities to exhibit activity/properties more closely matching said prescribed set of activity/properties as indicated by said structure-activity models;

(ii) their respective predicted abilities to validate said structure-activity models;

(iii) their respective predicted abilities to discriminate between said structure-activity models;

(iv) their respective predicted abilities to have superior three-dimensional receptor fit; and

(v) similarity between their respective structural, physical, or chemical characteristics and characteristics of compounds in a structure-activity database whose activity/properties most closely conform to said prescribed set of activity/properties.

62. The synthesis protocol generator of claim 61, wherein said optimal set selecting means comprises:

means for selecting said optimal set by individually ranking said candidate compounds based on at least one of factors (i)-(v).

63. The synthesis protocol generator of claim 61, wherein said optimal set selecting means comprises:

means for selecting said optimal set by ranking combinations of said candidate compounds based on at least one of factors (i)-(v).

64. The synthesis protocol generator of claim 61, wherein said robotic synthesis instruction generating means comprises:

means for generating robotic synthesis instructions that, when executed, enable said chemical synthesis robot to robotically synthesize said optimal set of candidate compounds.
 Description Submit all comments and votes
 


BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates generally to the generation of chemical entities with defined physical, chemical or bioactive properties, and particularly to the automatic generation of drug leads via computer-based, iterative robotic synthesis and analysis of directed diversity chemical libraries.

2. Related Art

Conventionally, new chemical entities with useful properties are generated by identifying a chemical compound (called a "lead compound") with some desirable property or activity, creating variants of the lead compound, and evaluating the property and activity of those variant compounds. Examples of chemical entities with useful properties include paints, finishes, plasticizers, surfactants, scents, flavorings, and bioactive compounds, but can also include chemical compounds with any other useful property that depends upon chemical structure, composition, or physical state. Chemical entities with desirable biological activities include drugs, herbicides, pesticides, veterinary products, etc. There are a number of flaws with this conventional approach to lead generation, particularly as it pertains to the discovery of bioactive compounds.

One deficiency pertains to the first step of the conventional approach, i.e., the identification of lead compounds. Traditionally, the search for lead compounds has been limited to an analysis of compound banks, for example, available commercial, custom, or natural products chemical libraries. Consequently, a fundamental limitation of the conventional approach is the dependence upon the availability, size, and structural diversity of these chemical libraries. Although chemical libraries cumulatively total an estimated 9 million identified compounds, they reflect only a small sampling of all possible organic compounds with molecular weights less than 1200. Moreover, only a small subset of these libraries is usually accessible for biological testing. Thus, the conventional approach is limited by the relatively small pool of previously identified chemical compounds which may be screened to identify new lead compounds.

Also, compounds in a chemical library am traditionally screened (for the purpose of identifying new lead compounds) using a combination of empirical science and chemical intuition. However, as stated by Rudy M. Baum in his article "Combinatorial Approaches Provide Fresh Leads for Medicinal Chemistry," C&EN, Feb. 7, 1994, pages 20-26, "chemical intuition, at least to date, has not proven to be a particularly good source of lead compounds for the drug discovery process."

Another deficiency pertains to the second step of the conventional approach, i.e., the creation of variants of lead compounds. Traditionally, lead compound variants are generated by chemists using conventional chemical synthesis procedures. Such chemical synthesis procedures are manually performed by chemists. Thus, the generation of lead compound variants is very labor intensive and time consuming. For example, it typically takes many chemist years to produce even a small subset of the compound variants for a single lead compound. Baum, in the article referenced above, states that "medicinal chemists, using traditional synthetic techniques, could never synthesize all of the possible analogs of a given, promising lead compound" (emphasis added). Thus, the use of conventional, manual procedures for generating lead compound variants operates to impose a limit on the number of compounds that can be evaluated as new drug leads. Overall, the traditional approach to new lead generation is an inefficient, labor-intensive, time consuming process of limited scope.

Recently, attention has focused on the use of combinatorial chemical libraries to assist in the generation of new chemical compound leads. A combinatorial chemical library is a collection of diverse chemical compounds generated by either chemical synthesis or biological synthesis by combining a number of chemical "building blocks" such as reagents. For example, a linear combinatorial chemical library such as a polypeptide library is formed by combining a set of chemical building blocks called amino acids in every possible way for a given compound length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical compounds theoretically can be synthesized through such combinatorial mixing of chemical building blocks. For example, one commentator has observed that the systematic, combinatorial mixing of 100 interchangeable chemical building blocks results in the theoretical synthesis of 100 million tetrameric compounds or 10 billion pentameric compounds (Gallop et al., "Applications of Combinatorial Technologies to Drug Discovery, Background and Peptide Combinatorial Libraries," Journal of Medicinal Chemistry, Volume 37, Number 9, pages 1233-1250, Apr. 29, 1994).

To date, most work with combinatorial chemical libraries has been limited only to peptides and oligonucleotides for the purpose of identifying bioactive agents; little research has been performed using non-peptide, non-nucleotide based combinatorial chemical libraries. It has been shown that the compounds in peptide and oligonucleotide based combinatorial chemical libraries can be assayed to identify ones having bioactive properties. However, there is no consensus on how such compounds (identified as having desirable bioactive properties and desirable profile for medicinal use) can be used.

Some commentators speculate that such compounds could be used as orally efficacious drugs. This is unlikely, however, for a number of reasons. First, such compounds would likely lack metabolic stability. Second, such compounds would be very expensive to manufacture, since the chemical building blocks from which they are made most likely constitute high priced reagents. Third, such compounds would tend to have a large molecular weight, such that they would have bioavailability problems (i.e., they could only be taken by injection).

Others believe that the compounds from a combinatorial chemical library that are identified as having desirable biological properties could be used as lead compounds. Variants of these lead compounds could be generated and evaluated in accordance with the conventional procedure for generating new bioactive compound leads, described above. However, the use of combinatorial chemical libraries in this manner does not solve all of the problems associated with the conventional lead generation procedure. Specifically, the problem associated with manually synthesizing variants of the lead compounds is not resolved.

In fact, the use of combinatorial chemical libraries to generate lead compounds exacerbates this problem. Greater and greater diversity has often been achieved in combinatorial chemical libraries by using larger and larger compounds (that is, compounds having a greater number of variable subunits, such as pentameric compounds instead of tetrameric compounds in the case of polypeptides). However, it is more difficult, time consuming, and costly to synthesize variants of larger compounds. Furthermore, the real issues of structural and functional group div