WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
System and method for creating memory-retained, formatted pages of text, tabulation, and graphic data    

Custom CD of patents similar to US4803643 : System and method for creating memory-retained, formatted pages of text, tabulation, and graphic data - $19.95
United States Patent4803643   
Link to this pagehttp://www.wikipatents.com/4803643.html
Inventor(s)Hickey; Thomas B. (Columbus, OH)
AbstractA system and method for treating publisher's textual and graphical data and converting it to device independent digital form for retention in master memory. Once both textual and graphical data have been converted to digital form, the graphical data are inserted in appropriate locations as determined by a formatting routine. Formatted articles then are collected for retention in master memory, and eventual production of local memory. The formatted page files residing in master memory are in a device independent form such that they are outputtable with high quality on a variety of output devices. Additionally, an index is generated for every word retained in master memory enabling the user to peruse as many as 30,000 complex pages for the desired subject matter.
   














 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Drawing from US Patent 4803643
System and method for creating memory-retained, formatted pages of text,

     tabulation, and graphic data - US Patent 4803643 Drawing
System and method for creating memory-retained, formatted pages of text, tabulation, and graphic data
Inventor     Hickey; Thomas B. (Columbus, OH)
Owner/Assignee     OCLC Online Computer Library Center Incorporated (Dublin, OH)
Patent assignment
All assignments
Company News
Publication Date     February 7, 1989
Application Number     07/083,779
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     August 10, 1987
US Classification     715/513 715/523
Int'l Classification     G06F 015/62
Examiner     Pellinen; A. D.
Assistant Examiner     Williams; H.
Attorney/Law Firm     Mueller and Smith
Address
Parent Case    
Priority Data    
USPTO Field of Search     364/518 364/523 340/735 382/9
Patent Tags     creating memory-retained, formatted pages text, tabulation, graphic data
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
4748678
Takeda
382/306
May,1988

[0 after 0 votes]
4745561
Hirosawa

May,1988

[0 after 0 votes]
4679153
Robinson
345/468
Jul,1987

[0 after 0 votes]
4677571
Riseman
358/1.9
Jun,1987

[0 after 0 votes]
4608664
Bartlett
358/1.2
Aug,1986

[0 after 0 votes]
4601003
Yoneyama
715/775
Jul,1986

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B

[0 market size comments]
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%

[0 market share comments]
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%

[0 reasonable royalty comments]
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

[0 Guesstimation of Royalty Value Comments]
License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
[0 license availability comments]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
[0 owner/assignee comments]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

[0 competitive advantage comments]
Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

[0 commercial alternatives comments]
 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


We claim:

1. A system for converting visual illustrations on paper and associated magnetic media retained text and tabulation typesetting data of a printed publication of articles, into universally outputable digital data form retained in master memory, comprising:

graphics locator means for determining the insertion locations referenced within said text and tabulations typesetting data representing the positioning of said visual illustrations;

scanning means for converting said visual illustrations to digital graphic data;

means for generating graphic property data representing the graphics size characteristics of said digital graphic data;

means for providing font size data for the text characters represented by said text typesetting data;

means for providing font image defining data;

formatting means responsive to said insertion location, said text typesetting data, said graphic property data and said font size data for generating device independent formatted page files; and

mastering means responsive to said digital graphic data, said device independent formatted page files, and said font image defining data for deriving said universally outputable digital data retained within said master memory.

2. The system of claim 1 including:

means for altering those portions of said text typesetting data not configured with standardized general mark-up language to SGML configured data incorporating standardized general mark-up language; and

said formatting means is responsive to said SGML configured data as said text typesetting data.

3. The system of claim 1 including:

article identification means responsive to said text and tabulation typesetting data for assembly of a directory of all files corresponding with each article of said printed publication; and

said graphics locator means is responsive to said text and tabulations typesetting data and said article identification means directory for determining said insertion locations.

4. The system of claim 1 including:

tabluation identifier means responsive to said text and tabulations typesetting data for deriving a separate tabulation file corresponding therewith;

said formatting means is responsive to said tabulation file for deriving a device independent formatted tabulation file; and

including tabulation readout means responsive to said device independent formatted tabulation file for deriving a visually inspectable tabulation check readout corresponding therewith.

5. The system of claim 1 including:

equation identifier means responsive to said text and tabulations typesetting data for deriving a separate equation file corresponding therewith;

said formatting means is responsive to said equation file for deriving a device independent formatted equation file; and

including equation readout means responsive to said device independent formatted equation file for deriving a visually inspectable equation check readout corresponding therewith.

6. The system of claim 4 including change means responsive to applied corrections derived by visual inspection of said tabulation check readout for correctively effecting alteration of said tabulation file article identification means directly of files.

7. The system of claim 5 including change means responsive to applied corrections derived by visual inspection of said equation check readout for correctively effecting alteration of said equation file.

8. The system of claim 1 including numbering means for numbering each paragraph of each said article.

9. The system of claim 1 including graphics readout means responsive to said graphics locator means insertion locations, said scanning means digital graphic data and said graphic property data for deriving a visually inspectable graphics check readout corresponding therewith.

10. The system of claim 1 including:

tabulation identifier means responsive to said text and tabulations typesetting data for deriving a separate tabulation file corresponding therewith;

said formatting means is responsive to said tabulation file for deriving a device independent formatted tabulation file;

including tabulation readout means responsive to said device independent formatted tabulation file for deriving a visually inspectable tabulation check readout corresponding therewith;

change means responsive to applied corrections derived by visual inspection of said tabulation check readout for correctively effecting alteration of said tabulation file; and

graphics readout means responsive to said graphics locator means insertion locations, said scanning means digital graphic data and said graphic property data for deriving a visually inspectable graphics check readout corresponding therewith.

11. The system of claim 1 including:

equation identifier means responsive to said text and tabulations typesetting data for deriving a separate equation file corresponding therewith;

said formatting means is responsive to said equation file for deriving a device independent formatted equation file;

including equation readout means responsive to said device independent formatted equation file for deriving a visually inspectable equation check readout corresponding therewith;

change means responsive to applied corrections derive by visual inspection of said equation check readout for correctively effecting alteration of said equation file; and

graphics readout means responsive to said graphics locator means insertion locations, said scanning means digital graphic data and said graphic property data for deriving a visually inspectable graphics check readout corresponding therewith.

12. The system of claim 1 including:

indexing means responsive to said text and tabulation typesetting data for deriving a word list corresponding therewith said, indexing means further associating each said word of said word list with a select field associated therewith to provide a unique word list; and

said mastering means being further responsive to said unique word list for retention in said master memory.

13. The system of claim 12 including a comparison means responsive to a stop word list of non-essential words for deriving a refined unique word list.

14. The system of claim 1 including:

article identification means responsive to said text and tabulation typesetting data and said assembly of a directory of files corresponding with each article of said printed publication;

page readout means responsive to said insertion locations, said text typesetting data, said graphics property data, said digital graphic data and said font size data for deriving a visually inspectable page check readout for each page formatted by said formatting means; and

change means responsive to applied corrections derived by visual inspection of said page check readout for effecting alteration of said page formatted by said formatting means.

15. A method for converting the graphic illustrations on paper and associated magnetic media retained text typesetting data from which has been derived a printed assemblage of articles, into device independent digital data form retained in master memory, comprising the steps of:

identifying the insertion locations within said magnetic media retained text typesetting data for each said graphic illustration and generating a corresponding whatgraph file;

scanning each said graphic illustration and deriving digital graphic data corresponding therewith;

generating graphic property data representing the graphics size characteristics of said digital graphic data;

providing font size data;

providing font image defining data;

formatting said text typesetting data, said whatgraph file, said graphic property data, and said font size data to provide device independent formatted page files; and

combining said device independent formatted page files, said font image defining data, and said digital graphic data in magnetic media for retention in said master memory.

16. The method of claim 15 including the step of collecting and identifiying all files from said magnetic media retained text typesetting data corresponding with each said article of said assemblage and deriving therefrom a corresponding directory of said files.

17. The method of claim 15 including the step of assigning identifying numbers to the paragraphs of each said article.

18. The method of claim 15 including the step of altering said text typesetting data to incorporate standardized general mark-up language to provide said text typesetting data as standard SGML data.

19. The method of claim 15 including the steps of:

generating a visual graphic readout from said digital graphic data, said whatgraph file and said graphic property data for each said graphic illustration; and

rescanning said grahic illustration in the event of a defect represented in said visual graphic readout.

20. The method of claim 15 including the steps of:

generating a list of words from said text typesetting data; and

combining said word list in said magnetic media for retention as index data in said master memory.

21. The method of claim 20 including the steps of:

providing a stop word list of non-essential words; and

removing said non-essential words from said list of words to provide a refined word list for retention as indexed data in said master memory.

22. The method of claim 20 including the step of associating each said word of said word list with a select field associated therewith to provide a unique word list for retention as index data in said master memory.

23. The method of claim 21 including the step of associating each said word of said refined word list with a select field associated therewith to provide a unique refined word list for retention as index data in said master memory.

24. The method of claim 15 including the steps of:

generating a page visual readout from said device independent formatted page files, said font image defining data, and said digital graphic data; and

applying corrections to effect alteration of said page visual readout in the event of a defect represented in said page visual readout.

25. The method of claim 15 including steps of:

generating a graphic readout from said digital graphic data, said whatgraph file and said graphic property data for each said graphic illustration;

rescanning said grahic illustration in the event of a defect represented in said visual graphic readout;

generating a page visual readout from said device independent formatted page files, said font image defining data, and said digital graphic data; and

applying corrections to effect alteration of said page visual readout in the event of a defect represented in said page visual readout.

26. A method for treating the graphic illustration on paper and associated computer storage media retained text and tabulation typesetting data from which has been derived a printed assemblage of articles to provide corresponding device independent digital data for retention in master memory, comprising the steps of:

identifying the insertion locations within said magnetic media retained text typesetting data for each said graphic illustration and generating a corresponding whatgraph file;

scanning each said grahic illustration and deriving digital graphic data corresponding therewith;

generating graphic property data representing the graphics size characteristics of said digital graphic data;

formatting said tabulation typesetting data, and said font size data to provide device independent formatted tabulation files;

generating a tabulation visual image from said device independent formatted tabulation files;

applying corrections to said directory of files in the event of a defect represented in said tabulation visual image;

formatting said text typesetting data, said device independent formatted tabulation files, said whatgraph file, said graphic property data and said font size data to provide device independent formatted page files; and

combining said device independent formatted page files, said font image defining data and said digital graphic data in magnetic media for retention in said master memory.

27. The method of claim 26 including step of collecting and identifying all files from said magnetic media retained text and tabulation typesetting data corresponding with each said article of said assemblage and deriving therefrom a corresponding directory of said files.

28. The method of claim 26 including the step of assigning identifying numbers to paragraphs of each said article.

29. The method of claim 26 including the step of altering said tabulation and text typesetting data to incorporate standardized general mark-up language to provide said text typesetting data as standard SGML data and said tabulation typesetting data as table SGML data.

30. The method of claim 26 including the steps of:

generating a visual graphic readout from said digital graphic data, said whatgraph file and said graphic property data for each said graphic illustration; and

rescanning said graphic illustration in the event of a defect represented in said visual graphic readout.

31. The method of claim 26 including the steps of:

generating a list of words from said tabulation and text typesetting data; and

combining said word list in said magnetic media for retention as index data in said master memory.

32. The method of claim 31 including the steps of:

providing a stop word list of non-essential words; and

removing said non-essential words from said list of words to provide a refined word list for retention as index data in said master memory.

33. The method of claim 31 including step of associating each said word of said word list with a select field associated therewith to provide a unique word list for retention as index data in said mater memory.

34. The method of claim 32 including the step of associating each said word of said refined word list with a select field associated therewith to provide a unique refined word list for retention as index data in said master memory.

35. The method of claim 26 including the steps of:

generating a page visual image from said device independent formatted page files, said font image defining data, and said digital graphic data; and

applying corrections to in the event of a defect represented in said page visual image.

36. The method of claim 26 including the steps of:

generating a visual graphic readout from said digital graphic data, said whatgraph file and said graphic property data for each said graphic illustration; and

rescanning said graphic illustration in the event of a defect represented in said visual graphic readout;

generating a page visual image from said device independent formatted page files, said font image defining data, and said digital graphic data; and

applying corrections to effect alteration of said visual image in the event of a defect represented in said page visual image.

37. A method for converting graphic illustrations retained in an associated storage media, and text typesetting data retained in an associated storage media, into device independent digital form retained in master memory, comprising the steps of:

identifying the insertion locations within said media retained text typesetting data for each said graphic illustration and generating a corresponding whatgraph file;

generating graphic property data representing the graphics size characteristics of said graphic illustration data;

providing font size data;

providing font image defining data;

formatting said text and tabulation typesetting data, said whatgraph file, said graphic property data, and said font size data to provide device independent formatted page files; and

combining said device independent formatted page files, said font image defining data, and said graphic data for retention in said master memory.

38. The method of claim 37 including the step of collecting and identifying all files from said media retained text typesetting data corresponding with each said article of said assemblage and deriving therefrom a corresponding directory of said files.

39. The method of claim 37 including the step of assigning identifying numbers to paragraphs of each said article.

40. The method of claim 37 including the step of altering said text typesetting data to incorporate standardized general make-up language to provide said text typesetting data as standard SGML data.

41. The method of claim 37 including the steps of:

generating a list of words from said text and tabulation typesetting data; and

combining said word list for retaining as index data in said master memory.

42. The method of claim 41 including the steps of:

providing the stop word list of non-essential words; and

removing said non-essential words from said list of words to provide a refined word list for retention as index data in said master memory.

43. The method of claim 41 including the step of associating each said word of said word list with a select field associated therewith to provide a unique word list for retention as index data in said master memory.

44. The method of claim 42 including the step of associating each said word of said refined word list with a select field associated therewith to provide a unique refined word list for retention as index data in said master memory.

45. The method of claim 37 including the steps of:

generating a page visual readout from said device independent formatted page files, said font image defining data, and said graphic data; and

applying corrections to effect alteration of said page visual readout in the event of a defect in said page visual readout.
 Description Submit all comments and votes
 


BACKGROUND

As the amount of published information pertaining to science, business, reference and the like has grown, there has arisen a concomitant requirement to address the difficulties of space restraints associated with storing the great bulk of paper materials generated. Additionally, selective retrieval of documents and articles from these collections has cebome elaborate. As an example of the significance of storage requirements, a typical 10 year collection of merely one scientific journal may amount to 30,000 pages. Thus, costs to a library facility to carry out conventional binding and shelf storage of significant numbers of such publications have become expensive. To at least lessen the storage requirements, resort has been made for providing microfiche copies of the materials. However, ussers of such storage media find the approach somewhat unsatisfactory, particularly with respect to the information search and hard copy aspects.

Investigators have considered and implemented the storage of such information within the magnetic media of principal computer installations in conjunction with on-line communication with satellite or subscriber terminals provided, for example, as the ubiquitous personal computer (PC). Such systems offer advantages in terms of efficient information storage, as well as in providing searching techniques which may be interactive with the user or library patron. However, such advantages are somewhat offset by the costs of communication between host and satellite terminal, as well as the extended time factor often associated with host-to-PC interactive communication of graphical data, and the less than satisfactory visual quality of the retrieved graphical output.

Over the recent past, optical disk technology has introduced the CD ROM, a compact device capable of carrying a database, of approximately 600 megabytes of digital data. Accordingly, one such device is capable of retaining the equivalent of a decade collection of full text and graphics for a significantly sized scientific journal or the like. Further, because of the reasonable production costs involved, once a master is produced, the CD ROMs can be relatively widely distributed for interactive employment by library patrons or users in the field. High communication costs are avoided, as well as the excessive delays occasioned in accession data through communications links while providing high resolution output on a wide variety of screens and printers.

To effectively implement such high density local memory devices for retaining these reference materials, however, a practical technique or system for creating the necessary master from which they are formed is necessary. A practical commercialization of such technology requires that the output of complex typesetting systems used by publishers can be algorithmically translated into a standard format with a minimum of human intervention. Because such publications conventionally incorporate graphics, tabulations and, very often, chemical and mathematical symbols and equations practical techniques also are required for effectively combining such materials with text in a manner wherein page printouts will closely reflect the quality of the original publication. Finally, an effective indexing and searching facility is required of the master structure to permit adequate access on the part of the local user or patron of the data retained in the CD ROM in an effective and efficient interactive manner. For example, Boolean retrieval techniques, searches conducted by the occurrence of words and strings within specificed fields and paragraphs are desirable. The outputs at the local terminal must be adequately presentable both on the screen of a conventional PC monitor and through a conventional, reasonably priced printer. Thus, searching can be carried out at this local situs and full text materials can be retrieved in printed form.

SUMMARY

The present invention is addressed to a system and method for treating data carrying magnetic tape, as well as published graphics, text and illustrative materials used in forming extensive publications so they may be retained in a component storage device such as a small CD ROM. This conversion is carried out employing a technique permitting the production of these local memories at practical costs. The economy of the system and method results from limited operator intervention. In this regard, the conversion of publisher provided graphic and textual data into digital, formatted, device independent pages of master memory is almost completely automated. Once the system has been set-up for converting a particular publisher's data, the process operator's only task is to assure quality control.

As a result of the high level of practicality of the instant system and method, a small CD ROM local memory may be produced to retain, on a practical basis, substantial amounts of data. For example, the single CD ROM may contain a typical 10-year collection of a scientific journal which may amount to 30,000 complex pages. These retained data not only will include text which is reproducable on the monitors and printers of conventional personal computers, but also graphics, illustrations and tables.

Aside from being practical to produce, the local memories created by the present system and method include extensive searching capabilities and a high quality output on a variety of the noted "PC" output devices, i.e. monitors and printers. The local memory index allows the user to jump to the occurrence of any word or words of the 30,000 pages stored on the CD ROM disk. This index is created by listing all the words of every article and addressing each by field, paragraph, and sentence. In addition to the index, the user may take advantage of the table of contents generated by the present invention for each article, allowing the user at a monitor to skip to the appropriate field, heading, or sub-heading as desired. Further, the user may skip to any graphic illustration referenced in the text with a single keystroke and then may magnify this grahic illustration for careful analysis before outputting a hard copy of the article. Since the page format of the article is frozen by the present invention and not determined by the output device employed, the quality of the output obtained will not generally be affected by the user's selection of an output device. The invention provides an inexpensive solution to the storage, indexing, and output of complex reference data.

Another feature of the invention is to provide a system for converting visual illustrations on paper and associated magnetic media retained text and tabulation typesetting data of a printed publication of articles into universally outputtable digital data form retained in master memory. This system includes a graphic locator which determines the insertion location of the reference within the text and tabulation typesetting data representing the positioning of visual illustrations. A scanning arrangement is provided for converting the visual illustrations to digital graphic data and an arrangement is provided for generating graphic property data representing a graphic size characteristic data. An arrangement is made for providing font size data for the text characters represented by the text typesetting data as well as for providing font image defining data. A formatting arrangement then is provided which is responsive to the insertion locations, the text typesetting data, the graphic property data and the font size data for generating the device independent formatted page files. Finally, a mastering arrangement is provided which is responsive to the digital graphic data, the device independent formatted page files and the font image defining data for deriving the universally printable data retained within the master memory.

Another feature of the invention provides a method for converting graphic illustrations on paper and associated magnetic media retained text typesetting data from which has been derived a printed assemblage of articles into device independent digital data form retained in master memory which comprises the steps of identifying the insertion locations within the magnetic media retained text typesetting data for each graphic illustration and generating a corresponding whatgraph file. Each graphic illustration is scanned and there is derived digital graphic data corresponding therewith and there is generated graphic property data representing the graphic size characteristics of the digital graphic data. Font size data are provided and font image defining data are provided whereupon a formatting is carried out of the text typesetting data, the whatgraph file, the graphic property data, and the font size data to provide device independent formatted page files. These device independent formatted page files are combined along with the font image defining data and the digital graphic data in magnetic media for retention in the master memory.

Another feature of the invention is to provide a method for treating the graphic illustration on paper and associated computer storage media retained text and tabulation typesetting data from which has been derived a printed assemblage of articles to provide corresponding device independent digital data for retention in master memory. This method includes the steps of identifying the insertion locations within the magnetic media retained text typesetting data for each graphic illustration and generating a corresponding whatgraph file. The method further includes the step of scanning each graphic illustration and deriving digital graphic data corresponding therewith as well as the step of generating graphic property data representing the graphic size characterisitc of the digital graphic data. The tabulation typesetting data and font size data are formatted to provide device independent formatted tablution files and there is generated a tabulation video image from the device independent formatted tabulation files. The method further includes the step of applying corrections to the directory of files in the event of a defect represented in the tabulation digital image from the step of formatting the text typesetting data, the device independent formatted tabulation files, the whatgraph file, the graphic property data and font size data to provide device independent formatted page file. The device independent formatted page files, the font imaging data and the digital graphic data are combined in magnetic media for retention in master memory.

Another feature of the invention provides a method for converting graphic illustrations retained in an associated storage media, and text typesetting data retained in an associated storage media into device independent digital form retained in master memory. Steps in the method include identifying the insertion locations within the media retained text typesetting data for each graphic illustration and generating a corresponding whatgraph file. Property data representing size characteristics also is generated for each graphic illustration from analysis of the graphic illustration media. Following the provision of font size data, formatting of the text typesetting data, whatgraph file, the graphic property data, and the font size data provides device independent formatted page files. These device independent formatted page files are combined along with the provided font image defining data and graphic illustration data for retention in master memory.

Other objects of the invention will, in part, be obvious and will, in part, appear hereinafter.

The invention, accordingly, comprises the system and method possessing the construction, combination of elements, and arrangements of parts which are exemplified in the following description.

For a fuller understanding of the nature and objects of the invention, reference should be had to the following detailed description taken in connection with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a broad flow diagram of the system and method of the invention;

FIGS. 2A-2D combine to provide a detailed flow diagram of the invention;

FIG. 3 is an exemplary directory file tree for the invention;

FIG. 4 is an exploded view of the SGML conversion step of the invention;

FIG. 5 is an exploded view of the TeX MACROS step of the invention;

FIG. 6 is an exploded view of the TITLE 2.HDR MACRO;

FIG. 7 is an exploded view of the FONTS.TeX MACRO;

FIG. 8 is an exploded view of the GRAPHICS.TeX MACRO;

FIG. 9 is an exploded view of the SPECCHAR.TeX MACRO;

FIG. 10 is an exploded view of the PLAIN.TeX MACRO;

FIG. 11 is an exploded view of the TITLE.HDR MACRO; and

FIG. 12 is an exploded view of the TBLMACS.TeX MACRO.

DETAILED DESCRIPTION

The instant system and process develops master tapes suitable for creating conveniently sized local memory such as CD ROMs essentially from two products generated by a publisher, to wit, the paper text produced by the publisher and the publisher production tapes used in typesetting the textual material. While all textual and graphic data of the original work are retained, a printout generated of any given article or the like, while resembling the original material, will be a unique recreation thereof which has been improved with the addition of comprehensive indexing data to facilitate user searches. The process permits the production of these master tapes with necessary human intervention, but at a level remaining practical to an extent permitting reasonably priced wider dissemination of the resultant CD ROMs or similar local memories.

Referring to FIG. 1, the process and system of the invention are revealed in general fashion at 10. The chart 10 shows the typeset pages made available from the publisher at block 12 and the magnetic media such as tapes carrying the text data employed for typesetting purposes by the publisher at symbol 14. These materials as provided on tape will vary in data architecture from publisher to publisher and thus initial treatment often is necessary due to a lack of standardization. Looking, however, to the typeset pages made available as represented at block 12, as shown at line 16, the paper text carrying graphics and some tabular material which may not be available in magnetic media is SCANned as represented at a station represented by circle 18. Generally, such SCANning will involve only those sheets of paper material carrying such graphics and may, for example, be carried out by a CCITT Group 4 standard type device. The resultant digitized information now present in magnetic media is then directed as represented at line 20 to the display of a computer terminal represented at operational symbol 22. At this station, the operator "CROPs" each graphics image or the like not otherwise available in the data from tape 14. This procedure generally will define the properties of each graphics item, for example, the size of the item and, where appropriate, the rotation. As a result ofthe activity at station 22, as represented at line 24 and block 26, data in magnetic medium representing the noted size os each graphics element has been developed. Similarly, as represented at line 28 and block 30 the image data representing the graphics material as developed by the Group 4 device are present in magnetic media.

Returning to the publisher's magnetic data as described at symbol 14, as represented at lines 32 and 34 leading to the operation represented at symbol 36, a determination by supporting computer program is made as to where graphics materials are intended to be located as represented in the publisher's typesetting tapes. Generally, these tapes will not contain any graphics data but will contain some indicia as to the appropriate location for such graphics. Accordingly, the "WHATGRAPH" program 36 is carried out, and as represented at line 38, these data so developed are made available to reference the location of the graphics for use by the activity as represented at CROPping symbol 22. Thus, a cross-referencing between the two inputs to the system occurs at this juncture.

Line 34 also is seen to be directed to a computer activity as represented at symbol 40 wherein the data available in magnetic media from the publisher are converted to a Standardized General Mark-up Language (SGML). This conversion is to a format representing a current standard in the publishing industry for carrying out character, paragraph, reference, heading, subtitle, and the like definition. Inasmuch as the instant system is one wherein, for example, as many as 10 years worth of a scientific journal may be complied upon a single CD ROM, the data retained in magnetic tape will not always have been developed in accordance with such standards. The SGML data which are developed will not only carry data corresponding with the text of any given article but also will retain corresponding information defining all those tables and the like which formed a portion of the initial data or information carried by the publishers typesetting tapes 14.

Because the data developed from the SGML activites as represented at symbol 40 will contain both text data as well as, where available, data defining tables and equations, these data elements will be treated separately. In this regard by retaining a capability for developing table and equation data without resort to scanning procedures as represented at symbol 18, a substantial savings in memory requirements is realized. Accordingly, as represented at lines 42, 45, and 44 the equational, tabular, and textual data are seen to be separated into discrete magnetic files as represented, respectively, at blocks 47, 46, and 48.

Because the ultimate memory retained materials are intended for a somewhat universal application or are constituted as "device independent" (DVI), a well accepted formatting program is employed with the instant system and method referred to a "TeX." This program, for example, is described in detail in a volume entitled "The TeXbook" by D. E. Knuth of Stanford University, published by Addison-Wesley Publishing Company, Reading, Massachusetts. To employ this particular formatting program, it then becomes necessary to convert the SGML developed data as represented at blocks 47, 46, and 48 to a format acceptable for employment in the noted TeX program. This procedure is represented at lines 49, 50, and 52 emanating, respectively, from the equational, tabular, and textual data represented at blocks 47, 46, and 48. Lines 49, 50, and 52 are shown extending to the operation represented at symbol 54 whereupon the translated data then are submitted to the noted TeX program as represented at line 56 extending to operation symbol 58. The TeX program 58, in carrying out the formatting of each page of materials, employs the graphics size data earlier developed and represented at block 26. This employment is represented by line 60. Additionally, the TeX program as represented at 58 used font size data made available in magnetic medium per election (by macro instructions discussed in detail later herein) of a font by the producing personnel as represented at block 62. These data are shown extending to the TeX operation at symbol 58 as represented at line 64. The data thus supplied to the TeX program includes equational material, tabular material, textual material, font size material, and the size data for any graphics involved. Thus, formatting can be carried out and such formatted material then, as represented at line 66, is made available as a formatted device independent (DVI) page file represented by block 68. As noted above, however, the material available from the TeX program as represented at symbol 58 will not contain data representing the font images, per election, but only size data and further will not contain the image data for the graphics developed from scanning and the like as represented at block 30. Accordingly, as represented at line 70, the graphic image data are made available to the DVI file as at 68 as a separate component. In similar fashion, the data required for imaging individual fonts also are made available to the DVI files represented at block 68. In this regard, the elected font image data are represented at block 72 and the input to the DVI files represented at block 68 is shown at line 74. The materials thus generated, then are retained, as represented at line 76 and symbol 78, in a master tape suitable for producing a readily disseminated local memory such as a CD ROM. Also retained on the master tapes as represented at symbol 78 are the index data which are generated from the SGML files which previously have been developed as discussed in conjunction with symbol 40. In this regard, the data are subject to an index generation operation as represented by line 80 and operation symbol 82 labeled "INDEX." The index will include an alphabetical listing of the unique, refined words in any given article referenced for instance by field, paragraph, and sentence. These materials then are submitted to the master tape as represented at line 84. In similar fashion, it is necessary to provide sufficient data for employment of the local memory with any of a variety of personal computers and, thus, necessary data as represented at block 86 entitled "PC SOFTWARE" are submitted as represented at line 88 to the master tape represented at symbol 78. It may be observed that the device independent materials now contained in the master tape 78 will be uniform in generated output from output device, i.e. monitor or printer, to