|
Claims  |
|
|
We claim:
1. A system for converting visual illustrations on paper and associated
magnetic media retained text and tabulation typesetting data of a printed
publication of articles, into universally outputable digital data form
retained in master memory, comprising:
graphics locator means for determining the insertion locations referenced
within said text and tabulations typesetting data representing the
positioning of said visual illustrations;
scanning means for converting said visual illustrations to digital graphic
data;
means for generating graphic property data representing the graphics size
characteristics of said digital graphic data;
means for providing font size data for the text characters represented by
said text typesetting data;
means for providing font image defining data;
formatting means responsive to said insertion location, said text
typesetting data, said graphic property data and said font size data for
generating device independent formatted page files; and
mastering means responsive to said digital graphic data, said device
independent formatted page files, and said font image defining data for
deriving said universally outputable digital data retained within said
master memory.
2. The system of claim 1 including:
means for altering those portions of said text typesetting data not
configured with standardized general mark-up language to SGML configured
data incorporating standardized general mark-up language; and
said formatting means is responsive to said SGML configured data as said
text typesetting data.
3. The system of claim 1 including:
article identification means responsive to said text and tabulation
typesetting data for assembly of a directory of all files corresponding
with each article of said printed publication; and
said graphics locator means is responsive to said text and tabulations
typesetting data and said article identification means directory for
determining said insertion locations.
4. The system of claim 1 including:
tabluation identifier means responsive to said text and tabulations
typesetting data for deriving a separate tabulation file corresponding
therewith;
said formatting means is responsive to said tabulation file for deriving a
device independent formatted tabulation file; and
including tabulation readout means responsive to said device independent
formatted tabulation file for deriving a visually inspectable tabulation
check readout corresponding therewith.
5. The system of claim 1 including:
equation identifier means responsive to said text and tabulations
typesetting data for deriving a separate equation file corresponding
therewith;
said formatting means is responsive to said equation file for deriving a
device independent formatted equation file; and
including equation readout means responsive to said device independent
formatted equation file for deriving a visually inspectable equation check
readout corresponding therewith.
6. The system of claim 4 including change means responsive to applied
corrections derived by visual inspection of said tabulation check readout
for correctively effecting alteration of said tabulation file article
identification means directly of files.
7. The system of claim 5 including change means responsive to applied
corrections derived by visual inspection of said equation check readout
for correctively effecting alteration of said equation file.
8. The system of claim 1 including numbering means for numbering each
paragraph of each said article.
9. The system of claim 1 including graphics readout means responsive to
said graphics locator means insertion locations, said scanning means
digital graphic data and said graphic property data for deriving a
visually inspectable graphics check readout corresponding therewith.
10. The system of claim 1 including:
tabulation identifier means responsive to said text and tabulations
typesetting data for deriving a separate tabulation file corresponding
therewith;
said formatting means is responsive to said tabulation file for deriving a
device independent formatted tabulation file;
including tabulation readout means responsive to said device independent
formatted tabulation file for deriving a visually inspectable tabulation
check readout corresponding therewith;
change means responsive to applied corrections derived by visual inspection
of said tabulation check readout for correctively effecting alteration of
said tabulation file; and
graphics readout means responsive to said graphics locator means insertion
locations, said scanning means digital graphic data and said graphic
property data for deriving a visually inspectable graphics check readout
corresponding therewith.
11. The system of claim 1 including:
equation identifier means responsive to said text and tabulations
typesetting data for deriving a separate equation file corresponding
therewith;
said formatting means is responsive to said equation file for deriving a
device independent formatted equation file;
including equation readout means responsive to said device independent
formatted equation file for deriving a visually inspectable equation check
readout corresponding therewith;
change means responsive to applied corrections derive by visual inspection
of said equation check readout for correctively effecting alteration of
said equation file; and
graphics readout means responsive to said graphics locator means insertion
locations, said scanning means digital graphic data and said graphic
property data for deriving a visually inspectable graphics check readout
corresponding therewith.
12. The system of claim 1 including:
indexing means responsive to said text and tabulation typesetting data for
deriving a word list corresponding therewith said, indexing means further
associating each said word of said word list with a select field
associated therewith to provide a unique word list; and
said mastering means being further responsive to said unique word list for
retention in said master memory.
13. The system of claim 12 including a comparison means responsive to a
stop word list of non-essential words for deriving a refined unique word
list.
14. The system of claim 1 including:
article identification means responsive to said text and tabulation
typesetting data and said assembly of a directory of files corresponding
with each article of said printed publication;
page readout means responsive to said insertion locations, said text
typesetting data, said graphics property data, said digital graphic data
and said font size data for deriving a visually inspectable page check
readout for each page formatted by said formatting means; and
change means responsive to applied corrections derived by visual inspection
of said page check readout for effecting alteration of said page formatted
by said formatting means.
15. A method for converting the graphic illustrations on paper and
associated magnetic media retained text typesetting data from which has
been derived a printed assemblage of articles, into device independent
digital data form retained in master memory, comprising the steps of:
identifying the insertion locations within said magnetic media retained
text typesetting data for each said graphic illustration and generating a
corresponding whatgraph file;
scanning each said graphic illustration and deriving digital graphic data
corresponding therewith;
generating graphic property data representing the graphics size
characteristics of said digital graphic data;
providing font size data;
providing font image defining data;
formatting said text typesetting data, said whatgraph file, said graphic
property data, and said font size data to provide device independent
formatted page files; and
combining said device independent formatted page files, said font image
defining data, and said digital graphic data in magnetic media for
retention in said master memory.
16. The method of claim 15 including the step of collecting and
identifiying all files from said magnetic media retained text typesetting
data corresponding with each said article of said assemblage and deriving
therefrom a corresponding directory of said files.
17. The method of claim 15 including the step of assigning identifying
numbers to the paragraphs of each said article.
18. The method of claim 15 including the step of altering said text
typesetting data to incorporate standardized general mark-up language to
provide said text typesetting data as standard SGML data.
19. The method of claim 15 including the steps of:
generating a visual graphic readout from said digital graphic data, said
whatgraph file and said graphic property data for each said graphic
illustration; and
rescanning said grahic illustration in the event of a defect represented in
said visual graphic readout.
20. The method of claim 15 including the steps of:
generating a list of words from said text typesetting data; and
combining said word list in said magnetic media for retention as index data
in said master memory.
21. The method of claim 20 including the steps of:
providing a stop word list of non-essential words; and
removing said non-essential words from said list of words to provide a
refined word list for retention as indexed data in said master memory.
22. The method of claim 20 including the step of associating each said word
of said word list with a select field associated therewith to provide a
unique word list for retention as index data in said master memory.
23. The method of claim 21 including the step of associating each said word
of said refined word list with a select field associated therewith to
provide a unique refined word list for retention as index data in said
master memory.
24. The method of claim 15 including the steps of:
generating a page visual readout from said device independent formatted
page files, said font image defining data, and said digital graphic data;
and
applying corrections to effect alteration of said page visual readout in
the event of a defect represented in said page visual readout.
25. The method of claim 15 including steps of:
generating a graphic readout from said digital graphic data, said whatgraph
file and said graphic property data for each said graphic illustration;
rescanning said grahic illustration in the event of a defect represented in
said visual graphic readout;
generating a page visual readout from said device independent formatted
page files, said font image defining data, and said digital graphic data;
and
applying corrections to effect alteration of said page visual readout in
the event of a defect represented in said page visual readout.
26. A method for treating the graphic illustration on paper and associated
computer storage media retained text and tabulation typesetting data from
which has been derived a printed assemblage of articles to provide
corresponding device independent digital data for retention in master
memory, comprising the steps of:
identifying the insertion locations within said magnetic media retained
text typesetting data for each said graphic illustration and generating a
corresponding whatgraph file;
scanning each said grahic illustration and deriving digital graphic data
corresponding therewith;
generating graphic property data representing the graphics size
characteristics of said digital graphic data;
formatting said tabulation typesetting data, and said font size data to
provide device independent formatted tabulation files;
generating a tabulation visual image from said device independent formatted
tabulation files;
applying corrections to said directory of files in the event of a defect
represented in said tabulation visual image;
formatting said text typesetting data, said device independent formatted
tabulation files, said whatgraph file, said graphic property data and said
font size data to provide device independent formatted page files; and
combining said device independent formatted page files, said font image
defining data and said digital graphic data in magnetic media for
retention in said master memory.
27. The method of claim 26 including step of collecting and identifying all
files from said magnetic media retained text and tabulation typesetting
data corresponding with each said article of said assemblage and deriving
therefrom a corresponding directory of said files.
28. The method of claim 26 including the step of assigning identifying
numbers to paragraphs of each said article.
29. The method of claim 26 including the step of altering said tabulation
and text typesetting data to incorporate standardized general mark-up
language to provide said text typesetting data as standard SGML data and
said tabulation typesetting data as table SGML data.
30. The method of claim 26 including the steps of:
generating a visual graphic readout from said digital graphic data, said
whatgraph file and said graphic property data for each said graphic
illustration; and
rescanning said graphic illustration in the event of a defect represented
in said visual graphic readout.
31. The method of claim 26 including the steps of:
generating a list of words from said tabulation and text typesetting data;
and
combining said word list in said magnetic media for retention as index data
in said master memory.
32. The method of claim 31 including the steps of:
providing a stop word list of non-essential words; and
removing said non-essential words from said list of words to provide a
refined word list for retention as index data in said master memory.
33. The method of claim 31 including step of associating each said word of
said word list with a select field associated therewith to provide a
unique word list for retention as index data in said mater memory.
34. The method of claim 32 including the step of associating each said word
of said refined word list with a select field associated therewith to
provide a unique refined word list for retention as index data in said
master memory.
35. The method of claim 26 including the steps of:
generating a page visual image from said device independent formatted page
files, said font image defining data, and said digital graphic data; and
applying corrections to in the event of a defect represented in said page
visual image.
36. The method of claim 26 including the steps of:
generating a visual graphic readout from said digital graphic data, said
whatgraph file and said graphic property data for each said graphic
illustration; and
rescanning said graphic illustration in the event of a defect represented
in said visual graphic readout;
generating a page visual image from said device independent formatted page
files, said font image defining data, and said digital graphic data; and
applying corrections to effect alteration of said visual image in the event
of a defect represented in said page visual image.
37. A method for converting graphic illustrations retained in an associated
storage media, and text typesetting data retained in an associated storage
media, into device independent digital form retained in master memory,
comprising the steps of:
identifying the insertion locations within said media retained text
typesetting data for each said graphic illustration and generating a
corresponding whatgraph file;
generating graphic property data representing the graphics size
characteristics of said graphic illustration data;
providing font size data;
providing font image defining data;
formatting said text and tabulation typesetting data, said whatgraph file,
said graphic property data, and said font size data to provide device
independent formatted page files; and
combining said device independent formatted page files, said font image
defining data, and said graphic data for retention in said master memory.
38. The method of claim 37 including the step of collecting and identifying
all files from said media retained text typesetting data corresponding
with each said article of said assemblage and deriving therefrom a
corresponding directory of said files.
39. The method of claim 37 including the step of assigning identifying
numbers to paragraphs of each said article.
40. The method of claim 37 including the step of altering said text
typesetting data to incorporate standardized general make-up language to
provide said text typesetting data as standard SGML data.
41. The method of claim 37 including the steps of:
generating a list of words from said text and tabulation typesetting data;
and
combining said word list for retaining as index data in said master memory.
42. The method of claim 41 including the steps of:
providing the stop word list of non-essential words; and
removing said non-essential words from said list of words to provide a
refined word list for retention as index data in said master memory.
43. The method of claim 41 including the step of associating each said word
of said word list with a select field associated therewith to provide a
unique word list for retention as index data in said master memory.
44. The method of claim 42 including the step of associating each said word
of said refined word list with a select field associated therewith to
provide a unique refined word list for retention as index data in said
master memory.
45. The method of claim 37 including the steps of:
generating a page visual readout from said device independent formatted
page files, said font image defining data, and said graphic data; and
applying corrections to effect alteration of said page visual readout in
the event of a defect in said page visual readout. |
|
|
|
|
Claims  |
|
|
Description  |
|
|
BACKGROUND
As the amount of published information pertaining to science, business,
reference and the like has grown, there has arisen a concomitant
requirement to address the difficulties of space restraints associated
with storing the great bulk of paper materials generated. Additionally,
selective retrieval of documents and articles from these collections has
cebome elaborate. As an example of the significance of storage
requirements, a typical 10 year collection of merely one scientific
journal may amount to 30,000 pages. Thus, costs to a library facility to
carry out conventional binding and shelf storage of significant numbers of
such publications have become expensive. To at least lessen the storage
requirements, resort has been made for providing microfiche copies of the
materials. However, ussers of such storage media find the approach
somewhat unsatisfactory, particularly with respect to the information
search and hard copy aspects.
Investigators have considered and implemented the storage of such
information within the magnetic media of principal computer installations
in conjunction with on-line communication with satellite or subscriber
terminals provided, for example, as the ubiquitous personal computer (PC).
Such systems offer advantages in terms of efficient information storage,
as well as in providing searching techniques which may be interactive with
the user or library patron. However, such advantages are somewhat offset
by the costs of communication between host and satellite terminal, as well
as the extended time factor often associated with host-to-PC interactive
communication of graphical data, and the less than satisfactory visual
quality of the retrieved graphical output.
Over the recent past, optical disk technology has introduced the CD ROM, a
compact device capable of carrying a database, of approximately 600
megabytes of digital data. Accordingly, one such device is capable of
retaining the equivalent of a decade collection of full text and graphics
for a significantly sized scientific journal or the like. Further, because
of the reasonable production costs involved, once a master is produced,
the CD ROMs can be relatively widely distributed for interactive
employment by library patrons or users in the field. High communication
costs are avoided, as well as the excessive delays occasioned in accession
data through communications links while providing high resolution output
on a wide variety of screens and printers.
To effectively implement such high density local memory devices for
retaining these reference materials, however, a practical technique or
system for creating the necessary master from which they are formed is
necessary. A practical commercialization of such technology requires that
the output of complex typesetting systems used by publishers can be
algorithmically translated into a standard format with a minimum of human
intervention. Because such publications conventionally incorporate
graphics, tabulations and, very often, chemical and mathematical symbols
and equations practical techniques also are required for effectively
combining such materials with text in a manner wherein page printouts will
closely reflect the quality of the original publication. Finally, an
effective indexing and searching facility is required of the master
structure to permit adequate access on the part of the local user or
patron of the data retained in the CD ROM in an effective and efficient
interactive manner. For example, Boolean retrieval techniques, searches
conducted by the occurrence of words and strings within specificed fields
and paragraphs are desirable. The outputs at the local terminal must be
adequately presentable both on the screen of a conventional PC monitor and
through a conventional, reasonably priced printer. Thus, searching can be
carried out at this local situs and full text materials can be retrieved
in printed form.
SUMMARY
The present invention is addressed to a system and method for treating data
carrying magnetic tape, as well as published graphics, text and
illustrative materials used in forming extensive publications so they may
be retained in a component storage device such as a small CD ROM. This
conversion is carried out employing a technique permitting the production
of these local memories at practical costs. The economy of the system and
method results from limited operator intervention. In this regard, the
conversion of publisher provided graphic and textual data into digital,
formatted, device independent pages of master memory is almost completely
automated. Once the system has been set-up for converting a particular
publisher's data, the process operator's only task is to assure quality
control.
As a result of the high level of practicality of the instant system and
method, a small CD ROM local memory may be produced to retain, on a
practical basis, substantial amounts of data. For example, the single CD
ROM may contain a typical 10-year collection of a scientific journal which
may amount to 30,000 complex pages. These retained data not only will
include text which is reproducable on the monitors and printers of
conventional personal computers, but also graphics, illustrations and
tables.
Aside from being practical to produce, the local memories created by the
present system and method include extensive searching capabilities and a
high quality output on a variety of the noted "PC" output devices, i.e.
monitors and printers. The local memory index allows the user to jump to
the occurrence of any word or words of the 30,000 pages stored on the CD
ROM disk. This index is created by listing all the words of every article
and addressing each by field, paragraph, and sentence. In addition to the
index, the user may take advantage of the table of contents generated by
the present invention for each article, allowing the user at a monitor to
skip to the appropriate field, heading, or sub-heading as desired.
Further, the user may skip to any graphic illustration referenced in the
text with a single keystroke and then may magnify this grahic illustration
for careful analysis before outputting a hard copy of the article. Since
the page format of the article is frozen by the present invention and not
determined by the output device employed, the quality of the output
obtained will not generally be affected by the user's selection of an
output device. The invention provides an inexpensive solution to the
storage, indexing, and output of complex reference data.
Another feature of the invention is to provide a system for converting
visual illustrations on paper and associated magnetic media retained text
and tabulation typesetting data of a printed publication of articles into
universally outputtable digital data form retained in master memory. This
system includes a graphic locator which determines the insertion location
of the reference within the text and tabulation typesetting data
representing the positioning of visual illustrations. A scanning
arrangement is provided for converting the visual illustrations to digital
graphic data and an arrangement is provided for generating graphic
property data representing a graphic size characteristic data. An
arrangement is made for providing font size data for the text characters
represented by the text typesetting data as well as for providing font
image defining data. A formatting arrangement then is provided which is
responsive to the insertion locations, the text typesetting data, the
graphic property data and the font size data for generating the device
independent formatted page files. Finally, a mastering arrangement is
provided which is responsive to the digital graphic data, the device
independent formatted page files and the font image defining data for
deriving the universally printable data retained within the master memory.
Another feature of the invention provides a method for converting graphic
illustrations on paper and associated magnetic media retained text
typesetting data from which has been derived a printed assemblage of
articles into device independent digital data form retained in master
memory which comprises the steps of identifying the insertion locations
within the magnetic media retained text typesetting data for each graphic
illustration and generating a corresponding whatgraph file. Each graphic
illustration is scanned and there is derived digital graphic data
corresponding therewith and there is generated graphic property data
representing the graphic size characteristics of the digital graphic data.
Font size data are provided and font image defining data are provided
whereupon a formatting is carried out of the text typesetting data, the
whatgraph file, the graphic property data, and the font size data to
provide device independent formatted page files. These device independent
formatted page files are combined along with the font image defining data
and the digital graphic data in magnetic media for retention in the master
memory.
Another feature of the invention is to provide a method for treating the
graphic illustration on paper and associated computer storage media
retained text and tabulation typesetting data from which has been derived
a printed assemblage of articles to provide corresponding device
independent digital data for retention in master memory. This method
includes the steps of identifying the insertion locations within the
magnetic media retained text typesetting data for each graphic
illustration and generating a corresponding whatgraph file. The method
further includes the step of scanning each graphic illustration and
deriving digital graphic data corresponding therewith as well as the step
of generating graphic property data representing the graphic size
characterisitc of the digital graphic data. The tabulation typesetting
data and font size data are formatted to provide device independent
formatted tablution files and there is generated a tabulation video image
from the device independent formatted tabulation files. The method further
includes the step of applying corrections to the directory of files in the
event of a defect represented in the tabulation digital image from the
step of formatting the text typesetting data, the device independent
formatted tabulation files, the whatgraph file, the graphic property data
and font size data to provide device independent formatted page file. The
device independent formatted page files, the font imaging data and the
digital graphic data are combined in magnetic media for retention in
master memory.
Another feature of the invention provides a method for converting graphic
illustrations retained in an associated storage media, and text
typesetting data retained in an associated storage media into device
independent digital form retained in master memory. Steps in the method
include identifying the insertion locations within the media retained text
typesetting data for each graphic illustration and generating a
corresponding whatgraph file. Property data representing size
characteristics also is generated for each graphic illustration from
analysis of the graphic illustration media. Following the provision of
font size data, formatting of the text typesetting data, whatgraph file,
the graphic property data, and the font size data provides device
independent formatted page files. These device independent formatted page
files are combined along with the provided font image defining data and
graphic illustration data for retention in master memory.
Other objects of the invention will, in part, be obvious and will, in part,
appear hereinafter.
The invention, accordingly, comprises the system and method possessing the
construction, combination of elements, and arrangements of parts which are
exemplified in the following description.
For a fuller understanding of the nature and objects of the invention,
reference should be had to the following detailed description taken in
connection with the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a broad flow diagram of the system and method of the invention;
FIGS. 2A-2D combine to provide a detailed flow diagram of the invention;
FIG. 3 is an exemplary directory file tree for the invention;
FIG. 4 is an exploded view of the SGML conversion step of the invention;
FIG. 5 is an exploded view of the TeX MACROS step of the invention;
FIG. 6 is an exploded view of the TITLE 2.HDR MACRO;
FIG. 7 is an exploded view of the FONTS.TeX MACRO;
FIG. 8 is an exploded view of the GRAPHICS.TeX MACRO;
FIG. 9 is an exploded view of the SPECCHAR.TeX MACRO;
FIG. 10 is an exploded view of the PLAIN.TeX MACRO;
FIG. 11 is an exploded view of the TITLE.HDR MACRO; and
FIG. 12 is an exploded view of the TBLMACS.TeX MACRO.
DETAILED DESCRIPTION
The instant system and process develops master tapes suitable for creating
conveniently sized local memory such as CD ROMs essentially from two
products generated by a publisher, to wit, the paper text produced by the
publisher and the publisher production tapes used in typesetting the
textual material. While all textual and graphic data of the original work
are retained, a printout generated of any given article or the like, while
resembling the original material, will be a unique recreation thereof
which has been improved with the addition of comprehensive indexing data
to facilitate user searches. The process permits the production of these
master tapes with necessary human intervention, but at a level remaining
practical to an extent permitting reasonably priced wider dissemination of
the resultant CD ROMs or similar local memories.
Referring to FIG. 1, the process and system of the invention are revealed
in general fashion at 10. The chart 10 shows the typeset pages made
available from the publisher at block 12 and the magnetic media such as
tapes carrying the text data employed for typesetting purposes by the
publisher at symbol 14. These materials as provided on tape will vary in
data architecture from publisher to publisher and thus initial treatment
often is necessary due to a lack of standardization. Looking, however, to
the typeset pages made available as represented at block 12, as shown at
line 16, the paper text carrying graphics and some tabular material which
may not be available in magnetic media is SCANned as represented at a
station represented by circle 18. Generally, such SCANning will involve
only those sheets of paper material carrying such graphics and may, for
example, be carried out by a CCITT Group 4 standard type device. The
resultant digitized information now present in magnetic media is then
directed as represented at line 20 to the display of a computer terminal
represented at operational symbol 22. At this station, the operator
"CROPs" each graphics image or the like not otherwise available in the
data from tape 14. This procedure generally will define the properties of
each graphics item, for example, the size of the item and, where
appropriate, the rotation. As a result ofthe activity at station 22, as
represented at line 24 and block 26, data in magnetic medium representing
the noted size os each graphics element has been developed. Similarly, as
represented at line 28 and block 30 the image data representing the
graphics material as developed by the Group 4 device are present in
magnetic media.
Returning to the publisher's magnetic data as described at symbol 14, as
represented at lines 32 and 34 leading to the operation represented at
symbol 36, a determination by supporting computer program is made as to
where graphics materials are intended to be located as represented in the
publisher's typesetting tapes. Generally, these tapes will not contain any
graphics data but will contain some indicia as to the appropriate location
for such graphics. Accordingly, the "WHATGRAPH" program 36 is carried out,
and as represented at line 38, these data so developed are made available
to reference the location of the graphics for use by the activity as
represented at CROPping symbol 22. Thus, a cross-referencing between the
two inputs to the system occurs at this juncture.
Line 34 also is seen to be directed to a computer activity as represented
at symbol 40 wherein the data available in magnetic media from the
publisher are converted to a Standardized General Mark-up Language (SGML).
This conversion is to a format representing a current standard in the
publishing industry for carrying out character, paragraph, reference,
heading, subtitle, and the like definition. Inasmuch as the instant system
is one wherein, for example, as many as 10 years worth of a scientific
journal may be complied upon a single CD ROM, the data retained in
magnetic tape will not always have been developed in accordance with such
standards. The SGML data which are developed will not only carry data
corresponding with the text of any given article but also will retain
corresponding information defining all those tables and the like which
formed a portion of the initial data or information carried by the
publishers typesetting tapes 14.
Because the data developed from the SGML activites as represented at symbol
40 will contain both text data as well as, where available, data defining
tables and equations, these data elements will be treated separately. In
this regard by retaining a capability for developing table and equation
data without resort to scanning procedures as represented at symbol 18, a
substantial savings in memory requirements is realized. Accordingly, as
represented at lines 42, 45, and 44 the equational, tabular, and textual
data are seen to be separated into discrete magnetic files as represented,
respectively, at blocks 47, 46, and 48.
Because the ultimate memory retained materials are intended for a somewhat
universal application or are constituted as "device independent" (DVI), a
well accepted formatting program is employed with the instant system and
method referred to a "TeX." This program, for example, is described in
detail in a volume entitled "The TeXbook" by D. E. Knuth of Stanford
University, published by Addison-Wesley Publishing Company, Reading,
Massachusetts. To employ this particular formatting program, it then
becomes necessary to convert the SGML developed data as represented at
blocks 47, 46, and 48 to a format acceptable for employment in the noted
TeX program. This procedure is represented at lines 49, 50, and 52
emanating, respectively, from the equational, tabular, and textual data
represented at blocks 47, 46, and 48. Lines 49, 50, and 52 are shown
extending to the operation represented at symbol 54 whereupon the
translated data then are submitted to the noted TeX program as represented
at line 56 extending to operation symbol 58. The TeX program 58, in
carrying out the formatting of each page of materials, employs the
graphics size data earlier developed and represented at block 26. This
employment is represented by line 60. Additionally, the TeX program as
represented at 58 used font size data made available in magnetic medium
per election (by macro instructions discussed in detail later herein) of a
font by the producing personnel as represented at block 62. These data are
shown extending to the TeX operation at symbol 58 as represented at line
64. The data thus supplied to the TeX program includes equational
material, tabular material, textual material, font size material, and the
size data for any graphics involved. Thus, formatting can be carried out
and such formatted material then, as represented at line 66, is made
available as a formatted device independent (DVI) page file represented by
block 68. As noted above, however, the material available from the TeX
program as represented at symbol 58 will not contain data representing the
font images, per election, but only size data and further will not contain
the image data for the graphics developed from scanning and the like as
represented at block 30. Accordingly, as represented at line 70, the
graphic image data are made available to the DVI file as at 68 as a
separate component. In similar fashion, the data required for imaging
individual fonts also are made available to the DVI files represented at
block 68. In this regard, the elected font image data are represented at
block 72 and the input to the DVI files represented at block 68 is shown
at line 74. The materials thus generated, then are retained, as
represented at line 76 and symbol 78, in a master tape suitable for
producing a readily disseminated local memory such as a CD ROM. Also
retained on the master tapes as represented at symbol 78 are the index
data which are generated from the SGML files which previously have been
developed as discussed in conjunction with symbol 40. In this regard, the
data are subject to an index generation operation as represented by line
80 and operation symbol 82 labeled "INDEX." The index will include an
alphabetical listing of the unique, refined words in any given article
referenced for instance by field, paragraph, and sentence. These materials
then are submitted to the master tape as represented at line 84. In
similar fashion, it is necessary to provide sufficient data for employment
of the local memory with any of a variety of personal computers and, thus,
necessary data as represented at block 86 entitled "PC SOFTWARE" are
submitted as represented at line 88 to the master tape represented at
symbol 78. It may be observed that the device independent materials now
contained in the master tape 78 will be uniform in generated output from
output device, i.e. monitor or printer, to | | |