Rochester Institute of Technology
RIT Scholar Works
Theses
Thesis/Dissertation Collections
10-1-1992
SGML-based publishing
Erinne G. Cheney
Follow this and additional works at:
http://scholarworks.rit.edu/theses
This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please [email protected].
Recommended Citation
School of Printing Management and Sciences
Rochester Institute of Technology
Rochester, New York
Certificate of Approval
Master's Thesis
This is to certify that the Master's Thesis of
ERINNE CHENEY
name of studentWith a major in
_
has been approved by the Thesis Committee as satisfactory
for the thesis requirement for the Master of Science degree
at the convocation of
OCTOBER 1992
date
Thesis Committee:
Frank J. Cost
Thesis Advisor
Marie Freckleton
Graduate Program Coordinator
George H. Ryan
SGML-Based
Publishing
by
Erinne G.
Cheney
Athesisproject submittedinpartial
fulfillment
oftherequirementsfor thedegreeofMasterofScienceinthe
Schoolof
Printing
ManagementandSciencesintheCollegeofGraphic Arts and
Photography
ofthe RochesterInstitute ofTechnology
October 1992
8I\t1PL[ ototcmcnto to authartzo or
dony
pcrmiooion to rcproduoe on RIT
th~;,.1.
T1tle oj
thesl_S
_
- - - -
-lhereby
grant perm1ss10n to the
Wallece Memor1all1brary of
RIT
to reproduce my thes1s 1n whole or 1n part.
Any
reproduct1on w1ll not be for commerc1al use or prof1t.
Date,
_
2.
I_E_r_in_n_e_C_h_e_n_e...
y
----t"prefer to be contected each
ttme a reQuest for reproduct1on 1s mede.
I
can be reached at the follow1ng
~ress.
?
3
Id:~~j==
.dZuLi
~duj~P/~of
Dat,e~
_
3.
Title
ofThesi:~s_
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _hereby
deny
perm1ss1on to the
Walleoe Memor1all1brary of
RIT
to reproduce my thes1s 1n whole or 1n part.
Dateo-
_
TABLE OF
CONTENTS
1.
INTRODUCTION
11.1.
PURPOSE
21.2.
PROCEDURE
32. REVIEW OF THE LITERATURE 3
2.1. THE NEW
PARADIGM
42.1.1. THE
LOGICAL APPROACH
52.1.2. THE
PROCEDURAL
APPROACH 62.2. WHAT IS SGML? 7
2.2.1. DOCUMENT
INTERCHANGE
STANDARD 92.2.2. META-LANGUAGE 10
2.2.3. DATABASE CREATIONTOOL 10
3. SGML-BASED PUBLISHING SYSTEM 11
3.1. DOCUMENTPREPARATION 12
3.1.1. DOCUMENTTYPES AND STRUCTURES 13
3.1.2. DOCUMENTANALYSIS 16
3.1.3. ELEMENTLIST. 18
3.1.4. CREATING A DOCUMENT TYPE DEFINITION 20
3.2. INPUT. 21
3.2.1. CONTEXT-SENSITIVE EDITOR 21
3.2.2. ASCII EDITOR 22
3.2.3. AUTOTAGGING 23
3.2.4. STORAGE 24
3.3. VALIDATION 25
3.4. CREATING OUTPUT 25
3.4.1. BATCH FORMATTER 27
3.4.2. INTERACTIVE FORMATTER 28
3.4.3 DSSSL 29
3.4.4. OUTPUTTYPES 30
TABLE OF
CONTENTS
FIGURE #1 33
FIGURE #2 34
FIGURE#3 35
FIGURE #4 36
FIGURE #5 39
FIGURE #6 43
FIGURE #7 45
FIGURE#8 47
INTRODUCTION
The 1990's will
be
noted as the start oftrueelectronicpublishing. The requirementsfor
electronic
publishing
lie
inits definition:
asystemthathas
theability
tocreate, manage,manipulate,
integrate
text and/or graphical information for subsequent output to avariety
of output devices. With the introductionof TheStandard
GeneralizedMarkup
Language
(ISO
8879)
in1986,
a new era inelectronicpublishing
began. SGML isa metalanguage
thatcanbe
used to createdescriptive markup language definitions
for document
processing
anddocument
interchange. SGML isproving
to be a tool that willenable electronic
publishing
systems toexpand and embrace thefuture.The
future holds
the promise of an idealpublishing
system that: would handleinformation ina
way
that allowsfor themostflexibility
indataretrieval; could providethe easiest editability; could achieve high productivity; should have the
ability
toencodedata sothatitcanbeinterchanged across platforms and
among
applications.The issue of productivity
has
been the most problematic for electronicpublishing
systems. Thecurrentmethodsformanaging, manipulating, and
integrating
information
before
formatting
for
final output are often slow andextremely
complex. To preparebecause
it is oftennecessary
to re-codedocuments
with newformatting
instructionsfor
each type of output. The
Standard Generalized
Markup
Language can be used toprepare
data only
oncefor
storage sothatitcanbe
re-usedand assembledintoavariety
of formats. SGML
may be
the answerfor
some electronicpublishing
applications.Enterprises that are
currently attempting
to produce a variety of publishing productssuch as:
CD's, books,
manuals, on-demand printing, customizeddocuments,
multimedia, and electronic
documents
facemany
bewildering
options that must beexamined to seeif it can
be
applied to their ownpublishing needs.They
need answersaboutthe potential of anSGML-basedsystem thatwill notbetoo difficultorexpensive
toobtain.
1.1. PURPOSE
Thepurposeofthisresearch projectistoinvestigate how data canbecaptured once and
then output in several forms using the Standard Generalized
Markup
Language. Thistopicrepresentsthe definitionoftrueelectronicpublishing: a system that
has
theability
to create, manage, manipulate, integrate text and/or graphical information for subse
quent output to a variety of output devices. Publishers interested in
finding
out ifproject. Peoplewho
have
an interest inelectronicpublishing
issues willbe able to gainan
understanding
of one ofthemostimportant developments
inthisindustry,
SGML.1.2.
PROCEDURE
This investigationwill be
done
by
outlining
adetailed
model for anSGML-based
publishing
system (Figure#1)
using SGML-based
software, andby
studying
SGML-basedpublishing
systems thatarecurrently
being
used incommercial publishing.Arepresentative document will be used to illustrate how data can be prepared so that it can be
enteredintoanSGML-basedsystemonce
for
subsequent outputinavariety
offorms.This project will be evaluated
by
itsability
to give a clear and complete descriptionofanSGML-basedpublishing system.
REVIEWOFTHELITERATURE
The available information on SGML can be seen as pieces of a complex puzzle.
Figuring
out how the pieces connect is not easy. Aninquiring
mindwishing
toknow
what SGML
is,
and how it shouldbe
used,faces
an Olympic-sized challenge. Thebest
involved
inelectronicpublishing
or computerinformation
systems. After this investigation
into
theliterature
itwillbepossibletounderstandSGML andhow itcanbe
usedto
help
create a productivepublishing
system.The
proceeding
is a topicalliterature
review
outlining
theseissues.2.1. THENEW
PARADIGM
There is a new paradigm inthe
industry
regarding
documentprocessing
and electronicpublishing. Instead of
dealing
withdocuments based
upon physicalformats,
we cannow choose to takea
logical
approach:The
variety
of electronic representation canbeviewed as acontinuumfrom thephysical to thelogical. Atone extreme a
simpledocument imageserves as surrogateforthepaperversion.
Attheother extremeisan electronic document
consisting
ofdescriptive
tags andcontent. This latter form allowsmany
kinds
ofinteractionincluding
editing,full-text
retrieval, andre-formatting,
i.e.,
electronicpublishing
(Handley
&Weibel,
1990).Traditionally,
thejob of documentdesigning
was doneby
editors and graphicdesigners
afterthe content wascreated. Theauthor wouldwrite thecontent and theeditor would
act as the liaison between the author and the graphic designer. Due to the
increasing
amount of new technologies that require
documentation,
therehas been
a significantincrease in the amount of published information. The content of
documents
alsohas
use traditional
document
production methods. Publishers began toemploy
a logicalapproach to
document
designing
by
separating
the form from the content before thedata is
processedfor final
output. Nowmany
publishers viewdocuments
ashaving
two
dimensions: logical
andphysical.2.1.1. The Logical Approach
The logical approach to document
processing
frees the document analyst/designerfrom
having
toworry
about how the document will look. This person isspecifically
interested in the content structure. Where a graphic designer must consider the length
of the
document,
the size of the image area, and other physical constraints; the document analyst must consider the
ability
of the information to conform to the specificstructure of a document
ty
pe.By
viewing a document from a logical perspective, adocument analyst analyzes and identities what iocumentis
by
its type: atextbook,
technical manual, or anovel, andthenuses descriptive
markup
topreparethe data.Markup
is the term used to describe codes added toelectronically
preparedtext to define the structure of the text or the format in which it is to appear
based
upon the structure of the content.This
method uses nouns and adjectives todescribe
thedifferent
elementswithin adocument
(Davis
&Waldt,
1987).2.1.2. The
Procedural
ApproachPublishers
whodeal
with ephemeralinformation,
or information thatdoes not requireconstant
editing
orupdating
after it is published, do not need to follow the logicalparadigm. If a
publishing
operation meets the criteria above and alsohas
a documentproduction environment that produces
many
graphical images and merges them withtext on a
desktop
or workstation system, then the traditional procedural methods ofdocument
production are best.Preparing
the data to be published in this environmentis different. The methods most often used are
very
production oriented.They
facilitate
immediate
publishing
needs,which are to getthe information out as soon as possible,by
any
means necessary. The editor inthis environmentmay
use two typesof markup:proofreader's marks to
convey
instructions to the author, and proceduralcopy markup
to
convey
instructions to the artist. A proceduralmarkup language
is used to encodetext with instructions to the output device on how the
document
should appearThe
procedural method views adocument
asbeing
the same as its physical format.The
format
and content are not separate(Barkes,
1991). Inmany publishing
operationstherules
for
creating
the structureofthe contentby
theeditor and artist are procedural;based
upon thefinal
physical appearance. Theprimary
considerationsduring
production are
based
upon thedesired
format. The contentof the documentmustbe
made tofit
thisformat,
whichisusually
thesecondary
considerationduring
production.Theproblem withthis method is thatin eachphase,a document is tied tosets of physi
cal
formatting
requirements for specific output types. If there is a need to re-use thesame content of a
document,
and produce output in a differentform,
then theinformation must
be
re-entered into an application that produces the desired output. This hasbeen
a productivityproblem withsomepublishing
systemsbecausedocuments,
or output
types,
are stored at this level of the production cycle with theformatting
instructions intertwined with the content. It can
be
difficult to editdocuments
at this stagewithout
interfering
with the final physical appearance. It is also difficult to exchangethesame informationfor differentoutputtypes,a
flexibility
thatisneeded inthis age of2.2. WHAT IS
SGML?
As
a culture, wehave only
recently
begun
exploring
electronicdocuments;
wehave
nocollective notionyet of
how
they
ought tolook
orbehave
(Walter,
1992). The objectiveof
SGML
is tofree
markup
from
thedependencies
of theprocessing
to be performed,and toregularizeitsuseformoderntext
processing
applications(Goldfarb,
p.125).ISO 8879 is the
formal
specification created and publishedby
the InternationalStandards Organization
for the use of the Standard GeneralizedMarkup
Language.SGML
facilitates
the logical approach topublishing
information. SGML is ameta-lan-guage that creates descriptive
markup
languages that willprepare databy
providing
astandard
way
ofcoding
structured information. Thedata
that is markedup using
SGML is independent of the input and output
device(s)
being
used. Data that is prepared with SGML reduces the need to recode the data at
every
production step. Thisgives more control overthe use ofinformation.
SGML
looks
at the role of the different elements of adocument
type andidentifies
them with tags (Figure #3). The language can beused to define or
identify
other typesof elements besides text that
may be
a part of a document.SGMLidentifies
all oftheseas a whole.
This
means thatitwouldbe
possibleto treatevery
elementofthecontent asa separateunit, and structurethe
information
of a
document
in almostany
way. SGMLis not
simply
a set ofmarkup
tags. It provides thelanguage,
or morespecifically
thesyntax, for
creating
thelanguage
of the tags(Alexander
&Walter,
1990). This is called adocument
typedefinition (DTD).
A DTD is anASCIItext
file
thatdescribes
thestructure of aclassof
documents,
whatelementsthey
must ormay
contain andtheorder and
frequency
oftheiroccurrenceina document. ADTD isoftenthoughtof as a structuraloutline of a
document,
but itmay
alsobethoughtof as a
language,
a set oftokens(tags),
and a setofrulesthatgovern
how
those tokensmay be
usedtogetherinadocument
(Alexander&Walter,
1990).2.2.1. Document Interchange Standard
With
SGML,
documents can preserve their inherent information regardless of whatchanges might
be
madeintheirpresentation(Castro,
1990). SGMLwasdesigned
tobe
astandard
for
documentinterchange. Documentinterchange is themoving
ofdata acrossdifferent systems.
Today,
publishing systems are often comprised of severaldifferent
types ofplatforms and operating systems. The users are members ofworkgroups that
share
data
through networks, or database management systems. Or in othercasesdata
is
receivedfrom
different sources in variousformats,
and integrated within a publishThe
conversion process that the data types must go through from systemto system canbe
timeconsuming,orinsomecases,impossible.
The
key
to thesuccessfulinterchange
ofdata
withinanenterpriseor
between larger
entitiesis
standards.Standards
arethe
key
toallowing
diverse
systemstoworktogether toprovide asingle corporate
information
flow.Standards
allowforthedesignof systemsthatevolve withoutthelossof
knowledge
andinformation
(Boyd,
1989).Because of these
difficulties
electronicpublishing
systems require document interchange standards that can provide transparent movement of information between all
types of systems and platforms without
diminishing
theintegrity
of the content andstructure of a document. SGML meets these requirements
by
separating
the form fromthecontent ofdocuments.
2.2.2. Meta-Language
SGML is a meta-language notation that can be used to create
languages
thatdescribe
documents,
databases,
and other structured information (Alexander &Walter,
1990).The language is an
internationally
recognized standard used as thebasis
for severalother related document processing standards such as Document
Style Semantics
andSpecification Language
(DSSSL),
Standard Page DescriptionLanguage
(SPDL),
Standard Music Description Language
(SMDL),
andHypermedia/Time-based
Subset (Hytime).
2.2.3. Database Creation Tool
Using
adatabase
tohelp
publishinformation
is nota newprocess. Withrecentdevelop
ments in network configurations it is possible to give
database
systems a larger role inpublishing
data. SGMLcanbe usedto create a structured documentdatabase. The concepts that
form
thefoundation
of SGML are derived from such traditional databaseconcepts as contentencoding, system and application
independence,
and rigorous definition of information
(Gangemi,
1987). A structured database includes fields of informationand their definitions. A document type definition
(DTD)
canbeused to expressthestructure ofa proposed system.
ADTD canbepresented inwrittenformwhich makesiteasier tocommunicatethe pro
poseddesignstructureandother characteristics of adatabase beforeextensive and cost
ly
programming begins(Gangemi,
1987). Because DTD's created with SGML aredevice-independent,
databases do not have to be created based upon a system thatalready
exists. This is anadvantagebecausea database thatis designedwiththehelp
ofSGML can grow with an organization, and the users of the system do not
have
tobe
concerned about the information contained in the database
being
made obsoleteby
future
developments.SGML-BASED PUBLISHING SYSTEM
This outline of an
SGML-based
publishing
system corresponds to the modelillustratedin Figure#1.
The task of
analyzing documents
andcreating
an SGML document type definition isthe
first step
inusingan SGML-basedpublishing
system. Context-sensitiveeditorshelp
writers create
documents
that conform to a DTD. The tagged document is then compiled and validated through an SGML parser. Most SGML software application pack
ages
have
theability
to create andinteractively
parsedocuments.3.1. DOCUMENT PREPARATION
An SGML-based publishing system: receives input from a variety of sources in
many
different
forms;
enters the data into a database as SGML structureddata;
validates thedata
so that it iscorrectly
organized according to aDTD;
and translates the SGMLtagged
data for
formatted output. An SGML-basedpublishing
system consists of:workstationslinked througha databasemanagementsystem,ortoeach otherthrough
a network/serversystem;
SGML
context-sensitiveeditors;conforming SGML/ISO8879
validating
parsers; andanSGML /formatter
interface.Before data
canbe
entered intoanSGML-based
systemthe informationmustbetagged.How the
document
is tagged isbased
upon the particular document type definitionthatwillbe used. It is thejobofthe
document
analystto createDTDs.Creating
adocu
ment type
definition
involvesanalyzing
a representative document type because oneDTDwillbeusedfora whole class ofdocuments.
The analysis involves
creating
a structured model of the information. The model isexamined with the
help
of the authors and editors of a particular documentinorder todetermine
whatelements willbe included inthe DTD.Themodelbecomesthebasis
forthe
DTD,
thereis anidentifiable correlationbetweenthetwo.3.1.1. Document Types andStructures
A document is one possible representationof a
body
of information ina structured orhierarchical form. It can, for
instance,
be an entire databaseofinformation. Mostdocu
ment types
(novel,
manual, electronicbook,
etc.) have a general structure that a readercan
identify
because ofrecognizable textelements suchas: parts, chapters, sections,andparagraphs that are organized in a
hierarchical
structure. Different hierarchical modelscan
be
used toconvey
the relationshipsbetween
the text elements of variousdocument
types.Document model structures are used to
describe
therepresentation of a document. The"actual"
document
iswhat the readers see and use. The visual structure of a documentshould reflect the logical structure of the text that the document realizes
(Southall,
1989).
Structure editing
isbased
on a single structureor set ofrules for a class ofdocuments. This structureis
formally
described and allowsa consistent structure ofthedoc
uments to
be
maintainedaccording
to certain data models such as a tree structure(Vercoustre,
1990).A document thatcan conformtoa tree-structurehasthese characteristics:
Hierarchy. Ranks can
be
assigned totheelements ofthestructure.Containment.
Higher-ranking
elements containlowerranking
elements.Sequence. Elementsof equal rankfollowone anotherinthe text.
These relationships can
be
visually expressedby
a model (Figure#5)
in order tofully
realize thedocumentstructure
(Southall,
1989).The tree-structure is the visualrepresentation ofthe elements of a document.
Making
amodelof a
document
canclarify
many
ofthe complexities ofthe relationshipsbetween
the elements of a structured
document. Figure
#5 is an example of a tree-structuremodel.
The
complexities of the relationships between the various elements are identified.
Many
ofthe components oftheelementscanbe repeatedin differentplaces.A structure model can
convey
information about the document type. Different document types
vary
in the number of elements and the complexlinks
between the elements. For example: the
document
characteristics of a textbook are more numerousthanthoseof a novel. Atextbook contains
lists,
charts, references, anindex,
and maybefootnotes. A novel usually does not contain these elements. A textbook or a technical
manual canhave
long
andpainfully
complextree-structures thataredifficult tofollow.Creating
a model helps the document analyst know what will be thebest way
toprocess the information without
threatening
any
important document characteristics.Some documents cannot conform to a logical structure
very
easily
orthey
cannotconform at all.
Analyzing
informationby
using a tree model can provide thedesigner
with valuable information about how to prepare the data and
apply
thedescriptive
markup
beforeentering
it intothepublishing system.These
advantages anddisadvantages
ofusing
the logical approach tocreating
documents
arebased
uponthoseidentified
by
Nanard(1987)
andFuruta (1988).Advantages:
Identifying
andcreating
structureincreasestheflexibility
andeditability
of adocument.
Authors andeditors are relievedof
any
formatting
of a document.Maintaining
theconsistency
of adocument
type thathasanidentified structure iseasierthanone without anidentifiedstructure.
Linksbetweenvariouselementscanbemaintained moreeasily.
More controlcanbe administered
during
theprocessing
ofdocuments.Disadvantages:
Hierarchically
structureddocuments donotcorrespondto theauthoring
of adocument. Authorsdonot write inthisway
sotrying
toadhere to astructure canbe restrictive.
Documentmodelsare oftentoorigid.Itis difficulttoswitchfrom
model to model or usedifferentcontentsforthesame models.
3.1.2. Document
Analysis
The best way
to understand this process isby
looking
at an example. Avery
simpleDTDwill
be
createdbased
upon adocument
type of which a hypotheticalexample is amanual entitled The
Handbook of
Print Production forDesktop
Publishers. It is to be areference guide for
desktop
publishers to use for print production. This document isalso meanttobe an"active publication"
meaning
that theinformationwillbeconstantly
updated for theintended
users. What form that it willeventually
be output inhas
notbeen
determined
at thistime. Theproposedoutline iscontainedinFigure#4.It is
necessary for
thedocumentanalystto workwith theauthors and theeditorsinthepainstaking
process ofcreating
a tree model. The document analyst can also find outthings about the information from the authors such as the intended audience. The
intended use of the information
by
the editors and the end users is perhaps the mostimportant piece ofknowledge that the document analyst needs toknow. The
intended
use isimportant because the documentanalyst mustknow how
flexible
the DTD needsto be. Here are some possible results of the document analysis of The Handbook
of
PrintProduction
for
Desktop
Publishers:1. Can the thedocument conformtoahierarchicalstructure?
Yes. Thecontentcanfitthe general structureofa textbook.
2. Who is the
Audience?
Anyone
whouses a personal computertoassemble and prepareinformation
thatwillbe
outputinsome publishedform. Thiswillinclude
amajority
ofthepeopleinthegraphicarts andpublishing
industries. The level
ofinformation
thatis requiredby
the audiencewillvary based
uponthe segment.Artistsmay
needto
know
morethana color separator.Apublishing
managermay only
needtohave
introductory
information. Individualsmay
notneedall ofthe
information
thatis outlined.3. Whatarethe
flexibility
requirements?Based onthe
diversity
of needs oftheaudience,the DTD musthave
ahigh degree
offlexibility
inrelationtoinformationretrieval. Because the
document
willbean "activepublication",It is
necessary
to tohave
flexibility
forthemaximum amount ofeditability
aswell.Withthe possible exception of chapters2, 3, 4,
and8in the contentoutline,the
document
willconstantly
beupdatedtoincludenew information.
Based upon these answers, a tree-model structure of the document type can
be
created (Figure #5). This structure represents all of the elements identifiedby
thedocument
analysis, that can be a part of The Handbookof
Print Productionfor
Desktop
Publishers.3.1.3. ElementList
An element description list (Figure
#6)
will provide thedocument
analystwith a clearer description about the relationships and element structure of
a document type. It identifies all necessary and optional elements, and the occurrence
andsequence
for
eachone.The creationof anelementdescription list isnot anecessary
step.
More
experienceddocument
analystscanbe
successful withoutusing
one.Element lists
are useful aids to thedocument
analystwhencreating
a DTD. Anelementdescription list
is aless formal
DTD.Document
typedefinitions
are not as redundant,and require a specific written format. The lists show the elements, and embedded
sub-elements that are
defined
as a model group. Like the elementsthemselves,
themodel
group
consists of one or moreelementnames thatare connectedby
SGMLgroup
sequenceconnectors
(Figure
#6)
which also occurinthe DTD(Bryan, 1988,
p.110).As with a
DTD,
the element list also qualifies the use of each embedded sub-element.These are called occurrence indicators (Figure #6). Connectors and indicators can be
used together.
However,
whendetermining
the effect of occurrence indicators andgroup
connectors it is important to realize that occurrence indicators have a higherprecedence than connectors
(Bryan,
p.111). The occurrence indicators and operatorsrepresent all of the document possibilities.
They
determine what can and cannotbe
apart ofa documentthatconforms to a particular DTD. Element listsare a good
thing
togivetoauthors and editors tocheck offonbeforethe actualDTDiswritten.
3.1.4. C
eating
ADocument Type
DefinitionOnce
adocument
typeisidentified,
figuring
outthepossible elements thatmake upthedocumenttype and whichshouldbe included in theDTDis nottrivial.
Creating
a DTDinvolves
taking
theinformation
accumulated from the document analysis and thenapplying
the rules forconstructing
anSGML DTD as mandatedby
ISO8879. The DTDinthis example
(Fig
#7)
contains shortened versions ofthe names oftheelements inthetree-model and element list. The names are not minimized to the extent that
they
couldn'tbe understood
by
a human.The DTD that was created using the hypothetical manual The Handbook
of
PrintProductionfor
Desktop
Publishers as a representative document type is a simpleDTD,
and does not contain all of the features that are allowed
by
ISO 8879. A DTD that issuccessfully
parsed should not be confused with a good DTD. A DTD that is wellconstructed andsuccessfully parsed must alsohavethe
ability
toprovide away
for theintendeduses of the information toberealized.If theDTDisnot
flexible
enoughtoproduce avarietyofdocument
instances,
thenitis not avery goodDTD.3.2. INPUT
Data can
be input into
the systemby
eitherkeying
text in through a context-sensitiveeditor,
by inserting
tagsinto
anASCIIfile
thatwaseitherkeyedorscanned, orby
translating
anincompatible file
into SGML (autotagging). Manual re-keying or opticallyscanning
textfrom hard copy
is not uncommon. This introduces alarge
amount ofhumanand machine error. Alotofvaluable timemustbespent
proofing
the results. Ifapublisher were to provide the means for authors to submit text as SGML tagged files
via an
easy
to use,WYSIWYG,
context-sensitive editor, then onestep
in the inputprocess could
be
eliminated, thusincreasing
productivity.When first
starting
up an SGML-based system itmay
be necessary to translateinformation that exists in another file format or language into SGML. There are an
increasing
number of these sophisticated software applications that cando
thistranslationprocess.
3.2.1.
Context-Sensitive
Editor
The
best way
to create anSGML
taggeddocument
is to use anSGML context-sensitiveeditor. An
SGML
context-sensitive editor is a software application that allows the userto create
documents
that conformto aDTD.They
are similarto wordprocessing
applications except that the emphasis in
creating
documents is placed upon the contentstructure rather than physical appearance. The application createsthe tags basedupon
the DTD chosen
by
the user, and theninserts the tags.Many
of these software applications can validate
documents
asthey
are created. Aswellasbeing
usedto createdocuments, SGMLeditors canbeusedto inserttags into a documentthatwas not
originally
createdwith an SGML editor. This is done
by flowing
in the textfrom a document created
by
anASCIIeditor intoanSGMLeditor.3.2.2. ASCII Editor
Most computer platforms have the
ability
to create an ASCII file. The wordprocessing
software applications that run on various operating systems can output an ASCII
file.
Creating
SGML documents with an ASCII editor isvery
tediousbecause
the tags areinserted
by
the user character-by-character. This increases thepossibility
of errors tooccurwhen thedocumentis parsed.
Using
anASCIIeditor canbe
a complexing
because
the user needs to understand the structure of a DTD inorder to create anSGML
taggeddocument.
If aDTD is
extremely
complex, then it wouldbe virtually
impossible
to use anASCII
editortostructure andtag
documents.
The
only
reason to use an ASCII editor to create SGML taggeddocuments
is becausethey
arewidely
available and used.Using
anASCIIeditor insteadof anSGMLcontext-sensitive editor to structure and
tag documents,
is similar towriting
a postscript programto
layout
a pageinsteadofusing
a pagelayout software application.3.2.3.
Autotagging
Using
a translation software package is the bestway
to convert adocument
that wasnot prepared through an SGML editor. This procedure is called autotagging. Most of
these conversion software applications do what is called a "context translation". The
file
that wasoriginally
created using a wordprocessing
application is scanned forpatterns that represent the elements that correspond to a DTD. The application then
generates and inserts the tags into the document based upon the
first
assessment. Thenewly
tagged documentis
thenpassed througha parserthatis a part ofthe conversionprogram. Not all documents that were created with a word processor can
be
convertedin this way. The structure of incompatible documents must correspond to a DTD
because
theautotagging
that takes place isbased
upon the documents
ability
to conform
to a structure. If adocument
created in a wordprocessing
application does notconform to a structure, then it must
be
edited and the conversion program mustbe
employedwith a certain amountof
human
intervention. So inmany
casesautotagging
isnot
completely
automatic.3.2.4.
Storage
The
document database
is at the center of an SGML-basedpublishing
system(Figure#8). As with
many large
enterprises a database management system is used tohelp
facilitate
theproductivity
ofbusiness
operations.By integrating
managementoperations with
publishing
operations intoone database system, theprocessof publishing
information is helped tremendously. SGML can be used to create the structure ofthe integrated database system which will provide
security for
privatefiles,
manageprotocol,andtrack theprogressofdocuments.
An integrated database management system can be used to manage all of the steps in
the creation of SGML tagged documents. It can be used to direct all operations:
from
the collaborationofthedocument analysis andthe authoringof
documents,
to thefinal
preparationforoutput.
3.3.
VALIDATION
An
SGML
parseris
a softwareprogramused tovalidate DTD's anddocument
instances(SGML document). SGML
parsers canhelp
document analysts create DTDs thatconform to
ISO
8879. The parser worksby
examining
the logical elements of adocument instance
and makes certain that the elements correspond to the associatedDTD.
They
create a report onany
designflaws,
or the misuse ofany features
ofSGMLthat are considered errors
by
ISO 8879. There aremany
parsers available that can beused to work
interactively
with anSGML editor to create and validate documents andDTDs.Interactiveparsers
help
create a more productiveinputprocess.SGML parsers also gives information to the formatter which prepares the SGML
documents for output. How this is done is an implementation decision made
by
thepublishing
systemdesigner.3.4. CREATING OUTPUT
A DTDis a list ofingredients foronetypeofa document. A formatter can
be
thoughtofas something that assembles the various ingredients together to create a document. A
formatter is needed because SGML documents
do
not containany
information abouthow
they
shouldbe
processed, this information must be added. A formatter is aprogram that prepares
documents
created in SGML to be outputby
any
device. Theformatter
gathers thedata
that it is sentfrom
the parser, and converts that informationinto
theformat
requiredby
theoutputdevice. Theparser tells theformatter
whichDTDthe
document
conformsto,
and therefore what tags to look for in the conversionprocess. The parser also supplies the formatter with important structural information
about the
document
thatmustbemaintained.The conversion is
based
on pattern matching. Anexample of how thismay
workis asfollows: The SGML
document
is scanned to locate the patterns of the tags andfind
amatch to the tags that are contained in a
look-up
table. The matched tags are replacedby
theformatting
codes that correspond to a particular type of output devices. Thelook-up
table corresponds toonly
one document type definition. The table containstags in one column and corresponding
formatting
codes in an adjacent column. Thismeans that
for every
DTDthere is auniquelook-up
table foreach outputtype.The ways that formatters are built generally depends on the types of output that are
required ofapublishingsystem.
3.4.1.
BatchFormatter
Many
publishing
operationsonly
produce one type ofdocument
and output, such as:printed manuals,
journals,
ortextbooks. Apublishing
operation thatproduces one typeofoutput would
only
use afew
DTDs. Abatch composition system would be used tocreate pages
for
mass reproduction.Documents
are retrieved from some type ofstorage, parsed,
formatted,
composedand paginated forthe output device. Aformatter
built for
this process isvirtually hard-wired
to a particular DTD which affordsno
flexibility.
One benefit
ofbuilding
aformatter
in thisway
is that the composition isvery fast
because there are no intermediate steps in the conversion process. There is no human
intervention to get through the various
processing
steps because thereisonly
onefinal
physical format
for
all documents. This is appropriate for publishers that produce onetype of output such as printed
books,
and on-demand publishing.However,
if apublishing
system contained a formatter built in thisway
and wanted to expand andprocess documents created from different
DTDs,
then a new formatter wouldhave
tobe
configured.3.4.2.
Interactive Formatter
In Figure
#1 theSGML-based publishing
model shows a possiblepublishing
systemthatproduces a
variety
ofoutput with aformatter forevery
type of output.Thissectionof the model represents a
different
conversion for each DTD for each type of output,evenif the output is similar. For example: Two printed versions of a
document
having
different
styles would require unique conversions. As statedpreviously
the conversionis via a
look-up
table for each DTD for each type of output. In order to have moreflexibility
and control over theinformation,
it isnecessary
to create more controlbetweentheSGML documentand the conversionprocess. This controlis usually inthe
form
of another software program that canbe used to give additionalformatting
information to an SGML document. Instead of hard wiring a DTD to a particular set of
formatting
instructions,
this type of software program allows for the creationof
"on-the-fly"
look-up
tables for SGML documents that go througha formatter. This
has
proven to benecessary
because each instance of a DTDmay
require special processing that is not covered
by
the hard-wiredlook-up
table. Thesesoftware programs enhance the conversionprocess, andcan make a
publishing
systemthat produces
many
different types of output more productive thancreating
ahard
wiredformatterforeachtypeofoutput.
3.4.3.
DSSSLThe
Document
Style
Semantic
andSpecification Language
is adraft
ISO standard thatwill allow
formatting
information
to beinterchanged between
various output systems.
The objectives of the
DSSSL
standard is to provide aformal
and precise means ofexpressing
thefull
rangeofdocument
productionspecifications requiredby
the graphicarts
industry
(Adler,
1989). Insteadof usinga conversion software application thatdoes
"on-the-fly"
conversions, DSSSL canbe used toattach
formatting
instructions toSGMLdocuments
so thatthey
can stillbe
outputby
any formatter
andoutputdevice
withoutaltering
theresults.IntheSGML-based
publishing
system modelinFigure#1,
thereare typesof outputthatproduce the same result: printed pages. There isa
different
conversion foreachoftheseoutputs even though
they
will all produce printed pages. DSSSL could be used toattach
processing
informationto an SGML document so thatit couldbeformatted
andoutput through a copier, a postscript device or a typesetter and still produce a
document
thatlooksthe same. DSSSLusedinthisway
woulddrive
theformatter,
but
itcould
be
usedto giveinformation toanotherlanguage thatwillformat
theSGML docu
ment, for
instance,
in the creation of a hypertext or multimediadocument.
DSSSLwould
fit
wellintoapublishing
systemthatproduces avariety
ofpublished products.3.4.4.
OutputTypes
Figure
#1 contains the typesofoutput thatcanbe
possible for anSGML-based publishing
system.They
representthemostpopular options usedby
commercialand corporatepublishers today.
Creating
multimedia and hypertextdocuments
involvesworking
with
diverse datatypes
and formats. People thatcreate these types ofdocuments
needto
have data
sources that canbe
integrated and output from one source. On-demandand custom
publishing
products arebeing
produced at a rate asfast
astraditionally
printed products. The
increasing
number of publishersthatprovide on-lineinformationservices reflectthedesire of consumerstogetinformationas theneedforitarises.
As
previously
statedSGMLcanbe used tocreate structured databases. OneofthewaysthatSGML is
currently
being
used is in the creationof CD-ROM products. Theselarge
databases contained in a small package can be structured using SGML. The value of
information stored on a CD-ROM is that it allows the end-user to search through the
data
ina hypertextmanner.SGML creates a generic source of information that will outlast the output system. As
new output systems are developed the same source of SGML tagged
data
canbe usedto produce new output types. So when a revolutionary new type ofoutput is made
available, for
example voice synthesis, apublishing
systemthatstores its data in SGMLwill
be
able toproduceavoice synthesiswithoutdoing
costly
conversions.CONCLUSION
An enterprise that publishes a
large
amount of information thathas
along
life andmust be
constantly
edited shouldseriously
consider using an SGML-based system.SGML
canbe
usedfor
operations thatreceiveinput frommany different
sources andindifferent formats.
Publishers
have
a larger number ofpublishing
products to offer to their customers.How
they
goabout thisoperation canbevery
difficult andcomplex. Themost economical and productive
way
wouldbe
to have one generic source ofdata that canbe
usedby
all of the output systems that apublishing
operationmay
have.Having
one sourceof
data
sharedby
many
connected users, on divergentoperating
systems, and atdifferent locations througha network database system can increase the
productivity
oftheeditorial functionsof apublishingoperation.
The
key
tousing
SGML to output one source ofinformation in avariety
offorms
is tospend the time and effort to prepare the input data.
Taking
thelogical
approach topublishing instead of a procedural approach can open
up
new possibilitiesfor
apublishing
enterprise.The logical
approachfrees
apublishing
system frombeing
dependent
onany
one particular type of output that is produced. Apublishing
enterprisethat wantsto
be
productive and competitive can nolonger use a system thatis
dependent
on one typeof output. Asystem of this typecannotbe easily
adaptedforfuture file
formats
and output. SGMLfacilitates
thelogical approachby
preparing
datain a
way
that will allow it to beproductively
edited, retrieved, and published in avariety
offorms.A practical
way
to viewSGML
isfrom
the point of view of the problem to which itis applied (Alexander &
Walter,
1990). SGML can be used to implement a newmethodology
and solve the problems that exist in publishing: heterogeneousenvironments, data interchange and
integration,
andproducing
different types ofoutput
from
a single source of information. These problems were theforce behind
thecreation of
SGML,
not tobring
abouttheoften predicted paperlesssociety,orto answerall ofthe problemsinelectronic publishing.SGML is not a magical solutionto integrate
all ofthe wonders of
technology
that canbe a part ofan idealpublishing
system. It isan option that an organization can choose to
help
create productivepublishing
andinformationsystems.
FIGURE
#1
LU
I-(0
>
CO
CD
Z
X
CO
CO
3
a.
Q
LU
CO
<
CO
o
CO
O UJ a oc 0.s
>. I-c CD E 3 O "I
o I- I
Q Q
I
^
I
c CD CO I TJ 1 DCLU I <:<
t
Q QLLI LU 1
<
2
^r
cc
W
->o
LL
ill
FIGURE
#2
PROCEDERAL
MARKUP
SGML
PUBLISHING SYSTEM
16/18 Palatino Bold Caps
INTRODUCTION
1416 Palatino Bold CapsThe
1990's
will be noted as the start of true electronic\publishing. The requirements for electronic
publishing
lie
in itsdefinition:
a systemthathastheability
tocreate,manage, manipulate, integrate text and/or graphical
informationforsubsequent outputtoavarietyof output
devices. The
ability
to encode data so that it can beinterchangedacross platforms andbetweenapplications.
The issueof
productivity has
beenthemostproblematicfor electronic
publishing
systems. Thecurrentmethodsformanaging, manipulating,and
integrating
information/
List Item
'
List Item
List Item
ListItem
110/12 Palatino Romanupper/lower
10/12 Palatino RomanCaps
FIGURE
#3
SGML/
DESCRIPTIVE
MARKUP
<ChapterTitle>
SGML PUBLISHING
SYSTEM<SectionTitle> 1.
INTRODUCTION
<Paragraph> The 1990's will
be
noted as the start of true electronicpublishing. Therequirements for electronic
publishing
lie
in its definition:a system thathastheability
to create,manage, manipulate, integrate text and/or graphical
information forsubsequent outputto a
variety
of outputdevices. The
ability
to encode data so that it can beinterchangedacross platforms andbetweenapplications.
Theissueof
productivity
has been themost problematicforelectronicpublishingsystems. The current methods
formanaging, manipulating,and
integrating
information<ListItem> List Item
List Item
List Item List Item
FIGURE
#4
THE
HANDBOOK
OF
PRODUCTION FOR
DESKTOPPUBLISHERS
(OUTLINE)
1.
Models
fordesktop
publishing-printing
systemsA.
Postscript
B.
System
Components, including
hardware
and softwareC.
Network
Architectures
D.
Workflows
and material flows2.
What
printing
process willbe
used?A. Run
length,
image quality
requirements, sizes, colors B.Lithography,
flexography,
gravure, screenC.
Process
specific requirementsD.
Electronic
processesE.
Decision
tree3.
Planning
for thebindery
A.
Binding
methodsB.
Page
sizesC.
Signatures
D.
Sheetfed,
half web, full web signature sizesE. Impositions
F. Compensation for material behavior
4. Paper and Ink
A. Materials used in different processes
B. How design must accommodate choice of materials
C.
Recycling
andprinting
on recycled materialsD.
Health andsafety
5. Film and Plates
A.
Film requirements for plate makingB.
Reading
orientationC.
Tonal orientationD.
Spreads,
chokes, andtrapping
E.
Stripping
requirements F. ImagesettersG.
Basic
principles of scan-conversion H. Computer-to-plateType
A.
Adobe Type
1
format
B.
TrueType
C.
Other
type
formats
D.
Effects
ofprinting
process characteristics on typereproduction
Input
scanning
A.
Types
of scannersB.
Linear
spatial resolutionC. Pixel
depth
and noise (tonal resolution)D.
Scanning
resolutionrequirements for given output res.
E.
Ideal
copy
forscanning
F.
Raster
to vector conversion for lineart
G.
Data
compression8.
Tone
and color reproductionA.
Synthetic
color BPhotographic
colorC.
Spot
and process colorD.
Color
gamuts forprinting
processesE.
RGB
toCMYK
F.
HSB
color modelG.
Pantone
colorH.
Focoltone
colorI. Halftone reproduction
J. Color separations
K.
Screening
technologiesL. DPI of imagesetter vs. LPI of halftone
M.
Proofing
technologies9. The right tools for the job
A.
Print quality
capabilities of processesB.
Register,
image
structure, tone & color res. capabilitiesC.
Imagesetter requirements determinedby
printexpectations
D. Imagesetter types
E.
Page layout programsF.
Hardware/software platformsG.
Storage
options and requirementsH.
Network structure requirements10.
Working
with a service bureauA. File structures and interchange formats
B. Production processes that work
C.
Standard
proceduresD.
Checklists
11.
Training
issues
A.
Teaching
desktop
publishers about printing B.Information
resourcesC.
Methods
D.
Communication
E.
Software-based
self-trainingAnnotated index
Appendices
1. Guide to file formats
*
FIGURE
#5
o o CO H X uj H c "S-.* a 3 -ss 2 c 6 o U > -i o 5 SO.2 o u U 3 Q 3 C 60 S."> o fl u Si 3 u o a. o "3 U X ^ .5! 2. ^ Q Z a* ^ Q oo 3 : o U c o 39Figure basedontextbook tree-structure inSGML-An
FIGURE
#5
CONT
*-^s a a. o <&. o w 'f H 3 x 1 Hi
c "C a. .* O O c a:$
2^ 1?3
>
x> 3 ;/) .4 3 1/1 > 5 oI
o a 3H ji 3$
J! xi 3H [/> X. 3 t/5 o -* s c ft. HFigure basedontextbook tree-structure
to SGML An Author' Guideto theStandard GeneralizedMarkupUngwigc
FIGURE
#5
CONT
o o CO ' H a. "C B. O o a c 5 o c 9 60 53 6b-o u1
a |3 ai OS x ex. 6b-"8 . e x: 60 C 'll-(/I -x 01 a.. f2 - s OS x: en a.. <v 6b S 2 .2 * aft. a* S3 2 a. X 3 H > c s "5. c
->!
>Figure basedontextbook tree-structure inSGML'
AnAuthor'sGuuie to theStandardGeneraiizea MarkupLanguage
FIGURE
#5
CONT
Figure basedontextbooktree-structure inSGML An AuthorjGuideto theStandard
GeneralizedMarkupLanguage
FIGURE
#6
ELEMENT
LIST
SGML
Group
Sequence Connectors:, All must occur, in the order specified.
& All must occur, in any order.
I
One and only one must occur.OccurrenceIndicators:
+ One or more can occur, but at least one must occur. ? Zero or one can occur.
Zero or many may occur.
Textbook=Front matter, body, back matter
Front Matter=Half title page, title page, title verso, table of contents, preliminary text,
illustration and photo list?
Half Title Page=Title&authors&
Title=#PCDATA Author=#PCDATA Title Page=title&authors&edition&editor&publisher Authors=#PCDATA Edltlon=#PCDATA Editor=#PCDATA Publlsher=#PCDATA
Title Verso=Publisher details, copyright details
Publisher Details=Name, location
Name=#PCDATA
Locatlon=#PCDATA
Copyright Details=Date of publication, copyright holder, copyright declaration, colophon?,
cataloging details, ISBN #
Dateof Publicatlon=#PCDATA
Copyright Holder=#PCDATA
Copyright Declaration=#PCDATA
Colophon=Printing
details, other contributorsPrinting
Details=Name, locationOtherContributors=typesetter, designer, illustrator, researchers
Typesetter=#PCDATA Deslgner=#PCDATA lllustrator=#PCDATA Researchers=#PCDATA
Cataloging
Details=Library
of Congress #Library
ofCongress=#PCDATAISBN #=#PCDATA
Table of Contents=#PC DATA
Preliminary
Text=Forward?, Preface?, Introduction, Acknowledgments?Forward=#PCDATA
Preface=#PCDATA
FIGURE
#6
continued
lntroduction=#PCDATA
Acknowledgments=#PCDATA
Illustration and Photo
Llst=#PCDATA
Body=Part*,
chapter of text +Part=Part title,
introductory
textPart
Tltle=#PCDATA
E?fUt101?
T,ext=Topics*,
(Paragraph+l
notes*), lists*, figures*,footnotes*
Topics=Topic headings Topic
Headlngs=Paragraph+
mlKM?Jih^#0PwCDATA|Hi9hli9htedPhrases*, elements in
paragraphs*|references|
lists-Highlighted
Phrases=#PCDATA
Exea^ep7eSs-n#PCDATAhS=EXamPleS*'indiVid^ ''neS f
,ex,*ltables'l
graph"'cs*|equations*
Individual Lines of Text=#PCDATA
Tables=#PCDATA Graphics=EMPTY Equatlons=#PCDATA
Notes=#PCDATA
Lists=ltemized
lists*)
glossaries*! list of items*Flgures=EMPTY
Footnotes=#PCDATA
Chapter of Text=Chapter title, main text, sections-Chapter Title=#PCDATA
Main
Text=Topics*,
footnotes* special text elements*,(paragraph+|
notes*), lists*, figures*,Sectlon=Section title, section text, subsection
Section Title=#PCDATA
Sectionfootnotes*Text=Topics*. special text elements*,
(paragraph+|
notes*), lists*, figures*,Subsection=Subsection title*, subsection text*, sub-subsection*
Subsection Tltle=#PCDATA
Subsection Text=Topics*, special text elements*,
(paragraph+|
notes*), lists", figures*,footnotes*
Sub-subsection=Title for lowest level*, text for lowest level*
Title for Lowest Level=#PCDATA
Text for Lowest Level=Topics*, special text elements*,
(paragraph+|
notes*), lists*,figures*, footnotes*
Back Matter=Appendices*, bibliographies+, index?
Appendices=Topics*,
special text elements",(paragraph+|
notes*), lists", figures*,footnotes*
Bibliographles=#PCDATA|Highlighted phrases", elements in paragraphs*|references| lists*
Annotated lndex=#PCDATA|Highlighted phrases*, elements in paragraphs*|references|
lists*
FIGURE
#7
DOCUMENT
TYPE
DEFINITION
<!DOCTYPE textbook For
Desktop
Publishers'><!ELEMENT textbook <!ELEMENT front
SYSTEM 'dtd of The Handbook of Print Production
<!ELEMENT
htitlep
<!ELEMENT title <!ELEMENT authors <!ELEMENT titlep <!ELEMENT edition <!ELEMENT editor <!ELEMENT pub <!ELEMENT pubdet <!ELEMENT name <!ELEMENT location <!ELEMENT copydet <!ELEMENT date <!ELEMENT holder <!ELEMENT decl <!ELEMENT colophon <!ELEMENT printdet <!ELEMENT othercon <!ELEMENT typesetter <!ELEMENT designer <!ELEMENT illustrator <!ELEMENT researcher <!ELEMENT catalogdet <!ELEMENT loc <!ELEMENT ISBN <!ELEMENT toe <!ELEMENT prelimtxt <!ELEMENT forward <!ELEMENT preface <!ELEMENT introduc <!ELEMENT acknowledg <!ELEMENT illusli <!ELEMENT photoli <!ELEMENTbody
<!ELEMENT part <!ELEMENT partitle<!ELEMENT (introtxt
|
text:!ELEMENT topics
-(front, body, back)>
--(htitlep, titlep, titlev, toe, prelimtxt,
illusli?, photoli?>
--
(title-*-& authors?)>
-- (#PCDATA)> - (#PCDATA)> - (title &
authors & edition & editor"
& pub)> -(#PCDATA)> - (#PCDATA)> -(pubdet, copydet)> -(name+, location+)> --(#PCDATA)> - (#PCDATA)>
-(date, holder, decl, colophon?, catalogdet, ISBN)> -- (#PCDATA)> - (#PCDATA)> - (#PCDATA)> --(printdet, othercon?)> -(name, iocation)>
- (typesetter & designer & illustrator &
researcher*)> -- (#PCDATA)> -- (#PCDATA)> - (#PCDATA)> - (#PCDATA)> - (loc)> - (#PCDATA)> -- (#PCDATA)> -- (#PCDATA)>
-(forward?, preface?, introduc, acknowledg?)> - (#PCDATA)> -- (#PCDATA)> -- (#PCDATA)> -- (#PCDATA)> -- (#PCDATA)> -- (#PCDATA)> -- (part*, chapte*)> -- (partitle, introtxt)>
-- (#PCDATA)>
|
sectxt|
subtxt|
subsubtxt) - (topics*, para+|
(notes, list,fig, fnotes)*>
-- (topichead*, para+)>
FIGURE
#7
continued <!ELEMENT topichead <!ELEMENT para <!ELEMENT phrases <!ELEMENT elements <!ELEMENT examples <!ELEMENT txtlines <!ELEMENT graphic <!ELEMENT equations <!ELEMENT table <!ELEMENT quotes <!ELEMENT ref <!ELEMENT list <!ELEMENT itemli <!ELEMENT glossary <!ELEMENT listi <!ELEMENT notes <!ELEMENTf>g
<!ELEMENT fnotes <!ELEMENT chapter <!ELEMENT chaptitle <!ELEMENT section <!ELEMENT subsec <!ELEMENT subtitle <!ELEMENT subsubsec <!ELEMENT subsubtitle <!ELEMENT backmatter <!ELEMENT appendices <!ELEMENTbibliography
<!ELEMENT index <!ELEMENT annotated --(#PCDATA)>-(#PCDATA
|
(phrases, elements, quotes)*|
ref*|
list*)> -(#PCDATA)>--(examples
|
txtlines|
graphiclequations)*
|
table?)> -(#PCDATA)> --(#PCDATA)> -- EMPYT)> -(#PCDATA)> - EMPTY)> - (#PCDATA)> - (#PCDATA)> -- (itemli|
glossary|
listi)*)> -- (#PCDATA)>- (#PCDATA &
phrases)> -- (#PCDATA)> -- (#PCDATA)> - EMPTY)> -- (#PCDATA)>
--(chaptitle, text , section)+>
- (#PCDATA)>
-(sectitle+, sectxt+, subsec*)>
-(subtitle-*-, subtxt+, subsubsect*)>
-- (#PCDATA)>
-(subsubtitle+, subsubtxt+)>
-- (#PCDATA)>
-(appendices*, bibliography+, index?)> - (topics*,
FIGURE
#8
WO^m ^^M
Q Q
1
LU LU
i
S
S
i
LU
t
1
Q. 3
!
X
2
!
4<
5
*
cr
O
1
O
H
1
O
!
Q
LU LL
I
BIBLIOGRAPHY
Adler,
Sharon
C,
(1989). DSSSL:
Document
style semantics and
specification
languange.
TAG
The
SGML
Newsletter, 8,
pp.8-9.
Alexander,
G. &
Walter,
M.,
(1990).
A
fresh
look
atSGML:
theconventional wisdom changes.
The Seybold Report
onPublishing
Systems, 20(7),
pp.3-16.
Barkes,
Kevin
G.,
(1991).
Substance
overform:
breaking
down
thebarriers between
the printed page and electronic media.DEC
Professional,
20(10,)
pp.34-38.
Boyd, Barbara,
(1989,
July).DEC's
technicalauthoring
environment. TAG TheSGML
Newsletter,
10,
pp. 5-7.Bryan,
Martin,
(1988).
SGML: AnAuthors Guide
to the StandardGeneralized
Markup
Language.Cheltenham,
UK: AddisonWesley
Publishing
Co.,
Inc.Castro, Christine, (1991,
April).
The
recipefor
document
interchange.
Computer
Publishing
Magazine,
pp.26-31.
Cruz,
G.C. &
Judd,
T.H.,
(1990).
The
role of adescriptive
markup
language
in the creation ofinteractive
multimediadocuments for
customized electronicdelivery. EP90:Proceedings
of
theInternational
Conference
onElectronic
Publishing,
R. Furuta
(Ed.)
VI
10.New York:
Cambridge
University
Press.
Date, C.J.,
(1986). An introduction todatabase
systems, Volume1,
4th Ed. Reading:Addison-Wesley Publishing
Company.Davis,
W. &Waldt, D., (1987,
May/June). An SGML overview. TAG The SGMLNewsletter, 2(1),
pp. 4-5.Exoterica Corporation. (1991). XGML OminMark Sampler.
Ottawa,
Ontario:Exoterica Corporation
Furuta, Richard,
(1989). Concepts and models for structured documents.Structured
Documents,
Andre,
Furuta & Quint(Eds.)
Cambridge: Press Syndicate.
Furuta, R., Quint, V., Andre, J.,
(1989,
April).Interactively
editing
structureddocument. Electronic
Publishing, 1(1),
pp.65-76.