American
Journal
of
C
ompat
at
ional
Linguistics
Mi crofi the 30 3R E V I E W
C O M P U T E R S
I N
T H E
H U M A N I T I E S
J. L.
MITCHELL,
E D I T O R
U n i v e r s i t y o f ~ i n n e s o t a
Edinburgh University Press
University
ofMinnesota
PressMinneapolis
1974Note: The t a b l e of c o n t e n t s appears i n AJCZ, Microfiche 1 4 ,
frames 6 7 - 6 9 , and summaries of several contributions appear on the same f i c h e .
REVIEWED B Y E D I T H J . HOLS AND DAN BURROWS
U n i v e r s i t y o f ~ i n n e s o t a , D u l u t h 5 5 8 1 2
Computers
in
theHumanities is
a s e l e c t i ~ n of papers from115
presented at a conference on computers and the humanities,1973, at the University o f Minnesota in Minneapolis I t s value
is both as a bank of ideas and a s a cross section o f t h e very
broad
area
a t theconfluence
of those two disciplinesThe
va-r i e t y of interests d i s p l a y e d here is an indication of the consi-
arable breadth of opportunity f o r further exploration Its e d i -
t o r hopes it will be "appropriate f o r use in such couraes in
'Computers and the Humanities' as are now found in many m a j o r
Review:
Computersin
theHumanities
course
itshould
serve
r a t h e r w e l lCertainly
i t has littlecompetition
Scholars in
thehumanities
are only
beginning
t o seethe
computer
as the greatestthing
since t h einvention
(discovery?)of
the
s t y l u s ,
and
recent
extensions
i n t o
languages
more
agree-able
to the soft sciences and to the artshave
made more a t t r a c -t i v e a
continuing
growthon
several fronts. Concordances andindexes
are
growlng in
numberand in informatory powers,
l a r g ebanks
of textsare being
created,
and
newand
imaginative usesof
such
stores
are being attemptedMore
sophisticated methodsof
analysis
a r e being devised,and
programs are being developedwhich
perform
increasingly complex tasksMuch
of thiswork
needs
to be done only once, s o i n order that efforts not be du-plicated it is urgent that information on
work
completed or i nprogress
be p u b l i c i z e d .The
Minnesota conference and t h e bookwhich has came from i t share some t h i n g s t h a t have been done
The
collection
i s a mixed bagin
more than
one sense. Dis-ciplines
representedinclude
music, a r t , archaeology, literaryanalysis, dialectology, language history, l e x i c o g r a p h y , and Roman
history Papers v a r y
in
l e n g t h and in readabilityWith
this example, we used a multiplicative application probabilities model which was f a r more consistent w i t hthe data than a non-application model, as measured by a
chi-square comparison of predicted v e r s u s observed fte-
Review
Computers inthe
Hutnanitieslyric p o e t r y tends toward introversion and in-
ternal
movement,
towards
t r a v e l
throughShelley'
s'caverns of t h e m i n d , ' that ' t h o u g h t can w i t h d t f f i - culty v i s i t ' ( C . M a r t i n d a l e , p 57)
Automating poetry is, on the whole, a f a i r l y harmless activity (R.
W.
B a i l e y , p 283)And, i n e v i t a b l y , the papers vary in importance
The only serious f a u l t we find i n the collection is that
e d i t o r i a l comment Fs
ingufficient
This
is especially true inthe
section, 'Art and P o e t r y , ' where titles and appropriatecredits are
given, one o r two c o m p l e t e d designsshown,
but notextual
advice
onthe
nature of t h e programs usedWe
have
chosen
to survey some of t h e papers herein from t w op o i n t s of view, f i r s t as a list of accomplishments, and second
as a source of inspiration for new accomplishments Readers
outside the
field, who
a r e aware that something is going on, b u twho
arenot
quite surewhat,
w i l lfind that
t h e computer isbeing
used most
in
doing the t h i n g s it can do best, that is, indexesand concordances. But i t s use i e being extended to more compli-
cated
tasks,
i t ssymbology is
beFng extended to new alphabetsThese
are some of the things which are being doneTHE COMPUTER AS WORKHORSE
The computer
serves best as a workhorse, d o i n g t h i n g s t h a tare at least
tedious and ttme-consuming, sometimes impossible,Review Computers i n
the
HumanitiesFrancine O u e l l e t t e , i n
"JEUDEMO
a text-handling system," de-scribe j u s t such a system, remarking that "the computer 's main
c o n t r i b u t i o n t o l i t e r a r y endeavour i s i n the provision of con-
cordances, word-indexes, and r a t h e r unsophisticated statistics. I I
The t e x t processing system they describe i s JEUDEMO, designed go
perform " t y p i c a l jobs " The s y s t e m allows for several types od
t e x t s , including s c r i p t s with s e v e r a l a c t o r s , scripts which can
be subdivided, f o r example, i n t o several acts. Output ranges
from vocabulary lists t o a Key Word i n Context (KWIC) l i s t i n g ,
which g i v e s the researcher a good idea of t h e use of a word within
the t e x t . JEUDEMO i s designed t o " meet some of the b a s i c
requirements of the u s e r from t h e h m a r i i t i e s , enabling him t o
realize some r a t h e r s o p h i s t i c a t e d t e x t processing operations with a high degree of computational eEf iciency and comparatively l i t
-
t l e e f f o r t , thus freeing him from much r o u t i n e work and allowing
h i s creativity t o be applied a t
a
much more fruitful stage"
Thew r i t e r s i n s t r u c t t h e reader i n some d e t a i l in the use of the
system, including t h e use of commands, and including an illustra-
tion of a command sequence Options available in t h e program
are given and i l l u s t r a t e d
BEYOND THE C A T A L O G
But w e can go beyond t h e catalog Sara R Jordan's METQA,
described
i n
"A computer program that l e a r n s t o understand natu-ral language," structures and a d a p t s i t s own memory t o reflect
Review Computers in the Humanities
t h a t attempt t o teach n a t u r a l languages u s e dictionaries and
built-
in
linguistic information But METQA "learns" over timeby the usage of each word In t,erms understandable by t h e non-
specialist, Jordan gives a detailed account of the memory s t r u c -
t u r e developed by the program B r i e f l y , t h e t r a i n e r makes hls
input, wlthout s p a c e s . The computer responds e i t h e r w i t h an
equivalent, o r with a q u e s t i o n (What i s i t ? " )
If
questioned,the trainer gives an answer, which i s s t o r e d and assigned t o one
of a system o f n o d e s , connected by labeled links Here machine
acquisition o f language leans i n the d i r e c t i o n o f human a c q u i s i -
t i o n of language
A reminder t h a t t h e machine has l i m i t a t i o n s in that respect
i s s t r u c k by R W B a i l e y in the liveliest paper of t h e book,
"Computer-assis t e d p o e t r y t h e wricing machine is f o r everybody
Bailey finds that mechanical techniques for the production of
p o e t r y work very w e l l , b u t do not produce p o e t r y
The
strategieshe describes are of constructions based on t y p i c a l p o e t i c pat-
terns, without artistic direction An example,
Furtive is mahogany
And delirious are the shadows of i t p pants ( p 288)
The computer manipulated by t h e artist i s another matter
Last in the book, captioned and c r e d i t e d b u t n o t e x p l a i n e d , a r e four examples of computer art Ruth Leavitt's "Computer Graphics'
and "SPLAT. A computer language f o r a r t i s t s , " by
D
Donohue andReview: Computers i n the Humanities 8
generated by the artist-in-command, the computer as tool. Unfor-
tunately, space w a g severely curtailed and explanations of the
programs were not i n c l u d e d .
M E C H A N I C A L PROBLEMS
This book includes accounts of some attempts to solve prob-
lems.peculiar to computer use and problems which arise on exten-
sions of computer use. One is the n e e d to feed alphabets other
than
Roman intothe
computer.K. L. Su, in "The creation of a set of alphabets for the
Chinese language," shows a set of symbols which can be combined to represent nearly all Chipese characters. The symbols chosen by
Su are
s t i l l largerin
numberthan
the number of charactersnecessary for other languages: the keyboard will have 256 keys,
including 210 symbols, 2 6 English l e t t e r s , 10 nuinbers, and 10
notations and punctuations. But without some such breakdown the
tens of thousands of Chinese characters could not b e used at all The author admits a "slight loss of readability" but "not of any
grave consequence. t I
"MUSTRAN I I : foundat i o n for computational musicology"
(J. Wenker) describes a system of notation which c a n b e used in
r e c o r d i n g , and subsequehtly in r e p r o d u c i n g , a musical score.
Another mechanical problem, the necessity that p r o d u c t s of
research be available and u s a b l e in many contexts, is broached
Review: Computers in the Humanities 9
problems in exchange of data.
The
conrmon structure that he findsand
usesin
data records f o r Webster's-
Seventh C o l l e g i a t e-
D i c -tionary, i s a r e v i s i o n of Machine Readable Catalog
( M A R C )
whichis already in use i n lib~aries of the U n i t e d S t a t e s and England.
Sherman's WEBMARC permits the addition of phonetic information
Problems of dimension in catalog assignment are solved t o
some degree by
D.
D. Fisher
in "An information s y s t e m f o r theJoint Caesarea MaritFma (Israel) archaeological excavations
.
t tIn answer to the necessity for precise recording of a r c h a e o l o g i -
cal finds he c a t a l o g s complex structures three-dimensionally
.
Afairly uncomplicated data base kept on punch cards helps keep
information of f i n d s during an excavation in a usable form, w i t h
a precision not easily achieved. The d a t a base can l a t e r be
used to compare artifacts and to work with scientific shapes to
a i d in chronolegical s t u d i e s .
The
computer has increasedwhat
is for other reasons a serious p r o b l a - - t h e paper surge. W. P C o l e , in "Computer-output
microfiche i n the Catalog o f American Portraits" comes up with
the inevitable solution--microfarm. The method he describes for
outputting on microfiche (COM-fiche) involves no paper output Copies for duplicate s e t s are said to b e comparatively low in
c o s t .
SOME DIRECTIONS F O R COMPUTERS IN THE HUMA$IITIES
As an i d e a book; Computers
- -
in the Humanities must give some10 Review : Computers in the Humanities
most
worth-while papers in the collection, is the editor's ownstudy
of
"The
language of the Peterborough Chronicle."
His workwith
the Chronicle isa
continuing eff~rt which is attempting todo
two things: to establish a definitivetext,
and to make con-tributions to
a
gramar forOld
English. What makes hie paperespecially interesting is the
full
andclear
account o f the wayhe
isgoing
about it.CONCORDANCES AND D I C T I O N A R I E S
The obvious and inmediate task for the humanities is the
assemblage of concordances and dictionaries. Two kinds of pro-
gram are presented in Mitchell's book
1)
the programwhich
s i m p l y g e t s the information out and prints on order, and 2) the
program which u s e s the information i n some kind
of
analysis.It
is not surprising that a concordance and a dictionary ofShakespeare would be an early choice.
M.
Spevack,H.
J. Neuhaus,and T.
Finkenstaedt describe the operation of"SHAD
a Shakespearedictionary."
At
the time of their writing, the dictionary wasbeing prepared with the use of an already existing concordance
(Spevack's Complete
-
and Systematic Concordance-
to Shakespeare),the magnetic tapes Urwesen (containing all of Shakespeare), a
Computer Dictionary
(CD)
composed of entries from the-
ShorterOxford English Dictionary (SOED)
,
andLEMCA,
a semi-automatic process of lemmatization.SHAD will
have, according t o theReview: Computers in the Humanities
information
which
may be given for the word gasp include thesethings:
The
word occurs s i x times, c o n t a i n s no grammatical am-biguity,
iscomposed
ofone free morpheme,
itfunctions
as anominal, is not a variant spelling, in uninflected. As a t y p e ,
it has eight tokens (two as a verb).
It
is classed as a noun inS O D D . Its s t a t u s i s cornon, i t i s Germanic in o r i g i n , Old Norse,
it f i r s t appeared
in
p r i n t i n 1 5 7 7 . I t occurs four times i n h i s -tory p l a y s , five i n plays of the 1 5 9 0 t s , i t i s spoken by b o t h
men and women, by both major and minor c h a r a c t e r s . I t i s always
directly preceded by one of these adjectives. last, latter,
l a t e s t . Such i n f o m a t i o n i s only t y p i c a l , n o t exhaustive.
Another p r o j e c t s i m p l e i n s t r u c t u r e but l a r g e i n scope which
can be attempted only with the aid of a computer is
E
J . Jory'sword index described in "New approaches to e p i g r a p h i c problems
in Roman his t o r y '
"
J o r y has coapleted a word index of one volumeof
an already p u b l i s h e d sixteen-volume s e t o f L a t i n epigrams.He
proposes as an e x t e n s i o n of h i s index a central data bank con-taining a l l the information about every s i n g l e known Latin in-
s c r i p t i o n .
LIMITS OF THEORY
It is probably n o t possible to set limits to the cataloging
functions of the computer, and perhaps n o t neceaaary What Fa
needed is an approximate l i n e between what the computer can do
b e s t and what t h e human mind alone can do best. Bailey demon-
Review: Computers in the Humanities 1 2
write poetry. What he did not show, but that we believe can be shown, i s that the poetry writing programs he c r i t i c i z e s are a t
least useful in utldeistanding forms in poetry
Establishing limits is a matter
of
cumulative results fromuninhibited p r o j e c t s
.
Another kind of limitation, boundaries of a theory, may b e
discovered at any time. For example, semantic field theory is
an
attractive method of organization of meaning. The difficultyis that, extended beyond certain neatly arranged categories,
where distincti,ns are clear, the method becomes unmanageably
complex. E . R . Maxwell and R. M. Smith ("A computerized l e x i c o n
of English") describe a process which nay stretch field theory
f a r beyond its present limits, or which may show it to b e so se-
verely limited as t o be o f little value. Beginning with an unde-
fined set of concepts c a l l e d primitives (saneness, difference,
m o t i o n , space, etc.), each word is processed, con t ihuing then
through finer degrees of d i f f e r e n t i a t i o n which ultimately define it and distinguish i t from all other words. Hit
-
"would be de-fined by primitives: an event involving a motion against an
o b j e c t
. . .
by another o b j e c t , the two objects being different I It would then be differentiated from such o t h e r words as punch,jab, and slap.
STYLISTICS
Beyond cataloging, the most urgent need for the computer in
the humanities is in stylistic studies.
We
have already men-Review: Computers in the Humanities 13
recently completed promise to make determination of authorship more certain. One is W.
M.
Baillie's "Authorship attribution inJacobean dramatic t e x t s , " i n Computers
- -
in the Huplanities.
Baillie investigates two t e x t s of Shakespeare s and two of
Fletcher's, looking for automatic distinctions in the dialog of
the two writers. EYEBALL lists part-of-speech and function cate-
gories wherein the investigator finds that several variables, including descriptive adverbs and complements, achieve a diffe-
rentiation success rate of seventy percent or b e t t e r . From ! t h e
information he forms a graph that shows s,ome consistency in dis- crimination between the two playwrights.
Baillie is aware of at least some of the difficulties in
establishment of authorship. 1) The plays are not totally con-
s b t e n t . In certain scenes the distinction is negative, that is,
the norms of one playwright are replaced by the norms of the
other. Such scenes are not destructive to the credibility of the
method, , h t rather they open up new quest5ons of style and author-
ship. 2) The project i s based on the assumption that a writer has
an i d e n t i f i a b l e s t y l e , an assumption which i s yet to be proved.
3)
In
a p l a y each character may have a s t y l e distinct from othersand from the author's personal style. That personal quality may
or may not be given to any one or more characters and may have only an indefinite effect on the style of all of the characters.
4)
Any stylistic study in breadth must assume that over a periodof years a writer's s-tyle w i l l change, that he may revise an
Review
:Computers
in
the Humanities 14these difficulties must
beconsidered,
but not asinsurmountable.
It is just such complexities
which
make the use of the computerurgent and
probably
essential.Another
who
attacks the prbblem of authorship isJ.
R.
Allen,
in
"On the authenticity of the Baligant episode in theChanson
-
de Roland1'. His method is different: he makes a judgmenton the
basisof
thefrequencies of high
frequency lemmata.In
six out of nine cases the Baligant episode vaties significantly
from
the others. Allen commentsthat
t h i sdoes
represent a clearstylistic difference, but not necessarily
a
difference in author-ship, since
i tmay
be the resulc of revision anda
r e t i e c t i o n o rstylistic change
in
the single author. P o e t A f Poet A+
X
years.Any stylistic study is conducted in a vacuum unless two
items
are
in
comparison, for example Writer
A
in
PoemX
vs-
Writer
A
in PoemY,
orWriter
A
vs-
WriterB,
or WriterA
vs-
WriterA
+
ten years. Eventually norms of a sort will be established.
D.
Ross, in "An
EYEBALL
view of Blake% Songs-
ofInnocent-e
- -
and Ex-perience" establishes
standards of "simplicity" againstwhich
other p o e t s ' work can be measured.
He
makes a stylistic compafison of she two poem sequencw, Songs of
-
Innocezlce and Songs-
ofExperiense, making use of
EYEBALL,
which
produces f o r h i m an"augmented t e x t , ' '
noting
for
eachword
syllable length, graxnmatical category and function,
and
location in clause, sentence,and text.
He
finds thestyle
of both groups of poems to he muchReview.
Computers in the Humanities 15Different from a l l these studies is that of
C.
Martindale,who considers categories of thought, as represented by vocabu-
lary items, and in "The semantic significance of spatial move-
ment in narrative verse: patterns of regresshe imagery in the
Divine Comedy," he finds evidence of stylistic progression. He
mslPause of the Regressive Imagery Dictionary (RID) which lists
3647
words "divided into thirty-six categories designed to t a pprimary process and secondary process thinking " His measures
are secondary process thought, i . e . , logical, reality-oriented,
and goal-directed, in contrast with primary process thought,
which is more primitive, free-associative, and sensation- and
drive-oriented.
One last thing w e note here is a direction toward a new meeting place between disciplines..
I. A.
Morton, in "Analysisof tonal music at the level o f perception," describes a system
of
tonal analysis which deals with harmonic relationships betweenchdrds in similar contexts. Chords which are performatively
identical may be ambiguous as to key until the ambiguity is re-
solved i n an ensuing pattern of sounds. Morton insists on the psychalogfcal reality of music,
. . .
much of the 'meaning' in muslc is contained i n anetwork of abstract relationships which is conceived 1 4
the mind of the composef and communicated t o the audi-
tor through a sonic description of it. The musical
score, then, is but a visual encoding of the sonic image
Review: Computers
in the
Humanities 16Everything that Morton says in his paper about the perception of
musical relationships hhs an echo in perception of speech r e l a -
tionships. The concept of music as being somehow parallel to
speech in competence is not new, bux his analysis of musical re-
lationships draws in some verifiable points f o r comparison.
Here we have a renewed realization of what this collection