• No results found

Vocal Production System: Evolution

The human vocal production system is similar in broad outline to that of other terrestrial vertebrates. All tetra-pods (nonfish vertebrates: amphibians, reptiles, birds, and mammals) inherit from a common ancestor three key components: (1) a respiratory system with lungs;

(2) a larynx that acts primarily as a quick-closing gate to protect the lungs, and often secondarily to produce sound; and (3) a supralaryngeal vocal tract which filters this sound before emitting it into the environment. De-spite this shared plan, a wide variety of interesting mod-ifications of the vocal production system are known. The functioning of the basic tetrapod vocal production sys-tem can be understood within the theoretical framework of the myoelastic-aerodynamic and source/filter theories familiar to speech scientists.

The lungs and attendant respiratory musculature provide the air stream powering phonation. In primi-tive air-breathing vertebrates, the lungs were inflated by rhythmic compression of the oral cavity, or ‘‘buccal pumping,’’ and this system is still used by lungfish and amphibians (Brainerd and Ditelberg, 1993). Inspiration by active expansion of the thorax evolved later, in the ancestor of reptiles, birds, and mammals. This was powered originally by the intercostal muscles (as in liz-ards or crocodilians) and later (in mammals only) by a muscular diaphragm (Liem, 1985). Phonation is typi-cally powered by passive deflation of the elastic lungs, or in some cases by active compression of the hypaxial musculature. In many frogs, air expired from the lungs during phonation is captured in an elastic air sac, which then deflates, returning the air to the lungs. This allows frogs to produce multiple calls from the same volume

of air. The inflated sac also increases the e‰ciency with which sound is radiated into the environment (Gans, 1973).

The lungs are protected by a larynx in all tetrapods.

This structure primitively includes a pair of barlike car-tilages that can be separated (for breathing) or pushed together (to seal the airway) (Negus, 1949). Expiration through the partially closed larynx creates a turbulent hiss—perhaps the most primitive vocalization, which virtually all tetrapods can produce. However, more so-phisticated vocalizations became possible after the inno-vation of elastic membranes within the larynx, the vocal cords, which are found in most frogs, vocal reptiles (geckos, crocodilians), and mammals. Although the larynx in these species can support a wide variety of vocalizations, its primary function as a protective gate-way appears to have constrained laryngeal anatomy. In birds, a novel phonatory structure called the syrinx evolved at the base of the trachea. Dedicated to vocal production, and freed from the necessity of tracheal protection, the avian syrinx is a remarkably diverse structure underlying the great variety of bird sounds (King, 1989).

Although our knowledge of animal phonation is still limited, phonation in nonhumans appears to follow the principles of the myoelastic-aerodynamic theory of hu-man phonation. The airflow from the lungs sets the vo-cal folds (or syringeal membranes) into vibration, and the rate of vibration is passively determined by the size and tension of these tissues. Vibration at a particular frequency does not typically require neural activity at that frequency. Thus, relatively normal phonation can be obtained by blowing moist air through an excised larynx, and rodents and bats can produce ultrasonic vocalizations at 40 kHz and higher (Suthers and Fattu, 1973). However, cat purring relies on an active tensing of the vocal fold musculature at the 20–30 Hz funda-mental frequency of the purr (Frazer Sissom, Rice, and Peters, 1991). During phonation, the movements of the vocal folds can be periodic and stable (leading to tonal sounds) or highly aperiodic or even chaotic (e.g., in screams); while such aperiodic vocalizations are rare in nonpathological human voices, they can be important in animal vocal repertoires (Fitch, Neubauer, and Her-zel, 2002).

Because the length of the vocal folds determines the lowest frequency at which they could vibrate (Titze, 1994), with long folds producing lower frequencies, one might expect that a low fundamental would provide a reliable indication of large body size. However, the size of the larynx is not tightly constrained by body size.

Thus, a huge larynx has independently evolved in many mammal species, probably in response to selection for low-pitched voices (Fig. 1A, B). For example, in howler monkeys (genus Alouatta) the larynx and hyoid have grown to fill the space between mandible and sternum, giving these small monkeys remarkably impressive and low-pitched voices (Kelemen and Sade, 1960). The most extreme example of laryngeal hypertrophy is seen in the hammerhead bat Hypsignathus monstrosus, in which the

larynx of males expands to fill the entire thoracic cavity, pushing the heart, lungs, and trachea down into the ab-domen (Schneider, Kuhn, and Keleman, 1967). A simi-lar though less impressive increase in simi-larynx dimensions is observed in human males and is partially responsible for the voice change at puberty (Titze, 1989).

Sounds created by the larynx must pass through the air contained in the pharyngeal, oral, and nasal cavities, collectively termed the supralaryngeal vocal tract or simply vocal tract. Like any column of air, this air has mass and elasticity and vibrates preferentially at certain resonant frequencies. Vocal tract resonances are termed formants (from the Latin formare, to shape): they act as filters to shape the spectrum of the vocal output. Because all tetrapods have a vocal tract, all have formants. For-mant frequencies are determined by the length and shape of the vocal tract. Because the vocal tract in mammals rests within the confines of the head, and skull size and body size are tightly linked (Fitch, 2000b), formant fre-quencies provide a possible indicator of body size not as easily ‘‘faked’’ as the laryngeal cue of fundamental frequency. Large animals have long vocal tracts and low formants. Together with demonstrations of formant perception by nonhuman animals (Sommers et al., 1992;

Fitch and Kelley, 2000), this suggests that formants may have provided a cue to size in primitive vertebrates (Fitch, 1997). However, it is possible to break the ana-tomical link between vocal tract length and body size, and some intriguing morphological adaptations have arisen to elongate the vocal tract (presumably resulting from selection to sound larger; Fig. 1C–E ). Elongations of the nasal vocal tract are seen in the long nose of male proboscis monkeys or the impressive nasal crests of hadrosaur dinosaurs (Weishampel, 1981). Vocal tract elongation can also be achieved by lowering the larynx;

this is seen in extreme form in the red deer Cervus ela-phus, which retract the larynx to the sternum during ter-ritorial roaring (Fitch and Reby, 2001). Again, a similar change occurs in human males at puberty: the larynx descends slightly to give men a longer vocal tract and lower formants than same-sized women (Fitch and Giedd, 1999).

Human speech is thus produced by the same conser-vative vocal production system of lungs, larynx, and vo-cal tract shared by all tetrapods. However, the evolution of the human speech apparatus involved several impor-tant changes. One was the loss of laryngeal air sacs. All great apes posses large balloon-like sacs that open into the larynx directly above the glottis (Negus, 1949; Scho¨n Ybarra, 1995). Parsimony suggests that the common ancestor of apes and humans also had such air sacs, which were subsequently lost in human evolution. How-ever, air sacs are occasionally observed in humans in pathological situations, a laryngocele is a congenital or acquired air sac that is attached to the larynx through the laryngeal ventricle at precisely the same location as in the great apes (Stell and Maran, 1975). Because the function of air sacs in ape vocalizations is not under-stood, the significance of their loss in human evolution is unknown.

Vocal Production System: Evolution 57

A second change in the vocal production system dur-ing human evolution was the descent of the larynx from its normal mammalian position high in the throat to a lower position in the neck (Negus, 1949). In the 1960s, speech scientists realized that this ‘‘descended larynx’’

allows humans to produce a wider variety of formant patterns than would be possible with a high larynx (Lieberman, Klatt, and Wilson, 1969). In particular, the

‘‘point vowels’’ /i, a, u/ seem to be impossible to attain unless the tongue body is bent and able to move freely within the oropharyngeal cavity. Given the existence of these vowels in virtually all languages (Maddieson, 1984), speech typical of modern humans appears to re-quire a descended larynx. Of course, all mammals can produce a diversity of sounds, which could have served a simpler speech system. Also, most mammals appear to lower the larynx during vocalization (Fitch, 2000a), lessening the gap between humans and other animals.

Despite these caveats, the descended larynx is clearly an important component of human spoken language (Lie-berman, 1984). The existence of nonhuman mammals with a descended larynx raises the possibility that this trait initially arose to exaggerate size in early hominids and was later coopted for use in speech (Fitch and Reby, 2001). Finally, recent fossils suggest that an expansion of the thoracic intervertebral canal occurred during the evolution of Homo some time after the earliest Homo erectus (MacLarnon and Hewitt, 1999). This change

may be associated with an increase in breathing control necessary for singing and speech in our own species.

See also vocalization, neural mechanisms of.

—W. Tecumseh Fitch

References

Brainerd, E. L., and Ditelberg, J. S. (1993). Lung ventilation in salamanders and the evolution of vertebrate air-breathing mechanisms. Biological Journal of the Linnean Society, 49, 163–183.

Fitch, W. T. (1997). Vocal tract length and formant fre-quency dispersion correlate with body size in rhesus ma-caques. Journal of the Acoustical Society of America, 102, 1213–1222.

Fitch, W. T. (2000a). The phonetic potential of nonhuman vocal tracts: Comparative cineradiographic observations of vocalizing animals. Phonetica, 57, 205–218.

Fitch, W. T. (2000b). Skull dimensions in relation to body size in nonhuman mammals: The causal bases for acoustic al-lometry. Zoology, 103, 40–58.

Fitch, W. T., and Giedd, J. (1999). Morphology and develop-ment of the human vocal tract: A study using magnetic resonance imaging. Journal of the Acoustical Society of America, 106, 1511–1522.

Fitch, W. T., and Kelley, J. P. (2000). Perception of vocal tract resonances by whooping cranes, Grus americana. Ethology, 106, 559–574.

Fitch, W. T., Neubauer, J., and Herzel, H. (2002). Calls out of chaos: The adaptive significance of nonlinear phenomena in Figure 1.Examples of unusual vocal adaptations among vertebrates (not to scale). A, Hammerheaded bat, Hypsignathus monstrosus, has a huge larynx (gray) enlarged to fill the thoracic cavity. B, Howler monkeys Alouatta spp. have the largest rela-tive larynx size among primates, which together with the enlarged hyoid fills the space beneath the mandible (larynx and hyoid shown in gray). C, Male red deer Cervus elaphus have a permanently de-scended larynx, which they lower to the sternum when roaring, resulting in an extremely elongated vocal tract (shown in gray). D, Humans—

Homo sapiens—have a descended larynx, resulting in an elongated

‘‘two-tube’’ vocal tract (shown in gray). E, The now extinct duck-billed dinosaur Parasaurolophus had a hugely elongated nasal cavity (shown in gray) that filled the bony crest adorning the skull.

mammalian vocal production. Animal Behaviour, 63, 407–

418.

Fitch, W. T., and Reby, D. (2001). The descended larynx is not uniquely human. Proceedings of the Royal Society, Biologi-cal Sciences, 268, 1669–1675.

Frazer Sissom, D. E., Rice, D. A., and Peters, G. (1991). How cats purr. Journal of Zoology (London), 223, 67–78.

Gans, C. (1973). Sound production in the Salientia: Mecha-nism and evolution of the emitter. American Zoologist, 13, 1179–1194.

Kelemen, G., and Sade, J. (1960). The vocal organ of the howling monkey (Alouatta palliata). Journal of Morphology, 107, 123–140.

King, A. S. (1989). Functional anatomy of the syrinx. In A. S.

King and J. McLelland (Eds.), Form and function in birds (pp. 105–192). New York: Academic Press.

Lieberman, P. (1984). The biology and evolution of language.

Cambridge, MA: Harvard University Press.

Lieberman, P. H., Klatt, D. H., and Wilson, W. H. (1969). Vocal tract limitations on the vowel repertoires of rhesus monkey and other nonhuman primates. Science, 164, 1185–1187.

Liem, K. F. (1985). Ventilation. In M. Hildebrand (Ed.), Functional vertebrate morphology (pp. 185–209). Cam-bridge, MA: Belknap Press/Harvard University Press.

MacLarnon, A., and Hewitt, G. (1999). The evolution of human speech: The role of enhanced breathing control.

American Journal of Physical Anthropology, 109, 341–363.

Maddieson, I. (1984). Patterns of sounds. Cambridge, UK:

Cambridge University Press.

Negus, V. E. (1949). The Comparative anatomy and physiology of the larynx. New York: Hafner.

Schneider, R., Kuhn, H.-J., and Kelemen, G. (1967). Der Lar-ynx des ma¨nnlichen Hypsignathus monstrosus Allen, 1861 (Pteropodidae, Megachiroptera, Mammalia). Zeitschrift fu¨r wissenschaftliche Zoologie, 175, 1–53.

Scho¨n Ybarra, M. (1995). A comparative approach to the nonhuman primate vocal tract: Implications for sound pro-duction. In E. Zimmerman and J. D. Newman (Eds.), Cur-rent topics in primate vocal communication (pp. 185–198).

New York: Plenum Press.

Sommers, M. S., Moody, D. B., Prosen, C. A., and Stebbins, W. C. (1992). Formant frequency discrimination by Japa-nese macaques (Macaca fuscata). Journal of the Acoustical Society of America, 91, 3499–3510.

Stell, P. M., and Maran, A. G. D. (1975). Laryngocoele. Jour-nal of Laryngology and Otology, 89, 915–924.

Suthers, R. A., and Fattu, J. M. (1973). Mechanisms of sound production in echolocating bats. American Zoologist, 13, 1215–1226.

Titze, I. R. (1989). Physiologic and acoustic di¤erences be-tween male and female voices. Journal of the Acoustical Society of America, 85, 1699–1707.

Titze, I. R. (1994). Principles of voice production. Englewood Cli¤s, NJ: Prentice Hall.

Weishampel, D. B. (1981). Acoustic analysis of potential vocalization in lambeosaurine dinosaurs (Reptilia: Ornith-ischia). Paleobiology, 7, 252–261.