Acoustic-articulatory correlations in a four-region model of the vocal tract: Experimental evidence for blade features

Mark Pennington

Abstract


In the first part of this report the formant frequencies F1–F4 and the quality (or gain) factors Q1–Q4 are correlated with the positions, areas, or area ratios formed by the four active articulators: tongue root, tongue body, blade, lips. Among the findings, it was determined that (1) when the blade position (location of smallest constriction) moves toward the lips, F3 frequency shifts higher; (2) blade aperture (blade area normalized by lip area) is directly correlated with Q3. In the second part of the report these two blade relations are applied to actual coronal speech sounds. To this end an auditorily-based estimator of Q3 is developed: the peak energy factor PE3. The asymptotic ERB (equivalent rectangular bandwidth) of the auditory filter is about one-sixth octave wide. Hence one-sixth octave is adopted as the unit of formant frequency resolution. Measured F3 frequencies are observed to span six one-sixth octaves (one octave). The six F3 distinctions are classified by the primary and secondary features of blade position [anterior posterior] and [AB RB], where AB and RB are advanced blade and retracted blade. Dentalveolars are [+anterior –posterior]; postalveolars are [–anterior +posterior]. Blade aperture is captured by the feature pair [elevated depressed]. Laminals are [+elevated –depressed]; apicals are [–elevated –depressed]. When the blade aperture increases from a small value (laminal) through a medium value (apical) to a large value (depressed), PE3 also increases. The coronal fricatives of American English, Toda, and Ubykh are examined as well as the coronal stops, nasals, and liquids of Central Arrernte. Both the palatographic evidence and the PE3 measures consistently show the laminality of [s] and the apicality of [ʃ ʂ]. Furthermore, the [s ʃ] sounds are always [+anterior]. In American English, for example, there is no statistically significant difference in F3 frequency between laminal [s] and apical [ʃ], which indicates very similar blade positions.


Keywords


vocal tract modeling; acoustic-articulatory correlations; formant measures; coronals

Full Text:

PDF

References


Aarts, Ronald M. & Augustus J. E. M. Janssen. 2003. Approximation of the Struve function H1 occurring in impedance calculations. Journal of the Acoustical Society of America 113(5). 2635–2637.

Anderson, Victoria B. 2000. Giving weight to phonetic principles: The case of place of articulation in Western Arrernte. Ph.D. dissertation, University of California, Los Angeles.

Badin, Pierre. 1989. Acoustics of voiceless fricatives: Production theory and data. Speech Transmission Laboratory Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 30(3). 33–55.

Baer, T., J. C. Gore, L. C. Gracco, & P. W. Nye. 1991. Analysis of vocal tract shape and dimensions using magnetic resonance imaging: Vowels. Journal of the Acoustical Society of America 90(2). 799–828.

Baltaxe, Christiane A. M. 1978. Foundations of distinctive feature theory. Baltimore: University Park Press.

Bell, Alexander M. 1867. Visible speech. London: Simkin, Marshall & Co.

Bladon, R. A. W. & F. J. Nolan. 1977. A video-fluorographic investigation of tip and blade alveolars in English. Journal of Phonetics 5. 185–193.

Blumstein, Sheila E. & Kenneth N. Stevens. 1980. Perceptual invariance and onset spectra for stop consonants in different vowel environments. Journal of the Acoustical Society of America 67(2). 648–662.

Breen, Gaven & Veronica Dobson. 2005. Central Arrernte. Journal of the International Phonetic Association 35(2). 249–254.

Browman, Catherine P. & Louis Goldstein. 1989. Articulatory gestures as phonological units. Phonology 6(2). 201–251.

Bundgaard-Nielsen, Rikke L., Brett J. Baker, Christian Kroos, Mark Harvey & Catherine T. Best. 2012. Vowel acoustics reliably differentiate three coronal stops of Wubuy across prosodic contexts. Laboratory Phonology 3. 133–161.

Catford, John C. 1968. The articulatory possibilities of man. In Bertil Malmberg (ed.), Manual of phonetics, 309–333. Amsterdam: North Holland.

Catford, John C. 1977. Mountain of tongues: The languages of the Caucasus. Annual Review of Anthropology 6. 283–314.

Chen, Marilyn Y. 1997. Acoustic correlates of English and French nasalized vowels. Journal of the Acoustical Society of America 102(4). 2360–2370.

Chiba, Tsutomu & Masato Kajiyama. 1958. The vowel, its nature and structure. Tokyo: Phonetic Society of Japan.

Chomsky, Noam & Morris Halle. 1968. The sound pattern of English. New York: Harper & Row.

Clements, G. N. 2009. The role of features in phonological inventories. In Eric Raimy & Charles E. Cairns (eds.), Contemporary views on architecture and representations in phonology, 19–68. Cambridge: MIT Press.

Clements, G. N., and Elizabeth V. Hume. 1995. The internal organization of speech sounds. In John A. Goldsmith (ed.), The handbook of phonological theory, 245–306. Cambridge: Blackwell.

Cohn, Abigail, C. 2011. Features, segments, and the sources of phonological primitives. In G. Nick Clements & Rachid Ridouane (eds.), Where do phonological features come from. Cognitive, physical and developmental bases of distinctive speech categories, 13–42. Amsterdam: John Benjamins.

Dalston, Rodger M. 1975. Acoustic characteristics of English /w, r, l/ spoken correctly by young children and adults. Journal of the Acoustical Society of America 57(2). 462–469.

Dang, Jianwu, Kiyoishi Honda & Hisayoshi Suzuki. 1994. Morphological and acoustical analysis of the nasal and the paranasal cavities. Journal of the Acoustical Society of America 96(4). 2088–2100.

Dart, Sarah N. 1991. Articulatory and acoustic properties of apical and laminal articulations. UCLA Working Papers in Phonetics 79.

Delattre, Pierre & Donald C. Freeman. 1968. A dialect study of American English r’s by X-ray motion picture. Linguistics 6(44). 29–68.

Divenyi, Pierre L. 2004. The times of Ira Hirsh: Multiple ranges of auditory temporal perception. Seminars in Hearing 25(3). 229–239.

Dixit, R. Prakash & Paul R. Hoffman. 2004. Articulatory characteristics of fricatives and affricates in Hindi: An electropalatographic study. Journal of the International Phonetic Association 34(2). 141–159.

Espy-Wilson, Carol Y., Suzanne E. Boyce, Michel Jackson, Shrikanth Narayanan & Abeer Alwan. 2000. Acoustic modeling of American English /r/. Journal of the Acoustical Society of America 108(1). 343–356.

Ewan, William G. & Robert Krones. 1974. Measuring larynx movement using the thyroumbrometer. Journal of Phonetics 2. 327–335.

Fant, Gunnar. 1960. Acoustic theory of speech production. The Hague: Mouton.

Fant, Gunnar. 1966. A note on vocal tract size factors and non-uniform F-pattern scalings. Speech Transmission Laboratory Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 7(4). 22–30.

Fant, Gunnar. 1975. Vocal-tract area and length perturbations. Speech Transmission Laboratory Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 16(4). 1–14.

Fant, Gunnar. 1986. Features: fiction and facts. In Joseph S. Perkell & Dennis H. Klatt (eds.), Invariance and variability in speech processes, 480–492. Hillsdale, New Jersey: Lawrence Erlbaum.

Fant, Gunnar, L. Nord & P. Branderud. 1976. A note on the vocal tract wall impedance. Speech Transmission Laboratory Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 17(4). 13–20.

Flanagan, James L. 1972. Speech analysis, synthesis and perception. Berlin: Springer Verlag.

Fletcher, Samuel G. & Dennis G. Newman. 1991. [s] and [ʃ] as a function of linguapalatal contact place and sibilant groove width. Journal of the Acoustical Society of America 89(2). 850–858.

Flemming, Edward S. 2002. Auditory representations in phonology. New York: Routledge.

Gafos, Adamantios I. 1999. The articulatory basis of locality in phonology. New York: Garland.

Glasberg, Brian R. & Brian C. J. Moore. 1990. Derivation of auditory filter shapes from notched-noise data. Hearing Research 47(1–2). 103–138.

Goldstein, Ursula G. 1980. An articulatory model for the vocal tracts of growing children. Ph.D. dissertation, Massachusetts Institute of Technology.

Goldstein, Louis, Dani Byrd, & Elliot Saltzman. (2006). The role of vocal tract gestural action units in understanding the evolution of phonology. In Michael A. Arbib (ed.), Action to language via the mirror neuron system, 215–249. Cambridge: Cambridge University Press.

Hall, T. Alan. 1997. The phonology of coronals. Amsterdam: John Benjamins.

Halle, Morris, Bert Vaux. & Andrew Wolfe. 2000. On feature spreading and the representation of place of articulation. Linguistic Inquiry 31(3). 387–444.

Hamann, Silke R. 2003. The phonetics and phonology of retroflexes. Ph.D. dissertation, Utrecht University.

Handbook of the International Phonetic Association. 1999. Cambridge: Cambridge University Press.

Hardcastle, William J. 1976. Physiology of speech production. An introduction for speech scientists. London: Academic Press.

Hartmann, William M. 1998. Signals, sound, and sensation. New York: Springer.

Heffner, Roe-Merrill S. 1949. General phonetics. Madison: University of Wisconsin Press.

Hewitt, George. 2004. Introduction to the study of the languages of the Caucasus. Munich: Lincom Europa.

Hillenbrand, James M. & Michael J. Clark. 2009. The role of f0 and formant frequencies in distinguishing the voices of men and women. Attention, Perception, & Psychophysics 71(5). 1150–1166.

Hillenbrand, James M., Laura A. Getty, Michael J. Clark & Kimberlee Wheeler. 1995. Acoustic characteristics of American English vowels. Journal of the Acoustical Society of America 97(5). 3099–3111.

Hirsh, Ira. J. 1974. Temporal order and auditory perception. In Howard R. Moskowitz, Bertram Scharf & Joseph C. Stevens (eds.), Sensation and measurement: Papers in honor of S. S. Stevens, 251–258. Dordrecht: D. Reidel.

Hoole, Philip & Christian Kroos. 1998. Control of larynx height in vowel production. Fifth International Conference on Spoken Language Processing (ICSLP-1998). 531–534.

House, Arthur S. & Kenneth N. Stevens. 1956. Analog studies of the nasalization of vowels. Journal of Speech and Hearing Disorders 21(2). 218–232.

International Phonetic Association. 1989. Report on the 1989 Kiel convention. Journal of the International Phonetic Association 19(2). 67–80.

Ishizaka, Kenzo, J. C. French & James L. Flanagan. 1975. Direct determination of vocal tract wall impedance. IEEE Transactions on Acoustics, Speech, and Signal Processing 23(4). 370–373.

Ito, M. Robert & Robert W. Donaldson. 1971. Zero-crossing measurements for analysis and recognition of speech sounds. IEEE Transactions on Audio and Electroacoustics 19(3). 235–242.

Jakobson, Roman, Gunnar Fant & Morris Halle. 1952. Preliminaries to speech analysis. The distinctive features and their correlates. Cambridge: MIT Press.

Jakobson, Roman, S. Karcevsky & N. Trubetzkoy. 1928. Quelles sont les méthodes les mieux appropriées à un exposé complet et pratique de la grammaire d'une langue quelconque? Actes du premier Congrès international de linguistes à La Haye. 33–36.

Jassem, Wiktor. 1965. The formants of fricative consonants. Language and Speech 8(1). 1–16.

Keating, Patricia A. 1988. A survey of phonological features. Bloomington: Indiana University Linguistics Club.

Keating, Patricia, A. 1991. Coronal places of articulation. In Carole Paradis & Jean-François Prunet (eds.), Phonetics and Phonology 2. The special status of coronals: internal and external evidence, 29–48. San Diego: Academic Press.

Kewley-Port, Diane & Charles S. Watson. 1994. Formant-frequency discrimination for isolated English vowels. Journal of the Acoustical Society of America 95(1). 485–496.

Kewley-Port, Diane & Yijian Zheng. 1999. Vowel formant discrimination: Towards more ordinary listening conditions. Journal of the Acoustical Society of America 106(5). 2945–2958.

Kinsler, Lawrence E. & Austin R. Frey. 1962. Fundamentals of Acoustics. New York: John Wiley & Sons.

Ladefoged, Peter. 1957. Use of palatography. Journal of Speech and Hearing Disorders 22(5). 764–774.

Ladefoged, Peter & Ian Maddieson. 1996. The sounds of the world’s languages. Oxford: Blackwell Publishers.

Lee, Sungbok, Alexandros Potamianos & Shrikanth Narayanan. 1999. Acoustics of children’s speech: Developmental changes of temporal and spectral parameters. Journal of the Acoustical Society of America 105(3). 1455–1468.

Lehiste, Ilse. 1964. Acoustical characteristics of selected English consonants. Bloomington: Indiana University.

Lindau, Mona. 1985. The story of /r/. In Victoria Fromkin (ed.), Phonetic linguistics, 157–168. Orlando: Academic Press.

Maddieson, Ian. 1984. Patterns of sounds. Cambridge: Cambridge University Press.

Ménard, Lucie, Jean-Luc Schwartz, Louis-Jean Boë, Sonia Kandel & Nathalie Vallée. 2002. Auditory normalization of French vowels synthesized by an articulatory model simulating growth from birth to adulthood. Journal of the Acoustical Society of America 111(4). 1892–1905.

Mielke, Jeff. 2011. Distinctive features. In Marc van Oostendorp, Colin J. Ewen, Elizabeth Hume & Keren Rice (eds.), The Blackwell companion to phonology I. General issues and segmental phonology, 391–415. Malden: Wiley-Blackwell.

Miller, James D. 1989. Auditory-perceptual interpretation of the vowel. Journal of the Acoustical Society of America 85(5). 2114–2134.

Moore, Brian C. J. & Kengo Ohgushi. 1993. Audibility of partials in inharmonic complex tones. Journal of the Acoustical Society of America 93(1). 452–461.

Mrayati, M. & R. Carré. 1976. Relations entre la forme du conduit vocal et les caractéristiques acoustiques des voyelles françaises. Phonetica 33(4). 285–306.

Mrayati, M. & B. Guérin. 1976. Étude des caractéristiques acoustiques des voyelles orales françaises par simulation du conduit vocal avec pertes. Revue d’Acoustique 36. 18–32.

Narayanan, Shrikanth S. 1995. Fricative consonants: An articulatory, acoustic, and systems study. Ph.D. dissertation, University of California, Los Angeles.

Ohde, Ralph N., Katarina L. Haley & Christine W. Barnes. 2006. Perception of the [m]-[n] distinction in consonant-vowel (CV) and vowel-consonant (VC) syllables produced by child and adult talkers. Journal of the Acoustical Society of America 119(3). 1697–1711.

Parker, Stephen G. 2002. Quantifying the sonority hierarchy. Ph.D. dissertation, University of Massachusetts.

Pennington, Mark. 2005. The phonetics and phonology of glottal manner features. Ph.D. dissertation, Indiana University.

Peterson, Gordon E. & Harold L. Barney. 1952. Control methods used in a study of vowels. Journal of the Acoustical Society of America 24(2). 175–184.

Pruthi, Tarun. 2007. Analysis, vocal-tract modeling and automatic detection of vowel nasalization. Ph.D. dissertation, University of Maryland, College Park.

Reddy, D. R. 1967. Phoneme grouping for speech recognition. Journal of the Acoustical Society of America 41(5). 1295–1300.

Repp, Bruno H. 1984. Categorical perception: Issues, methods, and findings. In Norman J. Lass (ed.), Speech and language: Advances in basic research and practice (vol. 10), 243–335. New York: Academic Press.

Rosenblum, Lawrence D. 2008. Speech perception as a multimodal phenomenon. Current Directions in Psychological Science 17(6). 405-409.

Sakthivel, Subbiah. 1977. A grammar of the Toda language. Annamalainagar: Annamalai University.

Shalev, Michael, Peter Ladefoged & Peri Bhaskararao. 1994. Phonetics of Toda. PILC Journal of Dravidic Studies 4(1). 19–56.

Shupljakov, V., G. Fant & A. de Serpa-Leitão. 1968. Acoustical features of hard and soft Russian consonants in connected speech: a spectrographic study. Speech Transmission Laboratory Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 9(4). 1–6.

Sreenivasan, K. R., A. Prabhu & R. Narasimha. 1983. Zero-crossings in turbulent signals. Journal of Fluid Mechanics 137. 251–272.

Stevens, Kenneth N. 1985. Evidence for the role of acoustic boundaries in the perception of speech sounds. In Victoria Fromkin (ed.), Phonetic linguistics, 243–255. Orlando: Academic Press.

Stevens, Kenneth N. 1998. Acoustic phonetics. Cambridge: MIT Press.

Stevens, Kenneth N. & Sheila E. Blumstein. 1981. The search for invariant acoustic correlates of phonetic features. In Peter D. Eimas & Joanne L. Miller (eds.), Perspectives on the study of speech, 1–38. Hillsdale, New Jersey: Lawrence Erlbaum.

Story, Brad H. 2006. Technique for “tuning” vocal tract area functions based on acoustic sensitivity functions. Journal of the Acoustical Society of America 119(2). 715–718.

Story, Brad H., Ingo R. Titze & Eric A. Hoffman. 1996. Vocal tract area functions from magnetic resonance imaging. Journal of the Acoustical Society of America 100(1). 537–554.

Story, Brad H., Ingo R. Titze & Eric A. Hoffman. 1998. Vocal tract area functions for an adult female speaker based on volumetric imaging. Journal of the Acoustical Society of America 104(1). 471–487.

Sweet, Henry. 1877. A handbook of phonetics. Oxford: Clarendon Press.

Tabain, Marija & Andrew Butcher. 1999. Stop consonants in Yanyuwa and Yindjibarndi: locus equation data. Journal of Phonetics 27. 333–157.

Takemoto, Hironori, Kiyoshi Honda, Shinobu Masaki, Yasuhiro Shimada & Ichiro Fujimoto. 2006. Measurement of temporal changes in vocal tract area function from 3D cine-MRI data. Journal of the Acoustical Society of America 119(2). 1037–1049.

Toda, Martine, Shinji Maeda & Kiyoshi Honda. 2010. Formant-cavity affiliation in sibilant fricatives. In Susanne Fuchs, Martine Toda & Marzena Żygis (eds.), Turbulent sounds: an interdisciplinary guide, 343–374. Berlin: Walter de Gruyter.

Traunmüller, Hartmut. 1990. Analytical expressions for the tonotopic sensory scale. Journal of the Acoustical Society of America 88(1). 97–100.

Walsh Dickey, Laura. 1997. The phonology of liquids. Ph.D. dissertation, University of Massachusetts.

Weitzman, Raymond S. 1992. Vowel categorization and the critical band. Language and Speech 35(1–2). 115–125.

Welmers, William E. 1973. African language structures. Berkeley: University of California Press.

Yang, Chang-Sheng & Hideki Kasuya. 1994. Accurate measurement of vocal tract shapes from magnetic resonance images of child, female and male subjects. Third International Conference on Spoken Language Processing (ICSLP-1994). 623–626.

Zhang, Zhaoyan & Carol Y. Espy-Wilson. 2004. A vocal-tract model of American English /l/. Journal of the Acoustical Society of America 115(3). 1274–1280.

Zhang, Shanjie & Jianming Jin. 1996. Computation of special functions. New York: Wiley Interscience.

Zhou, Xinhui, Carol Y. Espy-Wilson, Suzanne Boyce, Mark Tiede, Christy Holland & Ann Choe. 2008. A magnetic resonance imaging-based articulatory and acoustic study of “retroflex” and “bunched” American English /r/. Journal of the Acoustical Society of America 123(6). 4466–4481.


Refbacks

  • There are currently no refbacks.