Skip to main content Accessibility help
×
Hostname: page-component-8448b6f56d-m8qmq Total loading time: 0 Render date: 2024-04-19T04:14:37.933Z Has data issue: false hasContentIssue false

18 - Perceptual Normalization of Lexical Tones

Behavioral and Neural Evidence

from Part Three - Phonetic-phonological Issues in Chinese

Published online by Cambridge University Press:  04 August 2022

Chu-Ren Huang
Affiliation:
The Hong Kong Polytechnic University
Yen-Hwei Lin
Affiliation:
Michigan State University
I-Hsuan Chen
Affiliation:
University of California, Berkeley
Yu-Yin Hsu
Affiliation:
The Hong Kong Polytechnic University
Get access

Summary

Despite the complexity and variation of physical signals, human perception of a speech sound uttered by different talkers or in diverse contexts is amazingly constant. Nonetheless, the neurocognitive mechanisms of this fundamental human perceptual ability are not well understood. Even less is known about the neural bases of phonetic constancy. We present an emerging picture of this important issue based on accumulating behavioral as well as neuroimaging evidence from lexical tone studies. Lexical tone offers a useful test of various existing theories proposed based on segmental studies, because of its variable and dynamic nature. We draw evidence from a series of studies on the perceptual normalization of lexical tones to shed light on prior theories. We also discuss findings from neuroimaging studies which advance our understanding of the temporal and spatial neural signature of phonetic constancy in lexical tone perception. A new model is proposed which emphasizes that listeners extract and integrate information from multiple sources in a fast and robust manner, to help them achieve phonetic constancy.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2022

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Arend, Lawrence, and Reeves, Adam. 1986. Simultaneous color constancy. Journal of the Optical Society of America 3(10): 17431751. https://doi.org/10.1364/JOSAA.3.001743Google Scholar
Bäuml, Karl-Heinz. 1999. Simultaneous color constancy: How surface color perception varies with the illuminant. Vision Research 39(8):15311550. https://doi.org/10.1016/S0042–6989(98)00192-8.CrossRefGoogle ScholarPubMed
Bishop, Jason, and Keating, Patricia. 2012. Perception of pitch location within a speaker’s range: Fundamental frequency, voice quality and speaker sex. The Journal of the Acoustical Society of America 132(2):11001112. https://doi.org/10.1121/1.4714351.Google Scholar
Chandrasekaran, Bharath, Chan, Alice H. D., and Wong, Patrick C. M.. 2011. Neural processing of what and who information in speech. Journal of Cognitive Neuroscience 23(10): 26902700. https://doi.org/10.1162/jocn.2011.21631.CrossRefGoogle ScholarPubMed
Chen, Fei, and Peng, Gang. 2016. Context effect in the categorical perception of Mandarin tones. Journal of Signal Processing Systems 82(2):253261. https://doi.org/10.1007/s11265–015-1008-2.Google Scholar
Choi, Ja Young, and Perrachione, Tyler K.. 2019. Noninvasive neurostimulation of left temporal lobe disrupts rapid talker adaptation in speech processing. Brain and Language 196:104655. https://doi.org/10.1016/j.bandl.2019.104655.Google Scholar
Connolly, John F., and Phillips, Natalie A.. 1994. Event-related potential components reflect phonological and semantic processing of the terminal word of spoken sentences. Journal of Cognitive Neuroscience 6(3):256266. https://doi.org/10.1162/jocn.1994.6.3.256.Google Scholar
Craik, Fergus I. M., and Kirsner, Kim. 1974. The effect of speaker’s voice on word recognition. Quarterly Journal of Experimental Psychology 26(2):274284. https://doi.org/10.1080/14640747408400413.Google Scholar
Desroches, Amy S., Newman, Randy Lynn, and Joanisse, Marc F.. 2008. Investigating the time course of spoken word recognition: Electrophysiological evidence for the influences of phonological similarity. Journal of Cognitive Neuroscience 21(10):18931906.Google Scholar
Federmeier, Kara D., McLennan, Devon B., de Ochoa, Esmeralda, and Kutas, Marta. 2002. The impact of semantic memory organization and sentence context information on spoken language processing by younger and older adults: An ERP study. Psychophysiology 39(2):133146. https://doi.org/10.1017/S0048577202001373.Google Scholar
Fowler, Carol A. 2006. Compensation for coarticulation reflects gesture perception, not spectral contrast. Perception and Psychophysics 68(2):161177. https://doi.org/10.3758/BF03193666.Google Scholar
Francis, Alexander L., Valter, Ciocca, Natalie, King Yu Wong, Wilson, Ho Yin Leung, and Phoe, Cheuk Yan Chu. 2006. Extrinsic context affects perceptual normalization of lexical tone. The Journal of the Acoustical Society of America 119(3):17121726. https://doi.org/10.1121/1.2149768.Google Scholar
Galantucci, Bruno, Fowler, Carol A., and Turvey, M. T.. 2006. The motor theory of speech perception reviewed. Psychonomic Bulletin and Review 13(3):361377. https://doi.org/10.3758/BF03193990.Google Scholar
Garrett, Kathryn L., and Charles Healey, E.. 1987. An acoustic analysis of fluctuations in the voices of normal adult speakers across three times of day. Journal of the Acoustical Society of America 82(1):5862. https://doi.org/10.1121/1.395437.Google Scholar
Goldinger, Stephen D., Pisoni, David B., and Logan, John S.. 1991. On the nature of talker variability effects on serial recall of spoken word lists. Journal of Experimental Psychology: Learning, Memory, and Cognition 17:152162. https://doi.org/http://dx.doi.org/10.1037/0278-7393.17.1.152.Google Scholar
Goldinger, Stephen D. 1996. Words and voices: Episodic traces in spoken word identification and recognition memory. Journal of Experimental Psychology: Learning, memory, and cognition 22(5):11661183.Google Scholar
Goldinger, Stephen D. 1998. Echoes of echoes? An episodic theory of lexical access. Psychological Review 105(2):251279.Google Scholar
Grill-Spector, Kalanit. 2003. The neural basis of object perception. Current Opinion in Neurobiology 13(2):159166. https://doi.org/10.1016/S0959–4388(03)00040-0.Google Scholar
Hagoort, Peter, Baggio, Giosuè, and Willems, Roel M.. 2009. Semantic unification. In The cognitive neurosciences, ed. Gazzaniga, Michael S., 819836. Cambridge, MA: MIT Press.Google Scholar
Holt, Lori L. 2005. Temporally nonadjacent nonlinguistic sounds affect speech categorization. Psychological Science 16(4):305312. https://doi.org/10.1111/j.0956-7976.2005.01532.x.Google Scholar
Holt, Lori L. 2006a. Speech categorization in context: Joint effects of nonspeech and speech precursors. The Journal of the Acoustical Society of America 119(6):40164026. https://doi.org/10.1121/1.2195119.Google Scholar
Holt, Lori L. 2006b. The mean matters: Effects of statistically defined nonspeech spectral distributions on speech categorization. The Journal of the Acoustical Society of America 120(5):28012817. https://doi.org/10.1121/1.2354071.Google Scholar
Holt, Lori L., and Wade, Travis. 2004. Non-linguistic sentence-length precursors affect speech perception: Implications for speaker and rate normalization. From Sound to Sense: 1–6. https://doi.org/10.1184/R1/6617312.v1.Google Scholar
Honorof, Douglas N., and Whalen, D. H.. 2005. Perception of pitch location within a speaker’s F0 range. Journal of the Acoustical Society of America 117(4):21932200. http://dx.doi.org/10.1121/1.1841751.Google Scholar
Huang, Jingyuan, and Holt, Lori L.. 2009. General perceptual contributions to lexical tone normalization. The Journal of the Acoustical Society of America 125(6):39833994. https://doi.org/10.1121/1.3125342.Google Scholar
Huang, Jingyuan, and Holt, Lori L.. 2011. Evidence for the central origin of lexical tone normalization (L). The Journal of the Acoustical Society of America 129(3):11451148. https://doi.org/10.1121/1.3543994.Google Scholar
Johnson, Keith. 1997. Speech perception without speaker normalization: An exemplar model. In Talker variation in speech processing, ed. Johnson, Keith and Mullennix, John W., 145165. San Diego, CA: Academic Press.Google Scholar
Johnson, Keith. 2007. Decisions and mechanisms in exemplar-based phonology. In Experimental approaches to phonology: In honor of John Ohala, ed. Maria-Josep, Sole, Beddor, Patrice Speeter, and Ohala, Manjari, 2540. Oxford: Oxford University Press.Google Scholar
Johnson, Keith. 2008. Speaker normalization in speech perception. In The handbook of speech perception, ed. Pisoni, David B. and Remez, Robert E., 363389. Malden, MA: Blackwell Publishing. https://doi.org/10.1002/9780470757024.ch15.Google Scholar
Johnson, Keith, and Mullennix, John W.. (eds.) 1997. Talker variability in speech processing. San Diego, CA: Academic Press.Google Scholar
Joos, Martin. 1948. Acoustic Phonetics. Baltimore, MD: Linguistic Society of America.Google Scholar
Koenig, Laura L. 2000. Laryngeal factors in voiceless consonant production in men, women, and 5-year-olds. Journal of Speech, Language, and Hearing Research 43(5):12111228. https://doi.org/10.1044/jslhr.4305.1211.Google Scholar
Kreitewolf, Jens, Gaudrain, Etienne, and von Kriegstein, Katharina. 2014. A neural mechanism for recognizing speech spoken by different speakers. NeuroImage 91:375385. https://doi.org/10.1016/j.neuroimage.2014.01.005.Google Scholar
Kutas, Marta, and Federmeier, Kara D.. 2011. Thirty years and counting: Finding meaning in the N400 component of the event-related brain potential (ERP). The Annual Review of Psychology 62:14.114.27.Google Scholar
Ladefoged, Peter, and Broadbent, D. E.. 1957. Information conveyed by vowels. The Journal of the Acoustical Society of America 29(1):98104. https://doi.org/10.1121/1.1908694.Google Scholar
Laing, Erika J. C., Ran, Liu, Lotto, Andrew J., and Lori L, Holt. 2012. Tuned with a tune: Talker normalization via general auditory processes. Frontiers in Psychology 3(June):19. https://doi.org/10.3389/fpsyg.2012.00203.Google Scholar
Lawson, Rebecca. 1999. Achieving visual object constancy across plane rotation and depth rotation. Acta Psychologica 102(2–3):221245. https://doi.org/10.1016/S0001–6918(98)00052-3.Google Scholar
Leather, Jonathan. 1983. Speaker normalization in perception of lexical tone. Journal of Phonetics 11:373382.Google Scholar
Logothetis, Nikos K., and Sheinberg, David L.. 1996. Visual object recognition. Annual Review of Neuroscience 19(1):577621. https://doi.org/10.1146/annurev.ne.19.030196.003045.Google Scholar
Lotto, Andrew J., and Kluender, Keith R.. 1998. General contrast effects in speech perception: Effect of preceding liquid on stop consonant identification. Perception and Psychophysics 60(4):602619. https://doi.org/10.3758/BF03206049.Google Scholar
Malins, Jeffrey G., and Joanisse, Marc F.. 2012. Setting the tone: An ERP investigation of the influences of phonological similarity on spoken word recognition in Mandarin Chinese. Neuropsychologia 50(8):20322043. https://doi.org/10.1016/j.neuropsychologia.2012.05.002.Google Scholar
Maloney, Lawrence T., and Wandell, Brian A.. 1986. Color constancy: A method for recovering surface spectral reflectance. Journal of the Optical Society of America A 3(1):2933. https://doi.org/10.1364/JOSAA.3.000029.Google Scholar
Massaro, Dominic W., and Chen, Trevor H.. 2008. The motor theory of speech perception revisited. Psychonomic Bulletin & Review 15(2):453457.Google Scholar
Mesgarani, Nima, and Chang, Edward F.. 2012. Selective cortical representation of attended speaker in multi-talker speech perception. Nature 485(7397):233236. https://doi.org/10.1038/nature11020.Google Scholar
Monahan, Philip J., and Idsardi, William J.. 2010. Auditory sensitivity to formant ratios: Toward an account of vowel normalisation. Language and Cognitive Processes 25(6):808839. https://doi.org/10.1080/01690965.2010.490047.Google Scholar
Moore, Corrine B., and Jongman, Allard. 1997. Speaker normalization in the perception of Mandarin Chinese tones. The Journal of the Acoustical Society of America 102(3):18641877. https://doi.org/10.1121/1.420092.Google Scholar
Morris, Richard J., McCrea, Christopher R., and Herring, Kaileen D.. 2008. Voice onset time differences between adult males and females: Isolated syllables. Journal of Phonetics 36(2):308317. https://doi.org/10.1016/j.wocn.2007.06.003.Google Scholar
Nusbaum, Howard, and Magnuson, James. 1997. Talker normalization: Phonetic constancy as a cognitive process. In Talker variability and speech processing, ed. Johnson, Keith A. and Mullennix, John W., 109132. New York, NY: Academic Press.Google Scholar
Nygaard, Lynne C., and Pisoni, David B.. 1998. Talker-specific learning in speech perception. Perception & Psychophysics 60(3): 355376. https://doi.org/10.3758/BF03206860.Google Scholar
Peng, Gang. (2006). Temporal and tonal aspects of Chinese syllables: A corpus-based comparative study of mandarin and cantonese. Journal of Chinese Linguistics 34(1):135154.Google Scholar
Peng, Gang, Zhang, Caicai, Zheng, Hong-Ying, Minett, James W., and Wang, William S.-Y.. 2012. The effect of inter-talker variations on acoustic-perceptual mapping in Cantonese and Mandarin tone systems. Journal of Speech, Language, and Hearing Research 55(2):579595. https://doi.org/10.1044/1092-4388(2011/11-0025).Google Scholar
Protopapas, Athanassios, and Lieberman, Philip. 1997. Fundamental frequency of phonation and perceived emotional stress. The Journal of the Acoustical Society of America 101(4):22672277. https://doi.org/10.1121/1.418247.Google Scholar
Rose, Phil. 1996. Cantonese citation tones. In Vocal fold physiology: Controlling complexity and chaos, ed. Davis, Pamela J., and Fletcher, Neville H., 307324. San Diego, CA: Singular.Google Scholar
Sheffert, Sonya M., and Fowler, Carol A.. 1995. The effects of voice and visible speaker change on memory for spoken words. Journal of Memory and Language 34(5):665685. http://dx.doi.org/10.1006/jmla.1995.1030.Google Scholar
Sjerps, Matthias J., Mitterer, Holger, and McQueen, James M.. 2011a. Constraints on the processes responsible for the extrinsic normalization of vowels. Attention, Perception, and Psychophysics 73(4):11951215. https://doi.org/10.3758/s13414–011-0096-8.Google Scholar
Sjerps, Matthias J., Mitterer, Holger, and McQueen, James M.. 2011b. Listening to different speakers: On the time-course of perceptual compensation for vocal-tract characteristics. Neuropsychologia 49(14):38313846. https://doi.org/10.1016/j.neuropsychologia.2011.09.044.Google Scholar
Sjerps, Matthias J., Mitterer, Holger, and McQueen, James M.. 2012. Hemispheric differences in the effects of context on vowel perception. Brain and Language 120(3):401405. https://doi.org/10.1016/j.bandl.2011.12.012.Google Scholar
Sjerps, Matthias J., and Smiljanić, Rajka. 2013. Compensation for vocal tract characteristics across native and non-native languages. Journal of Phonetics 41(3–4):145155. https://doi.org/10.1016/j.wocn.2013.01.005.Google Scholar
Slawson, A. Wayne. 1968. Vowel quality and musical timbre as functions of spectrum envelope and fundamental frequency. The Journal of the Acoustical Society of America 43(1):87101. https://doi.org/10.1121/1.1910769.Google Scholar
Smith, David R. R., and Patterson, Roy D.. 2005. The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age. The Journal of the Acoustical Society of America 118(5):31773186. https://doi.org/10.1121/1.2047107.Google Scholar
Syrdal, Ann K., and Gopal, H. S.. 1986. A perceptual model of vowel recognition based on the auditory representation of American English vowels. The Journal of the Acoustical Society of America 79(4):10861100. https://doi.org/10.1121/1.393381.Google Scholar
Viswanathan, Navin, Fowler, Carol A., and Magnuson, James S.. 2009. A critical examination of the spectral contrast account of compensation for coarticulation. Psychonomic Bulletin and Review 16(1):7479. https://doi.org/10.3758/PBR.16.1.74.Google Scholar
Viswanathan, Navin, Magnuson, James S., and Fowler, Carol A.. 2010. Compensation for coarticulation: Disentangling auditory and gestural theories of perception of coarticulatory effects in speech. Journal of Experimental Psychology: Human Perception and Performance 36(4):10051015.Google Scholar
Viswanathan, Navin, Magnuson, James S., and Fowler, Carol A.. 2013. Similar response patterns do not imply identical origins: An energetic masking account of nonspeech effects in compensation for coarticulation. Journal of Experimental Psychology. Human Perception and Performance 39(4):11811192. https://doi.org/10.1037/a0030735.Google Scholar
Von Kriegstein, Katharina, Smith, David R. R., Patterson, Roy D., Kiebel, Stefan J., and Griffiths, Timothy D.. 2010. How the human brain recognizes speech in the context of changing speakers. The Journal of Neuroscience 30(2):629638. https://doi.org/10.1523/jneurosci.2742-09.2010.Google Scholar
Wallach, Hans. 1948. Brightness constancy and the nature of achromatic colors. Journal of Experimental Psychology 38(3):310324.Google Scholar
Wang, William S.-Y. 1972. The many uses of F0. In Linguistics and phonetics to the memory of Pierre Delattre, ed. Valdman, Albert, 487503. The Hague: De Gruyter Mouton.Google Scholar
Wong, Patrick C. M., and Diehl, Randy L.. 2003. Perceptual normalization for inter- and intratalker variation in Cantonese level tones. Journal of Speech, Language, and Hearing Research 46(2):413421.Google Scholar
Wong, Patrick C. M., Nusbaum, Howard C., and Small, Steven L.. 2004. Neural bases of talker normalization. Journal of Cognitive Neuroscience 16:11731184.Google Scholar
Yip, Moira. 2002. Tone. Cambridge: Cambridge University Press.Google Scholar
Zhang, Caicai. 2018. Online adjustment of phonetic expectation of lexical tones to accommodate speaker variation: A combined behavioural and ERP study. Language, Cognition and Neuroscience 33(2):175195. https://doi.org/10.1080/23273798.2017.1376752.Google Scholar
Zhang, Caicai, and Chen, Si. 2016. Toward an integrative model of talker normalization. Journal of Experimental Psychology: Human Perception and Performance 42(8):12521268. https://doi.org/10.1037/xhp0000216.Google Scholar
Zhang, Caicai, Peng, Gang, and Wang, William S.-Y.. 2012. Unequal effects of speech and nonspeech contexts on the perceptual normalization of Cantonese level tones. The Journal of the Acoustical Society of America 132(2): 10881099. https://doi.org/10.1121/1.4731470.Google Scholar
Zhang, Caicai, Peng, Gang, and Wang, William S.-Y.. 2013. Achieving constancy in spoken word identification: Time course of talker normalization. Brain and Language 126(2):193202. https://doi.org/10.1016/j.bandl.2013.05.010.Google Scholar
Zhang, Caicai, Pugh, Ken R., Einar Mencl, W., Molfese, Peter J., Frost, Stephen J., Magnuson, James S., Peng, Gang, and Wang, William S-Y.. 2016. Functionally integrated neural processing of linguistic and talker information: An event-related fMRI and ERP study. NeuroImage 124:536549. https://doi.org/10.1016/j.neuroimage.2015.08.064.Google Scholar
Zhang, Caicai, Shao, Jing, and Chen, Si. 2018. Impaired perceptual normalization of lexical tones in Cantonese-speaking congenital amusics. The Journal of the Acoustical Society of America 144(2):634647. https://doi.org/10.1121/1.5049147.Google Scholar
Zhao, Jingjing, Guo, Jingjing, Zhou, Fengying, and Shu, Hua. 2011. Time course of Chinese monosyllabic spoken word recognition: Evidence from ERP analyses. Neuropsychologia 49(7):17611770. https://doi.org/10.1016/j.neuropsychologia.2011.02.054.Google Scholar

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×