The long-term objective of the proposed work is to understand variability in speech intelligibility, with the ultimate goal of developing speech intelligibility enhancement techniques for a wide range of talkers, listeners and communicative situations. The current objective is to explore variability in speech intelligibility through both listener adaptation to the talker and talker adaptation to the listener. In particular, we focus on the special case of speech communication between all possible combinations of native and non-native English talkers. This focus is in-line with current national and global trends towards increasing contact between native and non- native English speakers, and therefore stands to make both theoretical and practical contributions. Our central claim is that variability in overall speech intelligibility is a function of talker-listener sound structure alignment, which can be adjusted in a bi-directional, dynamic manner according to the current communicative conditions. We hypothesize that a mismatch of language background between interlocutors is a bi-directional source of speech intelligibility variability, as well as of cognitive-linguistic innovation. Two predictions of this hypothesis are: (1) variability in intelligibility is related to talker-listener alignment rather than a simple function of talker and/or listener target language proficiency (bi-directional intelligibility variability), and (2) both native and non-native speakers exhibit speech perception and production changes in response to exposure to native and non-native speech (bi-directional innovation).
The specific aims are:
Aim 1 : To develop a large corpus of speech produced by native and non-native speakers of English that includes both scripted and spontaneous, dialogue-based speech samples. The speakers will be carefully selected to cover a range of native language backgrounds and levels of English proficiency. Moreover, in the dialogue portion of the corpus, talkers will be paired in a principled manner, covering 4 different conversation pair types: 2 native talkers, 1 native and 1 non-native talker, 2 non-native talkers from the same language background, and 2 non-native talkers from different language backgrounds. The corpus will be fully transcribed and partially phonetically aligned, creating a valuable resource for the speech research community.
Aim 2 : To use this corpus to examine bi-directional talker-listener alignment at the level of speech production and perception. In a series of 3 planned experiments we will (i) examine foreign-accented speech intelligibility in relation to inter- and intra-talker acoustic-phonetic variability, (ii) compare overall communicative efficiency across all 4 types of conversation pairs, and (iii) compare the direction and extent of phonetic convergence in dialogues between conversation partners that vary with respect to their levels of proficiency in the target language (English) and in terms of their matched or mismatched native language backgrounds.
By documenting and analyzing bi-directional adaptation under quasi-natural, laboratory-based conditions, this project stands to (a) add critical information to the empirical base on which theories and models of speech perception are built, and (b) forge a conceptual and empirical link between individual-level talker- listener alignment processes and population-level shifts in linguistic sound structure. Importantly, by focusing on the case of speech communication between interlocutors who do not share a "mother tongue," the project will contribute basic information that can ultimately be used to enhance speech communication in the contemporary "globalized" setting. This issue is one of great clinical relevance as the number of non-native English speaking "clients" and "providers" in communication disorder clinics increases (as it will/already has in virtually all educational and commercial settings).
|Brouwer, Susanne; Bradlow, Ann R (2014) Contextual variability during speech-in-speech recognition. J Acoust Soc Am 136:EL26-32|
|Burchfield, L Ann; Bradlow, Ann R (2014) Syllabic reduction in Mandarin and English speech. J Acoust Soc Am 135:EL270-6|
|Calandruccio, Lauren; Brouwer, Susanne; Van Engen, Kristin J et al. (2013) Masking release due to linguistic and phonetic dissimilarity between the target and masker speech. Am J Audiol 22:157-64|
|Baese-Berk, Melissa M; Bradlow, Ann R; Wright, Beverly A (2013) Accent-independent adaptation to foreign accented speech. J Acoust Soc Am 133:EL174-80|
|Brouwer, Susanne; Van Engen, Kristin J; Calandruccio, Lauren et al. (2012) Linguistic contributions to speech-on-speech masking for native and non-native listeners: language familiarity and semantic content. J Acoust Soc Am 131:1449-64|
|Smiljanic, Rajka; Bradlow, Ann R (2011) Bidirectional clear speech perception benefit for native and high-proficiency non-native talkers and listeners: intelligibility and accentedness. J Acoust Soc Am 130:4020-31|
|Clopper, Cynthia G; Smiljanic, Rajka (2011) Effects of gender and regional dialect on prosodic patterns in American English. J Phon 39:237-245|
|Baker, Rachel E; Baese-Berk, Melissa; Bonnasse-Gahot, Laurent et al. (2011) Word Durations in Non-Native English. J Phon 39:1-17|
|Bradlow, Ann; Clopper, Cynthia; Smiljanic, Rajka et al. (2010) A Perceptual Phonetic Similarity Space for Languages: Evidence from Five Native Language Listener Groups. Speech Commun 52:930-942|
|Calandruccio, Lauren; Dhar, Sumitrajit; Bradlow, Ann R (2010) Speech-on-speech masking with variable access to the linguistic content of the masker speech. J Acoust Soc Am 128:860-9|
Showing the most recent 10 out of 22 publications