The long-term objective of the proposed work is to understand variability in speech intelligibility, with the ultimate goal of developing speech intelligibility enhancement techniques for a wide range of talkers, listeners and communicative situations. The current objective is to explore variability in speech intelligibility through both listener adaptation to the talker and talker adaptation to the listener. In particular, we focus on the special case of speech communication between all possible combinations of native and non-native English talkers. This focus is in-line with current national and global trends towards increasing contact between native and non- native English speakers, and therefore stands to make both theoretical and practical contributions. Our central claim is that variability in overall speech intelligibility is a function of talker-listener sound structure alignment, which can be adjusted in a bi-directional, dynamic manner according to the current communicative conditions. We hypothesize that a mismatch of language background between interlocutors is a bi-directional source of speech intelligibility variability, as well as of cognitive-linguistic innovation. Two predictions of this hypothesis are: (1) variability in intelligibility is related to talker-listener alignment rather than a simple function of talker and/or listener target language proficiency (bi-directional intelligibility variability), and (2) both native and non-native speakers exhibit speech perception and production changes in response to exposure to native and non-native speech (bi-directional innovation).
The specific aims are:
Aim 1 : To develop a large corpus of speech produced by native and non-native speakers of English that includes both scripted and spontaneous, dialogue-based speech samples. The speakers will be carefully selected to cover a range of native language backgrounds and levels of English proficiency. Moreover, in the dialogue portion of the corpus, talkers will be paired in a principled manner, covering 4 different conversation pair types: 2 native talkers, 1 native and 1 non-native talker, 2 non-native talkers from the same language background, and 2 non-native talkers from different language backgrounds. The corpus will be fully transcribed and partially phonetically aligned, creating a valuable resource for the speech research community.
Aim 2 : To use this corpus to examine bi-directional talker-listener alignment at the level of speech production and perception. In a series of 3 planned experiments we will (i) examine foreign-accented speech intelligibility in relation to inter- and intra-talker acoustic-phonetic variability, (ii) compare overall communicative efficiency across all 4 types of conversation pairs, and (iii) compare the direction and extent of phonetic convergence in dialogues between conversation partners that vary with respect to their proficiency in the target language (English) and in terms of their native language background.

Public Health Relevance

By documenting bi-directional adaptation under quasi-natural, laboratory-based conditions, this project stands to (a) add critical information to the empirical base on which theories and models of speech perception are built, and (b) forge a conceptual and empirical link between individual-level talker-listener alignment processes and population-level shifts in linguistic sound structure. Moreover, the project will contribute a large, carefully constructed searchable database of recorded speech accompanied by orthographic transcriptions. This is inherently valuable for basic and applied speech researchers in clinical, university and technological- industry settings where concerns range from diagnosis of neurological speech impairments (as distinct from deviant speech patterns due to bilingual or second-language speech acquisition) to the handling of foreign- accents for automatic speech recognition systems.

Agency
National Institute of Health (NIH)
Institute
National Institute on Deafness and Other Communication Disorders (NIDCD)
Type
High Priority, Short Term Project Award (R56)
Project #
2R56DC005794-05A1
Application #
7893332
Study Section
Language and Communication Study Section (LCOM)
Program Officer
Shekim, Lana O
Project Start
2002-12-01
Project End
2011-06-30
Budget Start
2009-07-24
Budget End
2011-06-30
Support Year
5
Fiscal Year
2009
Total Cost
$297,706
Indirect Cost
Name
Northwestern University at Chicago
Department
Miscellaneous
Type
Schools of Arts and Sciences
DUNS #
160079455
City
Evanston
State
IL
Country
United States
Zip Code
60201
Kim, Midam; Horton, William S; Bradlow, Ann R (2011) Phonetic convergence in spontaneous conversations as a function of interlocutor language distance. Lab Phonol 2:125-156
Clopper, Cynthia G; Smiljanic, Rajka (2011) Effects of gender and regional dialect on prosodic patterns in American English. J Phon 39:237-245
Baker, Rachel E; Baese-Berk, Melissa; Bonnasse-Gahot, Laurent et al. (2011) Word Durations in Non-Native English. J Phon 39:1-17
Bradlow, Ann; Clopper, Cynthia; Smiljanic, Rajka et al. (2010) A Perceptual Phonetic Similarity Space for Languages: Evidence from Five Native Language Listener Groups. Speech Commun 52:930-942
Clopper, Cynthia G; Bradlow, Ann R (2009) Free classification of American English dialects by native and non-native listeners. J Phon 37:436-451
Smiljani?, Rajka; Bradlow, Ann R (2009) Speaking and Hearing Clearly: Talker and Listener Factors in Speaking Style Changes. Lang Linguist Compass 3:236-264