Every conversation is made up of turns at speaking. Many people are surprised to learn that speakers take turns with very little time between speakers (gaps average ~200ms) with hardly any overlaps. This is true of conversation in languages around the world. Since there's no single clue to indicate when someone will finish speaking, listeners must track speech in real time and attend to a variety of cues to when a speaker will end. But what information do participants in a conversation rely on? This project investigates how children and adults take turns 'on time' by studying natural variation in how quickly speakers take their turns, and by measuring onlookers' eye movements as they watch videos of two people talking. This project will investigate how much participants' ability to anticipate the end of a turn relies on words vs. intonation and rhythm (prosody) in everyday speech. Words give information about content, and also about the structure of upcoming speech. Prosody is a continuous cue to how a phrase is structured, and also relates to the words in many cases. Because young children are good at differentiating between some prosodic patterns, but don't yet know much about words or syntax, we hypothesize that their use of cues will differ from adults'. This project will study turn-taking in natural conversations in order to identify the developmental path for this conversational skill. Conversation is the gateway for analyzing interaction with other social beings. For children, conversation is a way to learn about language and the world. Trivial skills for adults, such as knowing when to come in and how to ask relevant questions, take children years to develop. Understanding how children develop these skills should also shed light on how they gain access to the information all around them.
Across human cultures, children learn language through interaction with caregivers and peers. Children’s early interactions, whatever form they take, are where they learn their first language. Part of learning language in interaction is learning how to use language with others. This project focuses on one conversational skill for interacting with others: turn-taking. Turn-taking in conversation patterns in a similar way across cultures: interlocutors switch between one turn and the next in less than 200 milliseconds on average. This quick timing in back-and-forth turn structure forms a perfect framework for contingent action, allowing us to achieve fine-tuned coordination and mutual estimations of common ground through rapid feedback and conversational repair. A turn-based framework is key to interactive efficiency and effectiveness, and it also shapes children's language-learning environments. Children begin to take turns (of a sort) long before their first words, but their mastery of turn-timing with language is a protracted process during which they respond at a significantly slower rate than adults. In a series of studies focusing on the production and perception of speech by adults and children aged 1-6 years, we explored the development of turn-taking skills in relation to children’s first language development. We used a combination of techniques to test children’s turn-taking and language skills: (1) observational studies of natural interaction during adult-child conversation, and (2) eye-tracking to measure children’s comprehension of conversations they were observing. We found that turn-timing is intimately linked to children's linguistic development. Both in their production and perception of speech, children become sensitive to different types of exchanges as they acquire new linguistic knowledge. Advances in their syntactic and prosodic knowledge result in a non-linear trajectory during development as children hone their turn-timing over their first few years. We are using these results to explore different aspects of the close link between language processing (listening to the speaker’s utterance and planning one’s own next utterance) and turn-structure, especially with respect to children learning language and adults using predictive processing (anticipating what is coming up next) to understand language in conversation. By focusing on this signature property of human conversation, our results will help us attain a better conceptualization of how the fundamental principles of human interaction shape both human language and its use.