Language learners must identify linguistic properties that differ across languages in the language input that surrounds them, and much recent research has explored the potential importance of distributional regularities in the language input for successful learning. However, other recent findings on child sentence understanding have shown that children's immature language comprehension system is prone to mis-parsing of the input. This raises the possibility that informative distributional information might be missed by the learner: if a child misanalyzes sentences in the input, then the true input distribution from the perspective of adults and researchers may be different from the 'intake', i.e., the effective input distribution that feeds into the language learning mechanism. This project investigates this issue through studies of incremental sentence parsing and reanalysis in question constructions in English and Japanese.

Under the direction of Dr. Colin Phillips and Dr. Jeffrey Lidz, Mr. Akira Omaki will conduct studies using eye-tracking, question-after-story and truth value judgment measures in English and Japanese in order to assess a) whether children, like adults, make early commitments to the interpretation of questions ('active dependency processing'), and b) whether children are able to successfully reanalyze in cases where their initial interpretation turns out to be incorrect. The experimental findings will be supplemented with a corpus analysis and a computational modeling study will be combined with the experimental findings, in order to generate an estimate of how the distribution of wh-question constructions appears from a child's perspective. A novel feature of the project is that it combines experimental and corpus-based approaches to gain an understanding of how children's language comprehension system might lead them to apprehend distributional regularities that do not correspond to what is actually present in their input. Thus, one broader impact of the project is that it could have important implications for any research that emphasizes the role of distributional regularities in the language acquisition process. A second broader impact of the project is that it will help to establish new partnerships for language acquisition research on Japanese, based on developing institutional connections with preschools, and will facilitate future international collaborations on comparative language acquisition studies.

Project Report

A key challenge for children learning their first language is to identify its unique grammatical properties. One popular answer to this question is that children are sophisticated data analysts: they can track information across large numbers of sentences and detect distributional regularities that could indicate the rules of grammar. However, this approach leaves aside the issue that children not only have to analyze the data distribution, but also have to gather the data themselves: children must accurately perceive and encode the language input before they can perform distributional analyses. But their comprehension mechanisms are known to often yield non-adultlike interpretations due to their inability to revise moment-by-moment interpretive choices. This project examined children’s interpretation of wh-questions in English and Japanese to shed light on the nature of children’s immature comprehension mechanisms, and how these mechanisms might influence language acquisition processes. A series of story-based comprehension experiments examined preschool-aged children’s interpretation of ambiguous questions such as (1), which provide an ideal testing ground for investigating online interpretation and sentence revision capacities in children. (1) a. Where did Emily tell someone that she hurt herself? b. Doko-de Emily-wa ashi-o kegashita-to itteta-no? where-at Emily-topic foot-accusative hurt-that told-question In (1), the wh-phrase ‘where’ could be associated with either ‘tell’ or ‘hurt’ (i.e., one could answer where the telling event happened, or where the hurting event happened), but previous studies with adults have demonstrated a strong bias for the first verb association. Due to the different word orders of the two languages, this corresponds to the main clause verb ‘tell’ in English (1a), and the embedded clause verb ‘hurt’ in Japanese (1b). Children’s comprehension data revealed a remarkably similar bias to associate ‘where’ to the first verb in the sentence in both English and Japanese, despite the differences in the canonical word order of the language. Next, we manipulated the Japanese examples like (1b) to test whether children were sensitive to cues that indicated that their preferred interpretation was wrong and that revision was required. We found that a syntactic cue was ineffective for children, but that a semantic cue was more effective. Armed with the results of our experiments with children, we investigated how children’s comprehension biases might impact the language learning process. For wh-questions like (1), there are languages that grammatically prohibit the second verb association (e.g., Russian), which suggests that children must learn whether association to the second verb is possible in their language. Learning this property of a language may require a sentence revision process that inhibits the first verb association. Our experimental findings are very relevant to this learning task, as we identified that children generally struggle with the required revision process, except under very specific conditions. To investigate whether the input contains wh-questions that could allow sentence revision, we conducted a corpus analysis of child-directed speech, to find out whether the types of sentences that would be most useful to children are well-represented in the input. From a sample of 14427 wh-questions in child-directed speech, only 86 involved wh-adjunct questions with two finite clauses, suggesting that the critical data for learning the interpretation possibilities in (1) are generally sparse. Of these 86 questions, only 20 (around 0.03%) are of the type that our experiments predict to be useful for children’s learning. This is the first demonstration that the input distribution that children must learn from can be skewed based on properties of the learner’s comprehension mechanisms, and it highlights the importance of children’s sentence processing in understanding language acquisition. In addition to the specific research findings described here, the project contributed to the training of Dr Akira Omaki, whose PhD research this supported. Dr Omaki is now an Assistant Professor in the Department of Cognitive Science at Johns Hopkins University. The project supported the training of Imogen Davidson White, an undergraduate student researcher at the University of Maryland who was recognized with the university’s Undergraduate Researcher of the Year award in 2010. The project also strengthened research collaborations between the US and two Japanese universities. The results of this research have been used in a number of presentations by the PIs that introduce the science of language learning to the general public and to high school students. The methods developed for this project are currently being used to examine adult and child language comprehension in other languages.

Project Start
Project End
Budget Start
2010-02-01
Budget End
2012-01-31
Support Year
Fiscal Year
2009
Total Cost
$11,966
Indirect Cost
Name
University of Maryland College Park
Department
Type
DUNS #
City
College Park
State
MD
Country
United States
Zip Code
20742