Automatic Voice-Based Assessment of Language Abilities

Dolata, Jill; Asgari, Meysam

Abstract

Since untreated language disorder - a disorder with a prevalence of at least 7% - can lead to serious behavioral and educational problems, large-scale early language assessment is urgently needed not only for early identification of language disorder but also for planning interventions and tracking progress. This is all the more so because a recent study found that 71% of children diagnosed with Specific Language Impairment (a type of language disorder) had not been previously identified. However, such large-scale efforts would pose a large burden on professional staff and on other scarce resources. As a result, clinicians, educators, and researchers have argued for the use of computer based assessment. Recently, progress has been made with computer based language assessment, but it has been limited to language comprehension (i.e., receptive vocabulary and grammar). Thus, computer based assessment of language production that is expressive language and particularly discourse skills, is still lacking. One contributing factor is that a key technology needed for this, Automatic Speech Recognition (ASR), is perceived as inadequate for accurate scoring of language tests since even the best ASR systems have word error rates in excess of 20%. However, this perception is based on a limited perspective of how ASR can be used for assessment, in which a general- purpose ASR system provides an (often inaccurate) transcript of the child's speech, which then would be scored automatically according to conventional rules. We take an alternative perspective, and propose an innovative approach that comprises two core concepts. The first is that of creating special-purpose, test-specific ASR systems whose search space is carefully matched to the space of responses a test may elicit. The second is that of integrating these systems with machine-learning based scoring algorithms whereby the latter operate not on the final, best transcript generated by the ASR system but on the rich layers of intermediate representations that the ASR system computes in the process of recognizing the input speech (rich representation). Earlier experiments in our lab with digit and narrative recall tests have demonstrated the feasibility of this approach. In the proposed project we will create computer-based scoring and test administration systems for tests in the expressive modality as well as in the vocabulary, grammar, and discourse domains; we will also create a system for a non-word repetition test. The systems will be applied to a diverse group of 300 children ages 3-9 with typical development and with neurodevelopmental disorders, and will be validated against conventional language measures. The automated language tests developed in the project cover core diagnostic criteria for language disorders but also create a technological foundation for the computerization of a much broader array of tests for voice based language and cognitive assessment.

Public Health Relevance

There is a significant need for language assessment for early detection, diagnosis, screening, and progress tracking of language difficulties. However, assessment involves face-to-face sessions with a professional, which may not always be available and affordable. The project goal is to provide a technology solution, by designing, implementing, and evaluating computer-based systems for automated voice-based language assessment (both test administration and test scoring) for narrative recall, picture naming, sentence repetition, sentence completion, and nonword repetition.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Deafness and Other Communication Disorders (NIDCD)
Type: Research Project (R01)
Project #: 5R01DC013996-04
Application #: 9600692
Study Section: Special Emphasis Panel (ZDC1)
Program Officer: Cooper, Judith

Project Start: 2015-12-10
Project End: 2020-11-30
Budget Start: 2018-12-01
Budget End: 2019-11-30
Support Year: 4
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: Oregon Health and Science University
Department: Internal Medicine/Medicine
Type: Schools of Medicine
DUNS #: 096997515

City: Portland
State: OR
Country: United States
Zip Code: 97239

Related projects


NIH 2020 R01 DC	Automatic Voice-Based Assessment of Language Abilities Dolata, Jill Kalat; Asgari, Meysam / Oregon Health and Science University
NIH 2019 R01 DC	Automatic Voice-Based Assessment of Language Abilities Dolata, Jill Kalat; Asgari, Meysam / Oregon Health and Science University
NIH 2018 R01 DC	Automatic Voice-Based Assessment of Language Abilities Van Santen, Jan P. / Oregon Health and Science University
NIH 2017 R01 DC	Automatic Voice-Based Assessment of Language Abilities Van Santen, Jan P. / Oregon Health and Science University
NIH 2016 R01 DC	Automatic Voice-Based Assessment of Language Abilities Van Santen, Jan P. / Oregon Health and Science University	$638,494

Comments

Be the first to comment on Jill Dolata's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: