Breast cancer continues to be one of the leading causes of cancer death among women in the United States, despite the advances made in the identification of prognostic and predictive markers for breast cancer treatment. Mammographic reporting is the first step in the screening and diagnosis of breast cancer. Abnormal mammographic findings such as a mass, abnormal calcifications, architectural distortion, and asymmetric density can lead to a cancer diagnosis. The American College of Radiology developed the Breast Imaging Reporting and Data System (BI-RADS) lexicon to standardize mammographic reporting to facilitate biopsy decision-making. However, application of the BI-RADS lexicon has resulted in substantial inter-observer variability, including inappropriate term usage and missing data. This observer variability has lead in part to a considerable variation in the rate of biopsy across the US, with a majority of breast biopsies ultimately found to be benign lesions. Hence, there is the need for a system that can better stratify the risk of cancer and define a more optimum threshold for biopsy. To address this need, we propose to develop an intelligent-augmented risk assessment system for breast cancer management based on multimodality image and clinical information with deep learning and data mining techniques. This study aims to develop a well-defined, novel risk assessment system incorporating multi-modality datasets with a novel predictive model that outputs a probability measure of cancer that is more clinically relevant and informative than the six discrete BI-RADS scores. Using mammographic or breast ultrasound BI- RADS reporting signatures and radiomics features, a predictive model that is more precise and clinically relevant may be developed to target well-characterized and defined specific biopsy patient subgroups rather than a broad heterogeneous biopsy group. Our proposed technique entails a novel strategy using Natural Language Processing to extract pertinent clinical risk factors related to breast cancer from vast amounts of patient charts automatically and integrate them with corresponding image-omics data and radiologist- generated reports. We will extract and quantitate image features from both large amounts of mammography and breast ultrasound images and combine them with the radiology reports and pertinent clinical risk profile and other patient characteristics to generate a risk assessment score to aid radiologists and oncologists in breast cancer risk assessment and biopsy decisions. Such a web-based application tool will be the first breast cancer risk assessment system based on integrative radiomics data augmented by AI methods. The iBRISK tool will enhance engagement between the patient and clinician for making an informed decision on whether or not to biopsy. Our hypothesis is that BI-RADS reports and the imaging metrics contain significant features for the breast cancer risk assessment and biopsy decision-making. By using BI-RADS reports and the imaging metrics, we will be able to develop new metrics to better breast cancer risk assessment. The novelty of the breast cancer risk assessment system is that it will incorporate a new predictive model that deploys deep learning and AI technology to provide a more reliable stratification of the BI-RADS subtypes for breast cancer risk assessment and reduce unnecessary breast biopsies and patients? anxiety.

Public Health Relevance

We propose to develop an intelligent-augmented risk assessment system for breast cancer management based on multimodality image and clinical information with deep learning and data mining techniques. Using mammographic or breast ultrasound BI-RADS reporting signatures and radiomics features, a more precise and clinically relevant predictive model will be developed to target well-characterized and defined specific biopsy patient subgroups rather than a broad heterogeneous biopsy group. The more reliable stratification of BI-RADS subtypes for breast cancer risk assessment will reduce the number of unnecessary breast biopsies and save billions of dollars in medical costs annually.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
1R01CA251710-01
Application #
10028242
Study Section
Biomedical Computing and Health Informatics Study Section (BCHI)
Program Officer
Tata, Darayash B
Project Start
2020-06-01
Project End
2025-05-31
Budget Start
2020-06-01
Budget End
2021-05-31
Support Year
1
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Methodist Hospital Research Institute
Department
Type
DUNS #
185641052
City
Houston
State
TX
Country
United States
Zip Code
77030