Automated spectral data transformations and analysis pipeline for high-throughput

Rajwa, Bartlomiej

Abstract

High-throughput flow cytometry is an emerging cell-analysis and screening technique employed in various fields of life-sciences, including drug discovery and clinical research. One of the major limitations of HT-FC is the lack of robust, rapid, and reproducible tools for data analysis and data mining. The current paradigm of FC analysis does not fit suit the HT format well. Traditionally, FC data are analyzed employing interactive exploratory visualization, which requires preparing a number of 2-D scatter plots that are used by an FC operator or researcher for visual evaluation of sample characteristics. Although the recent interest of computer science and bioinformatics communities in FC has spurred development of automated compensation and gating techniques, the proposed algorithms still follow the traditional analysis pathway (compensation plus gating), and typically attempt to mimic trained human operators in delineating various cell populations defined by the presence of fluorescent markers of varying intensities. Unfortunately, this model is not sustainable when hundreds or thousands of data sets must be processed in real time. This proposed research attempts to radically re-invent the FC data analysis pipeline for high-throughput FC by employing spectral classification approaches to FC data. In the proposed framework the FC data will be modeled as a mixture of signals that can be quantitatively recovered if certain physical and biological constraints describing the experimental system are rigorously followed. We propose a set of algorithms that will allow us first to define and encode the domain knowledge describing the analyzed specimens, subsequently to approximate the concentrations of labels, and from there recover information about the presence or absence of specific phenotypes of interest. The techniques employed will functionally replace two steps in FC data analysis that have traditionally been viewed as separate: compensation and gating. Instead, a new iterative spectral classification process will recover the quantitative characteristc of samples. This will allow for fast and automated extraction of sample features, as well as for mining the collected specimens for similar datasets. The proposed algorithm will be prototyped using R language for statistical computing, and relevant procedures will be made available to other researchers in the field of FC via the Bioconductor project. Upon successful testing and validation using various datasets contributed by collaborators, the classification algorithms will be implemented in PlateAnalyzer, an HT-FC data analysis package developed at Purdue University.

Public Health Relevance

Flow cytometry (FC) is an important single-cell analysis tool employed in various clinical and research applications. The currently used FC data-analysis paradigm utilizes an exploratory, interactive model requiring operators to evaluate samples manually using expertise and experience. This project attempts to build an automated, robust, reproducible, and operator- independent data-analysis system that can be employed for FC data processing and data mining, limiting subjectivity and enhancing the value of FC techniques.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Biomedical Imaging and Bioengineering (NIBIB)
Type: Exploratory/Developmental Grants (R21)
Project #: 5R21EB015707-02
Application #: 8502256
Study Section: Modeling and Analysis of Biological Systems Study Section (MABS)
Program Officer: Pai, Vinay Manjunath

Project Start: 2012-07-01
Project End: 2014-06-30
Budget Start: 2013-07-01
Budget End: 2014-06-30
Support Year: 2
Fiscal Year: 2013
Total Cost: $210,579
Indirect Cost: $50,912

Institution

Name: Purdue University
Department: Miscellaneous
Type: Other Domestic Higher Education
DUNS #: 072051394

City: West Lafayette
State: IN
Country: United States
Zip Code: 47907

Related projects


NIH 2013 R21 EB	Automated spectral data transformations and analysis pipeline for high-throughput Rajwa, Bartlomiej / Purdue University	$210,579
NIH 2012 R21 EB	Automated spectral data transformations and analysis pipeline for high-throughput Rajwa, Bartlomiej / Purdue University	$198,990

Publications

Rajwa, Bartek; Wallace, Paul K; Griffiths, Elizabeth A et al. (2017) Automated Assessment of Disease Progression in Acute Myeloid Leukemia by Probabilistic Analysis of Flow Cytometry Data. IEEE Trans Biomed Eng 64:1089-1098

Thomas, Fiona M; Goode, Kourtney M; Rajwa, Bartek et al. (2017) A Chemogenomic Screening Platform Used to Identify Chemotypes Perturbing HSP90 Pathways. SLAS Discov 22:706-719

Azad, Ariful; Rajwa, Bartek; Pothen, Alex (2016) Immunophenotype Discovery, Hierarchical Organization, and Template-Based Classification of Flow Cytometry Samples. Front Oncol 6:188

Azad, Ariful; Rajwa, Bartek; Pothen, Alex (2016) flowVS: channel-specific variance stabilization in flow cytometry. BMC Bioinformatics 17:291

Dundar, Murat; Akova, Ferit; Yerebakan, Halid Z et al. (2014) A non-parametric Bayesian model for joint cell clustering and cluster matching: identification of anomalous sample phenotypes with random effects. BMC Bioinformatics 15:314

Novo, David; Gregori, Gerald; Rajwa, Bartek (2013) Generalized unmixing model for multispectral flow cytometry utilizing nonsquare compensation matrices. Cytometry A 83:508-20

Comments

Be the first to comment on Bartlomiej Rajwa's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: