This application seeks support for a center of excellence in computational mass spectrometry and a national and international resource in the broad area of proteomics. It proposes to enlarge the current research activities, to branch into previously unexplored areas of computational proteomics, and to support multiple collaborative efforts. The proposal addresses the computational bottleneck that affects the entire proteomics community and impairs interpretation of data in thousands of experimental labs around the world. The goal is to bring the modern algorithmic technologies to mass-spectrometry and to build a new generation of reliable open access software tools to support both new development in mass-spectrometry instrumentation and the emerging applications of mass-spectrometry. The proposal focuses on four directions: (i) enabling complex mass spectrometry searches, (ii) analyzing unknown proteomes without protein databases, (iii) analyzing altered proteomes, and (iv) constructing proteogenomic annotations and analyzing pathways. These directions cover both well-studied but still inadequately addressed problems (like search for mutations and post-translational modifications) and unexplored problems for which there are no computational tools currently available (like antibody sequencing or analyzing fusion proteins in cancer). These projects require two-way collaborative efforts on a wide range of topics involving biomedical and computational scientists from various institutions. While many collaborations have been already established at San Diego (UCSD and Burnham Institute), sixteen other US universities, hospitals and biotechnology companies, as well as foreign research institutions at Germany, Singapore, Spain, Sweden, and United Kingdom, we propose to further extend these collaborations by developing robust open access mass spectrometry software that will catalyze the exchanges between experimental and computational researchers in proteomics. The biomedical applications addressed in these collaborative projects include but are not limited to (i) discovery of cancer biomarkers, (ii) elucidation of changes in aged cataractous lens, (iii) understanding how bacteria adjust to antibiotics and other harsh conditions, (iv) addressing the need to constantly reformulate the influenza vaccine to make it efficient, and (v) sequencing of snake venoms that proved instrumental in design of blood clotting drugs. Educational activities in the area of computational proteomics will also be developed, including short courses, a seminar program, an annual conference, and concerted education of students and postdocs.
Showing the most recent 10 out of 87 publications