The Data and Dissemination Core [Babbitt (UCSF) and Buriey (Lilly), CoDirectors] addresses the requirements for data gathering/coordination, information dissemination, and bioinformatics as specified in the RFA for U54 Centers. This Core combines Cores B (Information Dissemination and Data-Coordinating Core) and C (Bioinformatics). For the Enzyme Function Initiative (EFI), the Data and Dissemination Core will be implemented and managed using four primary resources working together to ensure ease of data input and tracking by EFI members, coordination of experimental and computational data, implementation and access to analysis and visualization tools, and dissemination of data and information to EFI investigators and the public. Instructions for depositing data and information to the EFI and for using the tools and results of EFI efforts will be provided in the form of help files and online tutorials. Strategies and protocols for dissemination of EFI resources and materials, software and experimental results will also be provided by the core resources. Each is listed briefly below and described in detail in the sections following. EFI results will also be disseminated through publications in the peer-reviewed literature, presentations by EFI investigators at meetings and other venues, and through workshops developed for presentation at meetings of scientific societies and other appropriate organizations. 1. The EFI Website, the responsibility of the Administrative Core at UIUC (Gerit, Director), will be the official site for the EFI, providing information about the project and resources and acting as one of the portals for access to its data, results and materials. The homepage will provide tutorials, listings of publications, news, and other relevant information. Links will enable access to the Data Resource (Laboratory Information Management System for EFI members), the Structure Function Linkage Database, and the Computation Core (see sections 2-4 below). 2. The Data Resource (Sauder, Director) will rely on a state-of-the-art Laboratory Information Management System (LIMS) that has been optimized for high-throughput structural biology and studies of protein ligand-interactions. EFI experimental results and methods will be stored in the Data Resource for transfer to appropriate databases for dissemination. Target status will be automatically shared with the Structure Function Linkage Database (SFLD), together with other associated data. Structure and function results will be available through public websites linked to the Data Resource and through the other primary resources, including the EFI website and the SFLD. Because the core architecture of LIMS has already been implemented and its efficacy validated by a large numbers of users, this resource can provide extensive functionality at minimal cost to the EFI, with the requested funds used to extend LIMS to accommodate the specialized needs of the EFI and to integrate it with the other EFI resources described in this proposal. 3. The Structure Function Linkage Database (SFLD) (Babbitt, Director) will provide access to information about the superfamilies targeted by the Bridging Projects hierarchically classified by both sequence/structural and functional similarity into superfamilies and [monofunctional] families. Developed originally to analyze structure-function relationships of functionally diverse enzyme superfamilies, the SFLD will enable user access to EFI data and results in two major ways. First, the SFLD will serve as a continually updated archive of all sequences, structures (obtained experimentally or from models), and functions that are known for Bridging Project superfamilies, including functions discovered by the EFI Bridging Projects and Scientific Cores. Second, the SFLD will provide the results of computational (bioinformatic) and experimental analyses of these superfamilies in the context of their structure-function relationships to EFI investigators and the public. An extensive set of browse and search tools provide access to these data and information. Because the core architecture of the SFLD has already been implemented, EFI funds can be used principally to create new analysis and visualization functionality and to implement integration and sharing of data and information with the other dissemination resources. Computational analysis of superfamily structure-function relationships for target identification and function prediction is described in the Superfamily/Genome Core. 4. The Computation Core (Jacobson, Director) will create or extend existing resources to disseminate modeling and docking results along with relevant protocols and methods. As deemed useful, subsets of the raw data will be available for dissemination by LIMS or the SFLD and via links from the EFI Website. Because data and results are still closely tied to the further development of research methods, data coordination and bioinformatics for the Computation Core is described in that section rather than here. Only the dissemination and outreach efforts for the Computation Core are described in this Core proposal.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZGM1-PPBC-3)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Illinois Urbana-Champaign
United States
Zip Code
Gizzi, Anthony S; Grove, Tyler L; Arnold, Jamie J et al. (2018) A naturally occurring antiviral ribonucleotide encoded by the human genome. Nature 558:610-614
Kenney, Grace E; Dassama, Laura M K; Pandelia, Maria-Eirini et al. (2018) The biosynthesis of methanobactin. Science 359:1411-1416
Park, Yun Ji; Kenney, Grace E; Schachner, Luis F et al. (2018) Repurposed HisC Aminotransferases Complete the Biosynthesis of Some Methanobactins. Biochemistry 57:3515-3523
Calhoun, Sara; Korczynska, Magdalena; Wichelecki, Daniel J et al. (2018) Prediction of enzymatic pathways by integrative pathway mapping. Elife 7:
Sheng, Xiang; Patskovsky, Yury; Vladimirova, Anna et al. (2018) Mechanism and Structure of ?-Resorcylate Decarboxylase. Biochemistry 57:3167-3175
Zallot, RĂ©mi; Oberg, Nils O; Gerlt, John A (2018) 'Democratized' genomic enzymology web tools for functional assignment. Curr Opin Chem Biol 47:77-85
Barr, Ian; Stich, Troy A; Gizzi, Anthony S et al. (2018) X-ray and EPR Characterization of the Auxiliary Fe-S Clusters in the Radical SAM Enzyme PqqE. Biochemistry 57:1306-1315
Gerlt, John A (2017) Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence-Function Space and Genome Context to Discover Novel Functions. Biochemistry 56:4293-4308
Koo, Byoung-Mo; Kritikos, George; Farelli, Jeremiah D et al. (2017) Construction and Analysis of Two Genome-Scale Deletion Libraries for Bacillus subtilis. Cell Syst 4:291-305.e7
Holliday, Gemma L; Brown, Shoshana D; Akiva, Eyal et al. (2017) Biocuration in the structure-function linkage database: the anatomy of a superfamily. Database (Oxford) 2017:

Showing the most recent 10 out of 91 publications