Biomolecular NMR data computation is complex, requiring both semi-automated and manual processing steps. The NMRbox platform provides easy access to the myriad software tools used during an individual investigation. The efforts of TRD1 ensure that all of this software functions within the same operating system environment. The goals of TRD2 are to provide information management solutions to promote software interoperability across the NMRbox platform (TRD1) and with the analytics of TRD3; to foster reproducibility through the active curation of scientific datasets and workflows; to facilitate the integration of experimental and derived data with information from public databases to generate new knowledge, and to help aggregate this diverse panoply of experimental data and metadata to create and facilitate richer depositions to the appropriate public data repositories. A key component to enriching repository depositions is the collection of provenance metadata which reports on the history or lineage of how computations were orchestrated and intermediate and final results obtained. Knowing the pathway of how results were obtained is a critical component to reproducing and building upon scientific claims.
The aims of this project period are fourfold. The CONNJUR data model and CONNJUR software integration environments (integral components of NMRbox) will be extended to capture more metadata to support the tracking of provenance (or data lineage), to foster reproducibility, to promote software interoperability both within and without NMRbox, and to support a richer metadata set for depositions to the public data repositories. In addition, the workflow management system will be extended to support high-throughput computational workflows by connecting to powerful job management systems such as HTCondor and DAGMan. The data and process logging capabilities of NMRbox will be used for automatic harvesting of whatever metadata can be gleaned by these mechanisms. Finally, more direct access to BMRB resources will be provided through direct machine-to- machine services.
|Dashti, Hesam; Wedell, Jonathan R; Westler, William M et al. (2018) Applications of Parametrized NMR Spin Systems of Small Molecules. Anal Chem 90:10646-10649|
|Pupier, Marion; Nuzillard, Jean-Marc; Wist, Julien et al. (2018) NMReDATA, a standard to report the NMR assignment and parameters of organic compounds. Magn Reson Chem 56:703-715|
|Gryk, Michael R; Ludäscher, Bertram (2018) Semantic Mediation to Improve Reproducibility for Biomolecular NMR Analysis. Transform Digit Worlds (2018) 10766:620-625|
|Maciejewski, Mark W; Schuyler, Adam D; Hoch, Jeffrey C (2018) Practical Nonuniform Sampling and Non-Fourier Spectral Reconstruction for Multidimensional NMR. Methods Mol Biol 1688:341-352|
|Zambrello, Matthew A; Schuyler, Adam D; Maciejewski, Mark W et al. (2018) Nonuniform sampling in multidimensional NMR for improving spectral sensitivity. Methods 138-139:62-68|
|Kaplan, Anne R; Brady, Megan R; Maciejewski, Mark W et al. (2017) Nuclear Magnetic Resonance Structures of GCN4p Are Largely Conserved When Ion Pairs Are Disrupted at Acidic pH but Show a Relaxation of the Coiled Coil Superhelix. Biochemistry 56:1604-1619|
|Zambrello, Matthew A; Maciejewski, Mark W; Schuyler, Adam D et al. (2017) Robust and transferable quantification of NMR spectral quality using IROC analysis. J Magn Reson 285:37-46|
|Hoch, Jeffrey C (2017) Beyond Fourier. J Magn Reson 283:117-123|
|Eghbalnia, Hamid R; Romero, Pedro R; Westler, William M et al. (2017) Increasing rigor in NMR-based metabolomics through validated and open source tools. Curr Opin Biotechnol 43:56-61|
|Fathi, Fariba; Brun, Antonio; Rott, Katherine H et al. (2017) NMR-Based Identification of Metabolites in Polar and Non-Polar Extracts of Avian Liver. Metabolites 7:|
Showing the most recent 10 out of 22 publications