EarthCube is focused on community-driven development of an integrated and interoperable knowledge management system for data in the geo- and environmental sciences. By utilizing a cooperative, as opposed to competitive, process like that which created the Internet and Open Source software, EarthCube will attack the recalcitrant and persistent problems that so far have prevented adequate access to and the analysis, visualization, and interoperability of the vast storehouses of disparate geoscience data and data types residing in distributed and diverse data systems. This awards funds a series of broad, inclusive community interactions to gather adequate information and requirements to create a roadmap for a critical capability (workflow) in the development of EarthCube, a major new NSF initiative. Workflow in the context of EarthCube, and cyberinfrastructure in general, encompasses a broad range of topics including distributed execution management, the coupling of multiple models into composite applications, the integration of a wide range of data sources with processing, and the creation of refined data products from raw data. A key benefit of the funded work in terms of evaluating and creating community consensus on the best way forward for this capability (i.e., workflow) is the ability to document the provenance of data used in modeling and reproduce model and data-enabled scientific results. The funded workshop and information collecting activity will be open to all interested parties and is being led by a diverse and expert team of cyberinfrastructure developers, computer scientists, and geoscientists. Broader impacts of the work include converging on approaches, protocols, and standards that may be applicable across the sciences. They also include the fostering of close interaction between communities that do not commonly interact with one another and focusing them on the common goal of creating a new paradigm in data and knowledge management in the geosciences.
" is a road map document that provides a both a long term vision for scientific workflows supporting geoscience research and a concrete plan of action to make the vision real. The EarthCube Workflow Community Roadmap is compiled from insights gained through a series of workshops, through a community survey, through in-depth discussions with specific groups, and through regular open teleconference meetings. Intellectual Merit: Our key insight is the need to bring workflow tools into the daily use by geoscientists. Scientific workflow research and development have resulted in a wide array of tools that can solve problems ranging from managing complicated simulations on geographically distributed supercomputers and clusters, to capturing the metadata and provenance information needed to document and reproduce a scientific result, to "whiteboarding" and planning for multidisciplinary collaboration teams who need to understand how to work together on challenging new projects. Collecting these overarching usage scenarios (or vignettes as we term them in the roadmap) from a wide range of projects, encoding them into a common format, and identifying the major operational modes for each was a significant effort of our project team. Broader Impacts: By bringing together a diverse community of researchers in geosciences, computer science, and cyberinfrastructure into a single team, the project was able to gain deeper insights into the challenges faced by researchers as well as a better understanding of what has already been accomplished. Our teamâ€™s depth and breadth also aided in bringing many participants to our workshops, to take our survey, and to participate in the authoring of the final roadmap document. The resulting roadmap document represents an open community effort that includes the inputs of volunteers as well as the project team. We believe this effort illustrates a path for open community governance in cyberinfrastructure research and development.