EarthCube is a joint venture between the Directorate of Geosciences and the Office of Cyberinfrastructure at the National Science Foundation. It is a community-driven effort to design and implement an effective data and knowledge management system for the geosciences that will integrates disparate data sets and web services and serves all members of the geoscience community. This 2-day workshop at the University of Colorado brings together investigators and other interested parties who have been tasked with developing cyberinfrastructure components that are potential candidates for inclusion in the final architecture and design of EarthCube. Participants at the workshop will be tasked to synthesize community input from the June EarthCube meeting and sequence the series of proposed milestones and events included in community-created road maps of key cyberinfrastructure capabilities that were generated over the course of the last six months. The overarching goals of the workshop will be to propose a workable cyberinfrastructure framework for EarthCube and work out a provisional timeline for its implementation. Participants will consist of key individuals already working on potential EarthCube architecture components, as well as those from newly formed special interest groups such as those interesting in having EarthCube link to and incorporate biological datasets; those seeking to serve the needs of geoscientists who work with unique sample-based data, as opposed to large homogeneous datasets coming from sensor arrays; and those trying to develop software to allow direct ingestion of data from laboratory instruments. Broader impacts of the workshop lie in the area of building infrastructure for science and engineering.
The goal of EarthCube is a sustainable infrastructure that enables the sharing of all geosciences data, information, and knowledge in an open, transparent and inclusive manner. No such infrastructure currently exists that can meet the information management challenges implicit in the GEO Vision call to action. EarthCube supports the stated goal by providing a well-defined process to drive the evolution of the infrastructure through community engagement. Such engagement must encompass the whole of the GEO community as well as other agencies and international stakeholders. In formulating a path for the creation and evolution of EarthCube, the community was engaged via a number of mechanisms. Two community meetings (called Charrettes) were held, with broad participation. Community groups were funded to address critical areas and gaps. Concept development was stimulated through awards for key areas. The Charettes were an important step forward, but did not fully explore and converge on the potential solutions that could meet the user requirements and the overall objectives of EarthCube. The roadmap development work by the EarthCube-funded activities continued this exploration, refining plans and capability definition. But these roadmaps, coming from groups with different technical and disciplinary backgrounds, ultimately needed to be aligned. The goal of the workshop funded under this project was to broaden the collaboration and convergence between the funded EarthCube projects through a face-to-face exchange between Principal Investigators. All projects were represented. The topics addressed included: architecture concepts; governance; use cases and demonstrations; education and social systems; and, finally, building blocks and next steps. The workshop identified key characteristics of a cyberinfrastructure that would support the EarthCube vision, which were generally consistent across all EarthCube documentation. These include: A collaboration environment, Lower barriers for participation, Openness and extensibility, Ability to accept and store user feedback, Capacity to capture provenance and ensure reproducibility Discoverabilty of materials held by small-project scientists, Education and reward system to encourage data sharing, Cross-domain teams and broad collaboration, A new community paradigm. In addition to the identified challenges in forming and assessing EarthCube, a broader range of relevant key challenges were outlined in a 2010 European document "Riding the Wave - How Europe can gain from the rising tide of scientific data" that impact not only EarthCube, but also the general research community. These global challenges for science research include: Collection Trust Usability Interoperability Diversity Security Education and training Data publication and access Commercial Exploitation New Social Paradigms Preservation and Sustainability These are important goals and have the most value when their meaning and related requirements are refined by the ultimate users of the information system. Thus, at the workshop, emphasis was placed on meeting user needs including nontraditional users. An example of a system for managing arctic data was discussed. The system uses a simple web-based interface to enter data. The data are indexed, searchable, and exposed via web services. It is a microcosm of what could be done in EarthCube. The participants addressed the different categories of interoperability mechanisms that could be pursued to reach the above goals. While simple access to data is a first step, more full-featured services that understand the internal structure of the data and convey some of its semantics to the user are also desirable. It was agreed that mapping between domain information should be a service performed by the cyberinfrastucture, typically in the form of a broker. Use cases can illuminate the effectiveness of this approach and its outcomes. Technical, governance and social issues are intertwined and a successful EarthCube must address all of these. There are social issues associated with the motivation to share data, models, tools, methods, and software. Recommended objectives in this area include: advancing education functionality (sharing and reusing learning objects, etc.) and incorporating educational stakeholders and interests in EarthCube planning and operations; and establishing mechanisms to engender trust, innovation, and collaboration between the geoscience and cyberinfrasturcture communities, eventually encompassing all relevant Federal and international agencies and organizations. Details of the discussions are presented in the workshop proceedings, which are available online. The workshop outcomes were provided to the EarthCube Project Office for planning and were made available to the broad geosciences community through an interactive web forum that included a review of the meeting and provided opportunities for discussion.