EarthCube, a major new NSF initiative, is focused on community-driven development of an integrated and interoperable knowledge management system for data in the geo- and environmental sciences. By utilizing a cooperative, as opposed to competitive process like that which created the Internet and Open Source software, EarthCube will attack the recalcitrant and persistent problems that so far have prevented adequate access to and the analysis, visualization, and interoperability of the vast storehouses of disparate geoscience data and data types residing in distributed and diverse data systems. This awards funds a series of broad, inclusive community interactions to gather adequate information and requirements to create a roadmap for a critical capability (data mining and algorithm/data analytic techniques) that will enable the development of EarthCube. The focus of the community conversations include exploration of present data analysis approaches and possible new novel approaches; data filtering; pattern recognition and machine learning as applied to geoscience datasets; and search algorithm identification and development. Intensive interaction will also take place with awardees and participants in the other EarthCube community groups and concept development awards. Broader impacts of the work include: helping EarthCube realize its potential to dramatically improve the infrastructure for science, actively engaging early career geoscience researchers in the process, and supporting investigators at an institution in an EPSCoR state.

Project Report

EarthCube, a National Science Foundation (NSF) initiative, aims to transform the conduct of research in geosciences and encourage the community-guided cyberinfrastructure development of a coherent framework to integrate and use data and information for knowledge management across the research enterprise with the goal of creating a more sustainable future through improvements in our understanding of Earth as a complex and changing planet. Geoscientists, Computer Scientists, Information Technologists, Social Scientists, and others are collaborating to define a shared vision for the next generation of research over the next decade. To enable transformational research and education within and across geoscience disciplines, NSF has sponsored 26 end-user workshops in diverse geoscience domains over the past two years. The workshops engaged over 1,200 participants from the academic, government and private sectors of the Earth, atmospheric, ocean, computer, information and social science communities; with the overarching objectives of better understanding the limitations of existing cyberinfrastructure, and identifying synergistic opportunities that will help make web-based technologies for sharing data, tools and models accessible to the majority of geoscientists. This project supported a workshop to initiate a process of community dialogue in the area of data including data mining/analytics through "virtual" discussions to leverage existing data mining capabilities and explore emerging data analytics techniques with the over-arching objective of enhancing the scientific productivity of EarthCube datasets and promoting new multi-disciplinary scientific collaborations. The EarthCube Data Discovery, Mining and Access (DDMA) virtual workshop reviewed existing tools/approaches used by the geosciences community, identified challenges that must be overcome, identified promising new approaches, and developed a roadmap for the future of data in EarthCube. Data analytics included data exploration, mining and visualization. The aim was to achieve, through discussions, convergence between needs of the geoscientist and the current and the proposed cyberinfrastructure solutions. The workshop was open to participation by all interested parties, including individuals, institutions, and networks, and across the entire geosciences domain, as well as information sciences. The group prepared a document entitled, "A Community Roadmap for EarthCube Data Discovery, Mining and Access" to recommend a path forward. A two-day EarthCube End-User Principal Investigator workshop was also held, bringing together 40 community leaders tasked with synthesizing the outcomes of the end-user workshops. The goal of the meeting was to articulate a common vision to facilitate EarthCube’s ongoing development by identifying opportunities and mechanisms to further engage their scientific communities; encouraging collaboration across disciplines and domains; and understanding social and technical requirements for future infrastructure. These efforts culminated in the expression of forward-looking and actionable steps that will motivate and guide the development of EarthCube. Information from the end-user workshops showed individual geoscience domains routinely utilize ~30 distinct data sources. These data increasingly overlap traditional disciplinary boundaries and, to provide concrete value to the geoscience community, >70% of end-users thought EarthCube should establish guidelines that would render the mechanisms used to find and store data uniform and interoperable, and ensure the context(s) within which data are generated are reliably documented. For these reasons, the meeting’s common vision for EarthCube highlighted the need to develop a cyberinfrastructure that will: (1) make data discovery, access and integration easier; and (2) enable the construction of shared knowledge bases, communication and coordination through the adoption of community conventions for data management, documentation, exchange and analysis. Small and whole group discussion served to emphasize that, although information-technology is used across the spectrum of geoscience communities, the levels of its penetration into daily workflow are quite variable and to enable transformational science significant adjustments will need to be made to the workstream paths students and researchers follow. Bringing cyberinfrastructure into the mainstream, by developing a user-driven framework that addresses the computational, data, software, knowledge management, and services needs of diverse stakeholders, also requires opportunities and mechanisms that actively engage the geoscience community. To this end, participants recommended that EarthCube should actively encourage the use of community resources to: (1) establish a connection to the next generation of geoscientists by developing intuitive, modular learning objects and self-directed lessons for K-12 through graduate students; and (2) help geoscience domains mature their cyberinfrastructure framework by executing prototype cross-cutting science initiatives. To join the EarthCube community, be informed about the full range of end-user workshops, participate in ongoing community activities (including the emerging systems for governance), and learn more about NSF-funding opportunities, see www.earthcube.org.

Agency
National Science Foundation (NSF)
Institute
Division of Earth Sciences (EAR)
Type
Standard Grant (Standard)
Application #
1240394
Program Officer
Barbara Ransom
Project Start
Project End
Budget Start
2012-05-01
Budget End
2013-10-31
Support Year
Fiscal Year
2012
Total Cost
$99,394
Indirect Cost
Name
University of Alabama in Huntsville
Department
Type
DUNS #
City
Huntsville
State
AL
Country
United States
Zip Code
35805