This award to the American Anthropological Association will provide support for a workshop on creating a national registry for extant and future cultural anthropological databases. The workshop will focus on archive selection; best practices for data and meta data formatting; eliciting researcher participation; and providing for on-going technical and financial support. The overarching goal is to create a central site where researchers can go to locate sources of visual field recordings, digital data sets, websites, and archival collections of fieldnotes, photographs, coding sheets, and other materials. The workshop will bring together archivists, anthropologists, representatives of key anthropological institutions, and technical specialists. The resulting report will be widely disseminated through multiple publication venues to encourage a wider discussion and support from anthropological researchers.
This grant (BCS-1159109) supported the American Anthropological Association’s efforts to move forward on the creation of a registry for anthropological data sets. A data registry has the potential to address a key problem in anthropology: How to help researchers and other interested groups discover diverse, widely-distributed, heterogeneous data in a digital world; how to improve the access, sharing, and building upon existing data knowledge while respecting ethical mandates to protect human subjects. A data registry could provide a means by which a wide range of widely dispersed source materials--be they photographs, physical specimens like bones, spoken language samples, archaeological data, field notes, LIDaR scans, biometric databases, or recorded songs and rituals—could be located in one central open access finding aid. Such a resource would help groups interested in their cultural heritage, historians, international relations experts, social science scholars and anyone interested in "the human condition" locate documentary evidence about a vast range of cultural, linguistic, archaeological and biological processes. The grant funded two primary activities: 1) the creation of a wiki to canvass the location and variety of data sets currently available See: http://anthroregistry.wikia.com/wiki/Registry_of_Anthropological_Data_Wiki and 2) assemble leading archivists and data collectors at a two-day workshop to identify the necessary fields, define important datasets, and locate the best opportunities for a potential anthropological data registry. Key findings from the two day workshop: Descriptions of Records in the RegistryThe workshop attendees discussed essential fields for a data registry to support aiding discovery of relevant anthropological materials. The overwhelming consensus of these experts was less is more. Participants of the workshop felt that users no longer are fluent in structured searching, further undermining reasons to create a highly structured registry. Interestingly, the participants agreed that crowd-sourced records produce richer description, and there was a widespread sentiment that no taxonomy can adequately cover the real circumstances and nuances of data sets. There were three places, however, where the workshop conversation suggested the value of a controlled vocabulary outweighed its liabilities. Primary Investigator: The group came to a consensus that the "primary investigator" field (the data field that captures the creator and primary research collector of the data set) is critical for any registry. A significant contribution of a registry would be to concatenate the products of individual authors. This collation would offer a huge user benefit to the registry. The current state of records is that the researcher’s corpus is often divided, such as when the sound files go to a separate archive than the papers and field notes of the same individual. Some participants pointed out that authorship has become cloudier because informant communities are increasingly appearing as co-authors. The key take-away from the workshop, however, emphasized that searching for data sets is very likely to continue around the "primary investigator." Relationship of Data Sets: Workshop attendees felt that a data registry would also provide value if it could codify relationships between respective data sets. For instance, a registry would create a linkage between longitudinal studies or a reanalysis or recoding of an initial data set or secondary transcriptions. Level of Granularity: The participating organizations create records with varying levels of granularity. A registry would need to support this range of granularity, in which some archives would contribute collection level records, and some data repositories and institutional repositories would provide record-level entries. Importance of "the abstract" Discussions also identified "the abstract" as an extremely important field which leads to the discovery of data sets. Abstracts offer tremendous flexibility and depth of rich description of the object. The abstract of the dataset must be full-text searchable for registry users to locate data sets of significance to them. Data registry link to data object There was consensus that the registry should link to the data within its archive (as mediated by any restrictions set by the host institution). If a repository or archive assigns a DOI, this would be the preferred handle. If a repository or archive uses a different fixed handle, such as to a MARC record, this could serve. Barriers to archiving and registering data sources Participants discussed some of the barriers to archiving that are specific to anthropology, particularly concerns about management of confidentiality (for instance when research is on mental health or when the specimen is human remains or any other sensitive area of inquiry), Institutional Review Boards requirements, concerns of being scooped by other researchers, and respect for data ownership and copyright. Workshop participants represented the archivist and funding community, so discussion centered on actions that these institutions could take. Supportive steps that funders, societies and archives could take to encourage archiving include: 1) identify strategies that will help generate a group of "early adopters" in archiving, and 2) produce a set of guidelines and best practices that can easily be made available through funding agencies and professional associations.