The design of novel synthetic proteins is a key challenge in computational biochemistry with the potential to lead to a better understanding of the processes that underlie life and allow the discovery of molecules with applications in therapeutics, materials, and scienti?c tools. However, due to the high number of degrees of freedom and rugged energy landscape in even small proteins, protein design remains a challenging computational problem. Researchers have begun turning to video games as a means to crowdsource human problem solving for proteomics at a mass scale. This project aims to signi?cantly adapt the existing proteomics video game Foldit to incorporate big data from protein databases into computational structural protein design. This data will be used to inform the manipulation of structural components of proteins. Foldit, a scienti?c discovery game featuring an interactive protein manipulation interface, allows the public to contribute directly t scienti?c research involving the study of proteins. Previous work with Foldit has shown that with an appropriate interface and introduction, even amateur players with no formal background in biochemistry can make contributions to our knowledge of proteomics. Additionally, preliminary protein design work has shown that players can contribute to the successful redesign of existing protein enzymes. In this work we propose to build upon the existing successes of Foldit in crowdsourcing protein design. To do so, we will leverage the huge amount of data on protein structures that exists in protein databases like the RCSB Protein Data Bank. By integrating this data into the mechanics of the Foldit game, we will be able to both improve the tools available to the players and allow them to construct more realistic protein-like structures. We will additionall be able to reward players for staying closer to these structures when making future modi?cations and for ?nding novel sequences that do not exist in databases.

Public Health Relevance

The design of novel synthetic proteins is a key challenge in computational biochemistry due to the high number of degrees of freedom and rugged energy landscape in even small proteins. However, the structural nature of the problem is similar to other proteomics problems that have been shown amenable to human spatial reasoning skills; therefore we propose to develop, deploy, and re?ne signi?cant adaptations to the Foldit crowdsourcing proteomics game that integrate data from protein databases into the mechanics of the game. This additional 'big data' aims to assist players in designing novel, realistic protei structures and advance our understanding of synthetic protein design.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Exploratory/Developmental Cooperative Agreement Phase I (UH2)
Project #
1UH2CA203780-01
Application #
9078833
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Miller, David J
Project Start
2016-05-01
Project End
2018-04-30
Budget Start
2016-05-01
Budget End
2017-04-30
Support Year
1
Fiscal Year
2016
Total Cost
Indirect Cost
Name
Northeastern University
Department
Type
Schools of Arts and Sciences
DUNS #
001423631
City
Boston
State
MA
Country
United States
Zip Code
Cooper, Seth; Sterling, Amy L R; Kleffner, Robert et al. (2018) Repurposing Citizen Science Games as Software Tools for Professional Scientists. FDG 2018:
Kleffner, Robert; Flatten, Jeff; Leaver-Fay, Andrew et al. (2017) Foldit Standalone: a video game-derived protein structure manipulation interface using Rosetta. Bioinformatics 33:2765-2767
Gaston, Jacqueline; Cooper, Seth (2017) To Three or not to Three: Improving Human Computation Game Onboarding with a Three-Star System. Proc SIGCHI Conf Hum Factor Comput Syst 2017:5034-5039
Bauer, Aaron; Popovi?, Zoran (2017) Collaborative Problem Solving in an Open-Ended Scientific Discovery Game. Proc ACM Hum Comput Interact 1: