Efforts to integrate contact network data with molecular surveillance data provide enormous promise for HIV tracking and intervention. However, the lack of tools to facilitate integrated molecular-social surveillance remains a substantial barrier to progress. For example, most contact network data only contains information on the immediate sexual and drug use partners of a single individual. Yet, the same partners can appear across the contact networks of multiple individuals. Therefore, partners must be matched across contact networks - a process called entity resolution (ER) - in order to provide an accurate view of the overall contact network structure. The process of ER currently requires either substantial resources to manually match individuals or considerable technological expertise in programming to more efficiently match individuals using probabilistic models. Accordingly, this project will 1) develop a machine learning algorithm to match individuals across personal contact networks and validate it using a large existing dataset of young men who have sex with men, and 2) create a graphical user interface to implement the algorithm as an add-on package to an existing tool for network data capture and processing (Network Canvas). The results of this project will provide an open- source and freely available tool that can drastically reduce barriers to matching individuals across contact networks, thereby providing researchers and public health officials with unencumbered access to the underlying structure of drug use and sexual networks, and a potent tool for integrating contact network data with molecular surveillance.

Public Health Relevance

Developing an accurate picture of the drug use and sexual contact networks of men who have sex with men (MSM) and other high risk populations can revolutionize the way in which HIV spread is tracked and intervened upon by public health practitioners. However, these data are usually limited to personal networks, which only include immediate sex and drug use partners. The current project will employ state-of-the-art methods in machine learning to improve the accuracy of matching individuals across personal network data and will integrate this tool into a user-friendly graphical user interface and data management tool, thereby substantially reducing barriers to studying contact networks and providing a potent tool for HIV surveillance and interventions.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21LM012578-01
Application #
9348928
Study Section
Behavioral and Social Science Approaches to Preventing HIV/AIDS Study Section (BSPH)
Program Officer
Sim, Hua-Chuan
Project Start
2017-09-30
Project End
2019-09-29
Budget Start
2017-09-30
Budget End
2018-09-29
Support Year
1
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Northwestern University at Chicago
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
005436803
City
Chicago
State
IL
Country
United States
Zip Code
60611