People spend a great deal of their lives socializing, or interacting with other people. On a typical day a person might collaborate on a project with their work colleagues, play softball with their teammates, and converse with their family. Social interactions are inherently part of most of our activities, therefore, understanding social interactions is a fundamental part of understanding human behavior. As the amount of time people spend online rapidly grows, social interactions which were once limited to in-person meetings, letters, and telephone calls, are increasingly occurring through the use of online resources such as email, Facebook, and online chats. Social and cognitive scientists who strive to understand human behavior can analyze online interactions to illuminate social behavior in this new setting, and benefit from the wealth of data that it provides. However, social interactions are extremely complex, so analyzing and modeling them is not easy in any setting.

Fortunately Bayesian probabilistic methods offer rich, flexible, generative models for data, which can be used to model complex, highly structured, social interactions. In general, Bayesian methods provide a principled framework for reasoning about an uncertain world. Bayesian latent variable models allow us to reason about, or discover, the potentially quite complex, unobserved structure that underlies what we do observe. This research develops methods which discover the unobserved structure necessary to model complex social interactions which occur online, explore group interactions, evaluate how context effects social interactions, and explore social influence. This work has the potential to improve science (e.g. by improving long-distance collaborations), commerce (e.g. by identifying whom businesses should inform about their products), and society at large (e.g. by improving social networking).

Project Report

In this work we developed Bayesian models for social interactions between individuals in a social network. For example, we model people who interact with each other via email within a common social network, such as their workplace network. Results were obtained from two main projects. The goal of the first was to predict the times of future interaction events, such as sent emails, based on previous behavior, while simultaneously discovering groups of individuals who perform similar roles in the social network. The goal of the second was to predict the spread of infection within a college dormitory network. In the first project we used Bayesian statistical models to predict who sends emails to whom and when those emails are sent within a social network. Reciprocity behavior, or how people respond when emails are sent to them from other individuals, was used to group people in the social network together, thereby learning which individuals share similar roles. For example, we can leverage patterns in how people respond to each other’s emails to potentially distinguish business managers from computer administrators. These models were also applied to group conversations, and to militarized interstate disputes between countries. In the latter countries were grouped together by how they responded to threats or use of force by other countries. In the second project we modeled health data collected from students in a college dormitory. This data was collected using smartphones, leveraging health survey information, and proximities of individuals within the dormitory network over time. Our analysis led to the ability to successfully predict how likely an individual is to become ill based on whom they have recently interacted with, and their associated symptom reports. This work will potentially lead to the ability to give more personalized and relevant health advice to individuals.

Agency
National Science Foundation (NSF)
Institute
Division of Advanced CyberInfrastructure (ACI)
Type
Standard Grant (Standard)
Application #
1048563
Program Officer
Sushil K Prasad
Project Start
Project End
Budget Start
2011-01-01
Budget End
2013-12-31
Support Year
Fiscal Year
2010
Total Cost
$157,084
Indirect Cost
Name
Heller Katherine A
Department
Type
DUNS #
City
London
State
Country
United Kingdom
Zip Code