As many children are connected to their peers and spend a significant amount of time on social web sites (Facebook, Twitter, etc.), cyber bullying in social media is becoming a severe problem that can lead to serious social, psychological, and health effects. To alleviate the problem, intervention from adults (teachers, parents, law enforcement, social web site moderators, etc.) is the key. However, many victims do not report bullying to adults, and bullies can use aliases and act anonymously, and thus they are difficult to identify. The goal of this exploratory project is to eliminate or at least reduce these problems by developing an intelligent system to automatically detect and track cyber bullies on the social web.

This project explores a solution to cyber bullying based on a combination of machine learning, natural language processing, information filtering, and recommendation and social network modeling techniques. The expected results of the project include: (1) algorithm(s) that can detect cyber bullies and bullying messages automatically; (2) piloting results that suggest what prediction accuracy to expect; (3) a preliminary social web bullying detection prototype system and (4) the first labeled social cyber-bullying data set for testing of the prototype and future research in this direction. The project has high risk, as whether such a system can be developed is an untested idea and the task is challenging, largely due to the diversity of bully behaviors, ambiguity, and the special language used by bullies in social media.

Results from this research project are expected to create a foundation for future larger-scale projects investigating the cyber-bullying problem in social media, which will eventually make the social interaction much safer for hundreds of millions of children and beyond. Teaching, training and learning will be promoted directly for the graduate students who serve as research assistants and programmers, and several undergraduates will contribute to the programming. The impact will be strengthened by UCSC's ethnic and cultural diversity, its proximity to Silicon Valley, and the PI's industry collaborations. Results of this research, including data generated, publications, and demo software will be available via the project web site (http://users.soe.ucsc.edu/~yiz/bullying/).

Project Report

Social network websites such as Twitter and Facebook are among the most popular web sites on the Internet. While the impact of the social web can be positive, there can be a dark side as well. One example is social bullying, which can have huge negative effects on children, possibly leading to serious social, psychological and health effects. Unfortunately, social bullying can easily go undetected due to a lack of supervision. This is a pilot proof of concept research to determine whether or not we can build an intelligent information system to automatically detect bullies and bullying messages on the social web. The project team identified and tracked thousands of children’s social web account and found many social cyber bullying messages. The research data is made available for researchers who are interested in working on social bullying problem. Based on information filtering, social network analysis, machine learning and natural language processing techniques, an intelligent bully information filtering software is dveloped, which can automatically identities some potential bullying messages. This project leads to a demo system SocialFilter (http://kideroo.net/), a social web bully monitoring software to help parents and educators track children’s status on Twitter, and detect potential bullying messages. Given a location or a children’s account, SocialFilter tracks the messages related at real time, automatically classifies each message into bullying or non-bullying, and presents all potential bullying messages as a list or on a map. Guidance on how to handle bully cases are also provided for parents on this web site. The core of SocialFilter is an intelligent software trained using human labeled bullying and non-bullying message. Users of the system can also give feedback on whether a potential bullying message is indeed a bullying message or not, and software can further improve it’s classification accuracy by learning from the feedback information. More social web sites are beinng added into kideroo.net. This project has provided teaching, training and learning opportunities for the graduate students, undergraduate students and high school students involved in this project.

Agency
National Science Foundation (NSF)
Institute
Division of Information and Intelligent Systems (IIS)
Type
Standard Grant (Standard)
Application #
1144564
Program Officer
Maria Zemankova
Project Start
Project End
Budget Start
2011-09-01
Budget End
2013-08-31
Support Year
Fiscal Year
2011
Total Cost
$163,332
Indirect Cost
Name
University of California Santa Cruz
Department
Type
DUNS #
City
Santa Cruz
State
CA
Country
United States
Zip Code
95064