The goal of this project is to provide support for efficient search of multi-dimensional data in Peer-to-Peer (P2P) systems. The project achieves its goal using the following approaches: (1) Develop a new P2P overlay, called semantic small world (SSW), to serve as a general infrastructure (which includes a network, a distributed index, as well as a logical file structure) for supporting a variety of queries at the same time; (2) Develop an innovative dimensionality reduction method that maps data clustered in high-dimensional space to a low-dimensional SSW to reduce the high maintenance overhead required for storing high-dimensional data in constantly changing P2P environments; (3) Develop search algorithms for point query, range query, K nearest neighbor (KNN) search, Top-K query, similarity search and joins based on SSW; (4) Prototype SSW and the search algorithms on PlanetLab; and (5) Develop simulators and analytical models to evaluate the performance of the developed techniques. The research results will have great impacts on distributed data management in e-commerce, information retrieval, digital government and scientific studies. This project supports Ph.D students to pursue research in the areas of distributed data management and P2P systems. A new graduate-level course covering these areas and integrating the research results from the project will be introduced into the curriculum. Publications, technical reports, software and experimental data from this research will be disseminated via the project web site (www.cse.psu.edu/pda/P2P).

Agency
National Science Foundation (NSF)
Institute
Division of Information and Intelligent Systems (IIS)
Application #
0534343
Program Officer
Gia-Loi Le Gruenwald
Project Start
Project End
Budget Start
2006-01-15
Budget End
2009-12-31
Support Year
Fiscal Year
2005
Total Cost
$350,000
Indirect Cost
Name
Pennsylvania State University
Department
Type
DUNS #
City
University Park
State
PA
Country
United States
Zip Code
16802