This project is developing efficient solid-state data storage device and system design solutions. Although solid-state data storage has shown a high potential to significantly advance storage systems to achieve ultra-high performance with low energy consumption, continuous semiconductor technology scaling makes it increasingly challenging to realize efficient and reliable solid-state data storage devices and systems for the storage hierarchy in large-scale systems, such as data centers and cloud systems.

The intellectual merit of this proposal lies in the theme of developing solid-state data storage design techniques from both software and hardware. Major solid-state data storage device architecture functions, including error correction coding, wear-leveling and garbage collection algorithms, are being developed. Workload and device-aware solid-state data storage system design solutions are also being developed for large data center applications. Design solutions will be evaluated using an experimental hardware platform and the Linux operating system.

The broader impact of the proposed research is that it will enable the development of highly reliable and low-cost solid-state data storage devices and systems for large-scale computing and storage systems. The adoption of the design methodology by solid-state data storage architects, system designers, and data-intensive practitioners may provide a direct benefit to this strategically important high technology sector critical to the economic health of the nation. This team will also introduce new research results into courses.

Project Report

Although flash-based Solid State Device (SSD) technology has shown a high potential to significantly advance storage systems to achieve ultra-high performance with low energy consumption, there are several fundamental research issues to be systematically addressed before SSDs can be efficiently and reliably merged into the storage hierarchy in large-scale production systems, such as data centers and cloud systems. One major barrier is the existing simplified system software interface with SSDs, which prevents applications from fully utilizing SSDs in a high performance and highly reliable way measured by speed, longevity, and capacity of SSDs. Another issue is related to error correction codes (ECCs) currenlty used in SSD. Conventional ECCs, such as commonly used BCH code have become increasingly inadequate for SSD as the capacity of SSD continues to increase and its reliability continues to degrade. We have made strong efftorts to address these two issuse in this project. In our new interface designs and implementation in operating system, in databases, and in virtual machine environment, we have provided mechanisms to deliver certain important information items from the system/application layers to the SSD device, such as whether the data should be placed in SSD due to performance and energy reasons, where the data should be placed inside SSD due to a non-uniform reliability distribution of blocks, what data sets in SSD should be replaced, and what data sets should be exchanged betwen SSD and hard disks. Our solutions are highly effective to best utilize the performance of SSD and hard disks in various applications. We have also studied effective methods to deploy the low-density parity-check (LDPC) code, a powerful ECC, in SSD to significanlty improve the reliability of SSD. With our methods, LDPC can be used in SSD with a small increase of response tme delay to achieve a huge gain in SSD capacity, reliability and the price reduction. A research in this project has made a direct impact to influence the design and implementation of Apple's storage product Fusion Drive. We have published our research work in the best venues of the fields under rigorous reviews, including conferences of FAST, PACT, and VLDB. Two Ph.D. students are supported for their Ph.D. dissertation in the project. The new research results have been timely introduced in the classrooms of graduate studies.

Agency
National Science Foundation (NSF)
Institute
Division of Computer and Network Systems (CNS)
Application #
1162165
Program Officer
Anita J. LaSalle
Project Start
Project End
Budget Start
2012-07-01
Budget End
2014-06-30
Support Year
Fiscal Year
2011
Total Cost
$225,000
Indirect Cost
Name
Ohio State University
Department
Type
DUNS #
City
Columbus
State
OH
Country
United States
Zip Code
43210