ACM Computing Surveys 28A(4), December 1996, http://www-cse.ucsd.edu/~pasquale/SDCR96-IO/MuntzR-PasqualeJ.html. Copyright © 1996 by the Association for Computing Machinery, Inc. See the permissions statement below.


Strategic Directions in Computing Research

Working Group on Storage I/O for Large-Scale Computing

System-level Design Issues for Storage I/O


Richard R. Muntz

Computer Science Department, U.C.L.A.
3277A Boelter Hall, Los Angeles, CA 90095-1596, USA
muntz@cs.ucla.edu, http://www.cs.ucla.edu/~muntz/

Joseph Pasquale
Dept. of Computer Science and Engineering, U. C. San Diego
La Jolla, CA 92093-0114, USA
pasquale@cs.ucsd.edu, http://www-cse.ucsd.edu/~pasquale/



Abstract: I/O systems are becoming more complex, and must be designed by considering the entire system, end-to-end. We make a number of recommendations to address this problem, including the following. (1) There needs to be more emphasis on tertiary storage and on the whole (multilevel) storage hierarchy in general. (2) We must pay more attention to issues of resource management and availability in the network, especially if network-attached storage devices become more viable. (3) To improve performance, the operating system must give user-level processes more control over the data path between the storage device and the process, or be able to accept and exploit high-level hints about the application's behavior and its most important performance metrics/quality of service. (4) Finally, more emphasis should be placed on content-based or semantic-based compression, where we believe the greatest advances remain ahead of us. See also the citation page [Muntz Pasquale 1996] for this position statement.

Categories and Subject Descriptors: D.4.2 [Operating Systems]: Storage Management - storage hierarchies; D.4.4 [Operating Systems]: Communications Management - input/output, network communication; B.4.2 [Input/Output and Data Communications]: Input/Output Devices - disks, channels and controllers; E.4 [Data]: Coding and Information Theory - data compaction and compression;

General Terms: Algorithms, Design, Management, Measurement, Performance.

Additional Key Words and Phrases: I/O, communication.



1 I/O Architectures

I/O can no longer be viewed from the point perspective what happens between an I/O device and the machine it is connected to. I/O systems are becoming more complex, and must be designed by considering the entire system, end-to-end. For example, storage systems are themselves distributed systems, comprised of a hierarchy of storage devices of different speeds and sizes and connected by (different types of) networks.

Recommendations:

2 OS-Related Issues

Operating systems are getting more and more "in the way" between the user and the storage system. The buffering and caching done by the OS may actually be detrimental to performance.

Ultimately, the application (or, more typically, a server or middleware) should be provided with more control over low-level functions and let it do what it thinks is best. A good example is the old story about letting database systems control their own buffering because they know best how to do it, rather than the OS try to do it.

Recommendations:

3 Data Compression

How will future data compression techniques influence I/O and vice versa (in addition to simply reducing bandwidth and storage requirements). For example, lossy compression is influencing I/O in requiring variable retrieval rates. What about in the future? What will be the effects of important schemes currently being researched like content (or object) based compression rather than the more common pixel based compression (like JPEG). How will this influence (if at all) storage and retrieval?

Recommendations:

4 Understanding Device Technology Trends

Finally, researchers need a better understanding of where device technology is likely to be going and how that will affect I/O problems 5 years and 10 years from now. Will holographic memories make it? What exactly will they look like? Will semiconductor memories overtake disk in price per MB? Will that make solid state disks the dominant secondary storage device? For DVD technology, what will be the price tradeoffs? Will the technology stay at a plateau for 20 years after reaching the "blue light special" capacity of 40GB and a MB/sec bandwidth?

Recommendations:

References

[Muntz Pasquale 1996]
Muntz, R., and Pasquale, J., 1996. System-level Design Issues for Storage I/O, Strategic Directions in Computing Research: Working Group on Storage I/O for Large-Scale Computing, Computing Surveys, 28A(4), December 1996, http://www-cse.ucsd.edu/~pasquale/SDCR96-IO/MuntzR-PasqualeJ.html.


Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept, ACM Inc., fax +1 (212) 869-0481, or permissions@acm.org.


Last modified: Thu Oct 17 13:11:11 EDT
Joseph Pasquale <pasquale@cse.ucsd.edu>