TY - JOUR
T1 - IO workload characterization revisited
T2 - A data-mining approach
AU - Seo, Bumjoon
AU - Kang, Sooyong
AU - Choi, Jongmoo
AU - Cha, Jaehyuk
AU - Won, Youjip
AU - Yoon, Sungroh
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2014/12
Y1 - 2014/12
N2 - Over the past few decades, IO workload characterization has been a critical issue for operating system and storage community. Even so, the issue still deserves investigation because of the continued introduction of novel storage devices such as solid-state drives (SSDs), which have different characteristics from traditional hard disks. We propose novel IO workload characterization and classification schemes, aiming at addressing three major issues: (i) deciding right mining algorithms for IO traffic analysis, (ii) determining a feature set to properly characterize IO workloads, and (iii) defining essential IO traffic classes state-of-the-art storage devices can exploit in their internal management. The proposed characterization scheme extracts basic attributes that can effectively represent the characteristics of IO workloads and, based on the attributes, finds representative access patterns in general workloads using various clustering algorithms. The proposed classification scheme finds a small number of representative patterns of a given workload that can be exploited for optimization either in the storage stack of the operating system or inside the storage device.
AB - Over the past few decades, IO workload characterization has been a critical issue for operating system and storage community. Even so, the issue still deserves investigation because of the continued introduction of novel storage devices such as solid-state drives (SSDs), which have different characteristics from traditional hard disks. We propose novel IO workload characterization and classification schemes, aiming at addressing three major issues: (i) deciding right mining algorithms for IO traffic analysis, (ii) determining a feature set to properly characterize IO workloads, and (iii) defining essential IO traffic classes state-of-the-art storage devices can exploit in their internal management. The proposed characterization scheme extracts basic attributes that can effectively represent the characteristics of IO workloads and, based on the attributes, finds representative access patterns in general workloads using various clustering algorithms. The proposed classification scheme finds a small number of representative patterns of a given workload that can be exploited for optimization either in the storage stack of the operating system or inside the storage device.
KW - Classification
KW - Clustering
KW - IO workload characterization
KW - SSD
KW - Storage and operating systems
UR - https://www.scopus.com/pages/publications/84910032820
U2 - 10.1109/TC.2013.187
DO - 10.1109/TC.2013.187
M3 - Article
AN - SCOPUS:84910032820
SN - 0018-9340
VL - 63
SP - 3026
EP - 3038
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 12
M1 - 187
ER -