TY - JOUR
T1 - Investigating the effects of SQL hints on diverse storage devices through empirical analysis
AU - Kim, Changjong
AU - Son, Yongseok
AU - Kim, Sunggon
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2025/4
Y1 - 2025/4
N2 - 3-1) With the increasing demand for large-scale data management and analytics, applications have diverse I/O needs, and various storage devices are being adapted for database systems to meet these specific I/O requirements. For example, traditional Hard Disk Drives (HDDs), known for their cost-efficiency and high capacity, are used to store cold data and handle large-scale analytics. In addition, emerging Serial Advanced Technology Attachment (SATA) and Non-Volatile Memory Express (NVMe) Solid-State Drives (SSDs), which offer high performance, are utilized for storing hot data and executing real-time analytical tasks. However, due to the distinct characteristics of these devices, careful consideration of queries, databases, I/O operations, and storage devices is essential to effectively utilize underlying storage devices in database systems. In this paper, we analyze SQL hints, which are directives or suggestions provided to the database optimizer, and examine their implications on I/O performance using two popular databases with various storage devices, including HDD, SATA SSD, and NVMe SSD. Our analysis shows that when SQL hints are not used, databases can fully leverage HDD performance but cannot exploit the full potential of SATA and NVMe SSDs. Notably, the performance with NVMe SSDs can even be worse than that of HDDs in certain situations. By adopting SQL hints, while performance with HDD decreases by up to 77.14% due to the characteristics of the device, the performance of SATA and NVMe SSDs can be improved by up to 83.74% and 92.07%, respectively. These results emphasize the importance of carefully adapting SQL hints by considering the unique characteristics of devices, and with correct adoption, SQL hints can improve the performance of databases.
AB - 3-1) With the increasing demand for large-scale data management and analytics, applications have diverse I/O needs, and various storage devices are being adapted for database systems to meet these specific I/O requirements. For example, traditional Hard Disk Drives (HDDs), known for their cost-efficiency and high capacity, are used to store cold data and handle large-scale analytics. In addition, emerging Serial Advanced Technology Attachment (SATA) and Non-Volatile Memory Express (NVMe) Solid-State Drives (SSDs), which offer high performance, are utilized for storing hot data and executing real-time analytical tasks. However, due to the distinct characteristics of these devices, careful consideration of queries, databases, I/O operations, and storage devices is essential to effectively utilize underlying storage devices in database systems. In this paper, we analyze SQL hints, which are directives or suggestions provided to the database optimizer, and examine their implications on I/O performance using two popular databases with various storage devices, including HDD, SATA SSD, and NVMe SSD. Our analysis shows that when SQL hints are not used, databases can fully leverage HDD performance but cannot exploit the full potential of SATA and NVMe SSDs. Notably, the performance with NVMe SSDs can even be worse than that of HDDs in certain situations. By adopting SQL hints, while performance with HDD decreases by up to 77.14% due to the characteristics of the device, the performance of SATA and NVMe SSDs can be improved by up to 83.74% and 92.07%, respectively. These results emphasize the importance of carefully adapting SQL hints by considering the unique characteristics of devices, and with correct adoption, SQL hints can improve the performance of databases.
KW - Benchmark
KW - Database system
KW - Performance analysis
KW - Storage
KW - TPC-H
UR - http://www.scopus.com/inward/record.url?scp=85210374243&partnerID=8YFLogxK
U2 - 10.1007/s10586-024-04853-1
DO - 10.1007/s10586-024-04853-1
M3 - Article
AN - SCOPUS:85210374243
SN - 1386-7857
VL - 28
JO - Cluster Computing
JF - Cluster Computing
IS - 2
M1 - 107
ER -