A2FL: Autonomous and Adaptive File Layout in HPC through Real-time Access Pattern Analysis

Dong Kyu Sung, Yongseok Son, Alex Sim, Kesheng Wu, Suren Byna, Houjun Tang, Hyeonsang Eom, Changjong Kim, Sunggon Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Various scientific applications with different I/O characteristics are executed in HPC systems. However, underlying parallel file systems are unaware of these characteristics of applications, and using a single fixed file layout for all applications can degrade the performance of HPC systems. In this paper, we propose A2FL, an autonomous and adaptive file layout adjustment scheme that optimizes parallel file system configurations by analyzing the access pattern of the applications. The key steps of A2FL are as follows: (1) A2FL initially intercepts the I/O operations of the application, recording their access patterns in real-time. (2) The access patterns are then transformed into a graphical representation used for predicting I/O performance and providing adjustment recommendations. (3) A2FL autonomously adjusts the file layout based on the prediction results, delivering an optimal file layout within the parallel file system. Moreover, we propose A2FL-Compound which analyzes an access pattern by dividing it into smaller components to optimize the file layout in a fine-grained manner. Our evaluations demonstrate that A2FL significantly enhances I/O performance, with improvements of up to 65.9× compared to the default file layout.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages506-518
Number of pages13
ISBN (Electronic)9798350337662
DOIs
StatePublished - 2024
Event38th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024 - San Francisco, United States
Duration: 27 May 202431 May 2024

Publication series

NameProceedings - 2024 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024

Conference

Conference38th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024
Country/TerritoryUnited States
CitySan Francisco
Period27/05/2431/05/24

Keywords

  • Access Pattern
  • HPC
  • I/O
  • Machine Learning
  • Parallel File System
  • Storage

Fingerprint

Dive into the research topics of 'A2FL: Autonomous and Adaptive File Layout in HPC through Real-time Access Pattern Analysis'. Together they form a unique fingerprint.

Cite this