Skip to main navigation Skip to search Skip to main content

KLUE: Korean Language Understanding Evaluation

  • Sungjoon Park
  • , Jihyung Moon
  • , Sungdong Kim
  • , Won Ik Cho
  • , Jiyoon Han
  • , Jangwon Park
  • , Chisung Song
  • , Junseong Kim
  • , Youngsook Song
  • , Taehwan Oh
  • , Joohong Lee
  • , Juhyun Oh
  • , Sungwon Lyu
  • , Younghoon Jeong
  • , Inkwon Lee
  • , Sangwoo Seo
  • , Dongjun Lee
  • , Hyunwoo Kim
  • , Myeonghwa Lee
  • , Seongbo Jang
  • Seungwon Do, Sunkyoung Kim, Kyungtae Lim, Jongwon Lee, Kyumin Park, Jamin Shin, Seonghyun Kim, Lucy Park, Alice Oh, Jung Woo Ha, Kyunghyun Cho
  • Upstage Co., Ltd.
  • Korea Advanced Institute of Science and Technology
  • NAVER AI Lab
  • Seoul National University
  • Yonsei University
  • Scatter Lab.
  • Kyung Hee University
  • Kakao Enterprise Corp.
  • Sogang University
  • Riiid AI Research
  • New York University

Research output: Contribution to journalConference articlepeer-review

108 Scopus citations

Abstract

We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a collection of eight Korean natural language understanding (NLU) tasks, including Topic Classification, Semantic Textual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking. We create all of the datasets from scratch in a principled way. We design the tasks to have diverse formats and each task to be built upon various source corpora that respect copyrights. Also, we propose suitable evaluation metrics and organize annotation protocols in a way to ensure quality. To prevent ethical risks in KLUE, we proactively remove examples reflecting social biases, containing toxic content or personally identifiable information (PII). Along with the benchmark datasets, we release pretrained language models (PLM) for Korean, KLUE-BERT and KLUE-RoBERTa, and find KLUE-RoBERTaLARGE outperforms other baselines including multilingual PLMs and existing open-source Korean PLMs. The fine-tuning recipes are publicly open for anyone to reproduce our baseline result. We believe our work will facilitate future research on cross-lingual as well as Korean language models and the creation of similar resources for other languages.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education

Fingerprint

Dive into the research topics of 'KLUE: Korean Language Understanding Evaluation'. Together they form a unique fingerprint.

Cite this