한국어 기술문서 분석을 위한 BERT 기반의 분류모델

Translated title of the contribution: BERT-based Classification Model for Korean Documents

Research output: Contribution to journalArticlepeer-review

Abstract

It is necessary to classify technical documents such as patents, R&D project reports in order to understand the trends of technology convergence and interdisciplinary joint research, technology development and so on. Text mining techniques have been mainly used to classify these technical documents. However, in the case of classifying technical documents by text mining algorithms, there is a disadvantage that the features representing technical documents must be directly extracted. In this study, we propose a BERT-based document classification model to automatically extract document features from text information of national R&D projects and to classify them. Then, we verify the applicability and performance of the proposed model for classifying documents.
Translated title of the contributionBERT-based Classification Model for Korean Documents
Original languageKorean
Pages (from-to)203-214
Number of pages12
Journal한국전자거래학회지
Volume25
Issue number1
DOIs
StatePublished - 2020

Fingerprint

Dive into the research topics of 'BERT-based Classification Model for Korean Documents'. Together they form a unique fingerprint.

Cite this