Abstract
It is necessary to classify technical documents such as patents, R&D project reports in order to understand the trends of technology convergence and interdisciplinary joint research, technology development and so on. Text mining techniques have been mainly used to classify these technical documents. However, in the case of classifying technical documents by text mining algorithms, there is a disadvantage that the features representing technical documents must be directly extracted. In this study, we propose a BERT-based document classification model to automatically extract document features from text information of national R&D projects and to classify them. Then, we verify the applicability and performance of the proposed model for classifying documents.
| Translated title of the contribution | BERT-based Classification Model for Korean Documents |
|---|---|
| Original language | Korean |
| Pages (from-to) | 203-214 |
| Number of pages | 12 |
| Journal | 한국전자거래학회지 |
| Volume | 25 |
| Issue number | 1 |
| DOIs | |
| State | Published - 2020 |