TY - JOUR
T1 - Automated classification of patents
T2 - A topic modeling approach
AU - Yun, Junghwan
AU - Geum, Youngjung
N1 - Publisher Copyright:
© 2020 Elsevier Ltd
PY - 2020/9
Y1 - 2020/9
N2 - Due to the rapid increase in technological innovation and corresponding increase in patent applications, automatic patent classification systems are very helpful for both individual inventors and patent attorneys in classifying patents. However, previous studies have neglected the question of what content patents include and how to represent patent content effectively in a structured form to predict the patent class. In response, this study suggests a topic model based on support vector machine (SVM) prediction for automatic patent classification. This study considers two important issues for patent classification: text representation and class prediction. For text representation, we use the topic modeling technique and employ latent Dirichlet allocation (LDA). The result of LDA is then used as the input for the second aspect: class prediction. We use SVM prediction for automatic patent classification. We also suggest potential improvement strategies to enhance the prediction performance of our suggested approach. This study contributes to the field in that it can lead to the automatic classification of patents without the need for any expert judgment during the process.
AB - Due to the rapid increase in technological innovation and corresponding increase in patent applications, automatic patent classification systems are very helpful for both individual inventors and patent attorneys in classifying patents. However, previous studies have neglected the question of what content patents include and how to represent patent content effectively in a structured form to predict the patent class. In response, this study suggests a topic model based on support vector machine (SVM) prediction for automatic patent classification. This study considers two important issues for patent classification: text representation and class prediction. For text representation, we use the topic modeling technique and employ latent Dirichlet allocation (LDA). The result of LDA is then used as the input for the second aspect: class prediction. We use SVM prediction for automatic patent classification. We also suggest potential improvement strategies to enhance the prediction performance of our suggested approach. This study contributes to the field in that it can lead to the automatic classification of patents without the need for any expert judgment during the process.
KW - Automatic patent classification
KW - Latent Dirichlet allocation
KW - LDA
KW - Support vector machine
KW - SVM
UR - http://www.scopus.com/inward/record.url?scp=85087927034&partnerID=8YFLogxK
U2 - 10.1016/j.cie.2020.106636
DO - 10.1016/j.cie.2020.106636
M3 - Article
AN - SCOPUS:85087927034
SN - 0360-8352
VL - 147
JO - Computers and Industrial Engineering
JF - Computers and Industrial Engineering
M1 - 106636
ER -