TY - GEN
T1 - An embedding inversion approach to interpretation of patent vacancy
AU - Lee, Sungsoo
AU - Lee, Hakyeon
AU - Jeon, Jeonghwan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This study presents an approach to identifying emerging technology opportunities by extracting patent vacancies and concretizing their meaning in textual form. Patent abstracts are mapped into a high-dimensional vector space using a text embedding model, then reduced to a two-dimensional map using an autoencoder. Density estimation is applied to these coordinates to identify hotspots and define vacant cells as patent vacancies. The two-dimensional coordinates of these patent vacancies are then converted back into high-dimensional embedding vectors using the decoder of a trained autoencoder. Finally, the embedding inversion model converts the embedding vectors into text describing the technology overview. For validation, 7,413 patents related to solar cell technology registered in the last ten years as of 2023 were collected. The first eight years of patent data were used to extract vacancies and generate technical text. Consequently, patents exhibiting a resemblance to the generated text were observed to emerge in the subsequent two years, thereby substantiating the innovative potential of our approach.
AB - This study presents an approach to identifying emerging technology opportunities by extracting patent vacancies and concretizing their meaning in textual form. Patent abstracts are mapped into a high-dimensional vector space using a text embedding model, then reduced to a two-dimensional map using an autoencoder. Density estimation is applied to these coordinates to identify hotspots and define vacant cells as patent vacancies. The two-dimensional coordinates of these patent vacancies are then converted back into high-dimensional embedding vectors using the decoder of a trained autoencoder. Finally, the embedding inversion model converts the embedding vectors into text describing the technology overview. For validation, 7,413 patents related to solar cell technology registered in the last ten years as of 2023 were collected. The first eight years of patent data were used to extract vacancies and generate technical text. Consequently, patents exhibiting a resemblance to the generated text were observed to emerge in the subsequent two years, thereby substantiating the innovative potential of our approach.
KW - Autoencoder
KW - Embedding inversion
KW - Patent vacancy
KW - Technology opportunity analysis
UR - https://www.scopus.com/pages/publications/85218007190
U2 - 10.1109/IEEM62345.2024.10857259
DO - 10.1109/IEEM62345.2024.10857259
M3 - Conference contribution
AN - SCOPUS:85218007190
T3 - IEEE International Conference on Industrial Engineering and Engineering Management
SP - 873
EP - 877
BT - IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2024
PB - IEEE Computer Society
T2 - 2024 IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2024
Y2 - 15 December 2024 through 18 December 2024
ER -