The Hybrid Method for Accurate Patent Classification

V. V. Yadrintsev; I. V. Sochenkov

doi:10.1134/S1995080219110325

The Hybrid Method for Accurate Patent Classification

Authors: Yadrintsev V.V.¹^,2, Sochenkov I.V.¹^,3
Affiliations:
1. Federal Research Center Computer Science and Control of the Russian Academy of Sciences
2. Peoples’ Friendship University of Russia (RUDN University)
3. Lomonosov Moscow State University
Issue: Vol 40, No 11 (2019)
Pages: 1873-1880
Section: Article
URL: https://journals.rcsi.science/1995-0802/article/view/206101
DOI: https://doi.org/10.1134/S1995080219110325
ID: 206101

Cite item

Full Text

Open Access
Restricted Access

Access granted
Restricted Access

Subscription Access

Abstract
About the authors
References
Supplementary files
Statistics

Abstract

This article is dedicated to stacking of two approaches of patent classification. First is based on linguistically-supported k-nearest neighbors algorithm using the method of search for topically similar documents based on a comparison of vectors of lexical descriptors. Second is the word embeddings based fastText, where the sentence (or a document) vector is obtained by averaging the n-gram embeddings, and then a multinomial logistic regression exploits these vectors as features. We show in Russian and English datasets that stacking classifier shows better results compared to single classifiers.

Keywords

stacking, similarity search, KNN, word embeddings, fastText, patent classification

About the authors

V. V. Yadrintsev

Federal Research Center Computer Science and Control of the Russian Academy of Sciences; Peoples’ Friendship University of Russia (RUDN University)

Author for correspondence.
Email: vvyadrincev@gmail.com
Russian Federation, Moscow, 119333; Moscow, 117198

I. V. Sochenkov

Federal Research Center Computer Science and Control of the Russian Academy of Sciences; Lomonosov Moscow State University

Author for correspondence.
Email: sochenkov@isa.ru
Russian Federation, Moscow, 119333; Moscow, 119991

Supplementary files

Supplementary Files

Action

1. JATS XML

Download

Username
Password
Remember me

Forgot password?	Register

Username
Password
Remember me

Forgot password?	Register