The TF-IDF measure and analysis of links between words within N-grams in the formation of knowledge units for open tests
- Авторлар: Emelyanov G.M.1, Mikhailov D.V.1, Kozlov A.P.1
-
Мекемелер:
- Yaroslav-the-Wise Novgorod State University
- Шығарылым: Том 27, № 4 (2017)
- Беттер: 825-831
- Бөлім: Applied Problems
- URL: https://journals.rcsi.science/1054-6618/article/view/195266
- DOI: https://doi.org/10.1134/S1054661817040058
- ID: 195266
Дәйексөз келтіру
Аннотация
A method is proposed for searching in a text corpus for phrases that are the most similar to an original one in a described knowledge fragment (including linguistic forms of expression) based on a numerical evaluation of the coupling strength between the related words from the original phrase that occur in them. In this regard, the links themselves expand from traditional bigrams to three or more elements and are distinguished according to the results of dividing the words of the original phrase into classes according to the value of the TF-IDF measure as an alternative to syntactic dependences.
Авторлар туралы
G. Emelyanov
Yaroslav-the-Wise Novgorod State University
Хат алмасуға жауапты Автор.
Email: Gennady.Emelyanov@novsu.ru
Ресей, Velikii Novgorod, 173003
D. Mikhailov
Yaroslav-the-Wise Novgorod State University
Email: Gennady.Emelyanov@novsu.ru
Ресей, Velikii Novgorod, 173003
A. Kozlov
Yaroslav-the-Wise Novgorod State University
Email: Gennady.Emelyanov@novsu.ru
Ресей, Velikii Novgorod, 173003
Қосымша файлдар
