RusEmoLex: Russian Emotion Lexicon
- 作者: Iaroshenko P.V.1,2, Loukachevitch N.V.1,2
-
隶属关系:
- Lomonosov Moscow State University
- Ivannikov Institute for System Programming of the Russian Academy of Sciences
- 期: 卷 29, 编号 3 (2025)
- 页面: 659-687
- 栏目: Articles
- URL: https://journals.rcsi.science/2687-0088/article/view/332251
- DOI: https://doi.org/10.22363/2687-0088-44439
- EDN: https://elibrary.ru/CDQHHV
- ID: 332251
如何引用文章
全文:
详细
Emotion detection in natural language is an important area of computational linguistics. It involves the description, categorisation and systematization of emotion lexicon that can be further employed to enhance the performance of machine learning algorithms. The aim of the study is twofold: firstly, to provide a comprehensive description of the Russian-language resources pertaining to emotion vocabulary, and secondly, to create an emotion lexicon based on these resources. The study describes the following types of Russian-language sources: lexicographic resources (dictionaries and thesauri); data from the semantically tagged part of the Russian National Corpus; datasets consisting of texts labelled by emotion type; and word-emotion association lexicons. The methodology of creating RusEmoLex encompasses the following stages: the creation of initial lists of emotion words derived from available sources; the compilation of the lists; and the selection of lexis in RusEmoLex. A lexical unit has to meet the following criteria to be included in RusEmoLex: repeated occurrence in the sources (a lexical unit needs to be present in two or more sources); the quality of the sources (a lexical unit needs to be included either in a dictionary or a corpus resource and it should not be a translation); the class label (the lexical units have an emotional label established on the basis of the majority of sources). RusEmoLex is an open access resource which includes 1024 lexical units and can be used for linguistic studies and in the field of natural language processing.
作者简介
Polina Iaroshenko
Lomonosov Moscow State University; Ivannikov Institute for System Programming of the Russian Academy of Sciences
Email: polina.iaroshenko@yandex.ru
ORCID iD: 0000-0002-9425-5842
кандидат филологических наук, научный сотрудник лаборатории анализа информационных ресурсов Научноисследовательского вычислительного центра Московского государственного университета имени М.В. Ломоносова; младший научный сотрудник Института системного программирования им. В.П. Иванникова Российской академии наук. Сфера научных интересов: компьютерная семантика, обработка естественного языка, корпусная лингвистика
Moscow, RussiaNatalia Loukachevitch
Lomonosov Moscow State University; Ivannikov Institute for System Programming of the Russian Academy of Sciences
编辑信件的主要联系方式.
Email: louk_nat@mail.ru
ORCID iD: 0000-0002-1883-4121
доктор технических наук, ведущий научный сотрудник лаборатории анализа информационных ресурсов Научноисследовательского вычислительного центра Московского государственного университета имени М.В. Ломоносова; главный научный сотрудник Института системного программирования им. В.П. Иванникова Российской академии наук. Сфера научных интересов: компьютерная семантика, обработка естественного языка, искусственный интеллект
Moscow, Russia参考
- Апресян В.Ю. Речевые стратегии выражения эмоций в русском языке // Русский язык в научном освещении. 2010. № 2. С. 26–57. [Apresyan, Valetina. Yu. 2010. Rechevye strategii vyrazheniya emotsii v russkom yazyke (Speech strategies for expressing emotions in Russian). Russkiy yazyk v nauchnom osveshchenii 2 (20). 26–57. (In Russ.)].
- Апресян Ю.Д. Избранные труды. Т. 2: Интегральное описание языка и системная лексикография. М.: Школа «Языки русской культуры», 1995. [Apresian, Yury D. 1995. Izbrannye trudy. Integral’noe opisanie yazyka i sistemnaya leksikografiya (Selected Works. Integral Description of a Language and Systematic Lexicography). Vol. 2. Moscow: Shkola Iazyki russkoi kultury Publ., Moscow. (In Russ.)].
- Гладкова А.Н. Русская культурная семантика: Эмоции, ценности, жизненные установки. М.: Языки славянской культуры, 2010. [Gladkova, Anna 2010. Russkaya kul'turnaya semantika: Emotsii, tsennosti, zhiznennye ustanovki (Russian Cultural Sematics: Emotions, Values, Attitudes). Vol. 2. Мoscow: Iazyki slavianskoi kul’tury Publ. (In Russ.)].
- Иоанесян Е.Р. Исследования по семантике эмоциональных предикатов в типологическом аспекте. 2-е изд., расширенное и доп. М.; Ярославль: Издательство Канцлер, 2024. [Ioanesian, Evgenia R. 2024. Issledovaniya po semantike emotsional’nykh predikatov v tipologicheskom aspekte (Research on the Semantics of Emotional Predicates in the Typological Aspect). 2nd edn. Moscow; Yaroslavl: Kantsler Publ. (In Russ.)].
- Ионова С.В., Шаховский В.И. Проспекция лингвокультурологической теории эмоций Анны Вежбицкой // Russian Journal of Linguistics. 2018. Т. 22. No 4. С. 966–987. [Ionova, Svetlana & Viktor Shakhovsky. 2018. Anna Wierzbicka’s linguocultural theory of emotions in the development dynamics. Russian Journal of Linguistics 22 (4). 966–987(In Russ.)]. https://doi.org/10.22363/2312-9182-2018-22-4-966-987.
- Ионова С.В., Штеба А.А. Смешанные эмоции: к вопросу о лингвистической репрезентации и метаязыке описания // Вопросы психолингвистики. 2019. № 2 (40). С. 63–81. [Ionova, Svetlana V. & Alexey A. Shteba. 2019. Contradictory Emotions: The question of the linguistic representation and the metalanguage description. Journal of Psycholinguistics 2 (40). 63–81. (In Russ.)].
- Ионова С.В. Эмоциональная доминанта текста: некоторые лингвистические аспекты исследования // Вестник Волгоградского государственного университета. Сер. 2: Языкознание. 2023. Т. 22. № 1. С. 13–27. [Ionova, Svetlana V. 2023. Emotional dominant of the text: Some linguistic aspects of research. Science Journal of Volgograd State University. Linguistics 22 (1). 13–27. (In Russ.)].
- Котельников Е.В., Разова Е.В., Котельникова А.В., Вычегжанин С.В. Современные словари оценочной лексики для анализа мнений на русском и английском языках (аналитический обзор) // Научно-техническая информация. Серия 2: Информационные процессы и системы. 2020. № 12. С. 16–33. [Kotelnikov, Evgeny V., Elena V. Razova, Anastasia V. Kotelnikova, Sergey V. Vychegzhanin. 2020. Modern sentiment lexicons for opinion mining in English and Russian (analytical survey). Nauchno-tekhnicheskaya informaciya. Seria 2. Informacionnye processy i sistem 12. 16–33. (In Russ.)].
- Ларина Т.В. Прагматика эмоций в межкультурном контексте // Russian Journal of Linguistics. 2015. № 1. С. 144−163. [Larina, Tatiana V. 2015. Pragmatics of emotions in intercultural context. Russian Journal of Linguistics 1. 144−163. (In Russ.)].
- Ларина Т.В. Эмотивная экологичность и эмотивная вежливость в жанре английской и русской анонимной рецензии // Вопросы психолингвистики. 2019. T. 1. № 39. С. 38–57. [Larina, Tatiana V. 2015. Emotive ecology and emotive politeness in English and Russian: Blind peer-review. Journal of Psycholinguistics 1 (39). 38−57. (In Russ.)]. https://doi.org/10.30982/2077-5911-2019-39-1-38-57.
- Ларина Т.В. Эмотивная вежливость в лицеугрожающих речевых актах: кросс-культурный аспект // Слово.ру: балтийский акцент. 2025. Т. 16. №2. С. 118−135. [Larina, Tatiana V. Emotive politeness in face-threatening speech acts: Cross-cultural perspectives. Slovo.ru: Baltic Accent 16 (2). 118−135. (In Russ.)] https://doi.org/10.5922/2225-5346-2025-2-7.
- Озюменко В.И., Ларина Т.В. Искусственный интеллект в переводе: сильные и слабые стороны // Вестник Волгоградского государственного университета. 2025. Серия 2. Языкознание. 24 (1). C. 122−135. [Ozyumenko, Vladimir I. & Tatiana V. Larina. 2025. Artificial intelligence in translation: Advantages and limitations. Vestnik Volgogradskogo gosudarstvennogo universiteta. Seriya 2. Yazykoznanie / Science Journal of Volgograd State University. Linguistics 24 (1). 122−135. (In Russ.)]. https://doi.org/10.15688/jvolsu2.2025.1.10
- Падучева Е.В. Динамические модели в семантике лексики. М.: Языки славянской культуры, 2004. [Paducheva, Elena V. 2004. Dynamic Models in Lexical Semantics. Мoscow: Iazyki slavianskoi kul’tury Publ. (In Russ.)].
- Савчук С.О., Архангельский Т.А., Бонч-Осмоловская А.А., Донина О.В., Кузнецова Ю.Н., Ляшевская О.Н., Орехов Б.В., Подрядчикова М.В. Национальный корпус русского языка 2.0: новые возможности и перспективы развития // Вопросы языкознания. 2024. № 2. С. 7–34. [Savchuk, Svetlana O., Timofey Arkhangelskiy, Anastasiya A. Bonch-Osmolovskaya, Ol’ga V. Donina, Yuliya N. Kuznetsova, Ol’ga N. Lyashevskaya, Boris V. Orekhov & Mariya V. Podryadchikova. 2024. Russian National Corpus 2.0: New opportunities and development prospects. Voprosy yazykoznanija 2. 7–34. (In Russ.)].
- Сысоева Т.А., Люсин Д.В. Разработка расширенной базы данных с эмоциональными оценками существительных ENRuN-2: успехи, проблемы и перспективы // Психология познания: материалы Всероссийской научной конференции. ЯрГУ, 6–8 декабря 2024 г. / под ред. И.Ю. Владимирова, С.Ю. Коровкина. Ярославль: Филигрань, 2024. С. 316–320. [Sysoeva, Tatiana A. & Dmitrii V. Lyusin. 2024. Development of an extended database with emotional ratings of nouns ENRuN-2: Successes, problems and prospects. In Ilya Yu. Vladimirov, Sergey Yu. Korovkin (eds.), Psychology of cognition: Proceedings of the All-Russian Scientific Conference, YARSU, December 6-8, 316–320. Yaroslavl : YARSU. (In Russ.)].
- Шаховский В.И. Лингвистическая теория эмоций. М.: Гнозис, 2008. [Shakhovsky, Viktor I. 2008. Lingvisticheskaya teoriya emotsii (Linguistic theory of emotions). Moscow: Gnosis Publ. (In Russ.)].
- Шаховский В.И. Язык и эмоции в аспекте лингвокультурологии. Волгоград: Перемена, 2009. [Shakhovsky, Viktor I. 2008. Yazyk i emotsii v aspekte lingvokul’turologii (Language and emotions in the aspect of linguoculturology). Volgograd: Peremena Publ. (In Russ.)].
- Alba-Juez, Laura & Michael Haugh (eds.). 2025. The Sociopragmatics of Emotion. Cambridge: Cambridge University Press.
- Alba-Juez, Laura & Tatiana Larina. 2018. Language and emotion: Discourse-pragmatic perspectives. Russian Journal of Linguistics 22 (1). 9−37.
- Buechel, Sven, Susanna Rücker & Udo Hahn. 2020. Learning and evaluating emotion lexicons for 91 languages. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 1202–1217. http://doi.org/10.18653/v1/2020.acl-main.112
- Cavicchio, Federica. 2025. Emotion Detection in Natural Language Processing. Cham: Springer.
- Cortal, Gustave, Alain Finkel, Patrick Paroubek & Lina Ye. 2023. Emotion recognition based on psychological components in guided narratives for emotion regulation. Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. 72–81. http://doi.org/10.18653/v1/2023.latechclfl-1.8.
- Dalal, Dhairya, Gaurav Negi & Davide Picca. 2025. LLMs and Emotional Intelligence: Evaluating emotional understanding through psychometric tools. Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization (UMAP ’25). 323–328. https://doi.org/10.1145/3699682.3728315.
- Demszky, Dorottya, Dana Movshovitz-Attias, Jeongwoo Ko, Alan Cowen, Gaurav Nemade & Sujith Ravi. 2020. GoEmotions: A dataset of fine-grained emotions. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 4040–4054. http://doi.org/10.18653/v1/2020.acl-main.372.
- Ekman, Paul. 1992. Are there basic emotions? Psychological Review 99 (3). 550–553.
- Gladkova, Anna. 2010. ‘Sympathy’, ‘compassion’, and ‘empathy’ in English and Russian: A linguistic and cultural analysis. Culture & Psychology 16 (2). 267–285.
- Gladkova, Anna. 2022. Emotions and attitudes in present day Russian through the prism of new words: Cultural semantics of zhest’ and related concepts. Russian Journal of Linguistics 26 (4). 970–994. https://doi.org/10.22363/2687-0088-32167.
- Izard, Carroll. E. 1991. The Psychology of Emotions. New York: Plenum.
- Larina, Tatiana & Douglas M. Ponton. 2022. I wanted to honour your journal, and you spat in my face: Emotive (im)politeness and face in the English and Russian blind peer review. Journal of Politeness Research 18 (1). 201–226. https://doi.org/10.1515/pr-2019-0035.
- Loukachevitch, Natalia & Anatolii Levchik. 2016. Creating a general Russian sentiment lexicon. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). 1171–1176.
- Kazyulina, Marina, Aleksandr Babii & Alexey Malafeev. 2021. Emotion classification in Russian: Feature engineering and analysis. Analysis of Images, Social Networks and Texts, AIST 2020, Lecture Notes in Computer Science, vol. 12602. 135–148.
- Kondratenko, Vladimir, Artem Sokolov, Nikolay Karpov, Oleg Kutuzov, Nikita Savushkin, & Fyodor Minkin. 2022. Large Raw Emotional Dataset with Aggregation Mechanism. https://arxiv.org/abs/2212.12266 (accessed 25 May 2025).
- Mackenzie, John Lachlan & Laura Alba-Juez (eds.). 2019. Emotion in Discourse [Pragmatics and Beyond New Series 302]. Amsterdam/Philadelphia: John Benjamins Publishing Company.
- Mohammad, Saif M. 2023. Best practices in the creation and use of emotion lexicons. Findings of the Association for Computational Linguistics: EACL 2023. 1825–1836. http://doi.org/10.18653/v1/2023.findings-eacl.136
- Mohammad, Saif M. & Peter D. Turney. 2013. Crowdsourcing a word-emotion association lexicon. Computational Intelligence 29 (3). 436–465. http://doi.org/10.1111/j.1467-8640.2012.00460.x
- Muhammad, Shamsuddeen Hassan, Nedjma Ousidhoum, Idris Abdulmumin et al. 2025. BRIGHTER: BRIdging the Gap in Human-annotated Textual Emotion Recognition. Datasets for 28 languages. https://arxiv.org/abs/2502.11926. (accessed 25 May 2025).
- Park Seo-Hui, Byung-Chull Bae & Yun-Gyung Cheong. 2020. Emotion recognition from text stories using an emotion embedding model. IEEE International Conference on Big Data and Smart Computing. Busan, Korea (South), 2020. 579–583. http://doi.org/10.1109/BigComp48618.2020.00014.
- Plaza-del-Arco, Flor Miriam, Alba A. Cercas Curry, Amanda Cercas Curry & Dirk Hovy. 2024. Emotion analysis in NLP: Trends, gaps and roadmap for future directions. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. 5696–5710.
- Plutchik, Robert. 1980. A general psycho-evolutionary theory of emotion. In Robert Plutchik & Henry Kellerman (eds.), Theories of emotion, 3–33. Cambridge: Academic Press.
- Sboev, Alexander, Aleksandr Naumov & Roman Rybka. 2021. Data-driven model for emotion detection in Russian texts. Procedia Computer Science 190. 637–642.
- Wang, Shuo, Aishan Maoliniyazi, Xinle Wu & Xiaofeng Meng. 2020. Emo2Vec: Learning emotional embeddings via multi-emotion category. ACM Transactions on Internet Technology 20 (2). 1–17. https://doi.org/10.1145/3372152.
- Wierzbicka, Аnna. 1992. Talking about emotions: Semantics, culture, and cognition. Cognition and Emotion 6 (3–4). 285–319.
- Wierzbicka, Anna. 1999. Emotions across Languages and Cultures. Diversity and Universals. Cambridge University Press.
- Wierzbicka, Anna. 2010. The semantics of emotions: Fear and it relatives in English. Australian Journal of Linguistics 2. 359–375.
- Wierzbicka, Anna. 2018. Emotions of Jesus. Russian Journal of Linguistics 22 (1). 38–53. doi.org/10.22363/2312-9182-2018-22-1-38-53.
- Zappettini, Franco, Douglas M. Ponton & Tatiana V. Larina. 2021. Emotionalisation of contemporary media discourse: A research agenda. Russian Journal of Linguistics 25 (3). 586–610. https://doi.org/10.22363/2687-0088-2021-25-3-586-610.
补充文件
