The numbers reveal the author: a stylometric comparison of German-language modernist texts

Cover Page

Cite item

Full Text

Abstract

The present study pertains to stylometry (and, more broadly, to quantitative linguistics). The novel quantitative method of studying the author's style of literary texts, based on the analysis of statistics of numerals found in them, is applied to literary texts in German. A computer program has been developed to search in the text for cardinal and ordinal numerals expressed both in numbers and verbally (in different word forms). The program automatically removes phraseological units and stable combinations from the text that accidentally (without the author's intention) contain numerals. Previously, the text is manually cleared of auxiliary numerals such as pagination, chapter numbers, etc. It is shown that the numerals used by the author in the (artistic) text are individual for each author; their totality is a characteristic feature (author's invariant, "fingerprint") that distinguishes the texts written by different authors. A comparative stylometric analysis of a number of literary works by Thomas Mann, Hermann Broch, Robert Musil, and Elias Canetti – the representatives of German-language literary modernism of the 20th century – is performed. Substantial authorial differences in the manner of using numerals were discovered. The results of the analysis were subjected to hierarchical clustering process (the Manhattan metric; Complete linkage and Between-groups methods). The cluster analysis correctly distributed the texts according to their authorship. The use of various clustering methods for text analysis enhances the significance of the results obtained and confirms their non-random nature. This demonstrates that the novel method of stylometry is able to accurately attribute literary texts to their correct authors.

References

  1. Зенков А. В. Новый метод стилеметрии на основе статистики числительных, Компьютерные исследования и моделирование, 2017, Т. 9, № 5. С. 837–850.
  2. Zenkov A.V. A Method of Text Attribution Based on the Statistics of Numerals // J. of Quantitative Linguistics. 2018. No. 25(3). Pp. 256–270.
  3. Zenkov A.V., Místecký M. The Romantic Clash: Influence of Karel Sabina over Mácha’s Cikáni from the Perspective of the Numerals Usage Statistics // Glottometrics. 2019, No. 46, Pp. 12–28.
  4. Zenkov A.V. Stylometry and Numerals Usage: Benford’s Law and Beyond // Stats 2021. No. 4. Pp. 1051–1068.
  5. Zenkov A., Místecký M. Young Vladimír Vašek? – A Numerals Analysis Contribution to the Bezruč−Hrzánský Identity Issue // Naše řeč, 2022. No. 105(3). Pp. 151–161.
  6. Зенков А.В. Литературные мистификации и авторское использование числительных // Филологические науки. Вопросы теории и практики. 2023. № 16(11). С. 3696–3709. URL: https://doi.org/10.30853/phil20230568.
  7. Zenkov A.V. Under a False Flag: Literary Hoaxes and the Use of Numerals // Litera. 2023. № 10. С. 86–109. doi: 10.25136/2409-8698.2023.10.68743 EDN: TYDRFD URL: https://e-notabene.ru/fil/article_68743.html
  8. Зенков А.В., Ермаков Н.Е. Числительные в текстах как характерная особенность авторского стиля // Russian Linguistic Bulletin. 2023. № 45(9). URL: https://doi.org/10.18454/RULB.2023.45.28.
  9. Stamatatos E. A survey of modern authorship attribution methods // J. Amer. Soc. for Information Science and Technology. 2009. No. 60(3). Pp. 538–556.
  10. Tempestt N., Kalaivani S., Aneez F., Yiming Y., Yingfei X., and Damon W. Surveying Stylometry Techniques and Applications // ACM Comput. Surv. 2017, No. 50(6), Article 86, 36 pages.
  11. Burrows J. Delta: a Measure of Stylistic Difference and a Guide to Likely Authorship / J. Burrows // Literary and Linguistic Computing. – 2002. – 17(3). – P. 267–287.
  12. La Inteligencia Artificial ayuda a descubrir una obra desconocida de Lope de Vega en los fondos de la BNE, Biblioteca Nacional de España, https://www.bne.es/es/noticias/inteligencia-artificial-ayuda-descubrir-obra-desconocida-lope-vega-fondos-bne (Accessed: October 25, 2024).
  13. Schröter, J. (2020). Die apokryphen Evangelien: Jesusüberlieferungen außerhalb der Bibel. Munich: C. H. Beck.
  14. Vickers, B. (2002). 'Counterfeiting' Shakespeare: Evidence, Authorship and John Ford's Funerall Elegye. Cambridge: Cambridge University Press.
  15. Сорокина М. Ю., Суперфин Г. Г. «Был такой писатель Агеев…»: версия судьбы или о пользе наивного биографизма // Минувшее: Исторический альманах. Вып. 16. М., СПб: Феникс-Атенеум, 1994. С. 265–289.
  16. Dammann, G. (ed.) (2012). B. Traven, Autor – Werk – Werkgeschichte. Würzburg: Königshausen & Neumann.
  17. Bellos, D. (2010). Romain Gary: A Tall Story. London: Harvill Secker.
  18. Hupertz, H. (2021). Wie eine Frau sich als Holocaust-Überlebende ausgab. Frankfurter Allgemeine, 23 November. Available at https://www.faz.net/aktuell/feuilleton/medien/frau-gab-sich-als-holocaust-ueberlebende-aus-dokumentation-bei-arte-17646920.html (Accessed: October 25, 2024).
  19. Arnold, H. L. Thomas Mann. München: Edition Text u. Kritik, 1976. ISBN: 9783921402221. 226 S.
  20. M. Travers, Thomas Mann. London: Macmillan Education, 1992. Pp. vii + 146. ISBN :‎ 978-0333517079.
  21. Thomas Mann-Handbuch: Leben – Werk – Wirkung, A. Blödorn, F. Marx (Eds.), DOI: https://doi.org/10.1007/978-3-476-05341-1, Verlag J.B. Metzler Stuttgart, Springer-Verlag Berlin Heidelberg 2015. ISBN 978-3-476-02456-5. IX + 425 pages.
  22. Thomas Mann: neue kulturwissenschaftliche Lektüren. S. Börnchen, G. Mein, G. Schmidt (Eds.), Wilhelm Fink Verlag, 2012. ISBN 9783846753897. 457 Seiten.
  23. C. Grawe, Sprache im Prosawerk. Beispiele von Goethe, Fontane, Thomas Mann, Bergengruen, Kleist und Johnson. Bonn: Bouvier Verlag Herbert Grundmann, 1987. ISBN: 9783416009584. 111 Seiten.
  24. Dowden, S. D.: Sympathy for the abyss: a study in the novel of German modernism: Kafka, Broch, Musil, and Thomas Mann. Tübingen: Niemeyer, 1986. ISBN 3-484-18090-0. 195 p.
  25. Nübel, B. Robert Musil – Essayismus als Selbstreflexion der Moderne, Berlin, New York: De Gruyter, 2006. URL: https://doi.org/10.1515/9783110201857. 548 S.
  26. Nübel, B. and Wolf, N. Ch. Robert-Musil-Handbuch, Berlin, Boston: De Gruyter, 2016. URL: https://doi.org/10.1515/9783110255577. 1064 S.
  27. Boelderl, A. R. and Neymeyr, B. Robert Musil im Spannungsfeld zwischen Psychologie und Phänomenologie, Berlin, Boston: De Gruyter, 2024. URL: https://doi.org/10.1515/9783110988352. 366 S.
  28. H. Bloom, Robert Musil's the Man Without Qualities. Chelsea House Publishers, 2005. ISBN 9780791081228. 211 pages.
  29. J. Bouveresse, La Voix de l'âme et les Chemins de l'esprit. Dix études sur Robert Musil. Éditions du Seuil, 2001, ISBN: 9782020362894.462 p.
  30. A Companion to the Works of Robert Musil, P. Payne, G. Bartram, and G. Tihanov (Eds.). Camden House, Rochester, New York. 2007. ISBN: 978–1–57113–110–2. 472 p.
  31. P. Payne. Robert Musil’s ‘The Man Without Qualities’: A Critical Study. Cambridge University Press, 1988. ISBN: 978-0-521-11060-0. 271 p.
  32. Th. Sebastian, The Intersection of Science and Literature in Musil's The Man Without Qualities. Camden House, an imprint of Boydell & Brewer Inc., Rochester, 2005. ISBN: 1–57113–116–7. 159 p.
  33. F. Schwarzwälder, Der Weltanschauungsroman 2. Ordnung: Probleme literarischer Modellbildung bei Hermann Broch und Robert Musil. transcript Verlag, Bielefeld, 2019, 372 Seiten. ISBN: 978-3-8376-4996-3.
  34. A Companion to the Works of Hermann Broch, G. Bartram, S. McGaughey and G. Tihanov (Eds.), 2019. Camden House, an imprint of Boydell & Brewer Inc., Rochester, ISBN: 9781571135414, 290 p.
  35. Hermann-Broch-Handbuch: Zeit – Werk – Forschung, M. Kessler, P. M. Lützeler (Eds.), De Gruyter, 2015. ISBN:‎ 978-3110200713. 685 S.
  36. Wohlleben, D. and Lützeler, P. M. (Eds.). Hermann Broch und die Romantik, Berlin, Boston: De Gruyter, 2014. https://doi.org/10.1515/9783110351958. 235 S.
  37. Hermann Broch, Visionary in Exile, The 2001 Yale Symposium, P. M. Lützeler, M. Konzett and W. Riemer (Eds.). Camden House, an imprint of Boydell & Brewer Inc., Rochester. ISBN: 9781571132727. 280 p.
  38. W. C. Donahue, The End of Modernism: Elias Canetti’s Auto-da-Fé. The University of North Carolina Press, 2001. ISBN: 978-1-4696-5742-4. 302 p.
  39. J. P. Arnason and D. Roberts, Elias Canetti's Counter-Image of Society: Crowds, Power, Transformation. Camden House, an imprint of Boydell & Brewer Inc., Rochester. 2004. ISBN: 9781571131607. 174 p.
  40. A Companion to the Works of Elias Canetti, D. C. G. Lorenz (Ed.). Camden House, an imprint of Boydell & Brewer Inc., Rochester. 2004. ISBN: 9781571134080. 364 p.
  41. J S Mcclelland, The Crowd and the Mob: From Plato to Canetti. Unwin Hyman Ltd, 2011. ISBN 9780415602495. 356 Pages.
  42. B. Neumann, G. Wimmer, Elias Canetti in seiner Zeit: Kulturelle, wissenschaftliche und politische Deskriptionen. J.B. Metzler, ein Imprint des Springer-Verlages, 2020. ISBN 978-3-476-05649-8. 264 S.
  43. Radaelli, G. Literarische Mehrsprachigkeit: Sprachwechsel bei Elias Canetti und Ingeborg Bachmann, Berlin: Akademie Verlag, 2011. https://doi.org/10.1524/9783050053592. 304 S.
  44. Th. Mann, Die Erzählungen, Zweiter Band. Fischer Taschenbuch Verlag GmbH, Frankfurt am Main. 1979.
  45. H. Broch, Gedichte. Kommentierte Werkausgabe, Band 8. Suhrkamp, Frankfurt, 1980. ISBN:‎ 978-3518370728. 244 S.
  46. Moisl H. Cluster Analysis for Corpus Linguistics. De Gruyter Mouton, 2015. – 381 p. ISBN:9783110350258.
  47. Gan G., Ma C., Wu J., Data Clustering: Theory, Algorithms, and Applications. Society for Industrial and Applied Mathematics, 2007. – 466 p. doi: 10.1137/1.9780898718348.

Supplementary files

Supplementary Files
Action
1. JATS XML

Согласие на обработку персональных данных

 

Используя сайт https://journals.rcsi.science, я (далее – «Пользователь» или «Субъект персональных данных») даю согласие на обработку персональных данных на этом сайте (текст Согласия) и на обработку персональных данных с помощью сервиса «Яндекс.Метрика» (текст Согласия).