Using data compression to build a method for statistically verified attribution of literary texts

Boris Ryabko, Nadezhda Savina

Research output: Contribution to journalArticlepeer-review

Abstract

We consider the problems of the authorship of literary texts in the framework of the quantitative study of literature. This article proposes a methodology for authorship attribution of literary texts based on the use of data compressors. Unlike other methods, the suggested one gives a possibility to make statistically verified results. This method is used to solve two problems of attribution in Russian literature.

Original languageEnglish
Article number1302
JournalEntropy
Volume23
Issue number10
DOIs
Publication statusPublished - Oct 2021

Keywords

  • Authorship attribution of literary texts
  • Data compression
  • Hypothesis testing
  • Quantitative study of literature

OECD FOS+WOS

  • 1.02 COMPUTER AND INFORMATION SCIENCES
  • 1.01 MATHEMATICS
  • 1.03 PHYSICAL SCIENCES AND ASTRONOMY

Fingerprint

Dive into the research topics of 'Using data compression to build a method for statistically verified attribution of literary texts'. Together they form a unique fingerprint.

Cite this