So What’s the Plan? Mining Strategic Planning Documents

Ekaterina Artemova, Tatiana Batura, Anna Golenkovskaya, Vitaly Ivanin, Vladimir Ivanov, Veronika Sarkisyan, Ivan Smurov, Elena Tutubalina

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review


In this paper we present a corpus of Russian strategic planning documents, RuREBus. This project is grounded both from language technology and e-government perspectives. Not only new language sources and tools are being developed, but also their applications to e-government research. We demonstrate the pipeline for creating a text corpus from scratch. First, the annotation schema is designed. Next texts are marked up using human-in-the-loop strategy, so that preliminary annotations are derived from a machine learning model and are manually corrected. The amount of annotated texts is large enough to showcase what insights can be gained from RuREBus.

Original languageEnglish
Title of host publicationDigital Transformation and Global Society - 5th International Conference, DTGS 2020, Revised Selected Papers
EditorsDaniel A. Alexandrov, Alexander V. Boukhanovsky, Andrei V. Chugunov, Yury Kabanov, Olessia Koltsova, Ilya Musabirov
PublisherSpringer Science and Business Media Deutschland GmbH
Number of pages15
ISBN (Print)9783030652173
Publication statusPublished - 2020
Event5th International Conference on Digital Transformation and Global Society, DTGS 2020 - St. Petersburg, Russian Federation
Duration: 17 Jun 202019 Jun 2020

Publication series

NameCommunications in Computer and Information Science
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937


Conference5th International Conference on Digital Transformation and Global Society, DTGS 2020
CountryRussian Federation
CitySt. Petersburg


  • Named entity recognition
  • Relation extraction
  • Strategic planning documents


Dive into the research topics of 'So What’s the Plan? Mining Strategic Planning Documents'. Together they form a unique fingerprint.

Cite this