Extraction of explicit consumer intentions from social network messages

Ivan Pimenov, Natalia Salomatina

Результат исследования: Публикации в книгах, отчётах, сборниках, трудах конференцийстатья в сборнике материалов конференциинаучнаярецензирование


In this paper we address the problem of automatic extraction of facts from Russian texts. The facts under examination are the intentions of social network users to purchase certain goods or use certain services. The utilized approach is machine learning with annotation. A training set for expert annotation consists of messages from the “VKontakte” social network, selected through the LeadScanner API. The invented system of semantic tags allows distinguishing between various intentional blocks: objects, their different properties and emphatic constructions. Pre-processing of the training set includes lemmatization and grammatical tagging with PyMorphy2. Then, on the material of the training set, a directed graph is constructed. Each node in this graph corresponds to an intentional block, including information about its expertly-assigned intentional tag, grammatical and/or lexical properties of its main word. The edges of the graph connect the intentional blocks that can be found in adjacent positions across all the messages of the training set. Extraction of intention objects and their properties is achieved by test set analysis in accordance to the constructed graph. Test set includes both messages containing non-consumer intentions or no intentions at all. The precision and recall of intention extraction with macro average is 82% and 74% respectively.

Язык оригиналаанглийский
Название основной публикацииAnalysis of Images, Social Networks and Texts - 7th International Conference, AIST 2018, Revised Selected Papers
РедакторыAlexander Panchenko, Wil M. van der Aalst, Michael Khachay, Panos M. Pardalos, Vladimir Batagelj, Natalia Loukachevitch, Goran Glavaš, Dmitry I. Ignatov, Sergei O. Kuznetsov, Olessia Koltsova, Irina A. Lomazova, Andrey V. Savchenko, Amedeo Napoli, Marcello Pelillo
ИздательSpringer-Verlag GmbH and Co. KG
Число страниц7
ISBN (печатное издание)9783030110260
СостояниеОпубликовано - 2018
Событие7th International Conference on Analysis of Images, Social Networks and Texts, AIST 2018 - Moscow, Российская Федерация
Продолжительность: 5 июл. 20187 июл. 2018

Серия публикаций

НазваниеLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Том11179 LNCS
ISSN (печатное издание)0302-9743
ISSN (электронное издание)1611-3349


Конференция7th International Conference on Analysis of Images, Social Networks and Texts, AIST 2018
Страна/TерриторияРоссийская Федерация


Подробные сведения о темах исследования «Extraction of explicit consumer intentions from social network messages». Вместе они формируют уникальный семантический отпечаток (fingerprint).