An approach to filtering prohibited content on the web

E. A. Sidorova, I. S. Kononenko, Yu A. Zagorulko

Research output: Contribution to journalConference articlepeer-review

2 Citations (Scopus)

Abstract

The institution of legislative regulation of the content of information resources has aggravated the problem of automatic detection and blocking of prohibited content. We propose an approach to solving this problem. In this approach, a thematic analysis of websites is complemented by a genre one, which allows identification of the activity carried out through a website and, therefore, brings about a more accurate recognition and localization of the illicit content. The decision on the presence of prohibited content on a website page is made on the basis of both analysis of the page text content and results of thematic and genre analysis of the site as a whole. Software and Russian-language resources for the detection of prohibited content related to the topic "Drug addiction and drugs" have been developed.

Original languageEnglish
Pages (from-to)64-71
Number of pages8
JournalCEUR Workshop Proceedings
Volume2022
Publication statusPublished - 1 Jan 2017

Keywords

  • Filtering prohibited content
  • Thematic text analysis
  • Website classification
  • Website genre analysis

Fingerprint Dive into the research topics of 'An approach to filtering prohibited content on the web'. Together they form a unique fingerprint.

Cite this