Аннотация
A method for automatic classification of scientific texts based on data compression is proposed. The method is implemented and investigated based on the data from an archive of scientific texts (arXiv.org) and in the CyberLeninka scientific electronic library (CyberLeninka.ru). Experiments showed that the method correctly identified the themes of scientific texts with a probability of 75-95%; its accuracy depends on the quality of the original data.
Язык оригинала | английский |
---|---|
Страницы (с-по) | 120-126 |
Число страниц | 7 |
Журнал | Automatic documentation and mathematical linguistics |
Том | 51 |
Номер выпуска | 3 |
DOI | |
Состояние | Опубликовано - 1 июн 2017 |
Предметные области OECD FOS+WOS
- 1.02 КОМПЬЮТЕРНЫЕ И ИНФОРМАЦИОННЫЕ НАУКИ