Development of parallel FRiS-Tax text document clustering algorithm based on MPI technology

M. E. Mansurova, V. B. Barakhnin, S. S. Aubakirov, Ye Khibatkhanuly, A. B. Mussina

Результат исследования: Научные публикации в периодических изданияхстатья по материалам конференции

Аннотация

This paper describes a parallel implementation of FRiS-Tax text document clustering algorithm. The clustering algorithm is based on an assessment of the similarity between objects in the competitive situation that leads to the concept of competitive similarity function (FRiS-function). As the scales for determination of the similarity measures are selected attributes of bibliographic description of documents. The parallelization is performed on the step of coefficient tuning in similarity measure formula of the genetic algorithm, as well as directly in step of clustering. The clustering algorithm is implemented on a highperformance MPJ Express platform. Quantitative evaluation of the execution time of the process is performed, clearly demonstrating the advantages of parallel implementation of the algorithm.

Язык оригиналаанглийский
Страницы (с-по)244-256
Число страниц13
ЖурналCEUR Workshop Proceedings
Том1576
СостояниеОпубликовано - 2016
Событие10th Annual International Scientific Conference on Parallel Computing Technologies, PCT 2016 - Arkhangelsk, Российская Федерация
Продолжительность: 29 мар 201631 мар 2016

Fingerprint Подробные сведения о темах исследования «Development of parallel FRiS-Tax text document clustering algorithm based on MPI technology». Вместе они формируют уникальный семантический отпечаток (fingerprint).

  • Цитировать

    Mansurova, M. E., Barakhnin, V. B., Aubakirov, S. S., Khibatkhanuly, Y., & Mussina, A. B. (2016). Development of parallel FRiS-Tax text document clustering algorithm based on MPI technology. CEUR Workshop Proceedings, 1576, 244-256.