Discovery of the Similarities for Parasites

YILDIRIM P., ÇEKEN K.

14th Turkish National Software Engineering Symposium (UYMS), ELECTR NETWORK, 7 - 09 Ekim 2020, ss.59-62, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/uyms50627.2020.9247052
Basıldığı Ülke: ELECTR NETWORK
Sayfa Sayıları: ss.59-62
Anahtar Kelimeler: biomedical text mining, clustering analysis, k-means algorithm, parasites
Akdeniz Üniversitesi Adresli: Evet

Özet

In this paper we report on a study for discovering hidden patterns in commonly seen parasites by using abstracts from MEDLINE database. Parasites affect millions of people in the world and cause tremendous morbidity and mortality. Diagnosing parasites can be difficult because some symptoms and related to gene-proteins can be common to some of them. We utilize a web based biomedical text mining tool to find symptoms and gene-proteins. After selecting the most common symptoms and gene-proteins, we create two datasets with the frequencies of symptoms and gene-proteins for each parasite. For this work we selected the k-means algorithm for clustering analysis and apply it on the datasets. In addition, we compared different algorithms to observe the performance of k-means. Clustering analysis generated different types of groups of parasites. Although the results are not 100% certain, they can make positive contributions to medical researchers and experts for the diagnosis of parasites.