Discovery of the Similarities for Parasites


14th Turkish National Software Engineering Symposium (UYMS), ELECTR NETWORK, 7 - 09 October 2020, pp.59-62 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/uyms50627.2020.9247052
  • Page Numbers: pp.59-62
  • Keywords: biomedical text mining, clustering analysis, k-means algorithm, parasites
  • Akdeniz University Affiliated: Yes


In this paper we report on a study for discovering hidden patterns in commonly seen parasites by using abstracts from MEDLINE database. Parasites affect millions of people in the world and cause tremendous morbidity and mortality. Diagnosing parasites can be difficult because some symptoms and related to gene-proteins can be common to some of them. We utilize a web based biomedical text mining tool to find symptoms and gene-proteins. After selecting the most common symptoms and gene-proteins, we create two datasets with the frequencies of symptoms and gene-proteins for each parasite. For this work we selected the k-means algorithm for clustering analysis and apply it on the datasets. In addition, we compared different algorithms to observe the performance of k-means. Clustering analysis generated different types of groups of parasites. Although the results are not 100% certain, they can make positive contributions to medical researchers and experts for the diagnosis of parasites.