Classification of News Texts by Categories Using Machine Learning Methods

Creative Commons License

Kayakuş M., Yiğit Açıkgöz F.

Alphanumeric Journal, vol.10, no.2, pp.155-166, 2022 (Peer-Reviewed Journal)

  • Publication Type: Article / Article
  • Volume: 10 Issue: 2
  • Publication Date: 2022
  • Doi Number: 10.17093/alphanumeric.1149753
  • Journal Name: Alphanumeric Journal
  • Journal Indexes: TR DİZİN (ULAKBİM)
  • Page Numbers: pp.155-166
  • Akdeniz University Affiliated: Yes


In parallel with the advances in technology, digital journalism is preferred more than printed journalism day by day. Due to the fast and up-to-date sense of journalism provided by digital journalism and its ubiquitous accessibility features, it is read more by users. In addition to these advantages provided by digital journalism, it also has some difficulties compared to printed journalism. The stage of preparation and delivery of the news to the user requires more technological knowledge and equipment compared to printed journalism. The processes of title selection, text creation, photo selection and determination of the appropriate news category in the preparation phase of the news are designed to be both faster and user-friendly compared to printed publishing. The news created to be presented to the target audience may belong to one or more of different categories such as economy, politics, sports, technology, and health. The inclusion of the news in the appropriate category provides convenience in terms of reaching the right audience and archiving the news correctly. In this study, news texts were classified according to their categories based on the machine learning methods. In the study, news of five newspapers in three different categories were used. Bayesian classifier and decision tree methods were used to classify the news in the dataset including a total of 10.500 news. In the results of the study, it was observed that the Bayesian classifier classified the news more successfully according to their categories.