Indonesian News Classification Using Weighted K-nearest Neighbour

Authors

  • Muhammad Ihsan Amien Ismandiya Telkomuniversity
  • Yuliant Sibaroni Telkomuniversity
  • Niken Dwi Wahyu Cahyani Telkomuniversity

Abstract

Abstract News is one means of information for the general public. Today, the number of news articles that reach 2 million articles per day can make it difficult for users to find news articles they want to read. In order to make it easier for users, most Indonesian newspapers classify their articles into certain categories, but there are also many blogs, or amateur articles that have not classified the news they circulated. Therefore this paper aims to categorize Indonesian language news using the weighted k-nearest neighbor method. In this paper there are several stages in classifying the news, namely preprocessing, feature extraction, and classification using wKNN. The study used the wK-NN method where K = 6. In this study feature extraction was carried out in unigram and bigram which resulted in accuracy that was not much different. So it is recommended to use unigram because it is more efficient

Keywords: Classification, Prepocessing, feature extraction, weighted K-nearest neighbor

Downloads

Published

2020-08-01

Issue

Section

Program Studi S1 Informatika