A Systematic Review on Stopword Removal Algorithms

Main Article Content

Jashanjot Kaur, Preetpal Kaur Buttar

Abstract

Stopwords, also known as noise words, are the words that contain a little information which is not usually required. Stopwords were discovered by H.P. Luhn in 1958. In the domain of information retrieval, an effective indexing can be achieved by removing the stopwords. Indexing is a technique of connecting or tagging documents with different search terms or criteria. The main motive behind the elimination of stopwords is to increase the execution speed and the accuracy. It not only decreases the vector space but also helps to improve overall performance. It also helps in reducing the size of text. Till now, techniques for automatic stopwords removal have been developed for languages such as English, Sanskrit, Arabic, Chinese, etc. In this paper, we discuss the different techniques which have been used by the researchers to construct automated stopword lists in different languages.

Article Details

How to Cite
, J. K. P. K. B. (2018). A Systematic Review on Stopword Removal Algorithms. International Journal on Future Revolution in Computer Science &Amp; Communication Engineering, 4(4), 207–210. Retrieved from http://www.ijfrcsce.org/index.php/ijfrcsce/article/view/1499
Section
Articles