Measuring Semantic Similarity among Text Snippets and Page Counts in Data Mining

V. Sobana, Mr. T. Muthusamy, Mrs. K. K. Kavitha

PDF

Published: Nov 30, 2017

V. Sobana, Mr. T. Muthusamy, Mrs. K. K. Kavitha

Abstract

Measuring the semantic similarity between words is an important component in various tasks on the web such as relation extraction, community mining, document clustering, and automatic metadata extraction. Despite the usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words (or entities) remains a challenging task. We propose an empirical method to estimate semantic similarity using page counts and text snippets retrieved from a web search engine for two words. Specifically, we define various word co-occurrence measures using page counts and integrate those with lexical patterns extracted from text snippets. To identify the numerous semantic relations that exist between two given words, we propose a novel pattern extraction algorithm and a pattern clustering algorithm. The optimal combination of page counts-based co-occurrence measures and lexical pattern clusters is learned using support vector machines. The proposed method outperforms various baselines and previously proposed web-based semantic similarity measures on three benchmark data sets showing a high correlation with human ratings. Moreover, the proposed method significantly improves the accuracy in a community mining task.

How to Cite

, V. S. M. T. M. M. K. K. K. (2017). Measuring Semantic Similarity among Text Snippets and Page Counts in Data Mining. International Journal on Future Revolution in Computer Science &Amp; Communication Engineering, 3(11), 383–389. Retrieved from http://www.ijfrcsce.org/index.php/ijfrcsce/article/view/319

Issue

Vol. 3 No. 11 (2017): November (2017) Issue

Section

Articles

Measuring Semantic Similarity among Text Snippets and Page Counts in Data Mining

Abstract

Contact Us:

Auricle Global Society of Education and Research
Y-18-A, Near Sanskar Play School,
Sudarshana Nagar,
Bikaner. Rajasthan (India).
Pin 334004

Article Sidebar

Main Article Content

Abstract

Article Details