Semantic Similarity Measure in Biomedical Domain Leverage Web Search Engine

Chi-Huang Chen, Sheau-Ling Hsieh, Yung-Ching Weng, Wen-Yung Chang, Feipei Lai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations


Semantic similarity measure plays an essential role in Information Retrieval and Natural Language Processing. In this paper we propose a page-count-based semantic similarity measure and apply it in biomedical domains. Previous researches in semantic web related applications have deployed various semantic similarity measures. Despite the usefulness of the measurements in those applications, measuring semantic similarity between two terms remains a challenge task. The proposed method exploits page counts returned by the Web Search Engine. We define various similarity scores for two given terms P and Q, using the page counts for querying P, Q and P AND Q. Moreover, we propose a novel approach to compute semantic similarity using lexico-syntactic patterns with page counts. These different similarity scores are integrated adapting support vector machines, to leverage the robustness of semantic similarity measures. Experimental results on two datasets achieve correlation coefficients of 0.798 on the dataset provided by A. Hliaoutakis, 0.705 on the dataset provide by T. Pedersen with physician scores and 0.496 on the dataset provided by T. Pedersen et al. with expert scores.
Original languageEnglish
Title of host publication32nd Annual International Conference of the IEEE Engineering-in-Medicine-and-Biology-Society (EMBC 10)
Number of pages4
StatePublished - 2010

Fingerprint Dive into the research topics of 'Semantic Similarity Measure in Biomedical Domain Leverage Web Search Engine'. Together they form a unique fingerprint.

Cite this