Analysis of word co-occurrence in human literature for supporting semantic correspondence discovery

Authors Jorge Martínez Gil
Mario Pichler
Editors
Title Analysis of word co-occurrence in human literature for supporting semantic correspondence discovery
Booktitle Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business - i-Know 2014
Type in proceedings
ISBN 978-1-4503-2769-5
Month November
Year 2014
Pages DOI 10.1145/2637748.2638422
SCCH ID# 1434
Abstract

Semantic similarity measurement aims to determine the likeness between two text expressions that use different lexicographies for representing the same real object or idea. In this work, we describe the way to exploit broad cultural trends for identifying semantic similarity. This is possible through the quantitative analysis of a vast digital book collection representing the digested history of humanity. Our research work has revealed that appropriately analyzing the co-occurrence of words in some periods of human literature can help us to determine the semantic similarity between these words by means of computers with a high degree of accuracy.