Analysis of word co-occurrence in human literature for supporting semantic correspondence discovery
Jorge Martínez Gil
|Title||Analysis of word co-occurrence in human literature for supporting semantic correspondence discovery|
|Booktitle||Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business - i-Know 2014|
Semantic similarity measurement aims to determine the likeness between two text expressions that use different lexicographies for representing the same real object or idea. In this work, we describe the way to exploit broad cultural trends for identifying semantic similarity. This is possible through the quantitative analysis of a vast digital book collection representing the digested history of humanity. Our research work has revealed that appropriately analyzing the co-occurrence of words in some periods of human literature can help us to determine the semantic similarity between these words by means of computers with a high degree of accuracy.