Toponym disambiguation or location names resolution is a critical task in unstructured text, articles or documents. Our research explores how to link ambiguous locations mentioned in documents, news and articles with latitude/longitude coordinates. We designed an evaluation system for toponym disambiguation based on annotated GEOCLEF data. We implemented a node-based approach taking population into account and a geographic distance-based approach. We have proposed new approach based on edges between the pairs of toponyms in ontology, taking also population attribute into account. Our edge-based approach gave better results than population and distance-based only approaches. The results could be used in any information system dealing with texts containing geographic locations, such as news texts.
Semantic Similarities between Locations based on Ontology
SHERWANI, MOIZ KHAN
;Francesco Calimeri
2017-01-01
Abstract
Toponym disambiguation or location names resolution is a critical task in unstructured text, articles or documents. Our research explores how to link ambiguous locations mentioned in documents, news and articles with latitude/longitude coordinates. We designed an evaluation system for toponym disambiguation based on annotated GEOCLEF data. We implemented a node-based approach taking population into account and a geographic distance-based approach. We have proposed new approach based on edges between the pairs of toponyms in ontology, taking also population attribute into account. Our edge-based approach gave better results than population and distance-based only approaches. The results could be used in any information system dealing with texts containing geographic locations, such as news texts.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.