Show simple item record

dc.contributor.authorSantos, Rui
dc.contributor.authorMurrieta-Flores, Patricia
dc.contributor.authorCalado, Pável
dc.contributor.authorMartins, Bruno
dc.date.accessioned2019-10-14T00:42:46Z
dc.date.available2019-10-14T00:42:46Z
dc.date.issued2017-10-31
dc.identifierdoi: 10.1080/13658816.2017.1390119
dc.identifier.citationSantos, R., Murrieta-Flores, P., Calado, P., & Martins, B. (2018). Toponym matching through deep neural networks. International Journal of Geographical Information Science, 32(2), 324-348. https://doi.org/10.1080/13658816.2017.1390119
dc.identifier.issn1365-8816
dc.identifier.doi10.1080/13658816.2017.1390119
dc.identifier.urihttp://hdl.handle.net/10034/622710
dc.descriptionThis article is not available on ChesterRep
dc.description.abstractToponym matching, i.e. pairing strings that represent the same real-world location, is a fundamental problemfor several practical applications. The current state-of-the-art relies on string similarity metrics, either specifically developed for matching place names or integrated within methods that combine multiple metrics. However, these methods all rely on common sub-strings in order to establish similarity, and they do not effectively capture the character replacements involved in toponym changes due to transliterations or to changes in language and culture over time. In this article, we present a novel matching approach, leveraging a deep neural network to classify pairs of toponyms as either matching or nonmatching. The proposed network architecture uses recurrent nodes to build representations from the sequences of bytes that correspond to the strings that are to be matched. These representations are then combined and passed to feed-forward nodes, finally leading to a classification decision. We present the results of a wide-ranging evaluation on the performance of the proposed method, using a large dataset collected from the GeoNames gazetteer. These results show that the proposed method can significantly outperform individual similarity metrics from previous studies, as well as previous methods based on supervised machine learning for combining multiple metrics.
dc.publisherTaylor & Francis
dc.relation.urlhttps://www.tandfonline.com/doi/full/10.1080/13658816.2017.1390119
dc.sourcepissn: 1365-8816
dc.sourceeissn: 1362-3087
dc.subjectGeography, Planning and Development
dc.subjectLibrary and Information Sciences
dc.subjectInformation Systems
dc.titleToponym matching through deep neural networks
dc.typeArticle
dc.identifier.eissn1365-8824
dc.contributor.departmentUniversity of Lisbon; University of Chester
dc.identifier.journalInternational Journal of Geographical Information Science
dc.date.updated2019-10-14T00:42:46Z
rioxxterms.funderFunder: Fundação para a Ciência e a Tecnologia; FundRef: 10.13039/501100001871; Grant(s): PTDC/EEI-SCR/1743/2014
rioxxterms.funderFunder: Trans-Atlantic Platform for the Social Sciences and Humanities; Grant(s): HJ-253525


This item appears in the following Collection(s)

Show simple item record