Paperback Edition
Paperback
289 pages
$25.95
Choose vendor to order paperback edition
BrownWalker Press
Amazon.com
Barnes & Noble
Harvard Book Store
Return policy
PDF eBook
Entire PDF eBook
5492k
$15
Get instant access to an entire eBook
Buy PDF Password
Download Complete PDF
eBook editions
Toponym Resolution in Text
Annotation, Evaluation and Applications of Spatial Grounding of Place Names
Paperback
eBook PDF
Publisher: | Dissertation |
Pub date: | 2008 |
Pages: | 289 |
ISBN-10: | 1581123841 |
ISBN-13: | 9781581123845 |
Categories: | Computer Science Computers |
Abstract
The problem of automatic toponym resolution, or computing the mapping from occurrences of names for places as found in a text to an unambiguous spatial footprint of the location referred to, such as a geographic latitude/longitude centroid is difficult to automate due to insufficient and error-prone geographic databases, and a large degree of place name ambiguity: common words need to be distinguished from proper names (geo/non-geo ambiguity), and the mapping between names and locations is ambiguous (London can refer to the capital of the UK or to London, Ontario, Canada, or to about forty other Londons on earth).This thesis investigates how referentially ambiguous spatial named entities can be grounded, or resolved, with respect to an extensional coordinate model robustly on open-domain news text by collecting a repertoire of linguistic heuristics and extra-linguistic knowledge sources such as population. I then investigate how to combine these sources of evidence to obtain a superior method. Noise effects introduced by the named entity tagging that toponym resolution relies on are also studied. While few attempts have been made to solve toponym resolution, these were either not evaluated, or evaluation was done by manual inspection of system output instead of creating a re-usable reference corpus. A systematic comparison leads to an inventory of heuristics and other sources of evidence. In order to carry out a comparative evaluation procedure, an evaluation resource is required, so a reference gazetteer and an associated novel reference corpus with human-labelled referent annotation were created for this thesis, to be used to benchmark a selection of the reconstructed algorithms and a novel re-combination of the heuristics catalogued in the inventory. Performance of the same resolution algorithms is compared under different conditions, namely applying it to the output of human named entity annotation and automatic annotation using an existing Maximum Entropy sequence tagging model.
About the Author
Jochen L. Leidner holds an M.A. in computational linguistics, English and computer science from the University of Erlangen, an M.Phil. in computer speech, text and Internet technologies from the University of Cambridge, and a Ph.D. from the University of Edinburgh. His thesis work won the first ACMSIGIR Doctoral Consortium Award 2004.
Paperback Edition
Paperback
289 pages
$25.95
Choose vendor to order paperback edition
BrownWalker Press
Amazon.com
Barnes & Noble
Harvard Book Store
Return policy
PDF eBook
Entire PDF eBook
5492k
$15
Get instant access to an entire eBook
Buy PDF Password
Download Complete PDF
eBook editions
Share this book
Relevant events
FEB
12
ICARA 2025
2025 The 11th International Conference on Automation, Robotics and Applications (ICARA 2025)
Publication:
Submitted papers will be peer reviewed by the conference committees and interna...
12 - 14 Feb 2025
Zagreb, Croatia
FEB
14
ICMLC 2025
2025 17th International Conference on Machine Learning and Computing (ICMLC 2025)
Publication:
All submitted papers will be sent to 2-3 peer reviewers for reviewing. And acce...
14 - 17 Feb 2025
Guangzhou, China
FEB
14
ICIEE 2025
2025 14th International Conference on Information and Electronics Engineering (ICIEE 2025)
PUBLICATION:
Peer-reviewed papers accepted by ICIEE2025 will be published in conference proc...
14 - 16 Feb 2025
Singapore, Singapore
FEB
14
ICMCR 2025
2025 3rd International Conference on Mechatronics, Control and Robotics (ICMCR 2025)
Conference Proceedings:
1. Papers submitted to ICMCR 2025 will be peer reviewed by the inter...
14 - 16 Feb 2025
Singapore, Singapore
FEB
20
ICCTECH 2025
2025 the 4th International Conference on Computer Technologies (ICCTech 2025)
Publication:
Submitted papers will be peer reviewed by conference committees, and accepted p...
20 - 23 Feb 2025
Kuala Lumpur, Malaysia
FEB
20
ICSCA 2025
2025 14th International Conference on Software and Computer Applications (ICSCA 2025)
Publication:
The ISBN number assigned to ICSCA 2025 is 979-8-4007-1012-4
You are invited t...
20 - 23 Feb 2025
Kuala Lumpur, Malaysia
FEB
20
ICIIT 2025
2025 10th International Conference on Intelligent Information Technology (ICIIT 2025)
Publication&Indexing:
1. ICIIT 2025 International Conference Proceedings by ACM (ISBN: 979-8...
20 - 23 Feb 2025
Hanoi, Vietnam
FEB
21
DSDE 2025
2025 The 8th International Conference on Data Storage and Data Engineering (DSDE 2025)
Publication:
Submitted papers will be peer reviewed by conference committees, and accepted p...
21 - 23 Feb 2025
Nanjing, China
FEB
21
ICCGV 2025
2025 Eighth International Conference on Computer Graphics and Virtuality (ICCGV 2025)
Pulication:
After a careful reviewing process, all accepted papers after proper registration...
21 - 23 Feb 2025
Chengdu, China
FEB
21
ICDSP 2025
2025 9th International Conference on Digital Signal Processing (ICDSP 2025)
Publication:
All accepted papers from the ICDSP 2025, will be proposed to be published in th...
21 - 23 Feb 2025
Chengdu, China