Paperback Edition
Paperback
289 pages
$25.95
Choose vendor to order paperback edition
BrownWalker Press Amazon.com Barnes & Noble Harvard Book Store Return policy
PDF eBook
Sample Preview
Size 249k
Free
Download a sample of the first 25 pages
Download Preview

Entire PDF eBook
5492k
$15
Get instant access to an entire eBook
Buy PDF Password Download Complete PDF
eBook editions

Toponym Resolution in Text

Annotation, Evaluation and Applications of Spatial Grounding of Place Names

small book icon  Paperback   small ebook icon   eBook PDF
Publisher:  Dissertation
Pub date:  2008
Pages:  289
ISBN-10:  1581123841
ISBN-13:  9781581123845
Categories:  Computer Science  Computers  

Abstract

The problem of automatic toponym resolution, or computing the mapping from occurrences of names for places as found in a text to an unambiguous spatial footprint of the location referred to, such as a geographic latitude/longitude centroid is difficult to automate due to insufficient and error-prone geographic databases, and a large degree of place name ambiguity: common words need to be distinguished from proper names (geo/non-geo ambiguity), and the mapping between names and locations is ambiguous (London can refer to the capital of the UK or to London, Ontario, Canada, or to about forty other Londons on earth).

This thesis investigates how referentially ambiguous spatial named entities can be grounded, or resolved, with respect to an extensional coordinate model robustly on open-domain news text by collecting a repertoire of linguistic heuristics and extra-linguistic knowledge sources such as population. I then investigate how to combine these sources of evidence to obtain a superior method. Noise effects introduced by the named entity tagging that toponym resolution relies on are also studied. While few attempts have been made to solve toponym resolution, these were either not evaluated, or evaluation was done by manual inspection of system output instead of creating a re-usable reference corpus. A systematic comparison leads to an inventory of heuristics and other sources of evidence. In order to carry out a comparative evaluation procedure, an evaluation resource is required, so a reference gazetteer and an associated novel reference corpus with human-labelled referent annotation were created for this thesis, to be used to benchmark a selection of the reconstructed algorithms and a novel re-combination of the heuristics catalogued in the inventory. Performance of the same resolution algorithms is compared under different conditions, namely applying it to the output of human named entity annotation and automatic annotation using an existing Maximum Entropy sequence tagging model.

About the Author

Jochen L. Leidner holds an M.A. in computational linguistics, English and computer science from the University of Erlangen, an M.Phil. in computer speech, text and Internet technologies from the University of Cambridge, and a Ph.D. from the University of Edinburgh. His thesis work won the first ACMSIGIR Doctoral Consortium Award 2004.



Paperback Edition
Paperback
289 pages
$25.95
Choose vendor to order paperback edition
BrownWalker Press Amazon.com Barnes & Noble Harvard Book Store Return policy
PDF eBook
Sample Preview
Size 249k
Free
Download a sample of the first 25 pages
Download Preview

Entire PDF eBook
5492k
$15
Get instant access to an entire eBook
Buy PDF Password Download Complete PDF
eBook editions
Share this book



Relevant events
DEC
27
MCVR 2024
2024 International Conference on Measurement, Communication and Virtual Reality (MCVR 2024) Publication: Submitted paper will be peer reviewed by technical committee, and accepted pape...
27 - 29 Dec 2024
Harbin, China
DEC
28
ITCAU 2024
2nd International Conference on Information Technology, Control and Automation (ITCAU 2024) 2nd International Conference on Information Technology, Control and Automation (ITCAU 2024) ...
28 - 29 Dec 2024
, United Arab Emirates
JAN
3
ICIGP 2025
2025 The 8th International Conference on Image and Graphics Processing (ICIGP 2025) Publication: Submitted papers will be peer reviewed by conference committees, and accepted p...
03 - 05 Jan 2025
Macau, China
JAN
10
AEIT 2025
2025 6th International Conference on Advances in Education and Information Technology (AEIT 2025) Publication: Accepted and presented papers of AEIT 2025 will be published as a volume of Spr...
10 - 12 Jan 2025
Fukuoka, Japan
JAN
10
ACIE 2025
2025 The 5th Asia Conference on Information Engineering (ACIE 2025) Proceedings: Accepted papers that fall within the technical scope of the IEEE will be publis...
10 - 12 Jan 2025
Phuket, Thailand
JAN
10
IPMV 2025
2025 7th International Conference on Image Processing and Machine Vision (IPMV 2025) Publication: Accepted and presented papers will be published into Conference Proceedings by ...
10 - 12 Jan 2025
Hong Kong, China
JAN
10
APCT 2025
2025 4th Asia-Pacific Computer Technologies Conference (APCT 2025) Publication: Accepted papers will be published into APCT 2025 conference proceedings by AIP,...
10 - 12 Jan 2025
Bangkok, Thailand
JAN
10
ICMSS 2025
2025 the 9th International Conference on Management Engineering, Software Engineering and Service Sciences (ICMSS 2025) Publication: Accepted and presented papers will be published into ICMSS 2025 Conference Publ...
10 - 12 Jan 2025
Bangkok, Thailand
JAN
10
APIT 2025
2025 7th Asia Pacific Information Technology Conference (APIT 2025) Accepted and registered papers can be publishe in the ACM international conference proceeding...
10 - 12 Jan 2025
Hong Kong, China
JAN
10
CVCI 2025
2025 6th International Conference on Computer Vision and Computational Intelligence (CVCI 2025) Accepted papers will be published in the ACM Conference Proceedings, which will be indexed b...
10 - 12 Jan 2025
Hong Kong, China