Paperback Edition

Paperback

289 pages

$25.95

Choose vendor to order paperback edition

BrownWalker Press Amazon.com Barnes & Noble Return policy

PDF eBook

Sample Preview

Size 249k

Free

Download a sample of the first 25 pages

Download Preview

Entire PDF eBook

5492k

$15

Get instant access to an entire eBook

Buy PDF Password Download Complete PDF

eBook editions

Google Books Apple iBook CompleteBooks PDF

Books

Computer Science

Purchase Options Free Preview

Toponym Resolution in Text

Annotation, Evaluation and Applications of Spatial Grounding of Place Names

by Jochen L. Leidner

Paperback

eBook PDF

Publisher:	Dissertation
Pub date:	2008
Pages:	289
ISBN-10:	1581123841
ISBN-13:	9781581123845
Categories:	Computer Science Computers

Abstract

The problem of automatic toponym resolution, or computing the mapping from occurrences of names for places as found in a text to an unambiguous spatial footprint of the location referred to, such as a geographic latitude/longitude centroid is difficult to automate due to insufficient and error-prone geographic databases, and a large degree of place name ambiguity: common words need to be distinguished from proper names (geo/non-geo ambiguity), and the mapping between names and locations is ambiguous (London can refer to the capital of the UK or to London, Ontario, Canada, or to about forty other Londons on earth).

This thesis investigates how referentially ambiguous spatial named entities can be grounded, or resolved, with respect to an extensional coordinate model robustly on open-domain news text by collecting a repertoire of linguistic heuristics and extra-linguistic knowledge sources such as population. I then investigate how to combine these sources of evidence to obtain a superior method. Noise effects introduced by the named entity tagging that toponym resolution relies on are also studied. While few attempts have been made to solve toponym resolution, these were either not evaluated, or evaluation was done by manual inspection of system output instead of creating a re-usable reference corpus. A systematic comparison leads to an inventory of heuristics and other sources of evidence. In order to carry out a comparative evaluation procedure, an evaluation resource is required, so a reference gazetteer and an associated novel reference corpus with human-labelled referent annotation were created for this thesis, to be used to benchmark a selection of the reconstructed algorithms and a novel re-combination of the heuristics catalogued in the inventory. Performance of the same resolution algorithms is compared under different conditions, namely applying it to the output of human named entity annotation and automatic annotation using an existing Maximum Entropy sequence tagging model.

About the Author

Jochen L. Leidner holds an M.A. in computational linguistics, English and computer science from the University of Erlangen, an M.Phil. in computer speech, text and Internet technologies from the University of Cambridge, and a Ph.D. from the University of Edinburgh. His thesis work won the first ACMSIGIR Doctoral Consortium Award 2004.

Paperback Edition

Paperback

289 pages

$25.95

Choose vendor to order paperback edition

BrownWalker Press Amazon.com Barnes & Noble Return policy

PDF eBook

Sample Preview

Size 249k

Free

Download a sample of the first 25 pages

Download Preview

Entire PDF eBook

5492k

$15

Get instant access to an entire eBook

Buy PDF Password Download Complete PDF

eBook editions

Google Books Apple iBook CompleteBooks PDF

Share this book

Relevant events

AUG

18 - 22 Aug 2024 Santa Barbara, United States

APR

26 - 29 Apr 2024 Bali, Indonesia

APR

26 - 29 Apr 2024 Bali, Indonesia

APR

27 - 29 Apr 2024 Singapore, Singapore

APR

27 - 28 Apr 2024 Copenhagen, Denmark

APR

27 - 28 Apr 2024 Copenhagen, Denmark, Denmark

APR

27 - 28 Apr 2024 Copenhagen, Denmark

APR

27 - 28 Apr 2024 Copenhagen, Denmark

APR

27 - 28 Apr 2024 Copenhagen, Denmark, Denmark

APR

27 - 28 Apr 2024 Copenhagen, Denmark