Using co-occurrence models for place name disambiguation

From WikiLit
Jump to: navigation, search
Publication (help)
Using co-occurrence models for place name disambiguation
Authors: Simon Overell, Stefan Rüger [edit item]
Citation: International Journal of Geographical Information Science 22 (3): 265-87. 2008 March.
Publication type: Journal article
Peer-reviewed: Yes
Database(s):
DOI: 10.1080/13658810701626236.
Google Scholar cites: Citations
Link(s): Paper link
Added by Wikilit team: Added on initial load
Search
Article: Google Scholar BASE PubMed
Other scholarly wikis: AcaWiki Brede Wiki WikiPapers
Web search: Bing Google Yahoo!Google PDF
Other:
Services
Format: BibTeX
Using co-occurrence models for place name disambiguation is a publication by Simon Overell, Stefan Rüger.


[edit] Abstract

This paper describes the generation of a model capturing information on how place names co-occur together. The advantages of the co-occurrence model over traditional gazetteers are discussed and the problem of place name disambiguation is presented as a case study. We begin by outlining the problem of ambiguous place names. We demonstrate how analysis of Wikipedia can be used in the generation of a co-occurrence model. The accuracy of our model is compared to a handcrafted ground truth; then we evaluate alternative methods of applying this model to the disambiguation of place names in free text (using the GeoCLEF evaluation forum). We conclude by showing how the inclusion of place names in both the text and geographic parts of a query provides the maximum mean average precision and outline the benefits of a co-occurrence model as a data source for the wider field of geographic information retrieval (GIR).

[edit] Research questions

"This paper describes the generation of a model capturing information on how placenames co‐occur together. The advantages of the co‐occurrence model over traditional gazetteers are discussed and the problem of placename disambiguation is presented as a case study."

Research details

Topics: Geographic information retrieval [edit item]
Domains: Geography, Information science [edit item]
Theory type: Design and action [edit item]
Wikipedia coverage: Other [edit item]
Theories: "Undetermined" [edit item]
Research design: Mathematical modeling [edit item]
Data source: Wikipedia pages [edit item]
Collected data time dimension: Cross-sectional [edit item]
Unit of analysis: Article [edit item]
Wikipedia data extraction: Dump [edit item]
Wikipedia page type: Article [edit item]
Wikipedia language: English [edit item]

[edit] Conclusion

"We conclude by showing how the inclusion of placenames in both the text and geographic parts of a query provides the maximum mean average precision and outline the benefits of a co‐occurrence model as a data source for the wider field of geographic information retrieval (GIR)."

[edit] Comments

"We conclude by showing how the inclusion of placenames in both the text and geographic parts of a query provides the maximum mean average precision and outline the benefits of a co‐occurrence model as a data source for the wider field of geographic information retrieval (GIR)."


Further notes[edit]

Facts about "Using co-occurrence models for place name disambiguation"RDF feed
AbstractThis paper describes the generation of a mThis paper describes the generation of a model capturing information on how place names co-occur together. The advantages of the co-occurrence model over traditional gazetteers are discussed and the problem of place name disambiguation is presented as a case study. We begin by outlining the problem of ambiguous place names. We demonstrate how analysis of Wikipedia can be used in the generation of a co-occurrence model. The accuracy of our model is compared to a handcrafted ground truth; then we evaluate alternative methods of applying this model to the disambiguation of place names in free text (using the GeoCLEF evaluation forum). We conclude by showing how the inclusion of place names in both the text and geographic parts of a query provides the maximum mean average precision and outline the benefits of a co-occurrence model as a data source for the wider field of geographic information retrieval (GIR).of geographic information retrieval (GIR).
Added by wikilit teamAdded on initial load +
Collected data time dimensionCross-sectional +
CommentsWe conclude by showing how the inclusion oWe conclude by showing how the inclusion of placenames in both the text and geographic parts of a query provides the maximum mean average precision and outline the benefits of a co‐occurrence model as a data source for the wider field of geographic information retrieval (GIR).of geographic information retrieval (GIR).
ConclusionWe conclude by showing how the inclusion oWe conclude by showing how the inclusion of placenames in both the text and geographic parts of a query provides the maximum mean average precision and outline the benefits of a co‐occurrence model as a data source for the wider field of geographic information retrieval (GIR).of geographic information retrieval (GIR).
Data sourceWikipedia pages +
Doi10.1080/13658810701626236 +
Google scholar urlhttp://scholar.google.com/scholar?ie=UTF-8&q=%22Using%2Bco-occurrence%2Bmodels%2Bfor%2Bplace%2Bname%2Bdisambiguation%22 +
Has authorSimon Overell + and Stefan Rüger +
Has domainGeography + and Information science +
Has topicGeographic information retrieval +
Issue3 +
MonthMarch +
Pages265-87 +
Peer reviewedYes +
Publication typeJournal article +
Published inInternational Journal of Geographical Information Science +
Research designMathematical modeling +
Research questionsThis paper describes the generation of a mThis paper describes the generation of a model capturing information on how placenames co‐occur together. The advantages of the co‐occurrence model over traditional gazetteers are discussed and the problem of placename disambiguation is presented as a case study.sambiguation is presented as a case study.
Revid11,023 +
TheoriesUndetermined
Theory typeDesign and action +
TitleUsing co-occurrence models for place name disambiguation
Unit of analysisArticle +
Urlhttp://dx.doi.org/10.1080/13658810701626236 +
Volume22 +
Wikipedia coverageOther +
Wikipedia data extractionDump +
Wikipedia languageEnglish +
Wikipedia page typeArticle +
Year2008 +