Browse wiki

Jump to: navigation, search
Learning to link with Wikipedia
Abstract This paper describes how to automatically This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify significant terms within unstructured text, and enrich it with links to the appropriate Wikipedia articles. The resulting link detector and disambiguator performs very well, with recall and precision of almost 75%. This performance is constant whether the system is evaluated on Wikipedia articles or -real world- documents. This work has implications far beyond enriching documents with explanatory links. It can provide structured knowledge about any unstructured fragment of text. Any task that is currently addressed with bags of words-indexing, clustering, retrieval, and summarization to name a few-could use the techniques described here to draw on a vast network of concepts and semantics. a vast network of concepts and semantics.
Added by wikilit team Added on initial load  +
Collected data time dimension Cross-sectional  +
Comments "The resulting link detector and disambiguator performs very well, with recall and precision of almost 75%." p. 509
Conclusion The resulting link detector and disambiguator performs very well, with recall and precision of almost 75%. This performance is constant whether the system is evaluated on Wikipedia articles or “real world” documents.
Conference location Napa Valley, CA, United states +
Data source Experiment responses  + , Wikipedia pages  +
Dates 26-30 +
Doi 10.1145/1458082.1458150 +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22Learning%2Bto%2Blink%2Bwith%2BWikipedia%22  +
Has author David N. Milne + , Ian H. Witten +
Has domain Computer science +
Has topic Information extraction +
Month October  +
Pages 509-518  +
Peer reviewed Yes  +
Publication type Conference paper  +
Published in Proceeding of the 17th ACM conference on Information and knowledge management +
Publisher Association for Computing Machinery +
Research design Experiment  +
Research questions This paper describes how to automatically This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify significant terms within unstructured text, and enrich it with links to the appropriate Wikipedia articles.nks to the appropriate Wikipedia articles.
Revid 10,849  +
Theories Undetermined
Theory type Design and action  +
Title Learning to link with Wikipedia
Unit of analysis Article  +
Url http://dl.acm.org/citation.cfm?id=1458150  +
Wikipedia coverage Main topic  +
Wikipedia data extraction Dump  +
Wikipedia language English  +
Wikipedia page type Article  +
Year 2008  +
Creation dateThis property is a special property in this wiki. 15 March 2012 20:29:27  +
Categories Information extraction  + , Computer science  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:29:24  +
hide properties that link here 
  No properties link to this page.
 

 

Enter the name of the page to start browsing from.