Browse wiki

Jump to: navigation, search
The Wikipedia XML corpus
Abstract Wikipedia is a well known free content, muWikipedia is a well known free content, multilingual encyclopedia written collaboratively by contributors around the world. Anybody can edit an article using a wiki markup language that offers a simplified alternative to HTML. This encyclopedia is composed of millions of articles in different languages.llions of articles in different languages.
Added by wikilit team Added on initial load  +
Collected data time dimension Cross-sectional  +
Conclusion In this article, we describe a set of XML collections based on Wikipedia.
Data source Wikipedia pages  +
Doi 10.1145/1147197.1147210 +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22The%2BWikipedia%2BXML%2Bcorpus%22  +
Has author Ludovic Denoyer + , Patrick Gallinari +
Has domain Computer science +
Has topic Other corpus topics + , Research platform +
Issue 1  +
Month June  +
Pages 64-69  +
Peer reviewed Yes  +
Publication type Journal article  +
Published in ACM SIGIR Forum +
Research design Other  +
Research questions In this article, we describe a set of XML collections based on Wikipedia.
Revid 10,964  +
Theories Undetermined
Theory type Analysis  +
Title The Wikipedia XML corpus
Unit of analysis Article  +
Url http://0-dl.acm.org.mercury.concordia.ca/citation.cfm?doid=1147197.1147210  +
Volume 40  +
Wikipedia coverage Main topic  +
Wikipedia data extraction Secondary dataset  +
Wikipedia language Multiple  +
Wikipedia page type Article  +
Year 2006  +
Creation dateThis property is a special property in this wiki. 15 March 2012 20:31:46  +
Categories Other corpus topics  + , Research platform  + , Computer science  + , Publications with missing comments  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:31:36  +
show properties that link here 

 

Enter the name of the page to start browsing from.