Browse wiki

Jump to: navigation, search
Taxonomy and clustering in collaborative systems: the case of the on-line encyclopedia Wikipedia
Abstract In this paper we investigate the nature anIn this paper we investigate the nature and structure of the relation between imposed classifications and real clustering in a particular case of a scale-free network given by the on-line encyclopedia Wikipedia. We find a statistical similarity in the distributions of community sizes both by using the top-down approach of the categories division present in the archive and in the bottom-up procedure of community detection given by an algorithm based on the spectral properties of the graph. Regardless of the statistically similar behaviour, the two methods provide a rather different division of the articles, thereby signaling that the nature and presence of power laws is a general feature for these systems and cannot be used as a benchmark to evaluate the suitability of a clustering method.te the suitability of a clustering method.
Added by wikilit team Added on initial load  +
Collected data time dimension Cross-sectional  +
Comments The varying agreement between clustering and categorization across the studied versions of Wikipedia suggests that links in Wikipedia do not necessarily imply similarity or relatedness relations.
Conclusion We find a statistical similarity in the diWe find a statistical similarity in the distributions of community sizes both by using the top-down approach of the categories division present in the archive and in the bottom-up procedure of community detection given by an algorithm based on the spectral properties of the graph. Regardless of the statistically similar behaviour, the two methods provide a rather different division of the articles, thereby signaling that the nature and presence of power laws is a general feature for these systems and cannot be used as a benchmark to evaluate the suitability of a clustering method.te the suitability of a clustering method.
Data source Wikipedia pages  +
Doi 10.1209/0295-5075/81/28006 +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22Taxonomy%2Band%2Bclustering%2Bin%2Bcollaborative%2Bsystems%3A%2Bthe%2Bcase%2Bof%2Bthe%2Bon-line%2Bencyclopedia%2BWikipedia%22  +
Has author Andrea Capocci + , Francesco Rao + , Guido Caldarelli +
Has domain Information science +
Has topic Ontology building +
Issue 2  +
Month January  +
Pages 28006-1  +
Peer reviewed Yes  +
Publication type Journal article  +
Published in Europhysics Letters +
Research design Case study  +
Research questions In this paper we investigate the nature and structure of the relation between imposed classifications and real clustering in a particular case of a scale-free network given by the on-line encyclopedia Wikipedia.
Revid 10,956  +
Theories Undetermined
Theory type Analysis  +
Title Taxonomy and clustering in collaborative systems: the case of the on-line encyclopedia Wikipedia
Unit of analysis Category  +
Url http://dx.doi.org/10.1209/0295-5075/81/28006  +
Volume 81  +
Wikipedia coverage Case  +
Wikipedia data extraction Dump  +
Wikipedia language All languages  +
Wikipedia page type Article  +
Year 2008  +
Creation dateThis property is a special property in this wiki. 15 March 2012 20:30:37  +
Categories Ontology building  + , Information science  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:31:30  +
hide properties that link here 
  No properties link to this page.
 

 

Enter the name of the page to start browsing from.