Size matters: word count as a measure of quality on Wikipedia

From WikiLit
Jump to: navigation, search
Publication (help)
Size matters: word count as a measure of quality on Wikipedia
Authors: Joshua E. Blumenstock [edit item]
Citation: International conference on World Wide Web  : 1095--1096. 2008. Beijing, China. Association for Computing Machinery.
Publication type: Conference paper
Peer-reviewed: Yes
Database(s):
DOI: 10.1145/1367497.1367673.
Google Scholar cites: Citations
Link(s): Paper link
Added by Wikilit team: Added on initial load
Search
Article: Google Scholar BASE PubMed
Other scholarly wikis: AcaWiki Brede Wiki WikiPapers
Web search: Bing Google Yahoo!Google PDF
Other:
Services
Format: BibTeX
Size matters: word count as a measure of quality on Wikipedia is a publication by Joshua E. Blumenstock.


[edit] Abstract

Wikipedia, "the free encyclopedia", now contains over two million English articles, and is widely regarded as a high-quality, authoritative encyclopedia. Some Wikipedia articles, however, are of questionable quality, and it is not always apparent to the visitor which articles are good and which are bad. We propose a simple metric -- word count -- for measuring article quality. In spite of its striking simplicity, we show that this metric significantly outperforms the more complex methods described in related work.

[edit] Research questions

"We propose a simple metric, word count, for measuring article quality. In spite of its striking simplicity, we show that this metric significantly outperforms the more complex methods described in related work."

Research details

Topics: Featured articles, Computational estimation of trustworthiness [edit item]
Domains: Information systems [edit item]
Theory type: Analysis [edit item]
Wikipedia coverage: Main topic [edit item]
Theories: "Undetermined" [edit item]
Research design: Statistical analysis [edit item]
Data source: Wikipedia pages [edit item]
Collected data time dimension: Cross-sectional [edit item]
Unit of analysis: Article [edit item]
Wikipedia data extraction: Dump [edit item]
Wikipedia page type: Article [edit item]
Wikipedia language: English [edit item]

[edit] Conclusion

"Blumenstock **** investigated multiple metrics such as word count and character count to predict the quality of Wikipedia articles. To compare these metrics, Blumenstock used them to classify featured versus non-featured articles. Results showed that the word count is a good predictor of good quality articles."

[edit] Comments


Further notes[edit]

Facts about "Size matters: word count as a measure of quality on Wikipedia"RDF feed
AbstractWikipedia, "the free encyclopedia", now coWikipedia, "the free encyclopedia", now contains over two million English articles, and is widely regarded as a high-quality, authoritative encyclopedia. Some Wikipedia articles, however, are of questionable quality, and it is not always apparent to the visitor which articles are good and which are bad. We propose a simple metric -- word count -- for measuring article quality. In spite of its striking simplicity, we show that this metric significantly outperforms the more complex methods described in related work.complex methods described in related work.
Added by wikilit teamAdded on initial load +
Collected data time dimensionCross-sectional +
ConclusionBlumenstock **** investigated multiple metBlumenstock **** investigated multiple metrics such as word count and character count to predict the quality of Wikipedia articles. To compare these metrics, Blumenstock used them to classify featured versus non-featured articles. Results showed that the word count is a good predictor of good quality articles.a good predictor of good quality articles.
Conference locationBeijing, China +
Data sourceWikipedia pages +
Doi10.1145/1367497.1367673 +
Google scholar urlhttp://scholar.google.com/scholar?ie=UTF-8&q=%22Size%2Bmatters%3A%2Bword%2Bcount%2Bas%2Ba%2Bmeasure%2Bof%2Bquality%2Bon%2BWikipedia%22 +
Has authorJoshua E. Blumenstock +
Has domainInformation systems +
Has topicFeatured articles + and Computational estimation of trustworthiness +
Pages1095--1096 +
Peer reviewedYes +
Publication typeConference paper +
Published inInternational conference on World Wide Web +
PublisherAssociation for Computing Machinery +
Research designStatistical analysis +
Research questionsWe propose a simple metric, word count, for measuring article quality. In spite of its striking simplicity, we show that this metric significantly outperforms the more complex methods described in related work.
Revid10,944 +
TheoriesUndetermined
Theory typeAnalysis +
TitleSize matters: word count as a measure of quality on Wikipedia
Unit of analysisArticle +
Urlhttp://wwwconference.org/www2008/papers/pdf/p1095-jblumenstock.pdf +
Wikipedia coverageMain topic +
Wikipedia data extractionDump +
Wikipedia languageEnglish +
Wikipedia page typeArticle +
Year2008 +