Cultural configuration of Wikipedia: measuring autoreferentiality in different languages

From WikiLit
Jump to: navigation, search
Publication (help)
Cultural Configuration of Wikipedia: Measuring Autoreferentiality in Different Languages
Authors: Marc Miquel-Ribé, Horacio Rodríguez [edit item]
Citation: Recent Advances in Natural Language Processing  : 316-322. 2011 September 2011. Hissar, Bulgaria.
Publication type: Conference paper
Peer-reviewed: Unknown
Database(s):
DOI: Define doi.
Google Scholar cites: Citations
Link(s): Paper link
Added by Wikilit team: Yes
Search
Article: Google Scholar BASE PubMed
Other scholarly wikis: AcaWiki Brede Wiki WikiPapers
Web search: Bing Google Yahoo!Google PDF
Other:
Services
Format: BibTeX
Cultural Configuration of Wikipedia: Measuring Autoreferentiality in Different Languages is a publication by Marc Miquel-Ribé, Horacio Rodríguez.


[edit] Abstract

Among the motivations to write in Wikipedia given by the current literature there is often coincidence, but none of the studies presents the hypothesis of contributing for the visibility of the own national or language related content. Similar to topical coverage studies, we outline a method which allows collecting the articles of this content, to later analyse them in several dimensions. To prove its universality, the tests are repeated for up to twenty language editions of Wikipedia. Finally, through the best indicators from each dimension we obtain an index which represents the degree of autoreferentiality of the encyclopedia. Last, we point out the impact of this fact and the risk of not considering its existence in the design of applications based on user generated content.

[edit] Research questions

"Could it not be then that some editors get involved due to some sort of cultural motivation related to their own national or linguistic sphere too? ... Autoreferentiality concept we propose stands out to describe the interest of a culture on itself, which in WP translates to the interest of editors for their own local content in a WP language edition. Our study makes two contributions: first, we show empirically how by an algorithm using the relations among categories and articles it is possible to retrieve a kind of content which is local to a language; second, how by the use of all kinds of WP features we can understand the importance of this content."

Research details

Topics: Contributor motivation, Cultural and linguistic effects on participation [edit item]
Domains: Computer science, Sociology [edit item]
Theory type: Analysis [edit item]
Wikipedia coverage: Main topic [edit item]
Theories: "Undetermined" [edit item]
Research design: Content analysis, Mathematical modeling [edit item]
Data source: Wikipedia pages [edit item]
Collected data time dimension: Cross-sectional [edit item]
Unit of analysis: Language [edit item]
Wikipedia data extraction: Live Wikipedia [edit item]
Wikipedia page type: Article, Information categorization and navigation [edit item]
Wikipedia language: Multiple [edit item]

[edit] Conclusion

"In this study, first we determined with a simple technique method the scope of the local content in WP language editions, which is in average a 24%. Choosing key words which are very tight to each language like the territories where they are spoken proved right to obtain local content, although a good choice of key words like the territory names and gentilics from the language edition was key to avoid losing content. Most of content comes from the main territory name. While this selection could have been influenced by the noisy category structure, studying after the category memberships as a feature of the content and discovering local content has more categories memberships reinforced the method. Our results according to our methodology for creating an index showed that autoreferentiality value can increase due to several dimensions."

[edit] Comments


Further notes[edit]

Facts about "Cultural configuration of Wikipedia: measuring autoreferentiality in different languages"RDF feed
AbstractAmong the motivations to write in WikipediAmong the motivations to write in Wikipedia

given by the current literature there is often coincidence, but none of the studies presents the hypothesis of contributing for the visibility of the own national or language related content. Similar to topical coverage studies, we outline a method which allows collecting the articles of this content, to later analyse them in several dimensions. To prove its universality, the tests are repeated for up to twenty language editions of Wikipedia. Finally, through the best indicators from each dimension we obtain an index which represents the degree of autoreferentiality of the encyclopedia. Last, we point out the impact of this fact and the risk of not considering its existence in the design of applications based on user generated content.ications based on

user generated content.
Added by wikilit teamYes +
Collected data time dimensionCross-sectional +
ConclusionIn this study, first we determined with a In this study, first we determined with a simple

technique method the scope of the local content in WP language editions, which is in average a 24%. Choosing key words which are very tight to each language like the territories where they are spoken proved right to obtain local content, although a good choice of key words like the territory names and gentilics from the language edition was key to avoid losing content. Most of content comes from the main territory name. While this selection could have been influenced by the noisy category structure, studying after the category memberships as a feature of the content and discovering local content has more categories memberships reinforced the method. Our results according to our methodology for creating an index showed that autoreferentiality

value can increase due to several dimensions.
ue can increase due to several dimensions.
Conference locationHissar, Bulgaria +
Data sourceWikipedia pages +
Dates2011 +
Google scholar urlhttp://scholar.google.com/scholar?ie=UTF-8&q=%22Cultural%2BConfiguration%2Bof%2BWikipedia%3A%2BMeasuring%2BAutoreferentiality%2Bin%2BDifferent%2BLanguages%22 +
Has authorMarc Miquel-Ribé + and Horacio Rodríguez +
Has domainComputer science + and Sociology +
Has topicContributor motivation + and Cultural and linguistic effects on participation +
MonthSeptember +
Pages316-322 +
Peer reviewedUnknown +
Publication typeConference paper +
Published inRecent Advances in Natural Language Processing +
Research designContent analysis + and Mathematical modeling +
Research questionsCould it not be then that some editors getCould it not be then that some editors get involved due to some sort of cultural motivation related to their own national or linguistic sphere too? ... Autoreferentiality concept we propose stands out to describe the interest of a culture on itself, which in WP translates to the interest of editors for their own local content in a WP language edition. Our study makes two contributions: first, we show empirically how by an algorithm using the relations among categories and articles it is possible to retrieve a kind of content which is local to a language; second, how by the use of all kinds of WP features we can understand the importance of this content.understand the importance of this content.
Revid10,724 +
TheoriesUndetermined
Theory typeAnalysis +
TitleCultural Configuration of Wikipedia: Measuring Autoreferentiality in Different Languages
Unit of analysisLanguage +
Urlhttp://aclweb.org/anthology/R11-1044 +
Wikipedia coverageMain topic +
Wikipedia data extractionLive Wikipedia +
Wikipedia languageMultiple +
Wikipedia page typeArticle + and Information categorization and navigation +
Year2011 +