WikiLit: A literature review of scholarly research on Wikipedia
Please contact us at ***.
To attribute your use of this data in accordance with the CC-BY-SA license, please cite our working paper.
Key details of this literature review
- We describe the methodology in detail in a working paper.
- We focus only on research on Wikipedia, not on any other wiki.
- We focused mainly on peer-reviewed journal articles and PhD dissertations, and have systematically sought to include these. It is a huge project, and we had to draw a limit to what we would mainly focus on.
- We have also included around 70 or so of the most highly-cited conference papers. Because of our limited time and resources, unfortunately we were unable to systematically include more; however, we can certainly include any important conference papers that we left out--please point them out!
- Our cut-off date for inclusion is June 2011, after which the Wikimedia Research Newsletter was formally inaugurated; we're letting them pick up from where we stop.
- We have submitted a presentation proposal for Wikimania 2012.
Request for help
Please help us verify the accuracy of our data extraction so far. Practically, if you could take a look at your own publications and the publications you know well, that would be great. It's an open wiki, so please make any corrections directly, even anonymously. (However, if you want us to acknowledge your contributions, please create a user account and identify yourself on your user page.) In particular, please help us with the following:
- Please correct any inaccuracies you see, or e-mail us at *** to notify us of them.
- Please point out any peer-reviewed journal articles or PhD dissertations we have missed that were published before July 2011; we will certainly add these.
- Please point out any other scholarly studies (especially conference articles and significant non-peer-reviewed work) that you feel should definitely be included. Since our time and resources are quite limited, unfortunately we will only include these if you can help us see why they are particularly important.
- Please suggest any data analysis or visualizations you would like to see as we synthesize the data.
- Please give any other feedback or suggestion that can help us make this dataset more useful to researchers!
The data is publicly available (the license is CC-****), but this is a beta release and there are probably a lot of errors. We hope to have a stable and very clean dataset within a couple months, both from community help and from our own internal quality control processes; we'll make another announcement when we feel it has reached "featured" quality. In particular, please wait a bit before exporting the data to other research collection websites and wikis until it is in a cleaner state; by then, we'll help make it available in as many export formats as practical.