Browse wiki

Jump to: navigation, search
Harvesting wiki consensus: using Wikipedia entries as vocabulary for knowledge management
Abstract Vocabularies that provide unique identifieVocabularies that provide unique identifiers for conceptual elements of a domain can improve precision and recall in knowledge-management applications. Although creating and maintaining such vocabularies is generally hard, wiki users easily manage to develop comprehensive, informal definitions of terms, each one identified by a {URI.} Here, the authors show that the {URIs} of Wikipedia entries are reliable identifiers for conceptual entities. They also demonstrate how Wikipedia entries can be used for annotating Web resources and knowledge assets and give precise estimates of the amount of Wikipedia {URIs} in terms of the popular Proton ontology's top-level concepts.ular Proton ontology's top-level concepts.
Added by wikilit team Yes  +
Collected data time dimension N/A  +
Comments We demonstrate how Wikipedia entries can be used for annotating Web resources and knowledge assets and give precise estimates of the amount of Wikipedia URIs in terms of the popular Proton ontology’s top-level concepts.
Conclusion The analysis of Wikipedia entries reveals The analysis of Wikipedia entries reveals two interesting results. First, despite the unsupervised, community- driven editing process, the conceptual entity associated with Wikipedia URIs rarely changes. Second, among the 1.5 million entries are very substantial amounts of concepts that are relevant for annotating Web resources, such as popular actors, research fields, cities, or universities. The analysis of Wikipedia entries’ ontological nature shows that the majority of URIs in our sample (87 percent) denote instances or subconcepts to the Proton top-level category object. This is defined as “entities that could be claimed to exist” (see http://proton.semanticweb.org). Nine percent are some sort of abstract, and 4 percent are classified as a happening. Figure 3a illustrates the proportion of entries in each main Proton category. The breakdown of Wikipedia entries that fall into the protont.Object branch is very interesting. Figure 3b shows the proportions in the sample. (For statistical reasons, the point estimates for the population don’t necessarily add up to 100 percent, which is why we based Figure 3 on the sample proportions and not on the population estimates from Table 3.) We always assigned each Wikipedia entry to the most specific subclass of protont.Object. This means that agent here counts only those conceptual entities for which no more specific subclass of protont.Object.Agent exists. We can see that the majority of the URIs denote people (23 percent), locations (23 percent), organizations (13 percent), product types (13 percent), and groups (6 percent). Our analysis shows that for the vast majority of Wikipedia entries, a community consensus exists about the URIs’ meaning from the very first to the most recent version. In other words, open communities seem able to achieve consensus about named conceptual entities as very lightweight ontological agreements in an unsupervised fash- 62 www. ion and relying only on the known mechanisms of standard wiki software to prevent destructive changes. We assume that the ease of access and using complementing multimedia elements for conceptualizing an entry are important factors in this process.try are important factors in this process.
Data source N/A  +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22Harvesting%2Bwiki%2Bconsensus%3A%2Busing%2BWikipedia%2Bentries%2Bas%2Bvocabulary%2Bfor%2Bknowledge%2Bmanagement%22  +
Has author Martin Hepp + , Katharina Siorpaes + , Daniel Bachlechner +
Has domain Computer science + , Knowledge management +
Has topic Ontology building +
Issue 5  +
Month October  +
Pages 54-65  +
Peer reviewed Yes  +
Publication type Journal article  +
Published in IEEE Internet Computing +
Research design Statistical analysis  +
Research questions Vocabularies that provide unique identifieVocabularies that provide unique identifiers for conceptual elements of a domain can improve precision and recall in knowledge-management applications. Although creating and maintaining such vocabularies is generally hard, wiki users easily manage to develop comprehensive, informal definitions of terms, each one identified by a URI. Here, the authors show that the URIs of Wikipedia entries are reliable identifiers for conceptual entities. They also demonstrate how Wikipedia entries can be used for annotating Web resources and knowledge assets and give precise estimates of the amount of Wikipedia URIs in terms of the popular Proton ontology’s top-level concepts.ular Proton ontology’s top-level concepts.
Revid 10,801  +
Theories Undetermined
Theory type Analysis  +
Title Harvesting wiki consensus: using Wikipedia entries as vocabulary for knowledge management
Unit of analysis Article  +
Url http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4305569&tag=1  +
Volume 11  +
Wikipedia coverage Sample data  +
Wikipedia data extraction Live Wikipedia  +
Wikipedia language English  +
Wikipedia page type Article  +
Year 2007  +
Creation dateThis property is a special property in this wiki. 15 March 2012 20:28:46  +
Categories Ontology building  + , Computer science  + , Knowledge management  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:28:39  +
hide properties that link here 
  No properties link to this page.
 

 

Enter the name of the page to start browsing from.