Browse wiki

Jump to: navigation, search
Deriving a large scale taxonomy from Wikipedia
Abstract We take the category system in Wikipedia aWe take the category system in Wikipedia as a conceptual network. We label the semantic relations between categories using methods based on connectivity in the network and lexico-syntactic matching. As a result we are able to derive a large scale taxonomy containing a large amount of subsumption, i.e. isa, relations. We evaluate the quality of the created resource by comparing it with ResearchCyc, one of the largest manually annotated ontologies, as well as computing semantic similarity between words in benchmarking datasets.ty between words in benchmarking datasets.
Added by wikilit team Added on initial load  +
Collected data time dimension Cross-sectional  +
Comments "Our Wikipedia-based taxonomy proved to be competitive with the two arguably largest and best developed existing ontologies. We believe that these results are caused by taking already structured and well-maintained knowledge as input." p. 1445
Conclusion Our Wikipedia- based taxonomy proved to be competitive with the two arguably largest and best developed existing ontologies. We believe that these results are caused by taking already structured and well-maintained knowledge as input.
Conference location Vancouver, BC, Canada +
Data source Experiment responses  + , Wikipedia pages  +
Dates 22-26 +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22Deriving%2Ba%2Blarge%2Bscale%2Btaxonomy%2Bfrom%2BWikipedia%22  +
Has author Simone Paolo Ponzetto + , Michael Strube +
Has domain Computer science +
Has topic Semantic relatedness +
Month July  +
Pages 1440-1445  +
Peer reviewed Yes  +
Publication type Conference paper  +
Published in AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2 +
Publisher American Association for Artificial Intelligence +
Research design Experiment  +
Research questions We take the category system inWikipedia asWe take the category system inWikipedia as a conceptual network. We label the semantic relations between categories using methods based on connectivity in the network and lexicosyntactic matching. As a result we are able to derive a large scale taxonomy containing a large amount of subsumption, i.e. isa, relations.mount of subsumption, i.e. isa, relations.
Revid 10,732  +
Theories Undetermined
Theory type Design and action  +
Title Deriving a large scale taxonomy from Wikipedia
Unit of analysis Article  +
Url http://en.scientificcommons.org/43568891  +
Volume 2  +
Wikipedia coverage Sample data  +
Wikipedia data extraction Dump  +
Wikipedia language English  +
Wikipedia page type Article  + , Information categorization and navigation  +
Year 2007  +
Creation dateThis property is a special property in this wiki. 15 March 2012 20:25:40  +
Categories Semantic relatedness  + , Computer science  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:23:16  +
show properties that link here 

 

Enter the name of the page to start browsing from.