Browse wiki

Jump to: navigation, search
A knowledge-based search engine powered by Wikipedia
Abstract This paper describes Koru, a new search inThis paper describes Koru, a new search interface that offers effective domain-independent knowledge-based information retrieval. Koru exhibits an understanding of the topics of both queries and documents. This allows it to (a) expand queries automatically and (b) help guide the user as they evolve their queries interactively. Its understanding is mined from the vast investment of manual effort and judgment that is Wikipedia. We show how this open, constantly evolving encyclopedia can yield inexpensive knowledge structures that are specifically tailored to expose the topics, terminology and semantics of individual document collections. We conducted a detailed user study with 12 participants and 10 topics from the 2005 TREC HARD track, and found that Koru and its underlying knowledge base offers significant advantages over traditional keyword search. It was capable of lending assistance to almost every query issued to it; making their entry more efficient, improving the relevance of the documents they return, and narrowing the gap between expert and novice seekers.the gap between expert and novice seekers.
Added by wikilit team Added on initial load  +
Collected data time dimension Cross-sectional  +
Comments "[Wikipedia was tested] with a varied doma"[Wikipedia was tested] with a varied domain independent collection of documents and retrieval tasks, and it was able to recognize and lend assistance to almost all queries issued to it, and significantly improve retrieval performance." p. 453 search results "Resign design" should be "Design science" (the construction of the Koru system is described and "experiment" (a user study with 12 subjects are made). "Data source" can (apart from those listed) also be Wikipedia pages as these are used to constructed a thesaurus. "Unit of analysis" should probably not be "user", - it is not a user of Wikipedia, but a user of the Koru system. Pages are used for analysis in the construction of the thesaurus. "Wikipedia page type" is probably "implicit" articles. "Wikipedia language" is very likely English, but should be set to "Not specified". From discussion in 2013-01-16: "“A knowledge-based search engine powered by Wikipedia”: Should also be “design science”, “data source”: also “Wikipedia pages” (because of use in the thesaurus), “Unit of analysis”: keep it as “User” (Wikipedia pages are not studied per se), “Wikipedia pagetype: “Article”." per se), “Wikipedia pagetype: “Article”."
Conclusion This paper has introduced Koru, a new searThis paper has introduced Koru, a new search engine that harnesses Wikipedia to provide domain-independent knowledgebased retrieval. Our intuition that Wikipedia could provide a knowledge base that matched both documents and queries has so far been borne out. We have tested it with a varied domainindependent collection of documents and retrieval tasks, and it was able to recognize and lend assistance to almost all queries issued to it, and significantly improve retrieval performance. Koru’s design was also validated, in that it allowed users to apply the knowledge found in Wikipedia to their retrieval process easily, effectively and efficiently. The following quote, given by one participant at the conclusion of their session, summarizes Koru’s performance best: It feels like a more powerful searching method, and allows you to search for topics that you may not have thought of… …it could use some improvements but the ability to graphically turn topics on/off is useful, and the way the system compresses synonymous terms together saves the user from having to search for the variations themselves. The ability to see a list of related terms also makes it easier to refine a search, where as with keyword searching you have to think up related terms yourself.u have to think up related terms yourself.
Conference location Lisboa, Portugal +
Data source Computer usage logs  + , Direct observation  + , Experiment responses  + , Survey responses  + , Wikipedia pages  +
Dates 6-9 +
Doi 10.1145/1321440.1321504 +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22A%2Bknowledge-based%2Bsearch%2Bengine%2Bpowered%2Bby%2BWikipedia%22  +
Has author David N. Milne + , Ian H. Witten + , David M. Nichols +
Has domain Computer science +
Has topic Query processing +
Month November  +
Pages 445-454  +
Peer reviewed Yes  +
Publication type Conference paper  +
Published in CIKM '07 Proceedings of the sixteenth ACM conference on Conference on information and knowledge management +
Publisher Association for Computing Machinery +
Research design Design science  + , Experiment  +
Research questions This paper describes Koru, a new search inThis paper describes Koru, a new search interface that offers effective domain-independent knowledge-based information retrieval. Koru exhibits an understanding of the topics of both queries and documents. This allows it to (a) expand queries automatically and (b) help guide the user as they evolve their queries interactively. Its understanding is mined from the vast investment of manual effort and judgment that is Wikipedia. We show how this open, constantly evolving encyclopedia can yield inexpensive knowledge structures that are specifically tailored to expose the topics, terminology and semantics of individual document collections. We conducted a detailed user study with 12 participants and 10 topics from the 2005 TREC HARD track, and found that Koru and its underlying knowledge base offers significant advantages over traditional keyword search. It was capable of lending assistance to almost every query issued to it; making their entry more efficient, improving the relevance of the documents they return, and narrowing the gap between expert and novice seekers.the gap between expert and novice seekers.
Revid 11,127  +
Theories Undetermined
Theory type Design and action  +
Title A knowledge-based search engine powered by Wikipedia
Unit of analysis User  +
Url http://researchcommons.waikato.ac.nz/handle/10289/5379  +
Wikipedia coverage Sample data  +
Wikipedia data extraction Dump  +
Wikipedia language Not specified  +
Wikipedia page type Article  +
Year 2007  +
Creation dateThis property is a special property in this wiki. 13 March 2012 12:20:12  +
Categories Query processing  + , Computer science  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:53:39  +
show properties that link here 

 

Enter the name of the page to start browsing from.