Abstract
|
This paper describes Koru, a new search in … This paper describes Koru, a new search interface that offers effective domain-independent knowledge-based information retrieval. Koru exhibits an understanding of the topics of both queries and documents. This allows it to (a) expand queries automatically and (b) help guide the user as they evolve their queries interactively. Its understanding is mined from the vast investment of manual effort and judgment that is Wikipedia. We show how this open, constantly evolving encyclopedia can yield inexpensive knowledge structures that are specifically tailored to expose the topics, terminology and semantics of individual document collections. We conducted a detailed user study with 12 participants and 10 topics from the 2005 TREC HARD track, and found that Koru and its underlying knowledge base offers significant advantages over traditional keyword search. It was capable of lending assistance to almost every query issued to it; making their entry more efficient, improving the relevance of the documents they return, and narrowing the gap between expert and novice seekers.the gap between expert and novice seekers.
|
Added by wikilit team
|
Added on initial load +
|
Collected data time dimension
|
Cross-sectional +
|
Comments
|
"[Wikipedia was tested] with a varied doma … "[Wikipedia was tested] with a varied domain independent collection of documents and retrieval tasks, and it was able to recognize and lend assistance to almost all queries issued to it, and significantly improve retrieval performance." p. 453
search results
"Resign design" should be "Design science" (the construction of the Koru system is described and "experiment" (a user study with 12 subjects are made).
"Data source" can (apart from those listed) also be Wikipedia pages as these are used to constructed a thesaurus.
"Unit of analysis" should probably not be "user", - it is not a user of Wikipedia, but a user of the Koru system. Pages are used for analysis in the construction of the thesaurus.
"Wikipedia page type" is probably "implicit" articles.
"Wikipedia language" is very likely English, but should be set to "Not specified".
From discussion in 2013-01-16: "“A knowledge-based search engine powered by Wikipedia”: Should also be “design science”, “data source”: also “Wikipedia pages” (because of use in the thesaurus), “Unit of analysis”: keep it as “User” (Wikipedia pages are not studied per se), “Wikipedia pagetype: “Article”." per se), “Wikipedia pagetype: “Article”."
|
Conclusion
|
This paper has introduced Koru, a new sear … This paper has introduced Koru, a new search engine that
harnesses Wikipedia to provide domain-independent knowledgebased
retrieval. Our intuition that Wikipedia could provide a
knowledge base that matched both documents and queries has so
far been borne out. We have tested it with a varied domainindependent
collection of documents and retrieval tasks, and it
was able to recognize and lend assistance to almost all queries
issued to it, and significantly improve retrieval performance.
Koru’s design was also validated, in that it allowed users to apply
the knowledge found in Wikipedia to their retrieval process
easily, effectively and efficiently. The following quote, given by
one participant at the conclusion of their session, summarizes
Koru’s performance best:
It feels like a more powerful searching method, and allows
you to search for topics that you may not have thought of…
…it could use some improvements but the ability to
graphically turn topics on/off is useful, and the way the system
compresses synonymous terms together saves the user from
having to search for the variations themselves. The ability to
see a list of related terms also makes it easier to refine a
search, where as with keyword searching you have to think up
related terms yourself.u have to think up
related terms yourself.
|
Conference location
|
Lisboa, Portugal +
|
Data source
|
Computer usage logs +
, Direct observation +
, Experiment responses +
, Survey responses +
, Wikipedia pages +
|
Dates
|
6-9 +
|
Doi
|
10.1145/1321440.1321504 +
|
Google scholar url
|
http://scholar.google.com/scholar?ie=UTF-8&q=%22A%2Bknowledge-based%2Bsearch%2Bengine%2Bpowered%2Bby%2BWikipedia%22 +
|
Has author
|
David N. Milne +
, Ian H. Witten +
, David M. Nichols +
|
Has domain
|
Computer science +
|
Has topic
|
Query processing +
|
Month
|
November +
|
Pages
|
445-454 +
|
Peer reviewed
|
Yes +
|
Publication type
|
Conference paper +
|
Published in
|
CIKM '07 Proceedings of the sixteenth ACM conference on Conference on information and knowledge management +
|
Publisher
|
Association for Computing Machinery +
|
Research design
|
Design science +
, Experiment +
|
Research questions
|
This paper describes Koru, a new search in … This paper describes Koru, a new search interface that offers
effective domain-independent knowledge-based information
retrieval. Koru exhibits an understanding of the topics of both
queries and documents. This allows it to (a) expand queries
automatically and (b) help guide the user as they evolve their
queries interactively. Its understanding is mined from the vast
investment of manual effort and judgment that is Wikipedia. We
show how this open, constantly evolving encyclopedia can yield
inexpensive knowledge structures that are specifically tailored to
expose the topics, terminology and semantics of individual
document collections. We conducted a detailed user study with 12
participants and 10 topics from the 2005 TREC HARD track, and
found that Koru and its underlying knowledge base offers
significant advantages over traditional keyword search. It was
capable of lending assistance to almost every query issued to it;
making their entry more efficient, improving the relevance of the
documents they return, and narrowing the gap between expert and
novice seekers.the gap between expert and
novice seekers.
|
Revid
|
11,127 +
|
Theories
|
Undetermined
|
Theory type
|
Design and action +
|
Title
|
A knowledge-based search engine powered by Wikipedia
|
Unit of analysis
|
User +
|
Url
|
http://researchcommons.waikato.ac.nz/handle/10289/5379 +
|
Wikipedia coverage
|
Sample data +
|
Wikipedia data extraction
|
Dump +
|
Wikipedia language
|
Not specified +
|
Wikipedia page type
|
Article +
|
Year
|
2007 +
|
Creation dateThis property is a special property in this wiki.
|
13 March 2012 12:20:12 +
|
Categories |
Query processing +
, Computer science +
, Publications +
|
Modification dateThis property is a special property in this wiki.
|
30 January 2014 20:53:39 +
|