Browse wiki

Jump to: navigation, search
Mobile information retrieval with search results clustering: prototypes and evaluations
Abstract Web searches from mobile devices such as PWeb searches from mobile devices such as PDAs and cell phones are becoming increasingly popular. However, the traditional list-based search interface paradigm does not scale well to mobile devices due to their inherent limitations. In this article, we invthe application of search results clustering, used with some success for desktop computer searches, to the mobile scenario. Building on CREDO (Conceptual Reorganization of Documents), a Web clustering engine based on concept lattices, we present imobile versions Credino and SmartCREDO, for PDAs and cell phones, respectively. Next, we evaluate the retrieval performance of the three prototype systems. We measure the effectiveness of their clustered results compared to a ranked list of results on a retrieval task, by means of the device-independent notion of subtopic reach time with a reusable test collection built from Wikipedia ambiguous entries. Then, we make a crosscomparison of methods (i.e., clustering and ranked list) and dev(i.e., desktop, PDA, and cell phone), using an interactive information-finding task performed by external participants. The main finding is that clustering engines are a viable complementary approach to plain search engines both for desktop and mobilsearches especially, but not only, for multitopic informational queries.nly, for multitopic informational queries.
Added by wikilit team Added on initial load  +
Collected data time dimension Cross-sectional  +
Comments Websites (Twitter), Wikipedia articles
Conclusion We have shown that mobile search results cWe have shown that mobile search results clustering is both feasible and effective. In particular, our results support the viewthat mobile clustering engines can be faster and more accurate than the corresponding mobile search engines, especially for subtopic retrieval tasks.We also found that although mobile retrieval becomes, in general, less effective as the search device gets smaller, the adoption of clustering may help expand the usage patterns beyond mere informational search while mobile.nd mere informational search while mobile.
Conference location United States, California +
Data source Experiment responses  + , Websites  + , Wikipedia pages  +
Doi 10.1002/asi.21036 +
Google scholar url http://scholar.google.com/scholar?ie=UTF-8&q=%22Mobile%2Binformation%2Bretrieval%2Bwith%2Bsearch%2Bresults%2Bclustering%3A%2Bprototypes%2Band%2Bevaluations%22  +
Has author Claudio Carpineto + , Stefano Mizzaro + , Giovanni Romano + , Matteo Snidero +
Has domain Computer science +
Has topic Ranking and clustering systems +
Issue 5  +
Pages 877-895  +
Peer reviewed Yes  +
Publication type Journal article  +
Published in Journal of the American Society for Information Science and Technology +
Research design Experiment  + , Mathematical modeling  +
Research questions In this article, we investigate the applicIn this article, we investigate the application of search results clustering, used with some success for desktop computer searches, to the mobile scenario. Building on CREDO (Conceptual Reorganization of Documents), a Web clustering engine based on concept lattices, we present its mobile versions Credino and SmartCREDO, for PDAs and cell phones, respectively. Next, we evaluate the retrieval performance of the three prototype systems. We measure the effectiveness of their clustered results compared to a ranked list of results on a subtopic retrieval task, by means of the device-independent notion of subtopic reach time together with a reusable test collection built from wkipedia ambiguous entries. Then, we make a crosscomparison of methods (i.e., clustering and ranked list) and devices (i.e., desktop, PDA, and cell phone), using an interactive information-finding task performed by external participants. Our study is split in two parts. In the first part, we consider the theoretical retrieval performance of cluster hierarchies and ranked lists, regardless of the specific device used to display and interact with the results. In the second part, we do not assume that there is a predefined model of access to information, as in the first experiment, and we explicitly consider not only the retrieval method (i.e., clustering and ranked list) but also the device (i.e., desktop, PDA, and cell phone)evice (i.e., desktop, PDA, and cell phone)
Revid 10,874  +
Theories In practice, CREDO starts from the clusterIn practice, CREDO starts from the cluster placed at the hierarchy root, which is usually described by the query terms and covers all retrieved results, and then iteratively builds two lower levels of the hierarchy. Each level contains the most general of the concepts that are theoretically more specific than the concepts in the preceding level, according to the definition of formal concepts. To increase the utility of the clustering process for subtopic retrieval, the first level is generated using only the terms contained in the title of search results, and the second level using both the title and the snippet. The CREDO hierarchy is then visualized using a simple folder tree layout. The system initially shows the hierarchy root and the first level of the hierarchy. The user can click on each cluster to see the results associated with it and expand its subclusters (if any). All the documents of one cluster that are not covered by its “children” are grouped in a dummy cluster named “other.” CREDO does not neatly fit in either of the two classes discussed earlier. Similar to data-centric algorithms, it uses strict single-word indexing. Its monothetic clusters are mostly described by a single word, but they also can accommodate labels with multiple contiguous words, reflecting the causal (or deterministic) associations between words in the given query context. For instance, for the query “metamorphosis” (see Figure 1), CREDO returns some multiple-word concepts such as “hilary duff” and “star trek,” consistent with the fact that in the limited context represented by the results of “metamorphosis,” “hilary” always co-occurs with “duff” and “star” with “trek.” In CREDO, cluster labeling is integrated with cluster formation by definition because a concept intent is univocally determined by a concept extent, and vice versa. Thus, CREDO builds the cluster structure and cluster descriptions at once. By contrast, these two operations are usually treated separately. The disadvantage of the common approach is that there may be a mismatch between the criterion used to find a common description and that used to group the search results, thus increasing the chance that the contents will not correspond to the labels (or vice versa). correspond to the labels (or vice versa).
Theory type Design and action  +
Title Mobile information retrieval with search results clustering: prototypes and evaluations
Unit of analysis Article  +
Url http://dx.doi.org/10.1002/asi.21036  +
Volume 60  +
Wikipedia coverage Sample data  +
Wikipedia data extraction Live Wikipedia  +
Wikipedia language English  +
Wikipedia page type Article  +
Year 2009  +
Creation dateThis property is a special property in this wiki. 15 March 2012 20:29:38  +
Categories Ranking and clustering systems  + , Computer science  + , Publications  +
Modification dateThis property is a special property in this wiki. 30 January 2014 20:29:50  +
hide properties that link here 
  No properties link to this page.
 

 

Enter the name of the page to start browsing from.