Extracting content holes by comparing community-type content with Wikipedia

From WikiLit
Jump to: navigation, search
Publication (help)
Extracting content holes by comparing community-type content with Wikipedia
Authors: Akiyo Nadamoto, Eiji Aramaki, Takeshi Abekawa, Yohei Murakami [edit item]
Citation: International Journal of Web Information Systems 6 (3): 248-260. 2010 August. Emerald Group Publishing.
Publication type: Journal article
Peer-reviewed: Yes
Database(s):
DOI: 10.1108/17440081011070178.
Google Scholar cites: Citations
Link(s): Paper link
Added by Wikilit team: Yes
Search
Article: Google Scholar BASE PubMed
Other scholarly wikis: AcaWiki Brede Wiki WikiPapers
Web search: Bing Google Yahoo!Google PDF
Other:
Services
Format: BibTeX
Extracting content holes by comparing community-type content with Wikipedia is a publication by Akiyo Nadamoto, Eiji Aramaki, Takeshi Abekawa, Yohei Murakami.


[edit] Abstract

Purpose – Community-type content that are social network services and blogs are maintained by communities of people. Occasionally, community members do not understand the nature of the content from multiple perspectives, and so the volume of information is often inadequate. The authors thus consider it necessary to present users with missing information. The purpose of this paper is to search for the content “hole” where users of community-type content missed information.

Design/methodology/approach – The proposed content hole is defined as different information that is obtained by comparing community-type content with other content, such as other community-type content, other conventional web content, and real-world content. The paper suggests multiple types of content holes and proposes a system that compares community-type content with Wikipedia articles and identifies the content hole. The paper first identifies structured keywords from the community-type content, and extracts target articles from Wikipedia using the keywords. It then extracts other related articles from Wikipedia using the link graph. Finally, it compares community-type content with the articles in Wikipedia and extracts and presents content holes.

Findings – Information retrieval looks for similar data. In contrast, a content-hole search looks for information that is different. This paper defines the type of content hole on the basis of viewpoints. The proposed viewpoints are coverage, detail, semantics, and reputation.

Originality/value – The paper proposes a system for extracting coverage content holes. The system compares community-type content with Wikipedia and extracts content holes in the community-type content.

[edit] Research questions

"Community-type content that are social network services and blogs are maintained by communities of people. Occasionally, community members do not understand the nature of the content from multiple perspectivesm and so the volume of information is often inadequate. The authors thus consider it necessary to present users with missing information. The purpose of this paper is to search for the content gile where users of communty-type content missed information... The paper first identifies structured keywords from the community-type content, and extracts target articles from Wikipedia using the keywords. It then extracts other related articles from wikipedia using the link graph. Finally, it compares community-type content with the articles in wikipedia and extracts and present content holes."

Research details

Topics: Reading support [edit item]
Domains: Computer science [edit item]
Theory type: Design and action [edit item]
Wikipedia coverage: Sample data [edit item]
Theories: "Undetermined" [edit item]
Research design: Experiment [edit item]
Data source: Experiment responses, Wikipedia pages [edit item]
Collected data time dimension: Cross-sectional [edit item]
Unit of analysis: Article [edit item]
Wikipedia data extraction: Live Wikipedia [edit item]
Wikipedia page type: Article [edit item]
Wikipedia language: Not specified [edit item]

[edit] Conclusion

"The conclusions of this article consist mainly of the results reported by users interacting with the proposed system. Advantages and disadvantages are reported."

[edit] Comments

"Wikipedia pages and other documents"


Further notes[edit]

Two conference papers are associated: "Content hole search in community-type content" and "Content hole search in community-type content using Wikipedia".

The paper could be categorized under "Other information retrieval topics".

"Data source" should be "Wikipedia page".

"Unit of analysis" is (rather than "Article") sections of the article.

"Wikipedia data extraction" is unclear. Probably Live Wikipedia

"Wikipedia language" is not specified. A figure shows English Wikipedia, but it is not clear if other language versions also work.

Facts about "Extracting content holes by comparing community-type content with Wikipedia"RDF feed
AbstractPurpose – Community-type content that are Purpose – Community-type content that are social network services and blogs are maintained by communities of people. Occasionally, community members do not understand the nature of the content from multiple perspectives, and so the volume of information is often inadequate. The authors thus consider it necessary to present users with missing information. The purpose of this paper is to search for the content “hole” where users of community-type content missed information.

Design/methodology/approach – The proposed content hole is defined as different information that is obtained by comparing community-type content with other content, such as other community-type content, other conventional web content, and real-world content. The paper suggests multiple types of content holes and proposes a system that compares community-type content with Wikipedia articles and identifies the content hole. The paper first identifies structured keywords from the community-type content, and extracts target articles from Wikipedia using the keywords. It then extracts other related articles from Wikipedia using the link graph. Finally, it compares community-type content with the articles in Wikipedia and extracts and presents content holes.

Findings – Information retrieval looks for similar data. In contrast, a content-hole search looks for information that is different. This paper defines the type of content hole on the basis of viewpoints. The proposed viewpoints are coverage, detail, semantics, and reputation.

Originality/value – The paper proposes a system for extracting coverage content holes. The system compares community-type content with Wikipedia and extracts content holes in the community-type content.
ntent holes in the community-type content.
Added by wikilit teamYes +
Collected data time dimensionCross-sectional +
CommentsWikipedia pages and other documents
ConclusionThe conclusions of this article consist mainly of the results reported by users interacting with the proposed system. Advantages and disadvantages are reported.
Data sourceExperiment responses + and Wikipedia pages +
Doi10.1108/17440081011070178 +
Google scholar urlhttp://scholar.google.com/scholar?ie=UTF-8&q=%22Extracting%2Bcontent%2Bholes%2Bby%2Bcomparing%2Bcommunity-type%2Bcontent%2Bwith%2BWikipedia%22 +
Has authorAkiyo Nadamoto +, Eiji Aramaki +, Takeshi Abekawa + and Yohei Murakami +
Has domainComputer science +
Has topicReading support +
Issue3 +
MonthAugust +
Pages248-260 +
Peer reviewedYes +
Publication typeJournal article +
Published inInternational Journal of Web Information Systems +
PublisherEmerald Group Publishing +
Research designExperiment +
Research questionsCommunity-type content that are social netCommunity-type content that are social network services and blogs are maintained by communities of people. Occasionally, community members do not understand the nature of the content from multiple perspectivesm and so the volume of information is often inadequate. The authors thus consider it necessary to present users with missing information. The purpose of this paper is to search for the content gile where users of communty-type content missed information... The paper first identifies structured keywords from the community-type content, and extracts target articles from Wikipedia using the keywords. It then extracts other related articles from wikipedia using the link graph. Finally, it compares community-type content with the articles in wikipedia and extracts and present content holes.ia and extracts and present content holes.
Revid11,130 +
TheoriesUndetermined
Theory typeDesign and action +
TitleExtracting content holes by comparing community-type content with Wikipedia
Unit of analysisArticle +
Urlhttp://www.emeraldinsight.com/journals.htm?articleid=1881407&show=abstract +
Volume6 +
Wikipedia coverageSample data +
Wikipedia data extractionLive Wikipedia +
Wikipedia languageNot specified +
Wikipedia page typeArticle +
Year2010 +