The value of everything: ranking and association with encyclopedic knowledge

From WikiLit
Revision as of 16:12, October 31, 2013 by Ochado (Talk | contribs) (Added simulation)

Jump to: navigation, search
Publication (help)
The value of everything: ranking and association with encyclopedic knowledge
Authors: Kino High Coursey [edit item]
Citation: University of North Texas  : . 2009. United States, Texas.
Publication type: Thesis
Peer-reviewed: Yes
Database(s):
DOI: Define doi.
Google Scholar cites: Not available
Link(s): Paper link
Added by Wikilit team: Added on initial load
Search
Article: Google Scholar BASE PubMed
Other scholarly wikis: AcaWiki Brede Wiki WikiPapers
Web search: Bing Google Yahoo!Google PDF
Other:
Services
Format: BibTeX
The value of everything: ranking and association with encyclopedic knowledge is a publication by Kino High Coursey.


[edit] Abstract

This dissertation describes {WikiRank}, an unsupervised method of assigning relative values to elements of a broad coverage encyclopedic information source in order to identify those entries that may be relevant to a given piece of text. The valuation given to an entry is based not on textual similarity but instead on the links that associate entries, and an estimation of the expected frequency of visitation that would be given to each entry based on those associations in context. This estimation of relative frequency of visitation is embodied in modifications to the random walk interpretation of the {PageRank} algorithm. {WikiRank} is an effective algorithm to support natural language processing applications. It is shown to exceed the performance of previous machine learning algorithms for the task of automatic topic identification, providing results comparable to that of human annotators. Second, {WikiRank} is found useful for the task of recognizing text-based paraphrases on a semantic level, by comparing the distribution of attention generated by two pieces of text using the encyclopedic resource as a common reference. Finally, {WikiRank} is shown to have the ability to use its base of encyclopedic knowledge to recognize terms from different ontologies as describing the same thing, and thus allowing for the automatic generation of mapping links between ontologies. The conclusion of this thesis is that the knowledge access heuristic" is valuable and that a ranking process based on a large encyclopedic resource can form the basis for an extendable general purpose mechanism capable of identifying relevant concepts by association which in turn can be effectively utilized for enumeration and comparison at a semantic level."

[edit] Research questions

"The primary focus of my research was to explore the use of a form of encyclopedic knowledge to aid automatic tasks. To do this I developed a method to implement a context sensitive simulation of a visitation process applied to a graph of encyclopedic knowledge."

Research details

Topics: Other natural language processing topics, Ontology building [edit item]
Domains: Information science [edit item]
Theory type: Design and action [edit item]
Wikipedia coverage: Other [edit item]
Theories: "Undetermined" [edit item]
Research design: Simulation, Statistical analysis [edit item]
Data source: [edit item]
Collected data time dimension: Cross-sectional [edit item]
Unit of analysis: Article [edit item]
Wikipedia data extraction: Clone [edit item]
Wikipedia page type: Article [edit item]
Wikipedia language: English [edit item]

[edit] Conclusion

"I developed a method to implement a context sensitive simulation of a visitation process applied to a graph of encyclopedic knowledge. The result of the process is an estimate of the frequency of accessing each entry (and by extension each concept) in an encyclopedia. The relative visitation values assigned offer what I think of as a ‚knowledge access heuristic‛: that the more often a piece of knowledge or concept is accessed the more important it is to that context. Using such a visitation simulation one can suggest better allocation of processing resources, perform analysis and inferences based on the distribution of knowledge access (allowing topic identification), and analyze the way the simulated knowledge access varied based on different stimuli (a form of semantic or topical similarity)."

[edit] Comments

"This dissertation describes WikiRank, an unsupervised method of assigning relative values to elements of a broad coverage encyclopedic information source in order to identify those entries that may be relevant to a given piece of text"


Further notes[edit]

Facts about "The value of everything: ranking and association with encyclopedic knowledge"RDF feed
AbstractThis dissertation describes {WikiRank}, anThis dissertation describes {WikiRank}, an unsupervised method of assigning relative values to elements of a broad coverage encyclopedic information source in order to identify those entries that may be relevant to a given piece of text. The valuation given to an entry is based not on textual similarity but instead on the links that associate entries, and an estimation of the expected frequency of visitation that would be given to each entry based on those associations in context. This estimation of relative frequency of visitation is embodied in modifications to the random walk interpretation of the {PageRank} algorithm. {WikiRank} is an effective algorithm to support natural language processing applications. It is shown to exceed the performance of previous machine learning algorithms for the task of automatic topic identification, providing results comparable to that of human annotators. Second, {WikiRank} is found useful for the task of recognizing text-based paraphrases on a semantic level, by comparing the distribution of attention generated by two pieces of text using the encyclopedic resource as a common reference. Finally, {WikiRank} is shown to have the ability to use its base of encyclopedic knowledge to recognize terms from different ontologies as describing the same thing, and thus allowing for the automatic generation of mapping links between ontologies. The conclusion of this thesis is that the knowledge access heuristic" is valuable and that a ranking process based on a large encyclopedic resource can form the basis for an extendable general purpose mechanism capable of identifying relevant concepts by association which in turn can be effectively utilized for enumeration and comparison at a semantic level."ation and comparison at a semantic level."
Added by wikilit teamAdded on initial load +
Collected data time dimensionCross-sectional +
CommentsThis dissertation describes WikiRank, an unsupervised method of assigning relative values to elements of a broad coverage encyclopedic information source in order to identify those entries that may be relevant to a given piece of text
ConclusionI developed a method to implement a contexI developed a method to implement a context sensitive simulation of a visitation process applied to a graph of encyclopedic knowledge. The result of the process is an estimate of the frequency of accessing each entry (and by extension each concept) in an encyclopedia. The relative visitation values assigned offer what I think of as a ‚knowledge access heuristic‛: that the more often a piece of knowledge or concept is accessed the more important it is to that context. Using such a visitation simulation one can suggest better allocation of processing resources, perform analysis and inferences based on the distribution of knowledge access (allowing topic identification), and analyze the way the simulated knowledge access varied based on different stimuli (a form of semantic or topical similarity).a form of semantic or topical similarity).
Conference locationUnited States, Texas +
Google scholar urlhttp://scholar.google.com/scholar?ie=UTF-8&q=%22The%2Bvalue%2Bof%2Beverything%3A%2Branking%2Band%2Bassociation%2Bwith%2Bencyclopedic%2Bknowledge%22 +
Has authorKino High Coursey +
Has domainInformation science +
Has topicOther natural language processing topics + and Ontology building +
Peer reviewedYes +
Publication typeThesis +
Published inUniversity of North Texas +
Research designSimulation + and Statistical analysis +
Research questionsThe primary focus of my research was to exThe primary focus of my research was to explore the use of a form of encyclopedic knowledge to aid automatic tasks. To do this I developed a method to implement a context sensitive simulation of a visitation process applied to a graph of encyclopedic knowledge.lied to a graph of encyclopedic knowledge.
Revid10,004 +
TheoriesUndetermined
Theory typeDesign and action +
TitleThe value of everything: ranking and association with encyclopedic knowledge
Unit of analysisArticle +
Urlhttp://proquest.umi.com/pqdweb?did=2005595041&Fmt=7&clientId=10306&RQT=309&VName=PQD +
Wikipedia coverageOther +
Wikipedia data extractionClone +
Wikipedia languageEnglish +
Wikipedia page typeArticle +
Year2009 +