YAGO: a large ontology from Wikipedia and WordNet

From WikiLit
Jump to: navigation, search
Publication (help)
YAGO: a large ontology from Wikipedia and WordNet
Authors: Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum [edit item]
Citation: Journal of Web Semantics 6 (3): 203-217. 2008.
Publication type: Journal article
Peer-reviewed: Yes
Database(s):
DOI: 10.1016/j.websem.2008.06.001.
Google Scholar cites: Citations
Link(s): Paper link
Added by Wikilit team: Added on initial load
Search
Article: Google Scholar BASE PubMed
Other scholarly wikis: AcaWiki Brede Wiki WikiPapers
Web search: Bing Google Yahoo!Google PDF
Other:
Services
Format: BibTeX
YAGO: a large ontology from Wikipedia and WordNet is a publication by Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum.


[edit] Abstract

This article presents YAGO, a large ontology with high coverage and precision. YAGO has been automatically derived from Wikipedia and WordNet. It comprises entities and relations, and currently contains more than 1.7 million entities and 15 million facts. These include the taxonomic Is-A hierarchy as well as semantic relations between entities. The facts for YAGO have been extracted from the category system and the infoboxes of Wikipedia and have been combined with taxonomic relations from WordNet. Type checking techniques help us keep YAGO's precision at 95%-as proven by an extensive evaluation study. YAGO is based on a clean logical model with a decidable consistency. Furthermore, it allows representing n-ary relations in a natural way while maintaining compatibility with RDFS. A powerful query model facilitates access to YAGO's data.

[edit] Research questions

"This article presents YAGO, a large ontology with high coverage and precision. YAGO has been automatically derived from Wikipedia and WordNet. It comprises entities and relations, and currently contains more than 1.7 million entities and 15 million facts. These include the taxonomic Is-A hierarchy as well as semantic relations between entities. The facts for YAGO have been extracted from the category system and the infoboxes of Wikipedia and have been combined with taxonomic relations fromWordNet. Type checking techniques help us keep YAGO’s precision at 95% – as proven by an extensive evaluation study. YAGO is based on a clean logical model with a decidable consistency. Furthermore, it allows representing n-ary relations in a natural way while maintaining compatibility with RDFS. A powerful query model facilitates access to YAGO’s data."

Research details

Topics: Ontology building [edit item]
Domains: Computer science [edit item]
Theory type: Design and action [edit item]
Wikipedia coverage: Sample data [edit item]
Theories: "Undetermined" [edit item]
Research design: Experiment [edit item]
Data source: Archival records, Experiment responses, Wikipedia pages [edit item]
Collected data time dimension: Cross-sectional [edit item]
Unit of analysis: Article [edit item]
Wikipedia data extraction: Dump [edit item]
Wikipedia page type: Article [edit item]
Wikipedia language: English [edit item]

[edit] Conclusion

"YAGO opens up new opportunities and challenges. On the theoretical side, we plan to investigate how the YAGO model and OWL 1.1 can be reconciled, once OWL 1.1 has been fully developed. Furthermore, the efficiency of the query engine deserves attention. On the practical side, we plan to enrich YAGO by further facts from other sources. We hope that future information extraction can profit from the knowledge that YAGO already provides – for example to do type checks or to generate seed pairs. This could result in a positive feedback loop, in which the addition of knowledge helps the extraction of new knowledge."

[edit] Comments


Further notes[edit]

Facts about "YAGO: a large ontology from Wikipedia and WordNet"RDF feed
AbstractThis article presents YAGO, a large ontoloThis article presents YAGO, a large ontology with high coverage and precision. YAGO has been automatically derived from Wikipedia and WordNet. It comprises entities and relations, and currently contains more than 1.7 million entities and 15 million facts. These include the taxonomic Is-A hierarchy as well as semantic relations between entities. The facts for YAGO have been extracted from the category system and the infoboxes of Wikipedia and have been combined with taxonomic relations from WordNet. Type checking techniques help us keep YAGO's precision at 95%-as proven by an extensive evaluation study. YAGO is based on a clean logical model with a decidable consistency. Furthermore, it allows representing n-ary relations in a natural way while maintaining compatibility with RDFS. A powerful query model facilitates access to YAGO's data.y model facilitates access to YAGO's data.
Added by wikilit teamAdded on initial load +
Collected data time dimensionCross-sectional +
ConclusionYAGO opens up new opportunities and challeYAGO opens up new opportunities and challenges. On the theoretical side,

we plan to investigate how the YAGO model and OWL 1.1 can be reconciled, once OWL 1.1 has been fully developed. Furthermore, the efficiency of the query engine deserves attention. On the practical side, we plan to enrich YAGO by further facts from other sources. We hope that future information extraction can profit from the knowledge that YAGO already provides – for example to do type checks or to generate seed pairs. This could result in a positive feedback loop, in which the addition of knowledge helps the extraction of new knowledge.dge helps the

extraction of new knowledge.
Data sourceArchival records +, Experiment responses + and Wikipedia pages +
Doi10.1016/j.websem.2008.06.001 +
Google scholar urlhttp://scholar.google.com/scholar?ie=UTF-8&q=%22YAGO%3A%2Ba%2Blarge%2Bontology%2Bfrom%2BWikipedia%2Band%2BWordNet%22 +
Has authorFabian M. Suchanek +, Gjergji Kasneci + and Gerhard Weikum +
Has domainComputer science +
Has topicOntology building +
Issue3 +
Pages203-217 +
Peer reviewedYes +
Publication typeJournal article +
Published inJournal of Web Semantics +
Research designExperiment +
Research questionsThis article presents YAGO, a large ontoloThis article presents YAGO, a large ontology with high coverage and precision.

YAGO has been automatically derived from Wikipedia and WordNet. It comprises entities and relations, and currently contains more than 1.7 million entities and 15 million facts. These include the taxonomic Is-A hierarchy as well as semantic relations between entities. The facts for YAGO have been extracted from the category system and the infoboxes of Wikipedia and have been combined with taxonomic relations fromWordNet. Type checking techniques help us keep YAGO’s precision at 95% – as proven by an extensive evaluation study. YAGO is based on a clean logical model with a decidable consistency. Furthermore, it allows representing n-ary relations in a natural way while maintaining compatibility with RDFS. A powerful query model facilitates access to YAGO’s data.y model

facilitates access to YAGO’s data.
Revid11,114 +
TheoriesUndetermined
Theory typeDesign and action +
TitleYAGO: a large ontology from Wikipedia and WordNet
Unit of analysisArticle +
Urlhttp://mx1.websemanticsjournal.org/index.php/ps/article/download/144/142 +
Volume6 +
Wikipedia coverageSample data +
Wikipedia data extractionDump +
Wikipedia languageEnglish +
Wikipedia page typeArticle +
Year2008 +