Yago: A core of semantic knowledge

Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3221 Scopus citations

Abstract

We present YAGO, a light-weight and extensible ontology with high coverage and quality. YAGO builds on entities and relations and currently contains more than 1 million entities and 5 million facts. This includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE). The facts have been automatically extracted from Wikipedia and unified with WordNet, using a carefully designed combination of rule-based and heuristic methods described in this paper. The resulting knowledge base is a major step beyond WordNet: in quality by adding knowledge about individuals like persons, organizations, products, etc. with their semantic relationships - and in quantity by increasing the number of facts by more than an order of magnitude. Our empirical evaluation of fact correctness shows an accuracy of about 95%. YAGO is based on a logically clean model, which is decidable, extensible, and compatible with RDFS. Finally, we show how YAGO can be further extended by state-of-the-art information extraction techniques.

Original languageEnglish
Title of host publication16th International World Wide Web Conference, WWW2007
Pages697-706
Number of pages10
DOIs
StatePublished - 2007
Externally publishedYes
Event16th International World Wide Web Conference, WWW2007 - Banff, AB, Canada
Duration: 8 May 200712 May 2007

Publication series

Name16th International World Wide Web Conference, WWW2007

Conference

Conference16th International World Wide Web Conference, WWW2007
Country/TerritoryCanada
CityBanff, AB
Period8/05/0712/05/07

Keywords

  • Wikipedia
  • WordNet

Fingerprint

Dive into the research topics of 'Yago: A core of semantic knowledge'. Together they form a unique fingerprint.

Cite this