This is a subset of the YAGO 4 knowledge base (paper), based on Wikidata, version from February 24, 2020. This dataset includes only the fact annotations in RDF-star, that is facts about facts. Each stream element corresponds to one item in Wikidata.
This README is a snapshot of documentation for the latest development version of the dataset. Full documentation for all versions can be found on the website.
- Title: YAGO annotated facts (en)
- Identifier:
yago-annotated-facts
- Has version:
dev
- Theme:
- Encyclopaedia (eurovoc:4137)
- Metadata (eurovoc:c_40f54e0c)
- Open data (eurovoc:c_5ea6e5c4)
- Creator:
- The creators and contributors of Wikidata (1)
- Name: The creators and contributors of Wikidata
- Homepage: https://www.wikidata.org/
- The YAGO team of Télécom Paris and the Max Planck Institute for Informatics (2)
- Name: The YAGO team of Télécom Paris and the Max Planck Institute for Informatics
- Homepage: https://yago-knowledge.org/contributors
- Piotr Sowiński (3)
- Name: Piotr Sowiński
- Nickname: Ostrzyciel
- Homepage:
- The creators and contributors of Wikidata (1)
- License: https://spdx.org/licenses/CC-BY-SA-3.0
- Source:
- Date Issued: 2023-04-30
- Date Modified: 2024-08-29
- Landing page: yago-annotated-facts (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has stream element count: 617,768
- Has stream element split:
- Type: Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Every stream element corresponds to one Wikidata item. (en)
- Has subject shape:
- Comment: Custom target – subject of any quoted triple in the subject position. (en)
- Target custom: YAGO annotated facts target (rb:yagoTarget)
- Uses vocabulary: http://schema.org/
- Conforms to W3C RDF 1.1 specification: no
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: yes
- Title: Full stream distribution
- Identifier:
stream-full
- Has file name:
stream_full.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 617,768
- Byte size: 36.2 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_full.tar.gz
- Title: Full Jelly distribution
- Identifier:
jelly-full
- Has file name:
jelly_full.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Jelly distribution (rb:jellyDistribution)
- Has stream element count: 617,768
- Byte size: 28.9 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/jelly_full.jelly.gz
- Title: Full flat distribution
- Identifier:
flat-full
- Has file name:
flat_full.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream element count: 617,768
- Byte size: 28.7 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_full.nt.gz
- Title: 100K elements stream distribution
- Identifier:
stream-100k
- Has file name:
stream_100K.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 100,000
- Byte size: 3.6 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_100K.tar.gz
- Title: 100K elements Jelly distribution
- Identifier:
jelly-100k
- Has file name:
jelly_100K.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 2.7 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/jelly_100K.jelly.gz
- Title: 100K elements flat distribution
- Identifier:
flat-100k
- Has file name:
flat_100K.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 2.4 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_100K.nt.gz
- Title: 10K elements stream distribution
- Identifier:
stream-10k
- Has file name:
stream_10K.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 10,000
- Byte size: 376.5 KB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_10K.tar.gz
- Title: 10K elements Jelly distribution
- Identifier:
jelly-10k
- Has file name:
jelly_10K.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 260.8 KB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/jelly_10K.jelly.gz
- Title: 10K elements flat distribution
- Identifier:
flat-10k
- Has file name:
flat_10K.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 256.7 KB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_10K.nt.gz