[1] Crawled from the Wikipedia knowledge base.
[2] Extracted from linked open data on famous persons; stored in relational format.
[3] Generated using our own db-tesma data generator (binaries for Windows 32 Bit).
[4] Streamed anonymized web log data from Plista.
[5] Generated using the dbgen data generator (binaries for Debian 64 Bit DB2).
[6] Obtained by partitioning the Freebase triples from the BTC 2012 dataset by their predicate.
[7] Generated using this generator.
*Numbers in brackets are the numbers we got when these datasets used for the experimental comparison among IND algorithms with NULL equal semantic