Crawled from the Wikipedia knowledge base.
 Extracted from linked open data on famous persons; stored in relational format.
 Generated using our own db-tesma data generator (binaries for Windows 32 Bit).
 Streamed anonymized web log data from Plista.
 Generated using the dbgen data generator (binaries for Debian 64 Bit DB2).
 Obtained by partitioning the Freebase triples from the BTC 2012 dataset by their predicate.
 Generated using this generator.
*Numbers in brackets are the numbers we got when these datasets used for the experimental comparison among IND algorithms with NULL equal semantic