voidGen documentation

Packages
de.hpi.fgis.voidgen.hadoop Contains the entry point for running the different tasks for generating VoID descriptions.
de.hpi.fgis.voidgen.hadoop.closure Contains the classes for creating the transitive closure of graph based data via Hadoop MapReduce using the transitive closure algorithm developed for the BTC 2010 by Dandy Fenz and Matthias Pohl.
de.hpi.fgis.voidgen.hadoop.config Contains classes for parsing Hadoop XML configuration files and provides a class for converting an XML configuration file to generic options.
de.hpi.fgis.voidgen.hadoop.datatypes Collects data types that are the basis for a lot of data types used in different MapReduce jobs.
de.hpi.fgis.voidgen.hadoop.parsing Contains the classes for parsing and filtering RDF quadruples.
de.hpi.fgis.voidgen.hadoop.tasks This package collects all tasks used for creating VoID descriptions.
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation Contains classes for creating information to the clusters.
de.hpi.fgis.voidgen.hadoop.tasks.clustering1 Collects the classes used for URI based clustering.
de.hpi.fgis.voidgen.hadoop.tasks.clustering2 Contains classes for connected clustering and hierarchical clustering.
de.hpi.fgis.voidgen.hadoop.tasks.clustersize Contains classes for counting the size of each generated cluster.
de.hpi.fgis.voidgen.hadoop.tasks.datasetdescription Contains Mapper and Reducer classes for aggregating different cluster description parts to a single cluster description per cluster.
de.hpi.fgis.voidgen.hadoop.tasks.distinctclustering Contains the classes for creating distinct clustering for void:datasets.
de.hpi.fgis.voidgen.hadoop.tasks.inputstatistics Contains the classes for counting the resources, subjects, predicates, objects and contexts of RDF input data.
de.hpi.fgis.voidgen.hadoop.tasks.joining Contains the classes for enriching each RDF quadruple with the cluster of the subject and the cluster of the object.
de.hpi.fgis.voidgen.hadoop.tasks.ksimilarity Contains Mapper and Reducer classes for crating fuzzy link sets via k-similarity.
de.hpi.fgis.voidgen.hadoop.tasks.linksets Contains classes for creating link sets between clusters.
de.hpi.fgis.voidgen.hadoop.tasks.vocabularydetection Contains classes for detecting vocabularies.
de.hpi.fgis.voidgen.hadoop.tests Contains classes used for debug output.
de.hpi.fgis.voidgen.hadoop.util Contains frequently used classes.