Package de.hpi.fgis.voidgen.hadoop.tasks.distinctclustering

Contains the classes for creating distinct clustering for void:datasets.

See:
          Description

Class Summary
LeftPartComparator The comparator only comparing the left parts of pairs.
LeftPartPartitioner Partitions the data only dependent on the real-key part of the whole key.
TypeSelectionMapper Identity Mapper
TypeSelectionReducer For each subject selects the type which number of appearances is closest to the desired cluster size.
TypeStatisticsMapper Reads RDF quadruples and outputs for each quadruple where the predicate is rdf:type the subjects of this type and an integer for counting the appearances of the predicate.
TypeStatisticsReducer Sums up all appearances of a type.
 

Package de.hpi.fgis.voidgen.hadoop.tasks.distinctclustering Description

Contains the classes for creating distinct clustering for void:datasets.