de.hpi.fgis.voidgen.hadoop.tasks
Class DistinctClustering

java.lang.Object
  extended by de.hpi.fgis.voidgen.hadoop.Driver
      extended by de.hpi.fgis.voidgen.hadoop.tasks.DistinctClustering
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class DistinctClustering
extends Driver

Selects a single concept type for each subject and assigns the resource to the respective data set.

The following table lists the properties necessary to set.

property name description example value
de.hpi.fgis.voidgen.hadoop.tasks.DistinctClustering.input_paths The input path containing RDF quadruples. voidGen/input3
de.hpi.fgis.voidgen.hadoop.tasks.DistinctClustering.temporary_path Temporary output for MapReduce jobs. voidGen/distinct_temp
de.hpi.fgis.voidgen.hadoop.tasks.DistinctClustering.output_path Output of the Distinct Clustering job. voidGen/distinct_clustering
de.hpi.fgis.voidgen.hadoop.tasks.distinctclustering.TypeStatisticsReducer.threshold Optional. Default value is '0'. The number of appearances a type must exceed to be considered as the most fitting type for a subject. 0
de.hpi.fgis.voidgen.hadoop.tasks.distinctclustering.TypeSelectionReducer.desired_cluster_size Optional. Default value is 'Integer.MAX_VALUE'. The number of Subjects assumed to be optimal for forming a cluster. Use smaller values to prevent to common types from being chosen. 100000

Author:
Martin Linkhorst, Hasso Plattner Institute at University of Potsdam, Germany, Johannes Gosda, Hasso Plattner Institute at University of Potsdam, Germany
See Also:
distinct clustering

Constructor Summary
DistinctClustering()
           
 
Method Summary
 int run(java.lang.String[] args)
           
 
Methods inherited from class de.hpi.fgis.voidgen.hadoop.Driver
getConf, getPath, getPaths, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DistinctClustering

public DistinctClustering()
Method Detail

run

public int run(java.lang.String[] args)
        throws java.lang.Exception
Throws:
java.lang.Exception