de.hpi.fgis.voidgen.hadoop.tasks
Class ClusterDescription

java.lang.Object
  extended by de.hpi.fgis.voidgen.hadoop.Driver
      extended by de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class ClusterDescription
extends Driver

Generates the textual description of clusters.

The following table lists the properties necessary to set.

property name description example value
de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription.input_paths The input paths containing RDF quadruples with subject and object cluster set. voidGen/joined
de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription.temporary_path The path for temporary MapReduce output. voidGen/temporary_path
de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription.output_path The output path for the created cluster descriptions. voidGen/descriptions
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoStep1Reducer.max_subjects Optional. The default value is '100000'. Describes how many subjects can be connected to a cluster via the same predicate until the cluster-predicate pair is too common. If the pair is too common it will be ignored. 100000
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoStep2Reducer.max_map_size Optional. The default value is '10000'. The maximum size of the map storing the subjects of a cluster. 10000
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoStep2Reducer.max_predicates Optional. The default value is '5'. The maximum number of predicates used for textual description of a cluster. 5

Author:
Johannes Gosda, Hasso Plattner Institute at University of Potsdam, Germany

Constructor Summary
ClusterDescription()
           
 
Method Summary
 int run(java.lang.String[] arg0)
           
 
Methods inherited from class de.hpi.fgis.voidgen.hadoop.Driver
getConf, getPath, getPaths, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ClusterDescription

public ClusterDescription()
Method Detail

run

public int run(java.lang.String[] arg0)
        throws java.lang.Exception
Throws:
java.lang.Exception