de.hpi.fgis.voidgen.hadoop.tasks
Class ClusteringUriBased

java.lang.Object
  extended by de.hpi.fgis.voidgen.hadoop.Driver
      extended by de.hpi.fgis.voidgen.hadoop.tasks.ClusteringUriBased
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class ClusteringUriBased
extends Driver

Clusters the input data only dependent on the subject and object URI. The cluster identifier is either the URI's host or schema (if there is no host).

The following table lists the properties necessary to set.

property name description example value
de.hpi.fgis.voidgen.hadoop.tasks.ClusteringUriBased.input_paths The paths used as input for the URI bases clustering job. voidGen/input3
de.hpi.fgis.voidgen.hadoop.tasks.ClusteringUriBased.output_path The path containing the output pairs of cluster identifier and URI. voidGen/clustering1

Author:
Johannes Gosda, Hasso Plattner Institute at University of Potsdam, Germany

Constructor Summary
ClusteringUriBased()
           
 
Method Summary
 int run(java.lang.String[] arg0)
           
 
Methods inherited from class de.hpi.fgis.voidgen.hadoop.Driver
getConf, getPath, getPaths, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ClusteringUriBased

public ClusteringUriBased()
Method Detail

run

public int run(java.lang.String[] arg0)
        throws java.lang.Exception
Throws:
java.lang.Exception