Generates the textual description of clusters.
The following table lists the properties necessary to set.
property name |
description |
example value |
de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription.input_paths |
The input paths containing RDF quadruples with subject and object cluster set. |
voidGen/joined |
de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription.temporary_path |
The path for temporary MapReduce output. |
voidGen/temporary_path |
de.hpi.fgis.voidgen.hadoop.tasks.ClusterDescription.output_path |
The output path for the created cluster descriptions. |
voidGen/descriptions |
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoStep1Reducer.max_subjects |
Optional. The default value is '100000'. Describes how many subjects can be connected to a cluster via the same predicate until the cluster-predicate pair is too common. If the pair is too common it will be ignored. |
100000 |
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoStep2Reducer.max_map_size |
Optional. The default value is '10000'. The maximum size of the map storing the subjects of a cluster. |
10000 |
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoStep2Reducer.max_predicates |
Optional. The default value is '5'. The maximum number of predicates used for textual description of a cluster. |
5 |