Package de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation

Contains classes for creating information to the clusters.

See:
          Description

Class Summary
ClusterInfoPatternStep1Mapper Outputs for subject and object of each quadruple each URL part of subject and object URL.
ClusterInfoPatternStep1MapperTask2 For every input quadruple where subject and object are URLs the subject and object URLs are processed.
ClusterInfoPatternStep1Reducer Generates regular expressions for the possible parts after the host of a URL.
ClusterInfoPatternStep2Mapper Identity Mapper.
ClusterInfoPatternStep2Reducer Aggregates by the authority of a URL.
ClusterInfoStep1Mapper Takes RDF quadruples and outputs key-value pairs for each quadruple.
ClusterInfoStep1MapperTask2 This Mapper reads quadruple files where each line contains an RDF quadruple in N-Quad format.
ClusterInfoStep1Reducer Collects all subject-predicate pairs of a cluster (key) and adds to every subject-predicate pair the number of subjects using this predicate in the given cluster.
ClusterInfoStep2Mapper Identity Mapper.
ClusterInfoStep2Reducer Creates sample and textual description of a cluster.
 

Package de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation Description

Contains classes for creating information to the clusters. This includes creating textual descriptions for the clusters.