de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation
Class ClusterInfoPatternStep2Reducer

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.Text,StringIntPair,org.apache.hadoop.io.Text,Description>
      extended by de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoPatternStep2Reducer

public class ClusterInfoPatternStep2Reducer
extends org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.Text,StringIntPair,org.apache.hadoop.io.Text,Description>

Aggregates by the authority of a URL. Collects all pairs of position and regular expression. Stores the regular expressions in a map dependend on their position (this leads to the loss of regex because only one regex per position will be stored)

Note: If there are multiple patterns for a part and it's position for a single URL authority then only one pattern is kept.

Input

Output

Author:
Dandy Fenz, Hasso Plattner Institute at University of Potsdam, Germany, Matthias Pohl, Hasso Plattner Institute at University of Potsdam, Germany, Johannes Gosda, Hasso Plattner Institute at University of Potsdam, Germany

Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Reducer
org.apache.hadoop.mapreduce.Reducer.Context
 
Constructor Summary
ClusterInfoPatternStep2Reducer()
           
 
Method Summary
 void reduce(org.apache.hadoop.io.Text key, java.lang.Iterable<StringIntPair> values, org.apache.hadoop.mapreduce.Reducer.Context context)
           
 
Methods inherited from class org.apache.hadoop.mapreduce.Reducer
cleanup, run, setup
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ClusterInfoPatternStep2Reducer

public ClusterInfoPatternStep2Reducer()
Method Detail

reduce

public void reduce(org.apache.hadoop.io.Text key,
                   java.lang.Iterable<StringIntPair> values,
                   org.apache.hadoop.mapreduce.Reducer.Context context)
            throws java.io.IOException,
                   java.lang.InterruptedException
Overrides:
reduce in class org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.Text,StringIntPair,org.apache.hadoop.io.Text,Description>
Throws:
java.io.IOException
java.lang.InterruptedException