de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation
Class ClusterInfoPatternStep2Reducer
java.lang.Object
org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.Text,StringIntPair,org.apache.hadoop.io.Text,Description>
de.hpi.fgis.voidgen.hadoop.tasks.clusterinformation.ClusterInfoPatternStep2Reducer
public class ClusterInfoPatternStep2Reducer
- extends org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.Text,StringIntPair,org.apache.hadoop.io.Text,Description>
Aggregates by the authority of a URL.
Collects all pairs of position and regular expression.
Stores the regular expressions in a map dependend on their position (this leads to the loss of regex
because only one regex per position will be stored)
Note: If there are multiple patterns for a part and it's position for a single URL authority
then only one pattern is kept.
Input
- Key: the authority of the URL
- Value: a pair
of the position of the URL part and
a regular expression describing the possible URL parts at this position
Output
- Key: the authority of the URL
- Value: a list of regular expression belonging to the parts of the URL
- Author:
- Dandy Fenz, Hasso Plattner Institute at University of Potsdam, Germany, Matthias Pohl, Hasso Plattner Institute at University of Potsdam, Germany, Johannes Gosda, Hasso Plattner Institute at University of Potsdam, Germany
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Reducer |
org.apache.hadoop.mapreduce.Reducer.Context |
Method Summary |
void |
reduce(org.apache.hadoop.io.Text key,
java.lang.Iterable<StringIntPair> values,
org.apache.hadoop.mapreduce.Reducer.Context context)
|
Methods inherited from class org.apache.hadoop.mapreduce.Reducer |
cleanup, run, setup |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
ClusterInfoPatternStep2Reducer
public ClusterInfoPatternStep2Reducer()
reduce
public void reduce(org.apache.hadoop.io.Text key,
java.lang.Iterable<StringIntPair> values,
org.apache.hadoop.mapreduce.Reducer.Context context)
throws java.io.IOException,
java.lang.InterruptedException
- Overrides:
reduce
in class org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.Text,StringIntPair,org.apache.hadoop.io.Text,Description>
- Throws:
java.io.IOException
java.lang.InterruptedException