de.hpi.fgis.dude.algorithm.recordlinkage
Class NaiveRecordLinkage

java.lang.Object
  extended by de.hpi.fgis.dude.util.AbstractCleanable
      extended by de.hpi.fgis.dude.algorithm.AbstractAlgorithm
          extended by de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
              extended by de.hpi.fgis.dude.algorithm.recordlinkage.NaiveRecordLinkage
All Implemented Interfaces:
Algorithm, Cleanable, AutoJsonable, Jsonable, Iterable<DuDeObjectPair>

public class NaiveRecordLinkage
extends AbstractRecordLinkage

NaiveRecordLinkage implements the naive approach for record-linkage. Each element of a DataSource is combined with each element of all other DataSources.

Example:

A B C
a1 b1 c1
a2 b2  
a3    
The example above will generate the following result:
  1. (a1, b1)
  2. (a1, b2)
  3. (a1, c1)
  4. (a2, b1)
  5. (a2, b2)
  6. (a2, c1)
  7. (a3, b1)
  8. (a3, b2)
  9. (a3, c1)
  10. (b1, c1)
  11. (b2, c1)

Author:
Matthias Pohl

Nested Class Summary
protected static class NaiveRecordLinkage.NaiveRecordLinkageIterator
          NaiveRecordLinkageIterator implements the actual functionality of the naive record-linkage approach.
 
Nested classes/interfaces inherited from class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
AbstractAlgorithm.AlgorithmIteratorWrapper
 
Constructor Summary
NaiveRecordLinkage()
           
 
Method Summary
protected  Iterator<DuDeObjectPair> createIteratorInstance()
          Returns a new Iterator instance.
 
Methods inherited from class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
addSource, dataSourceAttached, equals, fromJson, getData, getData, getDataSize, getMaximumPairCount, hashCode, iterator, preprocessData, toJson, unregisterDataSources
 
Methods inherited from class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
addDataSource, addPreprocessor, addPreprocessor, analyzeDuDeObject, createStorage, dataExtracted, disableInMemoryProcessing, enableInMemoryProcessing, finishExtraction, finishPreprocessing, forceExtraction, getDataSize, getExtractedData, inMemoryProcessingEnabled
 
Methods inherited from class de.hpi.fgis.dude.util.AbstractCleanable
cleanUp, registerCleanable, registerCloseable
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface de.hpi.fgis.dude.util.Cleanable
cleanUp, registerCleanable, registerCloseable
 

Constructor Detail

NaiveRecordLinkage

public NaiveRecordLinkage()
Method Detail

createIteratorInstance

protected Iterator<DuDeObjectPair> createIteratorInstance()
Description copied from class: AbstractRecordLinkage
Returns a new Iterator instance.

Specified by:
createIteratorInstance in class AbstractRecordLinkage
Returns:
The Iterator instance.


Copyright © 2011 Hasso Plattner Institute - Chair of Information Systems. All Rights Reserved.