de.hpi.fgis.dude.exec.duplicatedetection
Class CoraExec

java.lang.Object
  extended by de.hpi.fgis.dude.exec.duplicatedetection.CoraExec

public class CoraExec
extends Object

This execution class runs the naive duplicate detection algorithm on the CORA data source. Two records are similar if their titles match based on a relative Levenshtein distance of 0.8.

Author:
Matthias Pohl

Constructor Summary
CoraExec()
           
 
Method Summary
static void main(String[] args)
          Executes the naive duplicate detection on the CORA data source.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

CoraExec

public CoraExec()
Method Detail

main

public static void main(String[] args)
                 throws IOException,
                        SAXException
Executes the naive duplicate detection on the CORA data source. During the process all duplicates will be written onto the console.

Parameters:
args - No arguments will be processed.
Throws:
SAXException - If an error would occur during the XML parsing process. This exception should not be thrown since the CORA.xml can be parsed without any problems.
IOException - If an error occurs while reading from the file.


Copyright © 2011 Hasso Plattner Institute - Chair of Information Systems. All Rights Reserved.