de.hpi.fgis.dude.exec.duplicatedetection
Class CoraExec
java.lang.Object
de.hpi.fgis.dude.exec.duplicatedetection.CoraExec
public class CoraExec
- extends Object
This execution class runs the naive duplicate detection algorithm on the CORA
data source. Two records are similar if their titles
match based on a relative Levenshtein distance of 0.8.
- Author:
- Matthias Pohl
Method Summary |
static void |
main(String[] args)
Executes the naive duplicate detection on the CORA data source. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CoraExec
public CoraExec()
main
public static void main(String[] args)
throws IOException,
SAXException
- Executes the naive duplicate detection on the
CORA
data source. During the process all duplicates will be written onto the
console.
- Parameters:
args
- No arguments will be processed.
- Throws:
SAXException
- If an error would occur during the XML parsing process. This exception should not be thrown since the CORA.xml can be parsed
without any problems.
IOException
- If an error occurs while reading from the file.
Copyright © 2011 Hasso Plattner Institute - Chair of Information Systems. All Rights Reserved.