A B C D E F G H I J K L M N O P Q R S T U V W X Z

A

abortIncrease(boolean) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Determines whether the continuous increase of the window size may be aborted.
abortThreshold(float) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Sets the threshold for aborting the continuous increase of the window size.
AbsoluteNumberDiffFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl
This SimilarityFunction implementation checks the absolute variation of the numbers of two DuDeObject attributes.
AbsoluteNumberDiffFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
Internal constructor for Jsonable deserialization.
AbsoluteNumberDiffFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
The constructor initializes the RelativeNumberDiffFunction.
AbsoluteNumberDiffFunction(int, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
The constructor initializes the RelativeNumberDiffFunction.
AbstractAlgorithm - Class in de.hpi.fgis.dude.algorithm
AbstractAlgorithm implements the functionality that is needed by each algorithm type.
AbstractAlgorithm() - Constructor for class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
AbstractAlgorithm.AlgorithmIteratorWrapper - Class in de.hpi.fgis.dude.algorithm
AlgorithmIteratorWrapper is used for setting some common properties of the generated DuDeObjectPairs.
AbstractAlgorithm.AlgorithmIteratorWrapper(Iterator<DuDeObjectPair>) - Constructor for class de.hpi.fgis.dude.algorithm.AbstractAlgorithm.AlgorithmIteratorWrapper
Initializes a AbstractAlgorithm.AlgorithmIteratorWrapper with the passed Iterator.
AbstractCleanable - Class in de.hpi.fgis.dude.util
AbstractCleanable is implemented by classes that collect Closeable instances that shall be closed at the end of a process.
AbstractCleanable() - Constructor for class de.hpi.fgis.dude.util.AbstractCleanable
 
AbstractDataSource<S extends AbstractDataSource<S>> - Class in de.hpi.fgis.dude.datasource
AbstractDataSource provides the common functionality of all DataSource classes.
AbstractDataSource() - Constructor for class de.hpi.fgis.dude.datasource.AbstractDataSource
Internal constructor for Jsonable deserialization.
AbstractDataSource(String) - Constructor for class de.hpi.fgis.dude.datasource.AbstractDataSource
Initializes a AbstractDataExtractor with the passed identifier.
AbstractDataSource.AbstractDataSourceIterator<T extends AbstractDataSource<?>> - Class in de.hpi.fgis.dude.datasource
AbstractDataSourceIterator can be used to generate valid DuDeObjects.
AbstractDataSource.AbstractDataSourceIterator(T) - Constructor for class de.hpi.fgis.dude.datasource.AbstractDataSource.AbstractDataSourceIterator
Initializes an AbstractDataSourceIterator with for the passed AbstractDataSource.
AbstractDuDeObjectSorter - Class in de.hpi.fgis.dude.util.sorting.sorter
AbstractDuDeObjectSorter implements the DuDeObjectSorter interface partially.
AbstractDuDeObjectSorter() - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Initializes an AbstractDuDeObjectSorter with no SortingKey.
AbstractDuDeObjectSorter(SortingKey) - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Initializes an AbstractDuDeObjectSorter with the passed SortingKey.
AbstractDuDeOutput - Class in de.hpi.fgis.dude.output
AbstractDuDeOuput is an abstract class which provides the common functionality of every class that implements DuDeOutput.
AbstractDuDeOutput(OutputStream) - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Initializes a DuDeOutput with the given OutputStream.
AbstractDuDeOutput(OutputStream, String) - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Initializes a DuDeOutput with the given OutputStream.
AbstractDuDeOutput(OutputStream, String, String) - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Initializes a DuDeOutput with the given OutputStream.
AbstractDuDeOutput(File) - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Initializes a DuDeOutput with the given File.
AbstractDuDeOutput(File, String) - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Initializes a DuDeOutput with the given File.
AbstractDuDeOutput(File, String, String) - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Initializes a DuDeOutput with the given File.
AbstractDuDeOutput() - Constructor for class de.hpi.fgis.dude.output.AbstractDuDeOutput
Internal constructor for Jsonable deserialization.
AbstractDuDeStorage<T extends Jsonable> - Class in de.hpi.fgis.dude.util.data.storage
AbstractDuDeStorage stores Jsonable instances.
AbstractDuDeStorage() - Constructor for class de.hpi.fgis.dude.util.data.storage.AbstractDuDeStorage
 
AbstractDuDeStorage.AbstractJsonableWriter<T extends Jsonable> - Class in de.hpi.fgis.dude.util.data.storage
AbstractDuDeStorage.AbstractJsonableWriter implements some functionality that shall be provided by all JsonableWriter sub-classes.
AbstractDuDeStorage.AbstractJsonableWriter() - Constructor for class de.hpi.fgis.dude.util.data.storage.AbstractDuDeStorage.AbstractJsonableWriter
 
AbstractDuplicateDetection - Class in de.hpi.fgis.dude.algorithm
AbstractDuplicateDetection provides the common functionality that is needed by every duplicate-detection algorithm.
AbstractDuplicateDetection() - Constructor for class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
Initializes a new AbstractDuplicateDetection instance.
AbstractExpander - Class in de.hpi.fgis.dude.util.bibtex.expander
 
AbstractExpander(boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.AbstractExpander
 
AbstractIterator<ElementType> - Class in de.hpi.fgis.dude.util
AbstractIterator is an abstract class that should be used within all iterator-like classes.
AbstractIterator() - Constructor for class de.hpi.fgis.dude.util.AbstractIterator
 
AbstractMerger - Class in de.hpi.fgis.dude.merger
AbstractMerger splits the merge functionality into the merging of data, which should be implemented in subclasses and the merging of identifiers, which is done in this class.
AbstractMerger() - Constructor for class de.hpi.fgis.dude.merger.AbstractMerger
 
AbstractMerger - Class in de.hpi.fgis.dude.util.merger
AbstractMerger splits the merge functionality into the merging of data, which should be implemented in subclasses and the merging of identifiers, which is done in this class.
AbstractMerger() - Constructor for class de.hpi.fgis.dude.util.merger.AbstractMerger
 
AbstractRecordLinkage - Class in de.hpi.fgis.dude.algorithm
AbstractRecordLinkage provides the common functionality that is needed by every record-linkage algorithm.
AbstractRecordLinkage() - Constructor for class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
AbstractSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction
AbstractSimilarityFunction is a skeleton implementation for providing the common functionality of a SimilarityFunction implementation.
AbstractSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.AbstractSimilarityFunction
 
AbstractStatisticOutput - Class in de.hpi.fgis.dude.output.statisticoutput
AbstractStatisticOuput is an abstract class that provides functionality common to most classes implementing StatisticOutput.
AbstractStatisticOutput(StatisticComponent) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
Initializes a AbstractStatisticOutput with the passed StatisticComponent.
AbstractStatisticOutput() - Constructor for class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
Internal constructor for Jsonable deserialization.
AbstractSubkey - Class in de.hpi.fgis.dude.util.sorting.sortingkey
AbstractSubkey is an abstract class that should be extended by each subkey class.
AbstractSubkey(String...) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
Initializes a AbstractSubkey instance with the given default attribute.
activate() - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Activates usage of the current config measure.
activated - Variable in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
To check if the current config measure should be used for evaluation.
activateDefault() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Activates the usage of the Default GMD as evaluation measure.
activateF1() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Activates the usage of PairwiseF1 (and by that Precision and Recall) as GMD evaluation measure.
activateHybrid() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Activates the usage of Hybrid GMD as evaluation measure.
activatePRECISION() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Activates the usage of Precision as GMD evaluation measure.
activateRECALL() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Activates the usage of Recall as GMD evaluation measure.
activateVI() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Activates the usage of the Variation of Information as GMD evaluation measure.
actualComparisonCount - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
AdaptiveSNM_Yan2007 - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
Implementation of the adative Sorted Neighborhood Methods presented by Yan et.al.
AdaptiveSNM_Yan2007(AdaptiveSNM_Yan2007.AlgorithmVariant, SortingKey, float) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Initializes a AdaptiveSNM_Yan2007 instance with the passed windows size.
AdaptiveSNM_Yan2007.AA_SNM_Iterator - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
Iterator implementation that implements the behavior of the Accumulatively-Adaptive Sorted-Neighborhood Method.
AdaptiveSNM_Yan2007.AA_SNM_Iterator(SortingKey, float, Iterator<DuDeObject>) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.AA_SNM_Iterator
AdaptiveSNM_Yan2007.AlgorithmVariant - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
This enumeration collects the possible SNM variants.
AdaptiveSNM_Yan2007.IA_SNM_Iterator - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
Iterator implementation that implements the behavior of the Incrementally-Adaptive Sorted-Neighborhood Method.
AdaptiveSNM_Yan2007.IA_SNM_Iterator(SortingKey, float, Iterator<DuDeObject>) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.IA_SNM_Iterator
AdaptiveSNM_Yan2007.YanIterator - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
Abstract Iterator implementation that is used by the different adaptive Sorted Neighborhood methods.
AdaptiveSNM_Yan2007.YanIterator(SortingKey, float, Iterator<DuDeObject>) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Constructor
AdaptiveSNMExec_Yan2007 - Class in de.hpi.fgis.dude.exec.duplicatedetection
This execution class runs the AdaptiveSNM_Yan2007 algorithm.
AdaptiveSNMExec_Yan2007() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.AdaptiveSNMExec_Yan2007
 
add(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator
Adds a pair to the NaiveTransitiveClosureGenerator.
add(Collection<DuDeObjectPair>) - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator
Adds a Collection of pairs to the NaiveTransitiveClosureGenerator.
add(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
Adds the elements of the passed pair to the transitive closure.
add(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
Adds a pair to the WarshallClosureGenerator.
add(SimilarityFunction) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
Adds a SimilarityFunction to this Aggregator with no special multiplier.
add(SimilarityFunction, int) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
Adds a SimilarityFunction to this Aggregator with the passed multiplier.
add(BibtexPerson) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPersonList
Adds a BibtexPerson to this list.
add(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds the JsonValue to the end of the JsonArray.
add(boolean) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonBoolean to the end of the JsonArray.
add(int) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonNumber to the end of the JsonArray.
add(long) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonNumber to the end of the JsonArray.
add(double) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonNumber to the end of the JsonArray.
add(String) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonString to the end of the JsonArray.
add(T) - Method in interface de.hpi.fgis.dude.util.data.storage.JsonableWriter
Adds the passed value to the underlying storage.
add(String) - Method in class de.hpi.fgis.dude.util.DataSourceIdentifierManager
Adds the id to the set of identifiers that is already in use.
add(DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
 
add(DuDeObject) - Method in interface de.hpi.fgis.dude.util.sorting.sorter.DuDeObjectSorter
Adds a DuDeObject to the collection that will be sorted.
add(DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
 
addAll(Collection<T>) - Method in class de.hpi.fgis.dude.util.data.storage.AbstractDuDeStorage.AbstractJsonableWriter
 
addAll(Collection<T>) - Method in interface de.hpi.fgis.dude.util.data.storage.JsonableWriter
Adds content of the passed collection to the underlying DuDeStorage.
addAll(Iterable<DuDeObject>) - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Adds all elements of the passed Iterable to the AbstractDuDeObjectSorter.
addAll(Iterable<DuDeObject>) - Method in interface de.hpi.fgis.dude.util.sorting.sorter.DuDeObjectSorter
Adds the passed several DuDeObjects to this DuDeObjectSorter.
addAttribute(DataSource, String...) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Adds a DataSource-related attribute to this ContentBasedSimilarityFunction.
addAttributeValue(JsonRecord, String, JsonValue) - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource.AbstractDataSourceIterator
Adds a JsonValue instance to an attribute of the passed JsonRecord.
addCollection(Collection<? extends JsonValue>) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonArray generated out of the passed Collection to the end of the JsonArray.
addDataSource(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
addDataSource(DataSource) - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Adds a DataSource to the algorithm.
addDataSource(DataSource) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Adds a DataSource to the algorithm.
addDataSource(DataSource) - Method in class de.hpi.fgis.dude.util.Experiment
Adds a DataSource to this Experiment.
addDataSources(DataSource...) - Method in class de.hpi.fgis.dude.util.Experiment
Adds several DataSources to this Experiment.
addDuDeOutput(DuDeOutput) - Method in class de.hpi.fgis.dude.util.Experiment
Adds a new DuDeOutput to this Experiment.
addDuDeOutputs(DuDeOutput...) - Method in class de.hpi.fgis.dude.util.Experiment
Adds several DuDeOutputs to this Experiment.
addDuplicate(DuDeObjectPair, boolean) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Adds a DuDeObjectPair to the knowledge base that is labeled as a detected duplicate and the gold standard's duplicate pairs.
addDuplicate(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds a DuDeObjectPair to the knowledge base that is labeled as a detected duplicate.
addDuplicate(DuDeObjectPair, boolean) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds a DuDeObjectPair to the knowledge base that is labeled as a detected duplicate.
addDuplicate(Iterable<DuDeObjectPair>) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds several DuDeObjectPairs to the knowledge base that are labeled as detected duplicates.
addEntry(BibtexAbstractEntry) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Adds an entry to this file.
addFuzzyDuDeOutput(DuDeOutput) - Method in class de.hpi.fgis.dude.util.Experiment
Adds a new DuDeOutput for fuzzy duplicates to this Experiment.
addFuzzyDuDeOutputs(DuDeOutput...) - Method in class de.hpi.fgis.dude.util.Experiment
Adds several DuDeOutputs for fuzzy duplicates to this Experiment.
addIdAttributes(String...) - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
addIdAttributes(String...) - Method in interface de.hpi.fgis.dude.datasource.DataSource
Sets the attributes that shall be used for id generation.
addIdAttributes(String...) - Method in class de.hpi.fgis.dude.datasource.DuDeObjectSource
 
addIdentifier(DuDeObjectId) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Adds an identifier to this DuDeObject.
ADDITIONAL_AUTHORS_EXISTS_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute for storing information, whether there is an explicit mention that not all authors are listed.
addMap(Map<String, ? extends JsonValue>) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds a JsonRecord generated out of the passed Map to the end of the JsonArray.
addNonDuplicate(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds a DuDeObjectPair to the knowledge base that is labeled as a detected non-duplicate.
addNonDuplicate(DuDeObjectPair, boolean) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds a DuDeObjectPair to the knowledge base that is labeled as a detected non-duplicate.
addNonDuplicate(Iterable<DuDeObjectPair>) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds several DuDeObjectPairs to the knowledge base that are labeled as a detected non-duplicates.
addNull() - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Adds JsonNull to the end of the JsonArray.
addOutput(DuDeOutput) - Method in class de.hpi.fgis.dude.output.MultipleOutput
Adds an output to the list of DuDeOutputs.
addPair(DuDeObjectPair, boolean) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds a DuDeObjectPair to the knowledge base.
addPair(Iterable<DuDeObjectPair>, boolean) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Adds several DuDeObjectPairs to the knowledge base.
addPreprocessor(Preprocessor) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
addPreprocessor(DataSource, Preprocessor) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
addPreprocessor(Preprocessor) - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Adds a default Preprocessor to this algorithm.
addPreprocessor(DataSource, Preprocessor) - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Adds a Preprocessor for a specific DataSource to this algorithm.
addPrimitive(Class<? extends T>, Primitive<T>) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Adds a primitive for type T.
addSortingKey(SortingKey) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Adds a sorting key, which defines a blocking criterion
addSortingKey(DataSource, SortingKey) - Method in class de.hpi.fgis.dude.algorithm.SortingRecordLinkage
Adds a DataSource-related SortingKey.
addSource(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Adds the DataSource to this instance.
addSource(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
addSource(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
addStatisticOutput(StatisticOutput) - Method in class de.hpi.fgis.dude.util.Experiment
Adds a StatisticOutput instance to this Experiment.
addStatisticOutputs(StatisticOutput...) - Method in class de.hpi.fgis.dude.util.Experiment
Adds several StatisticOutput instances to this Experiment.
addSubkey(Subkey) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Adds a sub-key to the SortingKey.
Aggregator - Class in de.hpi.fgis.dude.similarityfunction.aggregators
Aggregator aggregates the similarities returned by different SimilarityFunctions.
Aggregator() - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
Internal constructor for Jsonable deserialization.
Aggregator(SimilarityFunction...) - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
Initializes a MultiDuDeObjectComparator with a number of sub-comparators.
Algorithm - Interface in de.hpi.fgis.dude.algorithm
Algorithm collects all the methods that are needed by each algorithm implementation.
algorithm - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
algorithmSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether a Algorithm was set.
allNecessaryPropertiesAreSet() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Checks whether all necessary properties are set (host, port, database name, user and password).
analyzeDuDeObject(DuDeObject) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Initiates the preprocessing for the passed DuDeObject.
analyzeDuDeObject(DuDeObject) - Method in class de.hpi.fgis.dude.preprocessor.CountPreprocessor
 
analyzeDuDeObject(DuDeObject) - Method in class de.hpi.fgis.dude.preprocessor.DocumentFrequencyPreprocessor
Retrieves the value frequencies within the considered attribute and ads them to the total document frequency of the terms
analyzeDuDeObject(DuDeObject) - Method in interface de.hpi.fgis.dude.preprocessor.Preprocessor
Passes the currently extracted DuDeObject to the Preprocessor for further analysis.
ArrayConversionStrategy - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
ArrayConversionStrategy generates a one-element JsonArray with the passed JsonAtomic value and runs the passed ContentBasedSimilarityFunction on both JsonArrays.
ArrayConversionStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.ArrayConversionStrategy
 
ArrayJsonSerialization<E> - Class in de.hpi.fgis.dude.util.data.json.auto
Json serialization for arrays.
ArrayJsonSerialization(BoundType) - Constructor for class de.hpi.fgis.dude.util.data.json.auto.ArrayJsonSerialization
Initializes the serialization for the given array type.
arrayOf(Class<?>...) - Static method in class de.hpi.fgis.dude.util.BoundType
Wraps an array of Class to an array of BoundTypes without parameters.
asList(Iterator<T>) - Static method in class de.hpi.fgis.dude.util.CollectionUtil
Materializes the elements of the given Iterator into a List.
autoGeneratedIds() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
Checks whether the object id is generated automatically.
autoGeneratedIds() - Method in class de.hpi.fgis.dude.datasource.DuDeObjectSource
 
AutoJsonable - Interface in de.hpi.fgis.dude.util.data
Tag interface for classes which can be stored into Json and restored from it.
AutoJsonSerialization<T> - Class in de.hpi.fgis.dude.util.data.json.auto
Provides json serialization for a specific BoundType.
AutoJsonSerialization(BoundType) - Constructor for class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Initialization for the given type
Average - Class in de.hpi.fgis.dude.similarityfunction.aggregators
Average returns the average similarity of all added SimilarityFunctions.
Average() - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Average
Internal constructor for Jsonable deserialization.
Average(SimilarityFunction...) - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Average
Initializes a Average instance.
AverageArrayArrayStrategy - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
Calculates the average similarity of the arrays' elements (e.g.
AverageArrayArrayStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.AverageArrayArrayStrategy
 

B

BestMatchCalculationStrategy - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
BestMatchCalculationStrategy compares a JsonArray with a JsonAtomic by selecting the best match.
BestMatchCalculationStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.BestMatchCalculationStrategy
 
BibtexAbstractEntry - Class in de.hpi.fgis.dude.util.bibtex.data
Abstract entries are the building blocks of BibTex files.
BibtexAbstractEntry(BibtexFile) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexAbstractEntry
Instantiates a new BibtexAbstractEntry.
BibtexAbstractValue - Class in de.hpi.fgis.dude.util.bibtex.data
Abstract values are the values that can be used as field values in entries or as bodies of macros.
BibtexAbstractValue(BibtexFile) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexAbstractValue
Instantiates a new BibtexAbstractValue.
BibtexConcatenatedValue - Class in de.hpi.fgis.dude.util.bibtex.data
Two abstract values concatenated by the hash-operator (#).
BibtexConcatenatedValue(BibtexFile, BibtexAbstractValue, BibtexAbstractValue) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexConcatenatedValue
Instantiates a new BibtexConcatenatedValue using the specified values.
BibtexEntry - Class in de.hpi.fgis.dude.util.bibtex.data
An entry is something that can be referenced by a latex file using the \cite command.
BibtexEntry(BibtexFile, String, String) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Instantiates a new BibtexEntry.
BibtexFile - Class in de.hpi.fgis.dude.util.bibtex.data
This is the root of a BibTex DOM tree and the factory for any BibTex model - the only way to create nodes.
BibtexFile() - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Instantiates a BibTexFile.
BibtexMacroDefinition - Class in de.hpi.fgis.dude.util.bibtex.data
BibTex let's you define macros which are essentially just shortcuts for strings.
BibtexMacroDefinition(BibtexFile, String, BibtexAbstractValue) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroDefinition
Instantiates a new BibtexMacroDefinition.
BibtexMacroReference - Class in de.hpi.fgis.dude.util.bibtex.data
A BibtexMacroReference references a BibtexMacroDefinition.
BibtexMacroReference(BibtexFile, String) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroReference
Instantiates a new BibtexMacroReference.
BibtexNode - Class in de.hpi.fgis.dude.util.bibtex.data
An abstract superclass for all BibTex model nodes.
BibtexNode(BibtexFile) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexNode
Some people have asked why this constructor has a parameter of type BibtexFile (which in turn extends BibtexNode).
BibtexParser - Class in de.hpi.fgis.dude.util.bibtex.parser
The parser will parse the BibTex into a basic AST.
BibtexParser(boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.parser.BibtexParser
Creates a new BibTexParser.
BibtexPerson - Class in de.hpi.fgis.dude.util.bibtex.data
BibtexPerson objects are elements of BibtexPersonLists, which can be used in author or editor fields.
BibtexPerson(BibtexFile, String, String, String, String, boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Instantiates a new BibtexPerson.
BibtexPersonList - Class in de.hpi.fgis.dude.util.bibtex.data
A list of BibtexPerson objects that can be used for author or editor fields - use the PersonListExpander to convert all editor/author field values of a particular BibtexFile to BibtexPersonLists.
BibtexPersonList(BibtexFile) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexPersonList
Instantiates a new BibtexPersonList.
BibtexPersonListParser - Class in de.hpi.fgis.dude.util.bibtex.expander
 
BibtexPersonListParser() - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser
 
BibtexPersonListParser.StringIterator - Class in de.hpi.fgis.dude.util.bibtex.expander
 
BibtexPersonListParser.StringIterator(String) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser.StringIterator
 
BibtexPreamble - Class in de.hpi.fgis.dude.util.bibtex.data
A preamble can be used to include pretty much arbitrary LaTex/Tex at the beginning of a generated bibliography.
BibtexPreamble(BibtexFile, BibtexAbstractValue) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexPreamble
Instantiates a new BibtexPreamble.
BibtexSource - Class in de.hpi.fgis.dude.datasource
BibtexSource represents *.bib files containing BibTeX syntax.
BibtexSource() - Constructor for class de.hpi.fgis.dude.datasource.BibtexSource
Internal constructor for Jsonable deserialization.
BibtexSource(String, File) - Constructor for class de.hpi.fgis.dude.datasource.BibtexSource
Initializes a BibtexSource.
BibtexSource(String, File, boolean) - Constructor for class de.hpi.fgis.dude.datasource.BibtexSource
Initializes a BibtexSource object.
BibtexSource.BibtexSourceIterator - Class in de.hpi.fgis.dude.datasource
BibtexSourceIterator is used for generating DuDeObjects out of BibtexSources.
BibtexSource.BibtexSourceIterator(BibtexSource) - Constructor for class de.hpi.fgis.dude.datasource.BibtexSource.BibtexSourceIterator
Initializes a BibtexSourceIterator using the passed BibtexSource.
BibtexStandardMacros - Class in de.hpi.fgis.dude.util.bibtex.data
With standard macros we mean the ones defined in plain.bst
BibtexStandardMacros() - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexStandardMacros
 
BibtexString - Class in de.hpi.fgis.dude.util.bibtex.data
A String - this class is used for numbers as well - if there's a number wrapped in here, the toString() method will be smart enough to leave out the braces, and thus print {1979} as 1979.
BibtexString(BibtexFile, String) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexString
Creates a new BibTexString.
BibtexToplevelComment - Class in de.hpi.fgis.dude.util.bibtex.data
Top-level comments wrap everything that is at the top-level of a BibTex file and not parsable as some other entry.
BibtexToplevelComment(BibtexFile, String) - Constructor for class de.hpi.fgis.dude.util.bibtex.data.BibtexToplevelComment
Creates a new BibtexToplevelComment.
BlockDistanceFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
BlockDistanceFunction compares two DuDeObjects based on the (city) Block Distance of the given attribute.
BlockDistanceFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.BlockDistanceFunction
Internal constructor for Jsonable deserialization.
BlockDistanceFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.BlockDistanceFunction
Initializes the BlockDistanceFunction with the default tokenizer.
BlockDistanceFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.BlockDistanceFunction
Initializes the BlockDistanceFunction with the default tokenizer.
BlockDistanceFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.BlockDistanceFunction
Initializes the BlockDistanceFunction with the passed InterfaceTokeniser.
BlockDistanceFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.BlockDistanceFunction
Initializes the BlockDistanceFunction with the passed InterfaceTokeniser.
blockQueue - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Queue that contains the records of the current block
blockQueueIterator - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Iterator for the records of the current block
BoundType - Class in de.hpi.fgis.dude.util
Represents an rectified, generic type, which is only equal to another BoundType iff all bounds are exactly the same.
BoundType(Class<?>) - Constructor for class de.hpi.fgis.dude.util.BoundType
Initializes a BoundType around the given raw type.
BoundType(ParameterizedType) - Constructor for class de.hpi.fgis.dude.util.BoundType
Initializes a BoundType around the given parameterized type.
build() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Initializes a AdaptiveWindowSizeSNM instance

C

calculateSimilarity(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.similarityfunction.AbstractSimilarityFunction
Calculates the similarity of the passed DuDeObjects.
calculateSimilarity(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, JsonArray, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.ArrayConversionStrategy
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, JsonArray, JsonArray) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.AverageArrayArrayStrategy
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, JsonArray, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.BestMatchCalculationStrategy
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, T1, T2) - Method in interface de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.CalculationStrategy
Calculates the similarity of the passed values.
calculateSimilarity(ContentBasedSimilarityFunction<?>, JsonArray, JsonArray) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.CrossProductStrategy
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, T1, T2) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.IgnoreStrategy
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, T1, T2) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.NotSupportedStrategy
 
calculateSimilarity(ContentBasedSimilarityFunction<?>, JsonArray, JsonArray) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.StableMarriageStrategy
 
calculateSimilarity(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
 
calculateSimilarity(JsonValue, JsonValue) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Calculates the similarity of the two passed JsonValues.
calculateTransitiveClosure() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyList
 
calculateTransitiveClosure() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyMatrix
 
calculateTransitiveClosure() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.GraphRepresentation
Iterating over the matrix and searching for new connections between elements.
CalculationStrategy<T1 extends JsonValue,T2 extends JsonValue> - Interface in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
CalculationStrategy is an interface for different strategies, that can be used within ContentBasedSimilarityFunctions for defining the behavior of the similarity calculation, if at least one value is not an atomic one.
canBeEscaped(char) - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Checks whether the passed character can be escaped.
caseSensitivityEnabled() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Checks if case-sensitivity is enabled.
CDExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
This execution class runs the naive duplicate detection algorithm on the CD data source.
CDExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.CDExec
 
ChainedIterator<T> - Class in de.hpi.fgis.dude.util
Logically concatenates several Iterators to one big iterator.
ChainedIterator(Iterable<Iterator<T>>) - Constructor for class de.hpi.fgis.dude.util.ChainedIterator
Initializes the chained iterator from multiple single iterators.
ChainedIterator(Iterator<Iterator<T>>) - Constructor for class de.hpi.fgis.dude.util.ChainedIterator
Initializes the chained iterator from multiple single iterators.
ChainedIterator(Iterator<T>...) - Constructor for class de.hpi.fgis.dude.util.ChainedIterator
Initializes the chained iterator from multiple single iterators.
checkMemory - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
checkMemoryUsage() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Starts Memoryusage task
choice - Variable in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer.Token
The choice of this token.
CitySimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
CitySimilarityFunction compares two strings and treats them as cities, allowing for some special normalization and comparison techniques.
CitySimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.CitySimilarityFunction
 
Cleanable - Interface in de.hpi.fgis.dude.util
Cleanable is an interface that provides methods for easily closing a bunch of Closeable or Cleanable instances.
cleanUp() - Method in class de.hpi.fgis.dude.database.DatabaseSource
 
cleanUp() - Method in class de.hpi.fgis.dude.datasource.XMLSource
 
cleanUp() - Method in class de.hpi.fgis.dude.util.AbstractCleanable
 
cleanUp() - Method in interface de.hpi.fgis.dude.util.Cleanable
Closes all registered Closeable and Cleanable instances.
cleanUp() - Method in class de.hpi.fgis.dude.util.Experiment
Performs a clean-up.
clear() - Method in class de.hpi.fgis.dude.util.GoldStandard
Clears all gathered data.
clear() - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
 
clear() - Method in interface de.hpi.fgis.dude.util.sorting.sorter.DuDeObjectSorter
Clears the already added data.
clear() - Method in class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
 
clearData() - Method in class de.hpi.fgis.dude.preprocessor.CountPreprocessor
 
clearData() - Method in class de.hpi.fgis.dude.preprocessor.DocumentFrequencyPreprocessor
 
clearData() - Method in interface de.hpi.fgis.dude.preprocessor.Preprocessor
Clears statistics that were already gathered.
clearDuplicateInfo() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Clears the is-duplicate property.
clearLineage() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Clears the lineage property.
clearSimilarity() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Clears the similarity of this pair.
clone() - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
There is only intended to be a single instance of the NULL object, so the clone method returns itself.
close() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
close() - Method in class de.hpi.fgis.dude.datasource.BibtexSource
 
close() - Method in class de.hpi.fgis.dude.datasource.DuDeObjectSource
 
close() - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
 
close() - Method in class de.hpi.fgis.dude.output.CSVOutput
 
close() - Method in interface de.hpi.fgis.dude.output.DuDeOutput
Closes the stream.
close() - Method in class de.hpi.fgis.dude.output.MultipleOutput
 
close() - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
close() - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Closes the CSVWriter.
close() - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
close() - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Closes the underlying stream.
close() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Closes the internally used reader.
close() - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Closes the underlying writer.
close() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
 
close() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Closes the used reader.
close() - Method in class de.hpi.fgis.dude.util.GoldStandard
Closes the stream of the DataSource.
close() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
 
closeDataSources() - Method in class de.hpi.fgis.dude.util.Experiment
Closes all added DataSourcess.
closeFuzzyOutputs() - Method in class de.hpi.fgis.dude.util.Experiment
Closes all added fuzzy DuDeOutputs.
closeOutputs() - Method in class de.hpi.fgis.dude.util.Experiment
Closes all added DuDeOutputs.
closeStatisticOutputs() - Method in class de.hpi.fgis.dude.util.Experiment
Closes all added StatisticOutputs.
CollectionJsonSerialization<E,C extends Collection<E>> - Class in de.hpi.fgis.dude.util.data.json.auto
Collection json serialization support.
CollectionJsonSerialization(BoundType) - Constructor for class de.hpi.fgis.dude.util.data.json.auto.CollectionJsonSerialization
Initializes the json serialization for the given type.
CollectionUtil - Class in de.hpi.fgis.dude.util
Convenience methods for collections.
CollectionUtil() - Constructor for class de.hpi.fgis.dude.util.CollectionUtil
 
collectRelevantValues(JsonArray, JsonValue) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
Collects all relevant values and put them into the passed JsonArray.
collectRelevantValues(JsonArray, JsonValue) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
 
collectRelevantValues(JsonArray, JsonValue) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
 
column - Variable in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer.Token
The column information of this token.
ColumnInfo - Class in de.hpi.fgis.dude.database.util
ColumnInfo represents a column with its name and type.
ColumnInfo(String, int) - Constructor for class de.hpi.fgis.dude.database.util.ColumnInfo
Creates a ColumnInfo object with the name and the Types data type representation of the column.
columnNamesAreUnique() - Method in class de.hpi.fgis.dude.database.util.Schema
Checks whether the set columns are unique.
columns - Variable in class de.hpi.fgis.dude.database.util.Schema
A list of all column representations which are part of this schema.
comparator - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
comparator used to calculate the distance between two sorting keys
compare(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
 
compare(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Compares two DuDeObjects.
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Calculates the similarity of the two passed JsonAtomics.
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.EquationSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SimmetricsFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.SoundExFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.CitySimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.DateSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.FamilyNameSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.GivenNameSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.HonorificSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.HouseNumberSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.PhoneNumberSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.StreetSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.TitleSimilarityFunction
 
compareAtomicValues(JsonAtomic, JsonAtomic) - Method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.ZIPSimilarityFunction
 
compareJsonValues(JsonValue, JsonValue) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
Executes the text-based comparison for each Json type.
compareJsonValues(JsonValue, JsonValue) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
 
compareJsonValues(JsonValue, JsonValue) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
 
compareTo(DuDeObject) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
 
compareTo(DuDeObjectId) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
 
compareTo(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
 
compareTo(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
compareTo(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
 
compareTo(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
compareTo(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
 
compareTo(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonString
 
compareTo(JsonValue) - Method in interface de.hpi.fgis.dude.util.data.json.JsonValue
Compares the current instance with the passed JsonValue.
CompositeJsonSerialization<T> - Class in de.hpi.fgis.dude.util.data.json.auto
The json serialization support for classes.
CompositeJsonSerialization(BoundType) - Constructor for class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
Initializes the serialization with the given type.
computeF1(double, double) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Computation of the f1 measure by calculating the harmonic mean of the given precision and recall.
computeGMD(Collection<? extends Collection<DuDeObject>>, Collection<? extends Collection<DuDeObject>>) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Returns the Generalized Merge Distance based on the current cost configurations.
computeSingleClusters(Vector<Vector<DuDeObject>>) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Returns a list of clusters with each record alone in its cluster.
consolidatePushBack() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Cleans up helper structures used with DuDeJsonParser.createPushBackGenerator().
This method should especially be invoked when all pushed json elements have been read.
Without calling this method, a slight performance decrease should be observable.
ConstantSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.structurebased
ConstantSimilarityFunction returns a similarity that is independent from the passed DuDeObjectPair and can be specified by the user.
ConstantSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.structurebased.ConstantSimilarityFunction
Internal constructor for Jsonable deserialization.
ConstantSimilarityFunction(double) - Constructor for class de.hpi.fgis.dude.similarityfunction.structurebased.ConstantSimilarityFunction
Initializes the ConstantSimilarityFunction with the passed similarity.
contains(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
Checks whether at least one element of the pair is already within this transitive closure.
contains(DuDeObject) - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
Checks whether the passed DuDeObject is already member of the transitive closure.
contains(DuDeObjectPair) - Method in class de.hpi.fgis.dude.util.GoldStandard
Checks whether the passed pair is in the gold standard.
content - Variable in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer.Token
The content of this token.
ContentBasedSimilarityFunction<T extends ContentBasedSimilarityFunction<T>> - Class in de.hpi.fgis.dude.similarityfunction.contentbased
ContentBasedSimilarityFunction is a skeleton implementation with common functionality that is used by any content-based SimilarityFunction.
ContentBasedSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Internal constructor for Jsonable deserialization.
ContentBasedSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Initializes a ContentBasedSimilarityFunction with the passed default attribute.
ContentBasedSimilarityFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Initializes a ContentBasedSimilarityFunction with the passed default attribute.
copyPropertiesFrom(OrderedPair<T>) - Method in class de.hpi.fgis.dude.util.OrderedPair
Copies the properties from the passed pair into the current instance.
CoraExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
This execution class runs the naive duplicate detection algorithm on the CORA data source.
CoraExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.CoraExec
 
CosineSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
CosineSimilarityFunction compares two DuDeObjects based on the cosine similarity of the given attribute.
CosineSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.CosineSimilarityFunction
Internal constructor for Jsonable deserialization.
CosineSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.CosineSimilarityFunction
Initializes the CosineSimilarityFunction with the default tokenizer.
CosineSimilarityFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.CosineSimilarityFunction
Initializes the CosineSimilarityFunction with the default tokenizer.
CosineSimilarityFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.CosineSimilarityFunction
Initializes the CosineSimilarityFunction with the passed InterfaceTokeniser.
CosineSimilarityFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.CosineSimilarityFunction
Initializes the CosineSimilarityFunction with the passed InterfaceTokeniser.
CountPreprocessor - Class in de.hpi.fgis.dude.preprocessor
CountPreprocessor is a sample class, that shows how the Preprocessor interface can be used.
CountPreprocessor() - Constructor for class de.hpi.fgis.dude.preprocessor.CountPreprocessor
Initializes a CountPreprocessor.
createConnection() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns a Connection object, which represents a new connection to the database.
createData() - Method in class de.hpi.fgis.dude.exec.duplicatedetection.SortedBlocksExec
Populate the data source with data
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
Returns a new Iterator instance.
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
Returns a new Iterator instance.
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh
Creates a new instance of the GSwooshIterator.
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveBlockingAlgorithm
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveDuplicateDetection
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh
Creates a new instance of the RSwooshIterator.
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Initializes a SortedBlocks instance using variable block sizes.
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.recordlinkage.NaiveRecordLinkage
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.SortingDuplicateDetection
 
createIteratorInstance() - Method in class de.hpi.fgis.dude.algorithm.SortingRecordLinkage
 
createJsonArray(String) - Static method in class de.hpi.fgis.dude.util.data.json.JsonArray
Creates a JsonArray instance based on the passed Json code.
createJsonBoolean(String) - Static method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
Returns a Json-convertible boolean based on the passed String.
createJsonBoolean(boolean) - Static method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
Returns the Json representation of the passed boolean value.
createJsonNumber(String) - Static method in class de.hpi.fgis.dude.util.data.json.JsonNumber
Creates a JsonNumber instance based on the passed Json code.
createJsonParseException(String) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Creates a new JsonParseException.
createJsonParseException(String, Throwable) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Creates a new JsonParseException.
createJsonRecord(String) - Static method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Creates a JsonRecord instance based on the passed Json code.
createJsonString(String) - Static method in class de.hpi.fgis.dude.util.data.json.JsonString
Creates a JsonString instance based on the passed Json code.
createPushBackGenerator() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Creates a push back generator for the parser similar to PushbackInputStream.
The returned DuDeJsonGenerator might be used to generate arbitrary json elements in front of the current parser position in a FIFO manner.
In order to adjust the internal state to the newly generated json elements, a closing the returned generator with DuDeJsonGenerator.close() is necessary.
createResultCluster() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Creates a Vector of Vector of DuDeObjects from the given data structure (Transform gold standard from duplicate pairs to cluster format).
createStorage(String) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Creates the DuDeStorage instance based on the in-memory-processing flag.
createXMLStreamReader() - Method in class de.hpi.fgis.dude.datasource.XMLSource
Instantiates a new XMLStreamReader.
CrossProductStrategy - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
CrossProductStrategy compares a each member of the first JsonArray with all elements of the second JsonArray.
CrossProductStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.CrossProductStrategy
 
CrossReferenceExpander - Class in de.hpi.fgis.dude.util.bibtex.expander
This expander expands the crossreferences defined by the crossref fields - you should run the MacroReferenceExpander first.
CrossReferenceExpander() - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.CrossReferenceExpander
Equivalent to CrossReferenceExpander(true)
CrossReferenceExpander(boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.CrossReferenceExpander
 
CSVOutput - Class in de.hpi.fgis.dude.output
Writes passed DudeObjectPairs , their similarity value and selected optional value in a CSV file row by row.
CSVOutput(File) - Constructor for class de.hpi.fgis.dude.output.CSVOutput
Initializes a new CSVOutput.
CSVOutput(OutputStream) - Constructor for class de.hpi.fgis.dude.output.CSVOutput
Initializes a new CSVOutput with the passed OutputStream.
CSVOutput(Writer) - Constructor for class de.hpi.fgis.dude.output.CSVOutput
Initializes a new CSVOutput.
CSVOutput() - Constructor for class de.hpi.fgis.dude.output.CSVOutput
Internal constructor for Jsonable deserialization.
CSVReader - Class in de.hpi.fgis.dude.util.csv
CSVReader reads CSV formatted data.
CSVReader(Reader) - Constructor for class de.hpi.fgis.dude.util.csv.CSVReader
Initializes a CSVReader with the given Reader.
CSVSource - Class in de.hpi.fgis.dude.datasource
CSVSource represents *.csv files.
CSVSource() - Constructor for class de.hpi.fgis.dude.datasource.CSVSource
Internal constructor for Jsonable deserialization.
CSVSource(String, File) - Constructor for class de.hpi.fgis.dude.datasource.CSVSource
Initializes a CSVSource.
CSVSource(String, File, String...) - Constructor for class de.hpi.fgis.dude.datasource.CSVSource
Initializes a CSVSource with column names.
CSVSource.CSVSourceIterator - Class in de.hpi.fgis.dude.datasource
CSVSourceIterator is used for generating DuDeObjects out of CSVSources.
CSVSource.CSVSourceIterator(CSVSource) - Constructor for class de.hpi.fgis.dude.datasource.CSVSource.CSVSourceIterator
Initializes a CSVSourceIterator using the passed CSVSource.
CSVStatisticOutput - Class in de.hpi.fgis.dude.output.statisticoutput
CSVStatisticOutput writes the statistics provided by a StatisticComponent instance into a CSV file.
CSVStatisticOutput(OutputStream, char) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Initializes a CSVStatisticOutput with no statistics.
CSVStatisticOutput(OutputStream, StatisticComponent, char) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Initializes the CSVStatisticOutput.
CSVStatisticOutput(File, char) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Initializes a CSVStatisticOutput with no statistics.
CSVStatisticOutput(File, StatisticComponent, char) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Initializes the CSVStatisticOutput.
CSVStatisticOutput() - Constructor for class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Internal constructor for Jsonable deserialization.
CSVWriter - Class in de.hpi.fgis.dude.util.csv
CSVWriter generates CSV formatted data.
CSVWriter(OutputStream) - Constructor for class de.hpi.fgis.dude.util.csv.CSVWriter
Initializes a CSVWriter with the given OutputStream.
CSVWriter(Writer) - Constructor for class de.hpi.fgis.dude.util.csv.CSVWriter
Initializes a CSVWriter with the given Writer.
current() - Method in class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser.StringIterator
 
current() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Returns the element on which the iterating pointer points right now.
currentFieldName() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the current field name without setting the cursor to the next token.
currentInputChar() - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
make sure to query enforceNoEof first!
currentRec - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
current record that is used to create record pairs with all other records in the block
currentToken() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the current token that was loaded during the last DuDeJsonParser.nextToken() call.
currentTokenName() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the type name of the current token.

D

Database - Class in de.hpi.fgis.dude.database.adapter
Database is an abstract class that encapsulates the database related information.
Database(DBInfo) - Constructor for class de.hpi.fgis.dude.database.adapter.Database
Initializes the database instance members and loads the settings provided by the parameter dbInfo.
Database(InputStream) - Constructor for class de.hpi.fgis.dude.database.adapter.Database
Initializes the database using the passed InputStream.
Database(Properties) - Constructor for class de.hpi.fgis.dude.database.adapter.Database
Initializes the database using the passed Properties.
DatabaseSource - Class in de.hpi.fgis.dude.database
DatabaseSource represents databases.
DatabaseSource() - Constructor for class de.hpi.fgis.dude.database.DatabaseSource
Internal constructor for Jsonable deserialization.
DatabaseSource(String, Database, String) - Constructor for class de.hpi.fgis.dude.database.DatabaseSource
Initializes DatabaseSource for the passed Database and table.
DatabaseSource.DatabaseSourceIterator - Class in de.hpi.fgis.dude.database
DatabaseSourceIterator is used for generating DuDeObjects out of DatabaseSources.
DatabaseSource.DatabaseSourceIterator(DatabaseSource) - Constructor for class de.hpi.fgis.dude.database.DatabaseSource.DatabaseSourceIterator
Initializes a DatabaseSourceIterator using the passed DatabaseSource.
dataExtracted() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Checks whether the data extraction was already done.
dataIterator - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Iterator for the extracted records from the data sources
dataSource - Variable in class de.hpi.fgis.dude.datasource.AbstractDataSource.AbstractDataSourceIterator
The internally used DataSource.
DataSource - Interface in de.hpi.fgis.dude.datasource
DataSource is used for extracting data out of different data sources.
dataSourceAttached(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Checks whether the passed DataSource is attached to this AbstractAlgorithm instance.
dataSourceAttached(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
dataSourceAttached(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
DataSourceIdentifierManager - Class in de.hpi.fgis.dude.util
IdentifierManager manages the DataSource identifiers.
dataSourcesSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether any DataSource is added.
DateSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
DateSimilarityFunction compares two strings and treats them as dates, allowing for some special normalization and comparison techniques.
DateSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.DateSimilarityFunction
 
DB2Database - Class in de.hpi.fgis.dude.database.adapter
DB2Database encapsulates all the necessary information for establishing a connection to a DB2 database.
DB2Database(DBInfo) - Constructor for class de.hpi.fgis.dude.database.adapter.DB2Database
Initializes the DB2Database instance members and loads the settings provided by the parameter dbInfo.
DB2Database(InputStream) - Constructor for class de.hpi.fgis.dude.database.adapter.DB2Database
Initializes the DB2Database using the passed InputStream.
DB2Database(Properties) - Constructor for class de.hpi.fgis.dude.database.adapter.DB2Database
Initializes the DB2Database using the passed Properties.
DBInfo - Class in de.hpi.fgis.dude.database.util
DBInfo encapsulates the settings which are needed for establishing a database connection.
DBInfo() - Constructor for class de.hpi.fgis.dude.database.util.DBInfo
Initializes a DBInfo instance with no initial information.
DBInfo(Properties) - Constructor for class de.hpi.fgis.dude.database.util.DBInfo
Initializes a DBInfo instance with the passed properties.
DBInfo(InputStream) - Constructor for class de.hpi.fgis.dude.database.util.DBInfo
Initializes a DBInfo instance where the initial properties are read from the passed InputStream.
DBInfo(String) - Constructor for class de.hpi.fgis.dude.database.util.DBInfo
Initializes a DBInfo instance with the properties provided by the properties file whose path was passed.
DCSNMMissingNotificationException - Exception in de.hpi.fgis.dude.exceptions
This exception should be raised by DuplicateCountSNM, if the algorithm has not been notified of the comparison result of the latest pair.
DCSNMMissingNotificationException() - Constructor for exception de.hpi.fgis.dude.exceptions.DCSNMMissingNotificationException
Calls NoSuchElementException.NoSuchElementException().
DCSNMMissingNotificationException(String) - Constructor for exception de.hpi.fgis.dude.exceptions.DCSNMMissingNotificationException
Calls NoSuchElementException.NoSuchElementException(String).
DCSNMMissingNotificationException(Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.DCSNMMissingNotificationException
Calls NoSuchElementException.NoSuchElementException() and stores the passed cause.
DCSNMMissingNotificationException(String, Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.DCSNMMissingNotificationException
Calls NoSuchElementException.NoSuchElementException(String) and stores the passed cause.
de.hpi.fgis.dude - package de.hpi.fgis.dude
The base package of the DuDe tool implementation.
de.hpi.fgis.dude.algorithm - package de.hpi.fgis.dude.algorithm
Contains all algorithm implementations (and interfaces) that are supported by DuDe.
de.hpi.fgis.dude.algorithm.duplicatedetection - package de.hpi.fgis.dude.algorithm.duplicatedetection
Contains all Duplicate-Detection algorithm implementations.
de.hpi.fgis.dude.algorithm.recordlinkage - package de.hpi.fgis.dude.algorithm.recordlinkage
Contains all Record-Linkage algorithm implementations.
de.hpi.fgis.dude.database - package de.hpi.fgis.dude.database
Provides all classes that encapsulate the database abstraction layer.
de.hpi.fgis.dude.database.adapter - package de.hpi.fgis.dude.database.adapter
Provides the concrete implementations for the different database types like MySQL or DB2.
de.hpi.fgis.dude.database.util - package de.hpi.fgis.dude.database.util
Provides all classes that are needed for encapsulating the database schema within the database abstraction layer.
de.hpi.fgis.dude.datasource - package de.hpi.fgis.dude.datasource
Contains all DataSources supported by DuDe.
de.hpi.fgis.dude.exceptions - package de.hpi.fgis.dude.exceptions
Contains all exceptions used within DuDe.
de.hpi.fgis.dude.exec - package de.hpi.fgis.dude.exec
Contains runnable test classes.
de.hpi.fgis.dude.exec.duplicatedetection - package de.hpi.fgis.dude.exec.duplicatedetection
Contains runnable test classes dealing with AbstractDuplicateDetection implementations.
de.hpi.fgis.dude.exec.recordlinkage - package de.hpi.fgis.dude.exec.recordlinkage
Contains runnable test classes dealing with AbstractRecordLinkage implementations.
de.hpi.fgis.dude.merger - package de.hpi.fgis.dude.merger
Contains classes related to record merging.
de.hpi.fgis.dude.output - package de.hpi.fgis.dude.output
Provides all interfaces and classes that can be used for printing results.
de.hpi.fgis.dude.output.statisticoutput - package de.hpi.fgis.dude.output.statisticoutput
Contains all classes that can be used for printing statistics of a run that are provided by the StatisticComponent.
de.hpi.fgis.dude.postprocessor - package de.hpi.fgis.dude.postprocessor
Provides all interfaces and classes dealing with the filtering of duplicate lists.
de.hpi.fgis.dude.preprocessor - package de.hpi.fgis.dude.preprocessor
Preprocessor is a component for manipulating DuDeObjects and gathering statistics while extracting the data.
de.hpi.fgis.dude.similarityfunction - package de.hpi.fgis.dude.similarityfunction
Contains all interfaces and classes for comparing DuDeObjectPairs.
de.hpi.fgis.dude.similarityfunction.aggregators - package de.hpi.fgis.dude.similarityfunction.aggregators
Contains aggregators for SimilarityFunctions.
de.hpi.fgis.dude.similarityfunction.contentbased - package de.hpi.fgis.dude.similarityfunction.contentbased
Contains all resources that belong to the content-based SimilarityFunction component.
de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy - package de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
 
de.hpi.fgis.dude.similarityfunction.contentbased.impl - package de.hpi.fgis.dude.similarityfunction.contentbased.impl
Contains SimilarityFunction implementations that compare attribute values of DuDeObjects.
de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics - package de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
Contains content-based SimilarityFunction implementations that use the Simmetrics library.
de.hpi.fgis.dude.similarityfunction.contentbased.util - package de.hpi.fgis.dude.similarityfunction.contentbased.util
Contains all the utility classes that are used by any ContentBasedSimilarityFunction.
de.hpi.fgis.dude.similarityfunction.domainspecific - package de.hpi.fgis.dude.similarityfunction.domainspecific
Contains similarity functions for domain specific data, e.g.
de.hpi.fgis.dude.similarityfunction.domainspecific.address - package de.hpi.fgis.dude.similarityfunction.domainspecific.address
 
de.hpi.fgis.dude.similarityfunction.domainspecific.address.misc - package de.hpi.fgis.dude.similarityfunction.domainspecific.address.misc
 
de.hpi.fgis.dude.similarityfunction.structurebased - package de.hpi.fgis.dude.similarityfunction.structurebased
Contains SimilarityFunction implementations that compare the structure of DuDeObjects.
de.hpi.fgis.dude.util - package de.hpi.fgis.dude.util
Provides DuDe's utility classes.
de.hpi.fgis.dude.util.bibtex - package de.hpi.fgis.dude.util.bibtex
The BibTeX parser package.
de.hpi.fgis.dude.util.bibtex.data - package de.hpi.fgis.dude.util.bibtex.data
This is an object model for parsed BibTeX files.
de.hpi.fgis.dude.util.bibtex.expander - package de.hpi.fgis.dude.util.bibtex.expander
The BibTeX expander package contains all classes that are used for extending the functionality of the BibTeX parser.
de.hpi.fgis.dude.util.bibtex.parser - package de.hpi.fgis.dude.util.bibtex.parser
Provides all the classes that are necessary for parsing the a BibTeX file.
de.hpi.fgis.dude.util.csv - package de.hpi.fgis.dude.util.csv
Contains all classes necessary for parsing and generating CSV-formatted data.
de.hpi.fgis.dude.util.data - package de.hpi.fgis.dude.util.data
Contains all the classes dealing with the actual data storage (in-memory or file-based).
de.hpi.fgis.dude.util.data.json - package de.hpi.fgis.dude.util.data.json
Contains all Json data types.
de.hpi.fgis.dude.util.data.json.auto - package de.hpi.fgis.dude.util.data.json.auto
 
de.hpi.fgis.dude.util.data.storage - package de.hpi.fgis.dude.util.data.storage
Contains all classes that deal with accessing Json code within files.
de.hpi.fgis.dude.util.merger - package de.hpi.fgis.dude.util.merger
Contains classes related to record merging.
de.hpi.fgis.dude.util.sorting - package de.hpi.fgis.dude.util.sorting
Contains all the classes and interfaces that deal with sorting of data.
de.hpi.fgis.dude.util.sorting.sorter - package de.hpi.fgis.dude.util.sorting.sorter
Contains different implementations of sorting algorithms.
de.hpi.fgis.dude.util.sorting.sortingkey - package de.hpi.fgis.dude.util.sorting.sortingkey
Contains all classes that are needed for creating a sorting key.
deactivate() - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Deactivates usage of the current config measure.
deactivateDefault() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Deactivates the usage of the Default GMD as evaluation measure.
deactivateF1() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Deactivates the usage of PairwiseF1 as GMD evaluation measure.
deactivateHybrid() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Deactivates the usage of Hybrid GMD as evaluation measure.
deactivatePRECISION() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Deactivates the usage of Precision as GMD evaluation measure.
deactivateRECALL() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Deactivates the usage of Recall as GMD evaluation measure.
deactivateVI() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Deactivates the usage of the Variation of Information as GMD evaluation measure.
DEFAULT_ESCAPE_CHARACTER - Static variable in class de.hpi.fgis.dude.output.CSVOutput
The default escape character.
DEFAULT_ESCAPE_CHARACTER - Static variable in class de.hpi.fgis.dude.util.csv.CSVReader
The default escape character.
DEFAULT_ESCAPE_CHARACTER - Static variable in class de.hpi.fgis.dude.util.csv.CSVWriter
The default escape character.
DEFAULT_FOOTER - Static variable in class de.hpi.fgis.dude.output.JsonOutput
The default footer of the JsonOutput.
DEFAULT_HEADER - Static variable in class de.hpi.fgis.dude.output.JsonOutput
The default header of the JsonOutput.
DEFAULT_MAX_TOLERANCE_FACTOR - Static variable in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The default maximum tolerance factor that is used, if no other factor is set.
DEFAULT_QUOTE_CHARACTER - Static variable in class de.hpi.fgis.dude.output.CSVOutput
The default quote character.
DEFAULT_QUOTE_CHARACTER - Static variable in class de.hpi.fgis.dude.util.csv.CSVReader
The default quote character.
DEFAULT_QUOTE_CHARACTER - Static variable in class de.hpi.fgis.dude.util.csv.CSVWriter
The default quote character.
DEFAULT_SEPARATOR - Static variable in class de.hpi.fgis.dude.output.CSVOutput
The default separator character.
DEFAULT_SEPARATOR - Static variable in class de.hpi.fgis.dude.output.SimpleTextOutput
The default separator string that is used for separating both DuDeObjects, if no separator string is passed.
DEFAULT_SEPARATOR - Static variable in class de.hpi.fgis.dude.util.csv.CSVReader
The default separator character.
DEFAULT_SEPARATOR - Static variable in class de.hpi.fgis.dude.util.csv.CSVWriter
The default separator character.
DEFAULT_WINDOW_SIZE - Static variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
The default window size that is used, if no window size was specified.
defaultColumnNames - Variable in class de.hpi.fgis.dude.output.CSVOutput
The default header.
defaultLabels - Static variable in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
Default labels.
DefaultMerger - Class in de.hpi.fgis.dude.merger
DefaultMerger implements merge functionality.
DefaultMerger() - Constructor for class de.hpi.fgis.dude.merger.DefaultMerger
 
DefaultMerger - Class in de.hpi.fgis.dude.util.merger
DefaultMerger implements merge functionality.
DefaultMerger() - Constructor for class de.hpi.fgis.dude.util.merger.DefaultMerger
 
delete() - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Deletes the underlying file.
deleteFile() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Deletes the file from the file system and clears the member data of this instance.
deleteWorkingDirOnExit() - Method in class de.hpi.fgis.dude.util.GlobalConfig
 
DiceCoefficientFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
DiceCoefficientFunction compares two DuDeObjects based on the Dice's Coefficient of the given attribute.
DiceCoefficientFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.DiceCoefficientFunction
Internal constructor for Jsonable deserialization.
DiceCoefficientFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.DiceCoefficientFunction
Initializes the DiceCoefficientFunction with the default tokenizer.
DiceCoefficientFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.DiceCoefficientFunction
Initializes the DiceCoefficientFunction with the default tokenizer.
DiceCoefficientFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.DiceCoefficientFunction
Initializes the DiceCoefficientFunction with the passed InterfaceTokeniser.
DiceCoefficientFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.DiceCoefficientFunction
Initializes the DiceCoefficientFunction with the passed InterfaceTokeniser.
disableCaseSensitivity(Locale) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Disables case-sensitivity for the comparisons.
disableCaseSensitivity() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Disables case-sensitivity for the comparisons.
disableFormattedJson() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Disables formatted Json.
disableFormattedJson() - Method in class de.hpi.fgis.dude.util.data.storage.AbstractDuDeStorage
 
disableFormattedJson() - Method in interface de.hpi.fgis.dude.util.data.storage.JsonWritable
Disables formatted Json for this storage.
disableFormattedJson() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Disables formatted Json.
disableHeader() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Disables header reading.
disableHeader() - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Disables writing the header into the output.
disableInMemoryProcessing() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
disableInMemoryProcessing() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Disables in-memory processing.
disableInMemoryProcessing() - Method in class de.hpi.fgis.dude.util.Experiment
Disables in-memory processing.
disablePrintingCompleteIdentifier() - Method in class de.hpi.fgis.dude.output.CSVOutput
If this is disabled, the source id won't be printed.
disableRetainingUnresolvedMacroReferences() - Method in class de.hpi.fgis.dude.datasource.BibtexSource
Disables retaining the macros, if their value cannot be resolved.
disableSourceIdMapping() - Method in class de.hpi.fgis.dude.util.GoldStandard
Disables source-id mapping.
disableStatistics() - Method in class de.hpi.fgis.dude.util.Experiment
Disables gathering statistics.
disableTransitiveClosures() - Method in class de.hpi.fgis.dude.util.Experiment
Disables transitive closure processing.
DocumentFrequencyPreprocessor - Class in de.hpi.fgis.dude.preprocessor
The DocumentFrequencyPreprocessor collects frequencies of terms within an attribute value.
DocumentFrequencyPreprocessor(String) - Constructor for class de.hpi.fgis.dude.preprocessor.DocumentFrequencyPreprocessor
Initializes a DocumentFrequencyPreprocessor object for the passed attribute.
DOUBLE_EQUALITY_EPSILON - Static variable in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The allowed variation of two doubles, so that the two values are regarded as equal.
DuDeJsonGenerator - Class in de.hpi.fgis.dude.util.data.json
DuDeJsonGenerator is another implementation for generating Json code.
DuDeJsonGenerator(OutputStream) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Initializes a new DuDeJsonGenerator.
DuDeJsonGenerator(Writer) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Initializes a new DuDeJsonGenerator.
DuDeJsonGenerator(JsonGenerator) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Initializes a new DuDeJsonGenerator with the given generator.
DuDeJsonParser<T extends Jsonable> - Class in de.hpi.fgis.dude.util.data.json
DuDeJsonParser can be used for converting a String containing Json syntax into its Java representation.
DuDeJsonParser(Class<T>) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Internal constructor for sub classing.
DuDeJsonParser(Class<T>, InputStream) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Initializes a new DuDeJsonParser using the passed InputStream.
DuDeJsonParser(Class<T>, Reader) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Initializes a new DuDeJsonParser using the passed Reader.
DuDeJsonParser(Class<T>, String) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Initializes a new DuDeJsonParser that parses the passed String.
DuDeJsonParser(InputStream) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Initializes a new DuDeJsonParser using the passed InputStream.
DuDeJsonParser(Reader) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Initializes a new DuDeJsonParser using the passed Reader.
DuDeJsonParser(String) - Constructor for class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Initializes a new DuDeJsonParser that parses the passed String.
DuDeObject - Class in de.hpi.fgis.dude.util.data
DuDeObject encapsulates the data of the original object and two ids (for the source and a local one) for identifying each DuDeObject.
DuDeObject(JsonRecord, String, JsonArray) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObject
Initializes a DuDeObject with the passed ids and the given data.
DuDeObject(String, JsonArray) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObject
Creates a DuDeObject reference.
DuDeObject(JsonRecord, String, String) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObject
Initializes a DuDeObject using the given ids.
DuDeObject(String, String) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObject
Creates a DuDeObject reference.
DuDeObjectId - Class in de.hpi.fgis.dude.util.data
DuDeObjectId encapsulates the identifying information of each DuDeObject.
DuDeObjectId() - Constructor for class de.hpi.fgis.dude.util.data.DuDeObjectId
Initializes an invalid DuDeObjectId.
DuDeObjectId(String, JsonArray) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObjectId
Initializes a DuDeObjectId with the passed identifiers.
DuDeObjectPair - Class in de.hpi.fgis.dude.util.data
DuDeObjectPair is an extension of the OrderedPair class, that encapsulates pairs of DuDeObjects.
DuDeObjectPair(DuDeObject, DuDeObject) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObjectPair
Initializes a pair of DuDeObjects.
DuDeObjectPair(String, JsonArray, String, JsonArray) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObjectPair
Initializes a DuDeObjectPair that contains no real data.
DuDeObjectPair(String, String, String, String) - Constructor for class de.hpi.fgis.dude.util.data.DuDeObjectPair
Initializes a DuDeObjectPair that contains no real data and where the object id is given as a single String value.
DuDeObjectPair.DuplicateType - Enum in de.hpi.fgis.dude.util.data
DuDeObjectPair.DuplicateType declares all possible values for the is-duplicate property.
DuDeObjectPair.GeneratedBy - Enum in de.hpi.fgis.dude.util.data
DuDeObjectPair.GeneratedBy declares the possible values for the lineage property.
DuDeObjectSorter - Interface in de.hpi.fgis.dude.util.sorting.sorter
The interface DuDeObjectSorter provides the method signatures for sorting a collection of DuDeObjects.
DuDeObjectSource - Class in de.hpi.fgis.dude.datasource
DuDeObjectSource is an in-memory DataSource.
DuDeObjectSource(String, DuDeStorage<DuDeObject>) - Constructor for class de.hpi.fgis.dude.datasource.DuDeObjectSource
Initializes the DuDeObjectSource with the passed identifier and DuDeStorage instance.
DuDeObjectSource(String, Collection<DuDeObject>) - Constructor for class de.hpi.fgis.dude.datasource.DuDeObjectSource
Initializes the DuDeObjectSource with the passed identifier and collection.
DuDeOutput - Interface in de.hpi.fgis.dude.output
DuDeOutput is an interface for writing DuDeObjectPairs onto an stream.
DuDeStorage<T extends Jsonable> - Interface in de.hpi.fgis.dude.util.data.storage
DuDeStorage stores Jsonable instances.
DUPLICATE_PROPERTY - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectPair
The name of the property that indicates, if the current pair is a duplicate.
DuplicateCountSNM - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
AdaptiveWindowSizeSNM implements the Adaptive-Window-Size Sorted-Neighborhood Method that was introduced by Oliver Wonneberg.
DuplicateCountSNM() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
For serialization
DuplicateCountSNM.AdaptionMode - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
This enumeration collects all the modes which can be used.
DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
The AdaptiveWindowSizeSNM.AdaptiveWindowSizeSNMBuilder maintains the adaptable window size of the AdaptiveWindowSizeSNM.
DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder(SortingKey, DuplicateCountSNM.AdaptionMode) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Initializes the AdaptiveWindowSizeSNM.AdaptiveWindowSizeSNMBuilder with a SortingKey and the mode that shall be used.
DuplicateCountSNM.AdaptiveWindowSizeSNMIterator - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
AdaptiveWindowSizeSNMIterator implements the behavior of the Adaptive-Window-Size SNM algorithm.
DuplicateCountSNM.AdaptiveWindowSizeSNMIterator(DuplicateCountSNM) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMIterator
Initializes a AdaptiveWindowSizeSNMIterator.
DuplicateCountSNM.ComparisonResult - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
The comparison of a DuDeObjectPair can either yield a DUPLICATE or a NON_DUPLICATE
DuplicateCountSNMExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
Executes the DuplicateCountSNM.
DuplicateCountSNMExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.DuplicateCountSNMExec
 
duplicates - Variable in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Collection of all duplicate pairs of the comparisons.

E

elementIsSet(int, int) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyList
 
elementIsSet(int, int) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyMatrix
 
elementIsSet(int, int) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.GraphRepresentation
Checks whether there is an edge between the two elements in the graph (element is already set in the matrix).
enableAdjacencyListRepresentation() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
Uses a WarshallTransitiveClosureGenerator.AdjacencyList to represent the graph.
enableAdjacencyMatrixRepresentation() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
Uses a WarshallTransitiveClosureGenerator.AdjacencyMatrix to represent the graph.
enableCaseSensitivity() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Enables case-sensitivity for the comparisons.
enableFormattedJson() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Enables formatted Json.
enableFormattedJson() - Method in class de.hpi.fgis.dude.util.data.storage.AbstractDuDeStorage
 
enableFormattedJson() - Method in interface de.hpi.fgis.dude.util.data.storage.JsonWritable
Enables formatted Json for this storage.
enableFormattedJson() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Enables formatted Json.
enableHeader() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Enables header reading.
enableHeader() - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Enables writing the header into the output.
enableInMemoryProcessing() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
enableInMemoryProcessing() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Enables in-memory processing.
enableInMemoryProcessing() - Method in class de.hpi.fgis.dude.util.Experiment
Enables in-memory processing.
enablePrintingCompleteIdentifier() - Method in class de.hpi.fgis.dude.output.CSVOutput
If this is enabled, the complete identifier is printed.
enableRetainingUnresolvedMacroReferences() - Method in class de.hpi.fgis.dude.datasource.BibtexSource
Enables retaining the macros, if their value cannot be resolved.
enableSourceIdMapping() - Method in class de.hpi.fgis.dude.util.GoldStandard
Enables source-id mapping.
enableStatistics() - Method in class de.hpi.fgis.dude.util.Experiment
Enables gathering statistics.
enableTransitiveClosures() - Method in class de.hpi.fgis.dude.util.Experiment
Enables transitive closure processing.
endDate - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
endOfContentReached() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Checks whether the end of the content was reached.
endOfWindowIsReached() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMIterator
Checks whether the current object was already compared with each other object of the window queue.
endTime - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
endTokenReached() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Checks whether the current token declares the end of a complex Json structure.
enforceNoEof(String, boolean) - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
make sure you call
enoughMemoryAvailable() - Method in class de.hpi.fgis.dude.util.MemoryChecker
Checks whether there is enough memory available.
EnumJsonSerialization<E extends Enum<E>> - Class in de.hpi.fgis.dude.util.data.json.auto
Enum constant serialization.
EnumJsonSerialization(BoundType) - Constructor for class de.hpi.fgis.dude.util.data.json.auto.EnumJsonSerialization
Initializes the serialization for the given type.
eof() - Method in class de.hpi.fgis.dude.util.bibtex.parser.LookAheadReader
Checks if the end-of-content was reached.
equals(Object) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
equals(Object) - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
equals(Object) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
equals(Database) - Method in class de.hpi.fgis.dude.database.adapter.Database
Checks whether two Database instances have the same information stored.
equals(Object) - Method in class de.hpi.fgis.dude.database.util.ColumnInfo
 
equals(DBInfo) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Checks whether two DBInfo instances have the same information stored.
equals(Object) - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
equals(Object) - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
equals(Object) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.IntPair
 
equals(Object) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
 
equals(Object) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
 
equals(Object) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
 
equals(Object) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
 
equals(Object) - Method in class de.hpi.fgis.dude.util.BoundType
 
equals(Object) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Two objects are equal, if both have the same ids.
equals(Object) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
 
equals(Object) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
 
equals(Object) - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
equals(Object) - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
A JsonRecord.JsonNull object is equal to the null value and to itself.
equals(Object) - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
equals(Object) - Method in class de.hpi.fgis.dude.util.data.json.JsonString
 
equals(Object) - Method in class de.hpi.fgis.dude.util.Experiment
 
equals(Object) - Method in class de.hpi.fgis.dude.util.GoldStandard
 
equals(Object) - Method in class de.hpi.fgis.dude.util.Pair
 
equals(Object) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
 
equals(Object) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
 
equals(Object) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
 
equalsID(DuDeObject) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Two objects are equal, if both have the same ids.
EquationSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl
EquationSimilarityFunction checks if two values are equal to each other.
EquationSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.EquationSimilarityFunction
Internal constructor for Jsonable deserialization.
EquationSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.EquationSimilarityFunction
Initializes a EquationSimilarityFunction with the passed default attribute.
EquationSimilarityFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.EquationSimilarityFunction
Initializes a EquationSimilarityFunction with the passed default attribute.
EuclideanDistanceFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
EuclideanDistanceFunction compares two DuDeObjects based on the Euclidean Distance of the given attribute.
EuclideanDistanceFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.EuclideanDistanceFunction
Internal constructor for Jsonable deserialization.
EuclideanDistanceFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.EuclideanDistanceFunction
Initializes the EuclideanDistanceFunction with the default tokenizer.
EuclideanDistanceFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.EuclideanDistanceFunction
Initializes the EuclideanDistanceFunction with the default tokenizer.
EuclideanDistanceFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.EuclideanDistanceFunction
Initializes the EuclideanDistanceFunction with the passed InterfaceTokeniser.
EuclideanDistanceFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.EuclideanDistanceFunction
Initializes the EuclideanDistanceFunction with the passed InterfaceTokeniser.
expand(BibtexFile) - Method in class de.hpi.fgis.dude.util.bibtex.expander.CrossReferenceExpander
Note: If you don't use the MacroReferenceExpander first, this function may lead to inconsistent macro references.
expand(BibtexFile) - Method in interface de.hpi.fgis.dude.util.bibtex.expander.Expander
 
expand(BibtexFile) - Method in class de.hpi.fgis.dude.util.bibtex.expander.MacroReferenceExpander
This method walks over all entries in a BibtexFile and expands macro references.
expand(BibtexFile) - Method in class de.hpi.fgis.dude.util.bibtex.expander.PersonListExpander
This method will expand all author and editor fields (if configured in the constructor) into BibtexPersonList values.
Expander - Interface in de.hpi.fgis.dude.util.bibtex.expander
An expander is a transformer that makes a bibtex model more elaborate.
ExpansionException - Exception in de.hpi.fgis.dude.util.bibtex.expander
Exception thrown by an Expander object.
ExpansionException(Throwable) - Constructor for exception de.hpi.fgis.dude.util.bibtex.expander.ExpansionException
 
ExpansionException(String) - Constructor for exception de.hpi.fgis.dude.util.bibtex.expander.ExpansionException
 
Experiment - Class in de.hpi.fgis.dude.util
Experiment is a Wrapper for hiding the actual process of checking each pair of records.
Experiment() - Constructor for class de.hpi.fgis.dude.util.Experiment
Initializes an Experiment.
ExperimentExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
The execution class for the Experiment class.
ExperimentExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.ExperimentExec
 
ExtendedStatisticComponent - Class in de.hpi.fgis.dude.postprocessor
ExtendedStatisticComponent provides functionality for gathering statistics concerning different measures that can be realized with the Generalized Merge Distance (GMD).
ExtendedStatisticComponent(Algorithm) - Constructor for class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Initializes an ExtendedStatisticComponent with no gold standard and default configuration for GMD.
ExtendedStatisticComponent(GoldStandard, Algorithm, ExtendedStatisticComponent.Config) - Constructor for class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Initializes an ExtendedStatisticComponent using the passed DuDeObjectPairs as real duplicates.
ExtendedStatisticComponent() - Constructor for class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Internal constructor for Jsonable deserialization.
ExtendedStatisticComponent.Config - Enum in de.hpi.fgis.dude.postprocessor
 
ExtractionFailedException - Exception in de.hpi.fgis.dude.exceptions
This exception should be raised by a DataSource implementation, if an object can't be extracted.
ExtractionFailedException() - Constructor for exception de.hpi.fgis.dude.exceptions.ExtractionFailedException
Calls NoSuchElementException.NoSuchElementException().
ExtractionFailedException(String) - Constructor for exception de.hpi.fgis.dude.exceptions.ExtractionFailedException
Calls NoSuchElementException.NoSuchElementException(String).
ExtractionFailedException(Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.ExtractionFailedException
Calls NoSuchElementException.NoSuchElementException() and stores the passed cause.
ExtractionFailedException(String, Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.ExtractionFailedException
Calls NoSuchElementException.NoSuchElementException(String) and stores the passed cause.

F

FALSE - Static variable in class de.hpi.fgis.dude.util.data.json.JsonBoolean
A Json-convertible false.
falseNegativesByComparison - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
falsePositives - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
falsePositivesByComparison - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
FamilyNameSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
FamilyNameSimilarityFunction compares two strings and treats them as family names, allowing for some special normalization and comparison techniques.
FamilyNameSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.FamilyNameSimilarityFunction
 
FileBasedStorage<T extends Jsonable> - Class in de.hpi.fgis.dude.util.data.storage
FileBasedStorage stores Jsonable instances in files.
FileBasedStorage(String) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the passed name.
FileBasedStorage(Class<T>, String) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the passed type information and a name.
FileBasedStorage(String, Collection<T>) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the passed name, and the initial content.
FileBasedStorage(Class<T>, String, Collection<T>) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the passed type information, its name, and the initial content.
FileBasedStorage(String, String) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the type information, a directory and its name.
FileBasedStorage(Class<T>, String, String) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the type information, a directory and its name.
FileBasedStorage(String, String, Collection<T>) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the type information, a directory, its name, and the initial content.
FileBasedStorage(Class<T>, String, String, Collection<T>) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the type information, a directory, its name, and the initial content.
FileBasedStorage(Class<T>, File, Collection<T>) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the type information, the underlying file, and the initial content.
FileBasedStorage(Class<T>, File) - Constructor for class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Initializes a FileBasedStorage instance with the type information, the underlying file, and the initial content.
FilenameManager - Class in de.hpi.fgis.dude.util
The FilenameManager manages all filenames within a directory.
FilenameManager(String) - Constructor for class de.hpi.fgis.dude.util.FilenameManager
Initializes a FilenameManager for the passed directory.
finalize() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Closes all streams if it is not already done.
findCluster(HashMap<DuDeObject, Integer>, DuDeObject) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Searches for the value of a given key in a hashmap by comparing their ids (from source and object)
finish() - Method in class de.hpi.fgis.dude.preprocessor.CountPreprocessor
 
finish() - Method in class de.hpi.fgis.dude.preprocessor.DocumentFrequencyPreprocessor
 
finish() - Method in interface de.hpi.fgis.dude.preprocessor.Preprocessor
This method is called after finishing the data extraction process.
finishExpansion() - Method in class de.hpi.fgis.dude.util.bibtex.expander.AbstractExpander
Call this at the end of your expand(BibtexFile) implementation.
finishExtraction() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Sets a flag which indicates that the extraction process is finished.
finishPreprocessing() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Executes Preprocessor.finish() method of each added Preprocessor.
finishTypeSerializationRead(DuDeJsonParser<?>, Class<T>) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Cleanup method used in conjunction with #readSerializedType(DuDeJsonParser, boolean).
first(List<T>) - Static method in class de.hpi.fgis.dude.util.CollectionUtil
Returns the first element of the collection.
first(Iterable<T>) - Static method in class de.hpi.fgis.dude.util.CollectionUtil
Returns the first element of the collection.
FIRST_ELEMENT_ATTRIBUTE_NAME - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectPair
The first element's Json attribute name.
FIRST_ELEMENTS_DEFAULT_OBJECT_ID_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.util.GoldStandard
The default attribute for extracting the first element's object id.
FIRST_OBJECT_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.output.JsonOutput
The attribute name of the first object within each pair.
flush() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Flushes the internal buffer.
fMerge(double, double) - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Computes the costs for a merge operation
forceExtraction() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Forces a new initialization phase before returning the next Iterator.
formattedJsonIsEnabled() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Checks whether formatted Json is enabled.
fromIterables(Iterable<Iterable<T>>) - Static method in class de.hpi.fgis.dude.util.ChainedIterator
Creates a chained iterator from Iterables.
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.datasource.BibtexSource
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.datasource.CSVSource
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.datasource.JSONSource
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.datasource.XMLSource
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.output.CSVOutput
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
 
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.JsonString
 
fromJson(String, Class<T>) - Static method in class de.hpi.fgis.dude.util.data.json.JsonUtil
Parses the given json string to an instance of the given Jsonable type.
fromJson(DuDeJsonParser<?>) - Method in interface de.hpi.fgis.dude.util.data.Jsonable
Initializes the current instance using the passed DuDeJsonParser.
fromJson(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.Experiment
 
fromSameSource(DuDeObjectId) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
Checks if the passed id has the same source information.
fSplit(double, double) - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Computes the costs for a split operation
FunctionSelector - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address.misc
maps from a class/datatype/semantics name to similarity function
FunctionSelector() - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.misc.FunctionSelector
 
fuzzyOutputSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether any output is set.

G

generateClusters() - Method in class de.hpi.fgis.dude.util.GoldStandard
If not already existing, generates a list of clusters containing the duplicates.
generateClustersForUnique(Vector<Vector<DuDeObject>>) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Generates for each unique item (that is not listed in the pairwise er result) its own cluster and adds it to the given clusters.
generateClustersForUnique(Vector<Vector<DuDeObject>>) - Method in class de.hpi.fgis.dude.util.GoldStandard
Generates for each unique item (that is not listed in the pairwise gold standard) its own cluster and adds it to the given goldCluster.
generateDataLine(String...) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Converts the String array into a valid CSV form.
generateDuplicatePairs() - Method in class de.hpi.fgis.dude.util.GoldStandard
Generates a gold standard in duplicate pair format from a given one in cluster format.
generateEmptyJsonFile() - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Overwrites any existing file and initializes a new one containing an empty Json array.
generateSortedFile() - Method in class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
Sorts the in-memory data and writes it into a file.
generateString() - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Generates a concatenated String out of all values stored in this JsonArray.
generateString() - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Generates a concatenated String out of all values stored in this JsonRecord.
getAbortThreshold() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Returns the abort threshold.
getAdaptionMode() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Returns the set adaptation mode.
getAggregatedSimilarity(double[], int[]) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
Aggregates the passed similarities and returns the aggregated similarity.
getAggregatedSimilarity(double[], int[]) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Average
 
getAggregatedSimilarity(double[], int[]) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.HarmonicMean
 
getAggregatedSimilarity(double[], int[]) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Maximum
 
getAggregatedSimilarity(double[], int[]) - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Minimum
 
getAlgorithm() - Method in class de.hpi.fgis.dude.util.Experiment
Returns the Algorithm.
getAnnotation(Class<?>, Class<? extends Annotation>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns the first annotation of the specified annotation type for the given type.
If no annotation is found for the type, the hierarchical ancestors are examined.
getAttribute(DuDeObject) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Returns the attribute path that is valid for the passed DuDeObject.
getAttribute(DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
Returns the attribute that corresponds to source of the passed DuDeObject.
getAttributeValue(String, int) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the concrete value of the given attribute at the passed position.
getAttributeValue(String) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the first value of the given attribute.
getAttributeValues(String) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Looks within the current DuDeObject for the given attribute.
getAttributeValuesByPath(String...) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Traverses the passed path and returns the corresponding attribute value or null, if the passed path is invalid.
getAverageMemoryUsed() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the registered average amount of memory used during the experiment.
getAverageMemoryUsed() - Method in class de.hpi.fgis.dude.util.MemoryCheckerTask
Gets the average amount of memory used by the current application.
getCallCount() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.IgnoreStrategy
Returns the current call count.
getCause() - Method in exception de.hpi.fgis.dude.exceptions.DCSNMMissingNotificationException
 
getCause() - Method in exception de.hpi.fgis.dude.exceptions.ExtractionFailedException
 
getCause() - Method in exception de.hpi.fgis.dude.exceptions.InvalidJsonException
 
getCharBlockKey() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Returns the number of characters of the sorting key that are used for defining the blocks.
getClassForPrimtive(Class<?>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns the boxing class for the given primitive type.
getCluster() - Method in class de.hpi.fgis.dude.util.GoldStandard
Triggers the generation of a gold standard in cluster format.
getColumn() - Method in class de.hpi.fgis.dude.util.bibtex.parser.LookAheadReader
Returns the current column.
getColumnCount() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Returns the number of columns of the currently processed data source.
getColumnInfo(int) - Method in class de.hpi.fgis.dude.database.util.Schema
Returns the information of the column with the given index.
getColumnName(int) - Method in class de.hpi.fgis.dude.database.util.Schema
Returns the name of the specified column.
getColumnNames() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Returns the column names or null, if no column names were set.
getColumnSQLType(int) - Method in class de.hpi.fgis.dude.database.util.Schema
Returns the data type of the specified column.
getComparisonCount() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the number of pairs that were already compared.
getComplexFields() - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
Returns the complex fields of the wrapped type.
getConfig() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Returns the current cost configuration set for GMD processing
getContent() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPreamble
Returns the content of this preamble.
getContent() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexString
Returns the content String of this instance (without surrounding quotes or curly braces!).
getContent() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexToplevelComment
Returns the content of this comment.
getConvertedStatistics() - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Converts the data of the StatisticComponent into a Map.
getCoreERAlgorithm() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Returns the entity resolution algorithm that is used to process blocks internally
getCurrent() - Method in class de.hpi.fgis.dude.util.bibtex.parser.LookAheadReader
Returns the current character.
getData() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
Returns the extracted data.
getData(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
Returns a JsonableReader that can be used to return the extracted data of the passed DataSource.
getData() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
Returns the DataSources and their extracted data.
getData() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the data that describes the real-world object of which this instance is the representation.
getData() - Method in class de.hpi.fgis.dude.util.GoldStandard
Extracts the gold standard if necessary and returns it.
getDatabase() - Method in class de.hpi.fgis.dude.database.DatabaseSource
Returns the underlying Database.
getDatabaseDriverName() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns the Driver's name used for loading the Driver class.
getDatabaseDriverName() - Method in class de.hpi.fgis.dude.database.adapter.DB2Database
 
getDatabaseDriverName() - Method in class de.hpi.fgis.dude.database.adapter.MySQLDatabase
 
getDatabaseDriverName() - Method in class de.hpi.fgis.dude.database.adapter.OracleDatabase
 
getDatabaseDriverName() - Method in class de.hpi.fgis.dude.database.adapter.PostGreSQLDatabase
 
getDatabaseName() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns the name of the database.
getDatabaseName() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Returns the name of the currently used database.
getDataFile() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Returns the FileBasedStorage that is encapsulated in this instance.
getDataLine(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.CSVOutput
Generates the data that shall be printed.
getDataLine(String) - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Extracts the data out of the passed String.
getDataSize(DataSource) - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
getDataSize() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
getDataSize() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
getDataSize() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
getDataSize() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Returns the overall data size after the extraction process is finished.
getDataSize(DataSource) - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Returns the data size of the passed DataSource.
getDataSources() - Method in class de.hpi.fgis.dude.util.Experiment
Returns all added DataSources.
getdCostFunc() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
get the d(i,j) cost function.
getDefaultEncoding() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Returns the default encoding used by DuDe.
getDefaultFooter() - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
Returns the default footer of the implementation.
getDefaultFooter() - Method in class de.hpi.fgis.dude.output.JsonOutput
Returns JsonOutput.DEFAULT_FOOTER.
getDefaultHeader() - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
Returns the default header of the implementation.
getDefaultHeader() - Method in class de.hpi.fgis.dude.output.JsonOutput
Returns JsonOutput.DEFAULT_HEADER.
getDuplicateInfo() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Checks whether the current pair represents a duplicate.
getEndDate() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the date of the specified end time of an algorithm.
getEntries() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Returns an unmodifiable view of the entries.
getEntryIterator() - Method in class de.hpi.fgis.dude.datasource.BibtexSource
Returns the entry iterator.
getEntryKey() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Returns the key of this entry.
getEntryType() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Returns the type of this entry.
getEscapeCharacter() - Method in class de.hpi.fgis.dude.output.CSVOutput
Returns the escape character.
getEscapeCharacter() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Returns the current escape character.
getEscapeCharacter() - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Returns the current escape character.
getExceptions() - Method in class de.hpi.fgis.dude.util.bibtex.expander.AbstractExpander
 
getExceptions() - Method in interface de.hpi.fgis.dude.util.bibtex.expander.Expander
 
getExceptions() - Method in class de.hpi.fgis.dude.util.bibtex.parser.BibtexParser
Returns the list of non-fatal exceptions that occurred during parsing.
getExtractedData() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
getExtractedData() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
 
getExtractedDataSize() - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns the amount of records extracted (formerly retrieved from Algorithm-Object).
getExtractedRecordCount() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
getExtractedRecordCount() - Method in interface de.hpi.fgis.dude.datasource.DataSource
Returns the number of already extracted records.
getExtractedRecordCount() - Method in class de.hpi.fgis.dude.datasource.DuDeObjectSource
 
getFalseNegatives() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the false negatives count.
getFalseNegativesByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the false negatives count that are explicitly classified by the comparator.
getFalsePositives() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the false positives count.
getFalsePositivesByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the false positives count that are explicitly classified by the comparator..
getFields() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Returns a read only view of the field map.
getFieldValue(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Returns the value of field specified by the passed field name.
getFieldValue(Object, String) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Dynamically retrieves the value of the specified field of an object.
getFile() - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Returns the underlying file.
getFilePath() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Returns the path of the CSV file.
getFirst() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.IntPair
Returns the first integer value of the integer pair.
getFirst() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Returns the first name of the person.
getFirstElement() - Method in class de.hpi.fgis.dude.util.Pair
Returns the first element.
getFirstElementObjectData() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Returns the data of the first DuDeObject.
getFirstElementsObjectIdAttributes() - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns the names of the attributes that store the object id of the pair's first element.
getFirstElementsSourceId() - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns the source id of the pair's first element.
getFirstPosition() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Returns the first set position or 0 if no position is set.
getFixBlockSize() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Returns the fix block size.
getFMeasure() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the f-measure based on the current knowledge base.
getFMeasureByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the f-measure based on the current knowledge base and the actual comparisons.
getGapCost() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
gets the gap cost for the distance function.
getGoldStandard() - Method in class de.hpi.fgis.dude.util.Experiment
Returns the gold standard.
getHeader() - Method in class de.hpi.fgis.dude.output.CSVOutput
Returns the header.
getHost() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns the host information of the underlying database system.
getHost() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Returns the host of the currently used database system.
getIdAttributes() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
Returns the set id attributes.
getIdAttributes() - Method in class de.hpi.fgis.dude.datasource.DuDeObjectSource
 
getIdentifier() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
getIdentifier() - Method in interface de.hpi.fgis.dude.datasource.DataSource
Returns the identifier of this DataSource.
getIdentifier() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the first identifier of this DuDeObject.
getIdentifiers() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns all identifiers of this DuDeObject.
getIgnoreRegex() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Returns the regular expression that specifies which character classes are ignored within the current instance.
getIncreaseFactor() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Returns the set increase factor.
getIncreaseThreshold() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Returns the threshold for increasing the window size.
getInMemoryData() - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Returns the in-memory data.
getInMemoryObjectThreshold() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Returns the maximum number of DuDeObject that will be stored in memory, if file-based processing is enabled.
getInstance() - Static method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the singleton instance
getInstance() - Static method in class de.hpi.fgis.dude.util.DataSourceIdentifierManager
Returns the Singleton instance of this class.
getInstance() - Static method in class de.hpi.fgis.dude.util.GlobalConfig
Returns the Singleton instance of GlobalConfig.
getInstance() - Static method in class de.hpi.fgis.dude.util.MemoryChecker
Singleton method for returning the Singleton instance.
getInverseDocumentFrequency(String) - Method in class de.hpi.fgis.dude.preprocessor.DocumentFrequencyPreprocessor
Retrieves the inverse document frequency of the passed term.
getJDBCString() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns the JDBC String which can be used for representing a connection to this database.
getJDBCString() - Method in class de.hpi.fgis.dude.database.adapter.DB2Database
 
getJDBCString() - Method in class de.hpi.fgis.dude.database.adapter.MySQLDatabase
 
getJDBCString() - Method in class de.hpi.fgis.dude.database.adapter.OracleDatabase
 
getJDBCString() - Method in class de.hpi.fgis.dude.database.adapter.PostGreSQLDatabase
 
getJDBCString() - Method in class de.hpi.fgis.dude.database.DatabaseSource
Returns the JDBC String of the underlying database.
getJsonValue(String) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Returns the first occurrence of the passed attribute (using a recursive look-up).
getJsonValue(String, Iterator<String>) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Iterates over the elements and looks for a JsonRecord that contains currentAttributeName.
getJsonValue(String...) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Returns the JsonValue specified by the passed path or null if the specified attribute does not exist.
getJsonValue(String, Iterator<String>) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Returns the value that corresponds to the passed path.
getKey() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroDefinition
Returns the key of the macro definition.
getKey() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroReference
Returns the key of this reference.
getKeyDistance(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Calculates the sorting key distance of two DuDeObjects.
getKeyString(DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Returns the sorting key value of the passed DuDeObject.
getKeyString(DuDeObject, String) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Returns the sorting key value of the passed DuDeObject.
getKeyValue(DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Returns the sorting key value of the passed DuDeObject.
getLabel() - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Returns the label of the configuration for the output components
getLabels() - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
getLabels() - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Returns the labels for the measurements.
getLast() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Returns the last name of the person.
getLastValidationState() - Method in class de.hpi.fgis.dude.similarityfunction.AbstractSimilarityFunction
 
getLastValidationState() - Method in interface de.hpi.fgis.dude.similarityfunction.SimilarityFunction
Returns the validation state of the last SimilarityFunction.getSimilarity(DuDeObjectPair) call.
getLastValidationState() - Method in class de.hpi.fgis.dude.similarityfunction.structurebased.ConstantSimilarityFunction
Since ConstantSimilarityFunction is not based on actual values, it returns SimilarityValidationState.BothValid for each calculated pair.
getLeft() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexConcatenatedValue
Returns the left value of this concatenation.
getLine() - Method in class de.hpi.fgis.dude.util.bibtex.parser.LookAheadReader
Returns the current line.
getLineage() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Returns the lineage of this person.
getLineage() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Returns the lineage value of the current pair.
getList() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPersonList
Returns a read-only list whose members are instances of BibtexPerson.
getLongDescriptionString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
returns the long string identifier for the metric.
getLongDescriptionString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
returns the long string identifier for the metric.
getLowerThreshold() - Method in class de.hpi.fgis.dude.util.Experiment
Gets the lower threshold for this experiment.
getMaxAllowedVariation(double, double) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
Just returns the AbsoluteNumberDiffFunction.maxAbsoluteVariation set in constructor.
getMaxAllowedVariation(double, double) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
Gets the maximum allowed variation based on the RelativeNumberDiffFunction.maxToleranceFactor.
getMaxBlockSize() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Returns the maximum block size.
getMaximumMemory() - Method in class de.hpi.fgis.dude.util.MemoryChecker
Returns the maximum amount of memory that can be used.
getMaximumMemoryUsage() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Returns the maximum relative memory used during the sorting phase.
getMaximumMemoryUsage() - Method in class de.hpi.fgis.dude.util.MemoryChecker
Returns the maximum memory usage in percent.
getMaximumMemoryUsed() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the registered maximum amount of memory during the experiment.
getMaximumPairCount() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
getMaximumPairCount() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
getMaximumPairCount() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Returns the number of pairs, that would be generated by the naive algorithm of the current instance's algorithm type based on the extracted data size.
getMaxMemoryUsed() - Method in class de.hpi.fgis.dude.util.MemoryCheckerTask
Gets the maximum amount of memory used by the current application.
getMemoryCheckFrequency() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the frequency of memory checks.
getMergeCount() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the number of objects this DuDeObject was merged from.
getMerger() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Returns the merger that merges several DuDeObjects into one
getMinimumMemoryUsed() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the registered minimum amount of memory during the experiment.
getMinMemoryUsed() - Method in class de.hpi.fgis.dude.util.MemoryCheckerTask
Gets the minimum amount of memory used by the current application.
getName() - Method in class de.hpi.fgis.dude.database.util.ColumnInfo
Returns the column's name.
getNextBlock() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Calculates the elements of the next block.
getNextRecord() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Returns the next object from the record queue.
getNextTemporaryFilename() - Method in class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
Returns the name of the next temporary DuDeObjectFile.
getNextValidFilename(String) - Method in class de.hpi.fgis.dude.util.FilenameManager
Returns a valid alternative for the passed filename.
getNotification() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Returns the category that was set for the last processed pair.
getNotification() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Returns the category that was set for the last processed pair.
getNrCharForBlocking() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveBlockingAlgorithm
Returns the number of characters of the sorting key that are used as blocking criterion.
getNumBaseRecords() - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Getter for total number of base records, needed for computation of VI.
getNumberAssignedRecords() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Returns the sum of records that are already assigned to a block.
getNumberCreatedBlocks() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Returns the number of created blocks
getNumberOfCandidateComparisons() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the maximum number of pairs that would be generated by the naive approach.
getNumberOfRealDuplicates() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the size of the gold standard.
getNumRecordsOfBlock() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.AA_SNM_Iterator
 
getNumRecordsOfBlock() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.IA_SNM_Iterator
 
getNumRecordsOfBlock() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Calculates the number of records within the next block.
getObjectCount() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the number of records that were processed by the algorithm.
getObjectCount() - Method in class de.hpi.fgis.dude.preprocessor.CountPreprocessor
Returns the number of objects that were extracted during the data extraction phase.
getObjectId() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the object identifier of this object.
getObjectId() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
Returns the object identifier.
getOptionalEntries() - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
Returns all extension columns' label and value.
getOverlapSize() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Returns the current overlap size.
getOwnerFile() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexNode
Returns the owner file of this node.
getPairCount() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the number of pairs that were already considered.
getParameterizedType() - Method in class de.hpi.fgis.dude.util.BoundType
Returns the wrapped parameterized type or null if this BoundType was not created around a ParameterizedType.
getParameters() - Method in class de.hpi.fgis.dude.util.BoundType
Returns the bound types or an empty array if none exists.
getPassword() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Returns the password which is used for establishing the current database connection.
getPort() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns the port of the underlying database system.
getPort() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Returns the port of the currently used database system.
getPositions() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Returns an iterable instance that stores all specified character positions.
getPrecision() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the precision based on the current knowledge base.
getPrecisionByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the precision based on the current knowledge base and the actual comparisons.
getPreLast() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Returns the middle name or any middle initials of this persons.
getPrimitive(Class<T>) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the primitive for type T.
getPrimitiveFields() - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
Returns the primitive fields of the wrapped type.
getProperty(String) - Method in class de.hpi.fgis.dude.util.OrderedPair
Returns the value of the passed property.
getQuery() - Method in class de.hpi.fgis.dude.database.DatabaseSource
Returns the complete query that is used for querying the result.
getQuoteCharacter() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Returns the set quote character.
getQuoteCharacter() - Method in class de.hpi.fgis.dude.output.CSVOutput
Returns the quote character.
getQuoteCharacter() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Returns the current quote character.
getQuoteCharacter() - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Returns the current quote character.
getRawType() - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Returns the raw type for which this serialization class was created.
getReader() - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
 
getReader() - Method in class de.hpi.fgis.dude.util.data.storage.InMemoryStorage
 
getReader() - Method in class de.hpi.fgis.dude.util.data.storage.InputStreamReadable
 
getReader() - Method in interface de.hpi.fgis.dude.util.data.storage.JsonReadable
Returns the JsonableReader that can be used to access the content of this DuDeStorage.
getRecall() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the recall based on the current knowledge base.
getRecallByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the recall based on the current knowledge base and the actual comparisons.
getReductionRatio() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the reduction ratio based on the current knowledge base.
getReductionRatioByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the reduction ratio based on the current knowledge base and the actual comparisons.
getReference() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Returns a DuDeObjectPair that refers to the current pair.
getRight() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexConcatenatedValue
Returns the right value of this concatenation.
getRootElementTag() - Method in class de.hpi.fgis.dude.datasource.XMLSource
Returns the set root element or null, if no root was set.
getRuntime() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the time difference between beginning time and finishing time.
getScalarValue(String) - Static method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Converts the passed String into the corresponding atomic JsonValue.
getSecond() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.IntPair
Returns the second integer value of the integer pair.
getSecondElement() - Method in class de.hpi.fgis.dude.util.Pair
Returns the second element.
getSecondElementObjectData() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Returns the data of the second DuDeObject.
getSecondElementsObjectIdAttributes() - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns the names of the attributes that store the object id of the pair's second element.
getSecondElementsSourceId() - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns the source id of the pair's second element.
getSeparator() - Method in class de.hpi.fgis.dude.output.CSVOutput
Returns the separator character.
getSeparator() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Returns the current separator character.
getSeparator() - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Returns the current separator character.
getSeparatorCharacter() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Returns the set separator character.
getShortDescriptionString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
returns the string identifier for the metric.
getShortDescriptionString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
returns the string identifier for the metric .
getSimilarity(DuDeObjectPair) - Method in class de.hpi.fgis.dude.similarityfunction.AbstractSimilarityFunction
 
getSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.EquationSimilarityFunction
 
getSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SimmetricsFunction
 
getSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.SoundExFunction
 
getSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
 
getSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
gets the similarity of the two strings using Monge Elkan.
getSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
gets the similarity of the two strings using Needleman Wunch distance.
getSimilarity(DuDeObjectPair) - Method in interface de.hpi.fgis.dude.similarityfunction.SimilarityFunction
Calculates the similarity of passed DuDeObjectPair's members.
getSimilarity(String, String) - Method in interface de.hpi.fgis.dude.similarityfunction.StringSimilarity
Returns the similarity of the passed Strings, where 0.0 means that Strings are completely different, and 1.0 indicates that the passed Strings are the same.
getSimilarity(DuDeObjectPair) - Method in class de.hpi.fgis.dude.similarityfunction.structurebased.ConstantSimilarityFunction
 
getSimilarity() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Returns the similarity of the DuDeObjectPair or DuDeObjectPair.NO_SIMILARITY_SET_VALUE, if the similarity wasn't set.
getSimilarityExplained(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
gets a div class xhtml similarity explaining the operation of the metric.
getSimilarityExplained(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
gets a div class xhtml similarity explaining the operation of the metric.
getSimilarityFunction() - Method in class de.hpi.fgis.dude.util.Experiment
Returns the SimilarityFunction.
getSimilarityFunctionForClassAsString(String) - Static method in class de.hpi.fgis.dude.similarityfunction.domainspecific.address.misc.FunctionSelector
returns the class object of the similarity function represented by the provided string
getSimilarityTimingEstimated(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
gets the estimated time in milliseconds it takes to perform a similarity timing.
getSimilarityTimingEstimated(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
gets the estimated time in milliseconds it takes to perform a similarity timing.
getSize() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyList
 
getSize() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyMatrix
 
getSize() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.GraphRepresentation
Returns the number of elements in the matrix.
getSortedCollection() - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Returns the sorted data.
getSortedCollection() - Method in interface de.hpi.fgis.dude.util.sorting.sorter.DuDeObjectSorter
Returns the sorted data.
getSortedCollection() - Method in class de.hpi.fgis.dude.util.sorting.sorter.InMemorySorter
 
getSortedCollection() - Method in class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
 
getSortedDataFilename() - Method in class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
Returns the name of the DuDeObjectFile containing the sorted data.
getSortingKey() - Method in class de.hpi.fgis.dude.algorithm.SortingDuplicateDetection
Returns the set SortingKey.
getSortingKey() - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Returns the SortingKey that defines the sorting order.
getSortingKeyComparisons() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Returns the number of distance comparisons of two sorting key values.
getSortingKeys() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Return the blocking criteria
getSoundEx(String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SoundEx
Generates the SoundEx value of the passed String.
getSourceId() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Returns the source identifier of this object.
getSourceId() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
Returns the source identifier.
getSplitToken() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Returns the split token.
getSQLSchema() - Method in class de.hpi.fgis.dude.database.adapter.Database
Returns the schema, which is used by this database connection.
getSQLSchema() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Returns the sqlSchema which is used in the current database connection.
getSQLType() - Method in class de.hpi.fgis.dude.database.util.ColumnInfo
Returns the Types datatype of the column.
getStartDate() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the date of the specified start time of an algorithm.
getStaticBoundTypes(ParameterizedType) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns the static bounds for the given type.
getStaticBoundTypes(Class<?>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns the static bounds for the given type.
getStaticBoundTypes(Field) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns the static bounds for the given field.
getStatisticOutputs() - Method in class de.hpi.fgis.dude.util.Experiment
Returns the added StatisticOutputs.
getStatistics() - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
getStatistics() - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Returns the current statistic component that is used by the output.
getString(Object) - Static method in class de.hpi.fgis.dude.output.CSVOutput
Returns the String representation of the object or null, if null was passed.
getStringValue() - Method in interface de.hpi.fgis.dude.util.data.json.JsonAtomic
Returns the actual value converted into a String.
getStringValue() - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
getStringValue() - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
 
getStringValue() - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
getStringValue() - Method in class de.hpi.fgis.dude.util.data.json.JsonString
 
getSubkeyValue(DuDeObject) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
 
getSubkeyValue(DuDeObject) - Method in interface de.hpi.fgis.dude.util.sorting.sortingkey.Subkey
Returns a JsonArray that collects all relevant values for the subkey of the passed DuDeObject.
getSuperTypeInfo() - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
Returns the super type json serialization.
getTableName() - Method in class de.hpi.fgis.dude.database.DatabaseSource
Returns the table name.
getThreshold() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Returns the sorting key.
getThreshold() - Method in class de.hpi.fgis.dude.util.Experiment
Gets the threshold for this experiment.
getTransitiveClosures() - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator
Returns the transitive closures as a 2-dimensional collection.
getTrueNegatives() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the true negatives count.
getTrueNegativesByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the true negatives count that are explicitly classified by the comparator.
getTruePositives() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the true positives count.
getTruePositivesByComparison() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns the true positives count that are explicitly classified by the comparator..
getType() - Method in class de.hpi.fgis.dude.util.BoundType
Returns the raw type.
getType() - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Returns the BoundType for which this serialization class was created.
getType() - Method in class de.hpi.fgis.dude.util.data.json.auto.Primitive
Returns the type of the primitive
getType() - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
Returns JsonType.Array.
getType() - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
Returns JsonType.Boolean.
getType() - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
Returns JsonType.Null.
getType() - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
Returns JsonType.Number.
getType() - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Returns JsonType.Record.
getType() - Method in class de.hpi.fgis.dude.util.data.json.JsonString
Returns JsonType.String.
getType() - Method in interface de.hpi.fgis.dude.util.data.json.JsonValue
Returns the type of the current instance.
getTypeInfo(BoundType) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the AutoJsonSerialization for the given BoundType.
getTypeInfo(Class<T>) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the AutoJsonSerialization for the given class.
getTypeInfo(Class<T>, Type) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the AutoJsonSerialization for the given class and the type parameters of the declarations if existent.
getTypeInfo(ParameterizedType) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the AutoJsonSerialization for the given type.
getTypeInfo(Type) - Method in class de.hpi.fgis.dude.util.data.json.auto.JsonTypeManager
Returns the AutoJsonSerialization for the given type.
getUnNormalisedSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
gets the un-normalised similarity measure of the metric for the given strings.
getUnNormalisedSimilarity(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
implements the Smith-Waterman distance function //see http://www.gen.tcd.ie/molevol/nwswat.html for details .
getUpperThreshold() - Method in class de.hpi.fgis.dude.util.Experiment
Gets the thresholds for this experiment.
getUsedVMMemory() - Static method in class de.hpi.fgis.dude.util.MemoryChecker
Returns the amount of already used memory.
getUser() - Method in class de.hpi.fgis.dude.database.util.DBInfo
Returns the user name which is used for establishing the current database connection.
getValue(String, String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.LevenshteinDistance
Returns the Levenshtein Distance of the passed Strings.
getValue() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroDefinition
Returns the value of the macro definition.
getValue() - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
Returns the actual value of this JsonBoolean.
getValue() - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
Returns the actual value.
getWindowSize() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Returns the current window size.
getWindowSize() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
Returns the window size of this instance.
getWorkingDirectory() - Method in class de.hpi.fgis.dude.util.GlobalConfig
Returns the working directory path.
getWriter() - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
 
getWriter() - Method in class de.hpi.fgis.dude.util.data.storage.InMemoryStorage
 
getWriter() - Method in interface de.hpi.fgis.dude.util.data.storage.JsonWritable
Returns the JsonableWriter that can be used to add instances to this DuDeStorage.
GivenNameSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
GivenNameSimilarityFunction compares two strings and treats them as given names, allowing for some special normalization and comparison techniques.
GivenNameSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.GivenNameSimilarityFunction
 
GlobalConfig - Class in de.hpi.fgis.dude.util
GlobalConfig manages the configuration parameters of DuDe.
GMDEvaluationExec_CORA - Class in de.hpi.fgis.dude.exec.duplicatedetection
GMDEvaluationExec_CORA is an example experiment for the usage of the ExtendedStatisticComponent.
GMDEvaluationExec_CORA() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.GMDEvaluationExec_CORA
 
GMDEvaluationExec_Restaurant - Class in de.hpi.fgis.dude.exec.duplicatedetection
GMDEvaluationExec_Restaurant is an example experiment for the usage of the ExtendedStatisticComponent.
GMDEvaluationExec_Restaurant() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.GMDEvaluationExec_Restaurant
 
goldStandard - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
GoldStandard - Class in de.hpi.fgis.dude.util
GoldStandard implements the functionality for extracting the gold standard out of a given DataSource.
GoldStandard() - Constructor for class de.hpi.fgis.dude.util.GoldStandard
Internal constructor for Jsonable deserialization.
GoldStandard(DataSource) - Constructor for class de.hpi.fgis.dude.util.GoldStandard
Initializes the GoldStandard with the passed DataSource.
GoldStandard(DataSource, String) - Constructor for class de.hpi.fgis.dude.util.GoldStandard
Initializes the GoldStandard with the passed DataSource and the filename of the gold standard in cluster format to read it in.
GoldStandard(String) - Constructor for class de.hpi.fgis.dude.util.GoldStandard
Initializes the GoldStandard with the passed filename of the gold standard in cluster format to read it in.
goldStandardSet() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Checks whether a gold standard was passed.
goldStandardSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether a GoldStandard was set.
GSwoosh - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
GSwoosh implements the GSwoosh duplicate detection (and merging) algorithm as described in the paper Swoosh: a generic approach for entity resolution.
GSwoosh() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh
Initializes the GSwoosh algorithm with the DefaultMerger.
GSwoosh(Merger) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh
Initializes the GSwoosh algorithm with the passed Merger.
GSwoosh.ComparisonResult - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
 
GSwooshExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
This execution class runs the GSwoosh duplicate detection algorithm on the Restaurant data source.
GSwooshExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.GSwooshExec
 

H

HarmonicMean - Class in de.hpi.fgis.dude.similarityfunction.aggregators
HarmonicMean returns the harmonic mean of the added SimilarityFunctions.
HarmonicMean() - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.HarmonicMean
Internal constructor for Jsonable deserialization.
HarmonicMean(SimilarityFunction...) - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.HarmonicMean
Initializes a HarmonicMean instance.
hasCurrentToken() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Checks whether a token is loaded.
hasData() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Checks whether the current DuDeObject contains real data.
hasDuplicateInfo() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Checks whether the is-duplicate property is set.
hasFunction(Object, String, Object...) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Checks dynamically whether the object has the specified function, which takes the given parameters.
hasGMD() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Checks whether this StatisticComponent calculates the Generalized Merge Distance.
hasGMD() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Checks whether this StatisticComponent calculates the Generalized Merge Distance.
hashCode() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
hashCode() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
hashCode() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
hashCode() - Method in class de.hpi.fgis.dude.database.util.ColumnInfo
 
hashCode() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
hashCode() - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
hashCode() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.IntPair
 
hashCode() - Method in class de.hpi.fgis.dude.similarityfunction.aggregators.Aggregator
 
hashCode() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
 
hashCode() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
 
hashCode() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
 
hashCode() - Method in class de.hpi.fgis.dude.util.BoundType
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
hashCode() - Method in class de.hpi.fgis.dude.util.data.json.JsonString
 
hashCode() - Method in class de.hpi.fgis.dude.util.Experiment
 
hashCode() - Method in class de.hpi.fgis.dude.util.GoldStandard
 
hashCode() - Method in class de.hpi.fgis.dude.util.Pair
 
hashCode() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
 
hashCode() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
 
hashCode() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
 
hasLineage() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Checks whether a lineage is specified.
hasNext() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm.AlgorithmIteratorWrapper
 
hasNext() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMIterator
 
hasNext() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.LegoIterator
hasNext() - Method in class de.hpi.fgis.dude.util.AbstractIterator
 
hasNext() - Method in class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser.StringIterator
 
hasNext() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Checks whether the end of the file is reached.
hasProperty(String) - Method in class de.hpi.fgis.dude.util.OrderedPair
Checks whether the passed property is set.
hasSimilarity() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Checks whether the similarity is set.
headerIsEnabled() - Method in class de.hpi.fgis.dude.output.CSVOutput
Checks whether the header shall be written.
HonorificSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
HonorificSimilarityFunction compares two strings and treats them as honorifics, allowing for some special normalization and comparison techniques.
HonorificSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.HonorificSimilarityFunction
 
HouseNumberSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
HouseNumberSimilarityFunction compares two strings and treats them as ZIP codes, allowing for some special normalization and comparison techniques.
HouseNumberSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.HouseNumberSimilarityFunction
 
HugePersonDataTestExecClass - Class in de.hpi.fgis.dude.exec.duplicatedetection
Executes a run using a large data file with the SortedNeighborhoodMethod.
HugePersonDataTestExecClass() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.HugePersonDataTestExecClass
 

I

IdentifierIsAlreadyInUseException - Exception in de.hpi.fgis.dude.exceptions
This exception is thrown by runtime if an identifier is used more than one time (e.g.
IdentifierIsAlreadyInUseException() - Constructor for exception de.hpi.fgis.dude.exceptions.IdentifierIsAlreadyInUseException
Calls RuntimeException.RuntimeException()
IdentifierIsAlreadyInUseException(String) - Constructor for exception de.hpi.fgis.dude.exceptions.IdentifierIsAlreadyInUseException
Calls RuntimeException.RuntimeException(String)
IdentifierIsAlreadyInUseException(Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.IdentifierIsAlreadyInUseException
Calls RuntimeException.RuntimeException(Throwable)
IdentifierIsAlreadyInUseException(String, Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.IdentifierIsAlreadyInUseException
Calls RuntimeException.RuntimeException(String, Throwable)
ignoreCapitalization() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Enables ignoring capitalization.
IgnoreStrategy<T1 extends JsonValue,T2 extends JsonValue> - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
IgnoreStrategy ignores the actual values and returns always the same default similarity.
IgnoreStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.IgnoreStrategy
Initializes a IgnoreStrategy that returns a default similarity of 0.0.
IgnoreStrategy(double) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.IgnoreStrategy
Initializes a IgnoreStrategy that returns the passed default similarity.
ignoringCapitalizationEnabled() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Checks whether this ContentBasedSimilarityFunction shall make a distinction between lower case and upper case or not.
increaseFactor(float) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Sets the factor by which the window size will be multiplied.
increaseThreshold(float) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Sets the threshold for increasing the window size.
initializeAlgorithm() - Method in class de.hpi.fgis.dude.util.Experiment
Initializes the algorithm instance.
inMemoryProcessingEnabled() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
 
inMemoryProcessingEnabled() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Checks, whether in-memory processing is enabled.
inMemoryProcessingEnabled() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether in-memory processing is enabled.
InMemorySorter - Class in de.hpi.fgis.dude.util.sorting.sorter
InMemorySorter implements a in-memory sort.
InMemorySorter() - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.InMemorySorter
Initializes a InMemorySorter with no SortingKey.
InMemorySorter(SortingKey) - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.InMemorySorter
Initializes a InMemorySorter with the passed SortingKey.
InMemoryStorage<T extends Jsonable> - Class in de.hpi.fgis.dude.util.data.storage
InMemoryStorage stores Jsonable instances in memory.
InMemoryStorage() - Constructor for class de.hpi.fgis.dude.util.data.storage.InMemoryStorage
Initializes a InMemoryStorage instance.
InMemoryStorage(Collection<T>) - Constructor for class de.hpi.fgis.dude.util.data.storage.InMemoryStorage
Initializes a InMemoryStorage instance with the passed initial content.
InputStreamReadable<T extends Jsonable> - Class in de.hpi.fgis.dude.util.data.storage
InputStreamReadable can be used to read Json data from any InputStream.
InputStreamReadable(InputStream) - Constructor for class de.hpi.fgis.dude.util.data.storage.InputStreamReadable
Initializes an InputStreamReadable with no type information.
InputStreamReadable(Class<T>, InputStream) - Constructor for class de.hpi.fgis.dude.util.data.storage.InputStreamReadable
Initializes an InputStreamReadable with the passed type information.
internalType - Variable in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
is needed for creating new instances (@see Jsonable#fromJson(DuDeJsonParser))
InvalidJsonException - Exception in de.hpi.fgis.dude.exceptions
This exception should be raised if an Json String could not be converted into an object.
InvalidJsonException() - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidJsonException
Calls NoSuchElementException.NoSuchElementException().
InvalidJsonException(String) - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidJsonException
Calls NoSuchElementException.NoSuchElementException(String).
InvalidJsonException(Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidJsonException
Calls NoSuchElementException.NoSuchElementException() and stores the passed cause.
InvalidJsonException(String, Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidJsonException
Calls NoSuchElementException.NoSuchElementException(String) and stores the passed cause.
InvalidSchemaException - Exception in de.hpi.fgis.dude.exceptions
This Exception can be thrown in every schema-related case.
InvalidSchemaException() - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidSchemaException
Calls Exception.Exception().
InvalidSchemaException(String) - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidSchemaException
Calls Exception.Exception(String).
InvalidSchemaException(Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidSchemaException
Calls Exception.Exception(Throwable).
InvalidSchemaException(String, Throwable) - Constructor for exception de.hpi.fgis.dude.exceptions.InvalidSchemaException
Calls Exception.Exception(String, Throwable).
invoke(Object, String, Object...) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Dynamically invokes the specified function on an object with the given parameters.
isAbortIncrease() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Checks whether aborting the increase is enabled.
isAccessible() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Checks whether the file can be accessed.
isActivated() - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Checks if current config measure is activated for usage.
isAlreadyInUse(String) - Method in class de.hpi.fgis.dude.util.DataSourceIdentifierManager
Checks whether the passed id is already in use.
isAtomic() - Method in enum de.hpi.fgis.dude.util.data.json.JsonValue.JsonType
Checks, whether the current type is atomic.
isCheckMemory() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the boolean flag that indicates the activation status of memory checking.
isClosed() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Checks whether the reader was already closed.
isClosed() - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Checks whether the writer was already closed.
isDuplicate(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Returns true if the DuDeObjectPair exists in the set of real duplicate pairs.
isDuplicate() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Checks whether the current pair represents a duplicate.
isEmpty() - Method in class de.hpi.fgis.dude.database.util.Schema
Checks whether the current schema contains no columns.
isEmpty() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Checks whether any Subkey was added.
isFormattedJson() - Method in class de.hpi.fgis.dude.util.data.storage.AbstractDuDeStorage
 
isFormattedJson() - Method in interface de.hpi.fgis.dude.util.data.storage.JsonWritable
Checks whether formatted Json is enabled in this storage.
isInstantiable(Class<? extends Object>) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Determines whether the given class has an accessible default constructor and caches the result.
isInstantiable(Class<?>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns true if the given type has an accessible default constructor.
Note: this method is thread-safe
isMerged() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Checks, if this DuDeObject is a merged object.
isMonthAbbreviation(String) - Static method in class de.hpi.fgis.dude.util.bibtex.data.BibtexStandardMacros
Checks whether the passed String is a standard macro key for month names.
isNonDuplicate(DuDeObjectPair) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Checks whether a specific pair exists in the set of real duplicate pairs.
isOthers() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Returns true, if this instance represents the BibTex "Other authors" value; otherwise false.
isReflexive() - Method in class de.hpi.fgis.dude.util.Pair
Checks whether this instance is reflexive.
isSameTypeOrPrimitive(Class<?>, Class<?>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Returns true if both types are the same or represent the same primitive or boxing type.
isStandardMacro(String) - Static method in class de.hpi.fgis.dude.util.bibtex.data.BibtexStandardMacros
Checks whether the passed String is the name of a standard macro.
isSymmetricTo(Pair<T1, T2>) - Method in class de.hpi.fgis.dude.util.Pair
Checks whether the passed Pair is the symmetric correspondent of this instance.
isValid() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
Checks if the id is valid.
iterator() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm
Starts the extraction and preprocessing phase if necessary and returns an Iterator instance for iterating over the algorithm's result.
iterator() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
iterator() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
iterator() - Method in class de.hpi.fgis.dude.database.DatabaseSource
 
iterator() - Method in class de.hpi.fgis.dude.database.util.Schema
Returns an iterator that iterates over the Schema instance.
iterator() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
iterator() - Method in class de.hpi.fgis.dude.datasource.BibtexSource
 
iterator() - Method in class de.hpi.fgis.dude.datasource.CSVSource
 
iterator() - Method in class de.hpi.fgis.dude.datasource.DuDeObjectSource
 
iterator() - Method in class de.hpi.fgis.dude.datasource.JSONSource
 
iterator() - Method in class de.hpi.fgis.dude.datasource.XMLSource
 
iterator() - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator
 
iterator() - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
 
iterator() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
 

J

JaccardSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
JaccardSimilarityFunction compares two DuDeObjects based on the Jaccard Coefficient of the given attribute.
JaccardSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaccardSimilarityFunction
Internal constructor for Jsonable deserialization.
JaccardSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaccardSimilarityFunction
Initializes the JaccardSimilarityFunction with the default tokenizer.
JaccardSimilarityFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaccardSimilarityFunction
Initializes the JaccardSimilarityFunction with the default tokenizer.
JaccardSimilarityFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaccardSimilarityFunction
Initializes the JaccardSimilarityFunction with the passed InterfaceTokeniser.
JaccardSimilarityFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaccardSimilarityFunction
Initializes the JaccardSimilarityFunction with the passed InterfaceTokeniser.
JaroDistanceFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
JaroDistanceFunction compares two DuDeObjects based on the Jaro Distance of the given attribute.
JaroDistanceFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaroDistanceFunction
Internal constructor for Jsonable deserialization.
JaroDistanceFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaroDistanceFunction
Initializes the JaroDistanceFunction with the default tokenizer.
JaroDistanceFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaroDistanceFunction
Initializes the JaroDistanceFunction with the default tokenizer.
JaroWinklerFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
JaroWinklerFunction compares two DuDeObjects based on the extended JaroWinkler distance of the given attribute.
JaroWinklerFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaroWinklerFunction
Internal constructor for Jsonable deserialization.
JaroWinklerFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaroWinklerFunction
Initializes the JaroWinklerFunction with the default tokenizer.
JaroWinklerFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.JaroWinklerFunction
Initializes the JaroWinklerFunction with the default tokenizer.
JSON_FILE_EXTENSION - Static variable in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
The file extension that is used for each file.
Jsonable - Interface in de.hpi.fgis.dude.util.data
Jsonable can be used by classes whose instances shall be Json-convertible.
JsonableReader<T extends Jsonable> - Interface in de.hpi.fgis.dude.util.data.storage
A JsonableReader can be used to read the content of a Jsonable storage.
JsonableWriter<T extends Jsonable> - Interface in de.hpi.fgis.dude.util.data.storage
A JsonableWriter can be used to add data to a JsonWritable.
JsonArray - Class in de.hpi.fgis.dude.util.data.json
JsonArray represents an ordered collection and provides functionality for collecting multiple instances of JsonValue.
JsonArray() - Constructor for class de.hpi.fgis.dude.util.data.json.JsonArray
Initializes a new JsonArray.
JsonArray(JsonValue...) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonArray
Initializes a new JsonArray with the passed JsonValue.
JsonArray(Collection<? extends JsonValue>) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonArray
Initializes a new JsonArray with the passed data.
JsonArray(int) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonArray
Initializes a new JsonArray with a predefined capacity.
JsonAtomic - Interface in de.hpi.fgis.dude.util.data.json
JsonAtomic represents all atomic Json types.
JsonBoolean - Class in de.hpi.fgis.dude.util.data.json
JsonBoolean represents a boolean value that can be converted into Json.
jsonize(String) - Static method in class de.hpi.fgis.dude.util.data.json.JsonString
Converts the passed String into its Json representation.
JsonNull - Class in de.hpi.fgis.dude.util.data.json
JsonNull represents the Json null value.
JsonNumber - Class in de.hpi.fgis.dude.util.data.json
JsonNumber represents a Json-convertible number.
JsonNumber() - Constructor for class de.hpi.fgis.dude.util.data.json.JsonNumber
Initializes a JsonNumber with 0.
JsonNumber(Number) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonNumber
Initializes a JsonNumber.
JsonOutput - Class in de.hpi.fgis.dude.output
JsonOutput converts the passed DuDeObject pairs into Json syntax.
JsonOutput(OutputStream) - Constructor for class de.hpi.fgis.dude.output.JsonOutput
Initializes a JsonOutput formatter.
JsonOutput(File) - Constructor for class de.hpi.fgis.dude.output.JsonOutput
Initializes a JsonOutput formatter.
JsonOutput() - Constructor for class de.hpi.fgis.dude.output.JsonOutput
Internal constructor for Jsonable deserialization.
JsonReadable<T extends Jsonable> - Interface in de.hpi.fgis.dude.util.data.storage
JsonReadable is an interface for adding readable functionality to some Jsonable storage.
JsonRecord - Class in de.hpi.fgis.dude.util.data.json
JsonRecord represents a Json record.
JsonRecord() - Constructor for class de.hpi.fgis.dude.util.data.json.JsonRecord
Initializes a JsonRecord.
JsonRecord(int) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonRecord
Initializes a JsonRecord with the passed initial capacity.
JsonRecord(Map<? extends String, ? extends JsonValue>) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonRecord
Initializes a new JsonRecord with the passed data.
JsonRecord(int, float) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonRecord
Initializes a JsonRecord with the passed initial capacity and its load factor.
JSONSource - Class in de.hpi.fgis.dude.datasource
JSONSource represents files containing Json syntax.
JSONSource() - Constructor for class de.hpi.fgis.dude.datasource.JSONSource
Internal constructor for Jsonable deserialization.
JSONSource(String, File) - Constructor for class de.hpi.fgis.dude.datasource.JSONSource
Initializes a JSONSource.
JSONSource.JSONSourceIterator - Class in de.hpi.fgis.dude.datasource
JSONSourceIterator is used for generating DuDeObjects out of JSONSources.
JSONSource.JSONSourceIterator(JSONSource) - Constructor for class de.hpi.fgis.dude.datasource.JSONSource.JSONSourceIterator
Initializes a JSONSourceIterator using the passed JSONSource.
JsonString - Class in de.hpi.fgis.dude.util.data.json
JsonString represents a Json-convertible String.
JsonString() - Constructor for class de.hpi.fgis.dude.util.data.json.JsonString
Initializes an empty Json-convertible String.
JsonString(String) - Constructor for class de.hpi.fgis.dude.util.data.json.JsonString
Initializes a new Json-convertible String.
JsonTypeManager - Class in de.hpi.fgis.dude.util.data.json.auto
Singleton for managing the AutoJsonSerializations.
JsonUtil - Class in de.hpi.fgis.dude.util.data.json
Convenience methods for dealing with Jsonables.
JsonUtil() - Constructor for class de.hpi.fgis.dude.util.data.json.JsonUtil
 
JsonValue - Interface in de.hpi.fgis.dude.util.data.json
JsonValue provides methods that has to be implemented by every Json data type.
JsonValue.JsonType - Enum in de.hpi.fgis.dude.util.data.json
JsonValue.JsonType includes all Json types that can be returned by JsonValue.getType().
JsonWritable<T extends Jsonable> - Interface in de.hpi.fgis.dude.util.data.storage
JsonWritable is an interface for adding writable functionality to some Jsonable storage.

K

KEY_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute that stores the key.

L

last(List<T>) - Static method in class de.hpi.fgis.dude.util.CollectionUtil
Returns the last element of the collection.
last(Iterable<T>) - Static method in class de.hpi.fgis.dude.util.CollectionUtil
Returns the last element of the collection.
Lego - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
Lego is an iterative blocking approach.
Lego() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
For serialization.
Lego(SortingKey...) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Initializes Lego with the passed SortingKey's.
Lego.ComparisonResult - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
 
Lego.LegoIterator - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
 
Lego.LegoIterator(Algorithm, Set<SortingKey>, Merger, JsonableReader<DuDeObject>) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.LegoIterator
Initializes a LegoIterator.
LegoExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
The execution class for the Lego algorithm.
LegoExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.LegoExec
 
LevenshteinDistance - Class in de.hpi.fgis.dude.similarityfunction.contentbased.util
LevenshteinDistance implements an Edit-Distance approach using the Levenshtein Distance algorithm.
LevenshteinDistance() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.LevenshteinDistance
 
LevenshteinDistanceFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
LevenshteinDistanceFunction compares two DuDeObjects based on the Levenshtein Distance of the given attribute.
LevenshteinDistanceFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.LevenshteinDistanceFunction
Internal constructor for Jsonable deserialization.
LevenshteinDistanceFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.LevenshteinDistanceFunction
Initializes the LevenshteinDistanceFunction with the default tokenizer.
LevenshteinDistanceFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.LevenshteinDistanceFunction
Initializes the LevenshteinDistanceFunction with the default tokenizer.
line - Variable in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer.Token
The line information of this token.
LINEAGE_PROPERTY - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectPair
The name of the lineage property.
loadClusters() - Method in class de.hpi.fgis.dude.util.GoldStandard
Reads in the gold standard in cluster format [objectId1 sourceId1];[objectId2 sourceId2] from the given file.
loadConfig(Properties) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Loads the configuration from the passed Properties.
loadData() - Method in class de.hpi.fgis.dude.util.GoldStandard
Extracts the gold standard either in cluster or duplicate pair format.
loadDatabaseInformation(InputStream) - Method in class de.hpi.fgis.dude.database.adapter.Database
Loads the data from the InputStream.
loadDatabaseInformation(Properties) - Method in class de.hpi.fgis.dude.database.adapter.Database
Loads the data from a Properties instance.
loadDuplicatePairs() - Method in class de.hpi.fgis.dude.util.GoldStandard
Extracts the gold standard out of a file with duplicate pairs.
loadNextElement() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMIterator
Returns the element in the iteration.
loadNextElement() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.LegoIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod.SortedNeighborhoodMethodIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.algorithm.recordlinkage.NaiveRecordLinkage.NaiveRecordLinkageIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource.AbstractDataSourceIterator
Generates a DuDeObject based on the data returned by AbstractDataSource.AbstractDataSourceIterator.loadNextRecord().
loadNextElement() - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosureIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.TransitiveClosureIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.util.AbstractIterator
Returns the element of the next iteration step.
loadNextElement() - Method in class de.hpi.fgis.dude.util.ChainedIterator
 
loadNextElement() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
 
loadNextRecord() - Method in class de.hpi.fgis.dude.database.DatabaseSource.DatabaseSourceIterator
 
loadNextRecord() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource.AbstractDataSourceIterator
Returns the data of the next object.
loadNextRecord() - Method in class de.hpi.fgis.dude.datasource.BibtexSource.BibtexSourceIterator
 
loadNextRecord() - Method in class de.hpi.fgis.dude.datasource.CSVSource.CSVSourceIterator
 
loadNextRecord() - Method in class de.hpi.fgis.dude.datasource.JSONSource.JSONSourceIterator
 
loadNextRecord() - Method in class de.hpi.fgis.dude.datasource.XMLSource.XMLSourceIterator
 
loadProperties(Properties) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Loads the settings into this DBInfo object using a Properties table.
loadProperties(InputStream) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Loads the settings out of a stream into the DBInfo object.
loadProperties(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Loads the properties out of a file specified by its path.
LookAheadReader - Class in de.hpi.fgis.dude.util.bibtex.parser
This implementation now features a buffer.
LookAheadReader(Reader) - Constructor for class de.hpi.fgis.dude.util.bibtex.parser.LookAheadReader
Initializes a LookAheadReader that reads from input.

M

MacroReferenceExpander - Class in de.hpi.fgis.dude.util.bibtex.expander
This expander expands macro references into strings - have a look at the options that can be given in the constructor.
MacroReferenceExpander(boolean, boolean, boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.MacroReferenceExpander
This is just a convenience / backward compatibility constructor.
MacroReferenceExpander(boolean, boolean, boolean, boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.MacroReferenceExpander
 
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.AdaptiveSNMExec_Yan2007
Executes a AdaptiveSNM_Yan2007 example experiment.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.CDExec
Executes the naive duplicate detection on the CD data source.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.CoraExec
Executes the naive duplicate detection on the CORA data source.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.DuplicateCountSNMExec
Sample main method in which DuDe extracts data from an XML file and runs the DuplicateCountSNM algorithm.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.ExperimentExec
Sample main method in which DuDe extracts data from a Json file.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.GMDEvaluationExec_CORA
 
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.GMDEvaluationExec_Restaurant
 
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.GSwooshExec
Executes the GSwoosh duplicate detection on the restaurant data source.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.HugePersonDataTestExecClass
Runs the SortedNeighborhoodMethod on a huge data set.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.LegoExec
Sample main method in which DuDe extracts data from an XML file and runs the Sorted-Neighborhood-Method algorithm.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.RestaurantExec
Executes the naive duplicate detection on the restaurant data source.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.RSwooshExec
Executes the RSwoosh duplicate detection on the restaurant data source.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.SortedBlocksExec
 
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.SortedNeighborhoodMethodExec
Sample main method in which DuDe extracts data from an XML file and runs the Sorted-Neighborhood-Method algorithm.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.duplicatedetection.TFIDFExperimentExec
Runs the SortedNeighborhoodMethod on a huge data set.
main(String[]) - Static method in class de.hpi.fgis.dude.exec.recordlinkage.NaiveRecordLinkageExec
Sample main method.
main(String[]) - Static method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
makeConcatenatedValue(BibtexAbstractValue, BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexConcatenatedValue.
makeEntry(String, String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexEntry.
makeMacroDefinition(String, BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexMacroDefinition.
makeMacroReference(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexMacroReference.
makeMemoryUsageSnapshot() - Method in class de.hpi.fgis.dude.util.MemoryChecker
Checks the memory usage status and makes a statistical snapshot.
makePerson(String, String, String, String, boolean) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexPerson.
makePersonList() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexPersonList.
makePreamble(BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexPreamble.
makeString(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexString.
makeToplevelComment(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Creates a BibtexToplevelComment.
MatchingCoefficientFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
MatchingCoefficientFunction compares two DuDeObjects based on the Matching Coefficient of the given attribute.
MatchingCoefficientFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MatchingCoefficientFunction
Internal constructor for Jsonable deserialization.
MatchingCoefficientFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MatchingCoefficientFunction
Initializes the MatchingCoefficientFunction with the default tokenizer.
MatchingCoefficientFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MatchingCoefficientFunction
Initializes the MatchingCoefficientFunction with the default tokenizer.
MatchingCoefficientFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MatchingCoefficientFunction
Initializes the MatchingCoefficientFunction with the passed InterfaceTokeniser.
MatchingCoefficientFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MatchingCoefficientFunction
Initializes the MatchingCoefficientFunction with the passed InterfaceTokeniser.
maxAbsoluteVariation - Variable in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
The absolute value of maximum allowed variation.
Maximum - Class in de.hpi.fgis.dude.similarityfunction.aggregators
Maximum returns the maximal similarity of the added SimilarityFunctions.
Maximum() - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Maximum
Internal constructor for Jsonable deserialization.
Maximum(SimilarityFunction...) - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Maximum
Initializes a Maximum instance.
maxToleranceFactor - Variable in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The maximum percentage of allowed absolute variation.
MemoryChecker - Class in de.hpi.fgis.dude.util
MemoryChecker is a Singleton implementation, that maintains the memory usage.
MemoryChecker() - Constructor for class de.hpi.fgis.dude.util.MemoryChecker
Initializes the singleton instance.
MemoryCheckerTask - Class in de.hpi.fgis.dude.util
This task retrieves the memory usage of the current process and stores the maximum memory usage ever occurred.
MemoryCheckerTask() - Constructor for class de.hpi.fgis.dude.util.MemoryCheckerTask
 
memoryCheckFrequency - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
memoryLimitReached() - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
Checks whether the memory limit was reached.
merge(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.merger.AbstractMerger
Merges two DuDeObjects into one new DuDeObject.
merge(DuDeObject, DuDeObject) - Method in interface de.hpi.fgis.dude.merger.Merger
Merges two DuDeObjects into one new DuDeObject.
merge(NaiveTransitiveClosureGenerator.TransitiveClosure) - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
Merges the passed TransitiveClosure with the current one.
merge(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.util.merger.AbstractMerger
Merges two DuDeObjects into one new DuDeObject.
merge(DuDeObject, DuDeObject) - Method in interface de.hpi.fgis.dude.util.merger.Merger
Merges two DuDeObjects into one new DuDeObject.
mergeData(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.merger.AbstractMerger
Creates a new data set containing the merged data from the passed DuDeObjects.
mergeData(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.merger.DefaultMerger
Merges the data from the DuDeObjects into a new JsonRecord.
mergeData(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.util.merger.AbstractMerger
Creates a new data set containing the merged data from the passed DuDeObjects.
mergeData(DuDeObject, DuDeObject) - Method in class de.hpi.fgis.dude.util.merger.DefaultMerger
Merges the data from the DuDeObjects into a new JsonRecord.
Merger - Interface in de.hpi.fgis.dude.merger
Merger is used to merge two DuDeObjects into one new DuDeObject.
Merger - Interface in de.hpi.fgis.dude.util.merger
Merger is used to merge two DuDeObjects into one new DuDeObject.
Minimum - Class in de.hpi.fgis.dude.similarityfunction.aggregators
Minimum returns the minimal similarity of the added SimilarityFunctions.
Minimum() - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Minimum
Internal constructor for Jsonable deserialization.
Minimum(SimilarityFunction...) - Constructor for class de.hpi.fgis.dude.similarityfunction.aggregators.Minimum
Initializes a Minimum instance.
MongeElkanFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
MongeElkanFunction compares two DuDeObjects based on the Monge Elkan Distance of the given attribute.
MongeElkanFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MongeElkanFunction
Internal constructor for Jsonable deserialization.
MongeElkanFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MongeElkanFunction
Initializes the MongeElkanFunction with the default tokenizer.
MongeElkanFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MongeElkanFunction
Initializes the MongeElkanFunction with the default tokenizer.
MongeElkanFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MongeElkanFunction
Initializes the MongeElkanFunction with the passed InterfaceTokeniser.
MongeElkanFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.MongeElkanFunction
Initializes the MongeElkanFunction with the passed InterfaceTokeniser.
MongeElkanSimilarity - Class in de.hpi.fgis.dude.similarityfunction.contentbased.util
This class is a derived implementation of the Monge Elkan metric from the simmetric library.
MongeElkanSimilarity() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
constructor - default (empty).
MongeElkanSimilarity(InterfaceTokeniser) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
constructor.
MongeElkanSimilarity(InterfaceTokeniser, AbstractStringMetric) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
constructor.
MongeElkanSimilarity(AbstractStringMetric) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
constructor.
MultipleOutput - Class in de.hpi.fgis.dude.output
The MultipleOutput to support more than one output formats.
MultipleOutput(DuDeOutput...) - Constructor for class de.hpi.fgis.dude.output.MultipleOutput
The default constructor initialized the MultipleOutput.
MultipleOutput() - Constructor for class de.hpi.fgis.dude.output.MultipleOutput
The default constructor initialized the MultipleOutput.
MySQLDatabase - Class in de.hpi.fgis.dude.database.adapter
MySQLDatabase encapsulates all the necessary information for establishing a connection to a MySQL database.
MySQLDatabase(DBInfo) - Constructor for class de.hpi.fgis.dude.database.adapter.MySQLDatabase
Initializes the MySQLDatabase instance members and loads the settings provided by the parameter dbInfo.
MySQLDatabase(InputStream) - Constructor for class de.hpi.fgis.dude.database.adapter.MySQLDatabase
Initializes the MySQLDatabase using the passed InputStream.
MySQLDatabase(Properties) - Constructor for class de.hpi.fgis.dude.database.adapter.MySQLDatabase
Initializes the MySQLDatabase using the passed Properties.

N

n - Variable in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Total number of base records, needed for computation of VI measure.
NaiveBlockingAlgorithm - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
NaiveBlockingAlgorithm is the naive blocking approach.
NaiveBlockingAlgorithm() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveBlockingAlgorithm
For serialization.
NaiveBlockingAlgorithm(SortingKey) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveBlockingAlgorithm
Initializes a NaiveBlockingAlgorithm with the passed SortingKey.
NaiveBlockingAlgorithm(SortingKey, int) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveBlockingAlgorithm
Initializes a NaiveBlockingAlgorithm with the passed SortingKey.
NaiveDuplicateDetection - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
NaiveDuplicateDetection implements the naive approach of checking all possible pairs.
NaiveDuplicateDetection() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveDuplicateDetection
Initializes a NaiveDuplicateDetection instance.
NaiveRecordLinkage - Class in de.hpi.fgis.dude.algorithm.recordlinkage
NaiveRecordLinkage implements the naive approach for record-linkage.
NaiveRecordLinkage() - Constructor for class de.hpi.fgis.dude.algorithm.recordlinkage.NaiveRecordLinkage
 
NaiveRecordLinkage.NaiveRecordLinkageIterator - Class in de.hpi.fgis.dude.algorithm.recordlinkage
NaiveRecordLinkageIterator implements the actual functionality of the naive record-linkage approach.
NaiveRecordLinkage.NaiveRecordLinkageIterator(Iterable<Map.Entry<DataSource, DuDeStorage<DuDeObject>>>) - Constructor for class de.hpi.fgis.dude.algorithm.recordlinkage.NaiveRecordLinkage.NaiveRecordLinkageIterator
Initializes the NaiveRecordLinkageIterator with the passed data.
NaiveRecordLinkageExec - Class in de.hpi.fgis.dude.exec.recordlinkage
NaiveRecordLinkageExec contains a code-snippet that illustrates, how to use the NaiveRecordLinkage implementation.
NaiveRecordLinkageExec() - Constructor for class de.hpi.fgis.dude.exec.recordlinkage.NaiveRecordLinkageExec
 
NaiveTransitiveClosureGenerator - Class in de.hpi.fgis.dude.postprocessor
NaiveTransitiveClosureGenerator implements the naive way of generating transitive closures.
NaiveTransitiveClosureGenerator() - Constructor for class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator
 
NaiveTransitiveClosureGenerator.TransitiveClosure - Class in de.hpi.fgis.dude.postprocessor
TransitiveClosure represents one transitive closure.
NaiveTransitiveClosureGenerator.TransitiveClosure(DuDeObjectPair) - Constructor for class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
Initializes a NaiveTransitiveClosureGenerator.TransitiveClosure.
NaiveTransitiveClosureGenerator.TransitiveClosureIterator - Class in de.hpi.fgis.dude.postprocessor
NaiveTransitiveClosureGenerator.TransitiveClosureIterator is used to iterate over all pairs collected or generated by the NaiveTransitiveClosureGenerator.
NeedlemanWunschFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
NeedlemanWunschFunction compares two DuDeObject s based on the Needleman Wunch Distance of the given attribute.
NeedlemanWunschFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Internal constructor for Jsonable deserialization.
NeedlemanWunschFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(float, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(float, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(AbstractSubstitutionCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(AbstractSubstitutionCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(float, AbstractSubstitutionCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
NeedlemanWunschFunction(float, AbstractSubstitutionCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.NeedlemanWunschFunction
Initializes the NeedlemanWunschFunction.
newInstance(Class<T>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Creates a new instance of the given type by invoking the default constructor.
newInstance(Class<T>, Object...) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Creates a new instance of the given type by invoking the best public constructor for the given parameter.
If there are multiple compatible constructors, the most specific one is chosen.
next() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm.AlgorithmIteratorWrapper
 
next() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMIterator
 
next() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.LegoIterator
next() - Method in class de.hpi.fgis.dude.util.AbstractIterator
 
next() - Method in class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser.StringIterator
 
next() - Method in class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Sets the the iteration pointer to the next element and returns it.
nextBlockExists() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Checks whether a next block exists.
nextBoolean() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the current boolean value.
nextFieldName() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the current field name.
nextJsonable(Class<ReturnType>) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next Json instance.
nextJsonArray() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next JsonArray.
nextJsonBoolean() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next JsonBoolean.
nextJsonNull() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next JsonNull.
nextJsonNumber() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next JsonNumber.
nextJsonRecord() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next JsonRecord.
nextJsonString() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next JsonString.
nextJsonValue() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns any JsonValue.
nextLine() - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Returns the data of the next line or null, if the end of the data source was reached.
nextObject(Class<C>) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the next instance of the given type initialized with the current json.
nextRecordExists() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Checks whether the record queue or the data source has a next element.
nextString() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Returns the current textual value.
nextToken() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Loads the next token from the stream.
NO_SIMILARITY_SET_VALUE - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectPair
This value will be returned, if no similarity was set.
NO_STATISTIC_VALUE - Static variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
NO_VOWELS_REGEX - Static variable in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
This regular expression can be used if vocals should be ignored.
normalize(double, double) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Returns a value normalized to 1 of the first given value, where the second value is the maximum.
notifyOfLatestComparisonResult(DuplicateCountSNM.ComparisonResult) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Notifies the algorithm, whether the latest object pair has been categorized as a duplicate or a non-duplicate
notifyOfLatestComparisonResult(Lego.ComparisonResult) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Notifies the algorithm, whether the latest object pair has been categorized as a duplicate or a non-duplicate
NotSupportedStrategy<T1 extends JsonValue,T2 extends JsonValue> - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
NotSupportedStrategy will throw an IllegalArgumentException no matter which values were passed.
NotSupportedStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.NotSupportedStrategy
 
NULL - Static variable in class de.hpi.fgis.dude.util.data.json.JsonNull
The Json null value.
NumberBasedSubkey - Class in de.hpi.fgis.dude.util.sorting.sortingkey
NumberBasedSubkey can be used for number-based sub-keys.
NumberBasedSubkey() - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
Internal constructor for Jsonable deserialization.
NumberBasedSubkey(String) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
Initializes a NumberBasedSubkey instance that takes all digits within the value.
NumberBasedSubkey(String, int) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
Initializes a NumberBasedSubkey instance that takes all digits within the value.
NumberBasedSubkey(String, int, int) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
Initializes a NumberBasedSubkey instance that takes all digits within the value.
NumberBasedSubkey(String, Integer[]) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.NumberBasedSubkey
Initializes a NumberBasedSubkey instance that takes all digits within the value.

O

OBJECT_DATA_ATTRIBUTE_NAME - Static variable in class de.hpi.fgis.dude.util.data.DuDeObject
The attribute name of the object data within the DuDeObject's Json representation.
OBJECT_ID_ATTRIBUTE_NAME - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectId
The attribute name of the object id within the Json representation of this DuDeObject.
OBJECT_IDENTIFIER_ATTRIBUTE_NAME - Static variable in class de.hpi.fgis.dude.util.data.DuDeObject
The attribute name of the object identifier within the DuDeObject's Json representation.
of(Class<?>, BoundType...) - Static method in class de.hpi.fgis.dude.util.BoundType
Creates a BoundType around the given raw type with additional type parameters.
of(Class<?>, Class<?>, Class<?>...) - Static method in class de.hpi.fgis.dude.util.BoundType
Creates a BoundType around the given raw type with additional type parameters.
of(ParameterizedType) - Static method in class de.hpi.fgis.dude.util.BoundType
Creates a BoundType for the given ParameterizedType.
OracleDatabase - Class in de.hpi.fgis.dude.database.adapter
OracleDatabase encapsulates all the necessary information for establishing a connection to an Oracle database.
OracleDatabase(DBInfo) - Constructor for class de.hpi.fgis.dude.database.adapter.OracleDatabase
Initializes the OracleDatabase instance members and loads the settings provided by the parameter dbInfo.
OracleDatabase(InputStream) - Constructor for class de.hpi.fgis.dude.database.adapter.OracleDatabase
Initializes the OracleDatabase using the passed InputStream.
OracleDatabase(Properties) - Constructor for class de.hpi.fgis.dude.database.adapter.OracleDatabase
Initializes the OracleDatabase using the passed Properties.
OrderedPair<T extends Comparable<T>> - Class in de.hpi.fgis.dude.util
OrderedPair extends Pair in this way that both elements has to have the same type.
OrderedPair() - Constructor for class de.hpi.fgis.dude.util.OrderedPair
Initializes an empty pair.
OrderedPair(T, T) - Constructor for class de.hpi.fgis.dude.util.OrderedPair
Creates a OrderedPair instance with the passed objects.
outputSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether any output is set.
outputStream - Variable in class de.hpi.fgis.dude.output.AbstractDuDeOutput
The stream writer that is used for the output.
OverlapCoefficientFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
OverlapCoefficientFunction compares two DuDeObjects based on the Overlap Coefficient of the given attribute.
OverlapCoefficientFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.OverlapCoefficientFunction
Internal constructor for Jsonable deserialization.
OverlapCoefficientFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.OverlapCoefficientFunction
Initializes the OverlapCoefficientFunction with the default tokenizer.
OverlapCoefficientFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.OverlapCoefficientFunction
Initializes the OverlapCoefficientFunction with the default tokenizer.
OverlapCoefficientFunction(InterfaceTokeniser, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.OverlapCoefficientFunction
Initializes the OverlapCoefficientFunction with the passed InterfaceTokeniser.
OverlapCoefficientFunction(InterfaceTokeniser, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.OverlapCoefficientFunction
Initializes the OverlapCoefficientFunction with the passed InterfaceTokeniser.

P

Pair<T1,T2> - Class in de.hpi.fgis.dude.util
A Pair is a container that stores two objects.
Pair() - Constructor for class de.hpi.fgis.dude.util.Pair
Initializes an empty pair.
Pair(T1, T2) - Constructor for class de.hpi.fgis.dude.util.Pair
Generates a pair of the given two elements.
pairCount - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
PARAM_NAME_DEFAULT_DIRECTORY - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The name of DuDe's default directory.
PARAM_NAME_DEFAULT_ENCODING - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The name of DuDe's default encoding.
PARAM_NAME_FORMATTED_JSON_OUTPUT - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
Name of the parameter for specifying if the formatted Json shall be printed.
PARAM_NAME_MAXIMUM_MEMORY_USAGE - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
Name of the parameter for specifying the maximum memory usage.
PARAM_NAME_MEMORY_CHECKER_STEP_SIZE - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
Name of the parameter for specifying how many records can be loaded into memory when using the MemoryChecker.
PARAM_VALUE_DEFAULT_DIRECTORY - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The default directory where all files are stored during file-based processing.
PARAM_VALUE_DEFAULT_ENCODING - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The default encoding that is used internally.
PARAM_VALUE_FORMATTED_JSON_OUTPUT - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The default setting of the parameter that specifies if the Json written to the file is formatted or not.
PARAM_VALUE_MAXIMUM_MEMORY_USAGE - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The default maximum memory usage for sorting the data in percent.
PARAM_VALUE_MEMORY_CHECKER_STEP_SIZE - Static variable in class de.hpi.fgis.dude.util.GlobalConfig
The default setting of the parameter that specifies how many records can be loaded into memory when using the MemoryChecker.
parse(BibtexString, String) - Static method in class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser
 
parse(BibtexFile, Reader) - Method in class de.hpi.fgis.dude.util.bibtex.parser.BibtexParser
Parses the input into BibtexParser.bibtexFile - don't forget to check BibtexParser.getExceptions() afterwards (if you don't use throwAllParseExceptions which you can configure in the constructor)...
ParseException - Exception in de.hpi.fgis.dude.util.bibtex.parser
This exception is raised if an error occurs while parsing using the BibtexParser.
ParseException(int, int, String, String) - Constructor for exception de.hpi.fgis.dude.util.bibtex.parser.ParseException
Initializes a ParseException.
PERSON_FIRST_NAME_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute that stores the first name of a person.
PERSON_LAST_NAME_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute that stores the last name of a person.
PERSON_LINEAGE_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute that stores the lineage (like 'jr' or 'sr') of a person.
PERSON_PRE_LAST_NAME_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute that stores the prefix of the person's last name.
PersonListExpander - Class in de.hpi.fgis.dude.util.bibtex.expander
This expander will convert author/editor field values into BibtexPersonList objects.
PersonListExpander(boolean, boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.PersonListExpander
Equivalent to PersonListExpander(expandAuthors,expandEditors,true).
PersonListExpander(boolean, boolean, boolean) - Constructor for class de.hpi.fgis.dude.util.bibtex.expander.PersonListExpander
 
PersonListParserException - Exception in de.hpi.fgis.dude.util.bibtex.expander
 
PersonListParserException(String) - Constructor for exception de.hpi.fgis.dude.util.bibtex.expander.PersonListParserException
 
phi - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
the distance threshold
PhoneNumberSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
PhoneNumberSimilarityFunction compares two strings and treats them as phone numbers, allowing for some special normalization and comparison techniques.
PhoneNumberSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.PhoneNumberSimilarityFunction
 
populateGraph(HashSet<WarshallTransitiveClosureGenerator.IntPair>) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.GraphRepresentation
Populates the graph.
positions - Variable in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Stores the position information.
PostGreSQLDatabase - Class in de.hpi.fgis.dude.database.adapter
PostGreSQLDatabase encapsulates all the necessary information for establishing a connection to a PostGreSQL database.
PostGreSQLDatabase(DBInfo) - Constructor for class de.hpi.fgis.dude.database.adapter.PostGreSQLDatabase
Initializes the PostGreSQLDatabase instance members and loads the settings provided by the parameter dbInfo.
PostGreSQLDatabase(InputStream) - Constructor for class de.hpi.fgis.dude.database.adapter.PostGreSQLDatabase
Initializes the PostGreSQLDatabase using the passed InputStream.
PostGreSQLDatabase(Properties) - Constructor for class de.hpi.fgis.dude.database.adapter.PostGreSQLDatabase
Initializes the PostGreSQLDatabase using the passed Properties.
preprocessData() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
Preprocesses the data.
preprocessData(Iterable<DataSource>) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
Preprocesses the data.
preprocessData() - Method in class de.hpi.fgis.dude.algorithm.SortingDuplicateDetection
 
preprocessData(Iterable<DataSource>) - Method in class de.hpi.fgis.dude.algorithm.SortingRecordLinkage
 
Preprocessor - Interface in de.hpi.fgis.dude.preprocessor
Preprocessor is an interface that can be used for gathering statistics of the data within the extraction phase.
Primitive<T> - Class in de.hpi.fgis.dude.util.data.json.auto
Json serialization for basic types including primitive types and boxing classes as well as Strings.
Primitive(Class<T>) - Constructor for class de.hpi.fgis.dude.util.data.json.auto.Primitive
Initializes the primitive for given type
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexConcatenatedValue
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroDefinition
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroReference
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexNode
Prints the node to the passed PrintWriter.
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPersonList
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPreamble
 
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexString
Prints the node to the passed PrintWriter.
printBibtex(PrintWriter) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexToplevelComment
 
printDataEnabled() - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
Checks whether printing the data is enable or not.
printFuzzyPair(DuDeObjectPair) - Method in class de.hpi.fgis.dude.util.Experiment
Writes the passed fuzzy DuDeObjectPair onto all added fuzzy DuDeOutputs.
printGraph() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
Prints the current graph as a matrix on the standard output
printingCompleteIdentifierEnabled() - Method in class de.hpi.fgis.dude.output.CSVOutput
Checks whether printing the complete identifier is enabled.
printingDataEnabled() - Method in class de.hpi.fgis.dude.output.CSVOutput
Checks whether printing the data is enabled.
printPair(DuDeObjectPair) - Method in class de.hpi.fgis.dude.util.Experiment
Writes the passed DuDeObjectPair onto all added DuDeOutputs.
processGMD() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Processes the Generalized Merge Distance regarding the current configuration if it is activated.
processLine(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns a cluster of duplicates read from the given line
PseudoLexer - Class in de.hpi.fgis.dude.util.bibtex.parser
This is not a real lexer, since BibTex is such an insane format...
PseudoLexer(Reader) - Constructor for class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
Initializes the PseudoLexer.
PseudoLexer.Token - Class in de.hpi.fgis.dude.util.bibtex.parser
The token class that is used within this lexer.
PseudoLexer.Token(int, String, int, int) - Constructor for class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer.Token
Creates a new Token.
put(String, JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and JsonValue into the JsonRecord.
put(String, boolean) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and the corresponding JsonBoolean value into the JsonRecord.
put(String, int) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and the corresponding JsonNumber value into the JsonRecord.
put(String, long) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and the corresponding JsonNumber value into the JsonRecord.
put(String, double) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and the corresponding JsonNumber value into the JsonRecord.
put(String, String) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and the corresponding JsonString value into the JsonRecord.
putCollection(String, Collection<? extends JsonValue>) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts a JsonArray generated out of the passed Collection and its key to this JsonRecord.
putMap(String, Map<String, ? extends JsonValue>) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts a JsonRecord generated out of the passed Map and its key to this JsonRecord.
putNull(String) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Puts the passed key and JsonNull into the JsonRecord .

Q

quote(String) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Quotes the passed String.

R

read(DuDeJsonParser<?>, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.ArrayJsonSerialization
 
read(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Creates an instance of the wrapped type and initializes the content with the json provided by the given DuDeJsonParser.
read(DuDeJsonParser<?>, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Creates an instance of the wrapped type and initializes the content with the json provided by the given DuDeJsonParser.
read(DuDeJsonParser<?>, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.CollectionJsonSerialization
 
read(DuDeJsonParser<?>, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
Creates an instance of the wrapped type and initializes the content with the json provided by the given DuDeJsonParser.
read(DuDeJsonParser<?>, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.EnumJsonSerialization
 
read(DuDeJsonParser<?>) - Method in class de.hpi.fgis.dude.util.data.json.auto.Primitive
Reads the next value from the parser and returns it.
readFields(DuDeJsonParser<?>, T) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Reads all non-transient fields of the jsonable to the DuDeJsonParser.
readFields(DuDeJsonParser<?>, T) - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
 
readFields(DuDeJsonParser<?>, AutoJsonable) - Static method in class de.hpi.fgis.dude.util.data.json.JsonUtil
Reads all non-transient fields of the jsonable to the DuDeJsonParser.
readHeader() - Method in class de.hpi.fgis.dude.datasource.CSVSource
Checks whether interpreting the first line as header is enabled.
readSerializedType(DuDeJsonParser<?>, boolean, Class<?>) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Parsing equivalent of AutoJsonSerialization.writeWithType(DuDeJsonGenerator, Object).
Since the type is not written in every case, this method also supports a non-destructive way for testing of type information.
If leaveRecordIntact is set and no type information has been detected, it appears as if the method has never been invoked.
However, if type information has been detected, it shall be removed from the stream and the parsed class shall be returned.
recordQueue - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Queue with already extracted records from the dataIterator that are not yet finally assigned to a block.
recordQueueIterator - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Iterator for the recordQueue
ReflectUtil - Class in de.hpi.fgis.dude.util
Convenience methods for reflective programming.
ReflectUtil() - Constructor for class de.hpi.fgis.dude.util.ReflectUtil
 
registerCleanable(Cleanable) - Method in class de.hpi.fgis.dude.util.AbstractCleanable
 
registerCleanable(Cleanable) - Method in interface de.hpi.fgis.dude.util.Cleanable
Registers a Cleanable instance.
registerCloseable(Closeable) - Method in class de.hpi.fgis.dude.util.AbstractCleanable
 
registerCloseable(Closeable) - Method in interface de.hpi.fgis.dude.util.Cleanable
Registers a Closeable instance.
registerStatement(Statement) - Method in class de.hpi.fgis.dude.database.DatabaseSource
Registers a Statement.
RelativeNumberDiffFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl
This SimilarityFunction implementation checks the relative variation of the numbers of two DuDeObject attributes.
RelativeNumberDiffFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
Internal constructor for Jsonable deserialization.
RelativeNumberDiffFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The constructor initializes the RelativeNumberDiffFunction.
RelativeNumberDiffFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The constructor initializes the RelativeNumberDiffFunction.
RelativeNumberDiffFunction(double, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The constructor initializes the RelativeNumberDiffFunction.
RelativeNumberDiffFunction(double, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
The constructor initializes the RelativeNumberDiffFunction.
remove() - Method in class de.hpi.fgis.dude.algorithm.AbstractAlgorithm.AlgorithmIteratorWrapper
 
remove() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMIterator
 
remove() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.LegoIterator
This method is not implemented and will throw an UnsupportedOperationException.
remove() - Method in class de.hpi.fgis.dude.util.AbstractIterator
This method is not implemented and will throw an UnsupportedOperationException.
remove(String) - Method in class de.hpi.fgis.dude.util.DataSourceIdentifierManager
Removes the passed id from the set of already used identifiers.
removeEntry(BibtexAbstractEntry) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexFile
Removes the passed entry from this file.
renameTo(String) - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
Renames the underlying file.
reset() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Resets the algorithm.
reset() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Sets the attributes for TruePositives, FalsePositives, TruePositivesByComparison, FalsePositivesByComparison, TrueNegativesByComparison, FalseNegativesByComparison, PairCount and ComparisonCount to 0.
reset() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
Resets the settings of the current WarshallClosureGenerator.
resetCallCount() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.IgnoreStrategy
Resets the call count.
resetNotification() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Resets the last notification
resetNotification() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Resets the last notification
resetOptionalColumns() - Method in class de.hpi.fgis.dude.output.CSVOutput
Resets the values of all optional columns using empty Strings.
resetOptionalStatisticEntries() - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
resetOptionalStatisticEntries() - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Resets the values of all optional labels using empty Strings.
resolveParamizedSuperclass(BoundType, ParameterizedType) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Resolves the ParameterizedType superclass for a given BoundType.
Example: class Pair<S, T> { }; class OrderedPair<X> extends Pair<X, X> {}.
If the actual type of X can be inferred for OrderedPair (hence BoundType), this method returns the actual bound for Pair.
resolveStandardMacro(String) - Static method in class de.hpi.fgis.dude.util.bibtex.data.BibtexStandardMacros
Returns the value corresponding to the passed standard macro name.
resolveType(BoundType, TypeVariable<?>) - Static method in class de.hpi.fgis.dude.util.ReflectUtil
Resolves the TypeVariable for a given BoundType.
Example: class Foo<T> { Collection<T> bar; }; class IntFoo extends Foo<Integer> {}.
The actual type of bar can be inferred for IntFoo and would result in a BoundType.of(Integer.class).
RestaurantExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
This execution class runs the naive duplicate detection algorithm on the Restaurant data source.
RestaurantExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.RestaurantExec
 
resultCluster - Variable in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Final ER result in cluster format.
retainUnresolvedMacroReferences() - Method in class de.hpi.fgis.dude.datasource.BibtexSource
Checks, whether unresolved macro references are retained.
rootIsSet() - Method in class de.hpi.fgis.dude.datasource.XMLSource
Checks whether a root element was set.
RSwoosh - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
RSwoosh implements the RSwoosh duplicate detection (and merging) algorithm as described in the paper Swoosh: a generic approach for entity resolution.
RSwoosh() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh
Initializes the RSwoosh algorithm with an instance of the DefaultMerger.
RSwoosh(Merger) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh
Initializes the RSwoosh algorithm with the passed Merger.
RSwoosh.ComparisonResult - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
 
RSwooshExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
This execution class runs the RSwoosh duplicate detection algorithm on the Restaurant data source.
RSwooshExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.RSwooshExec
 
run() - Method in class de.hpi.fgis.dude.exec.duplicatedetection.AdaptiveSNMExec_Yan2007
Runs the example experiment for the algorithm AdaptiveSNME_Yan2007
run() - Method in class de.hpi.fgis.dude.exec.duplicatedetection.SortedBlocksExec
Runs the example experiment.
run() - Method in class de.hpi.fgis.dude.util.Experiment
Starts a run based on the previously configured thresholds.
run(double) - Method in class de.hpi.fgis.dude.util.Experiment
Starts a run based on the passed thresholds.
run(double, double) - Method in class de.hpi.fgis.dude.util.Experiment
Starts a run based on the passed thresholds.
run() - Method in class de.hpi.fgis.dude.util.MemoryCheckerTask
 

S

saveConfig(OutputStream) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Persists the current configuration using the Properties format.
scan(char) - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
 
scanAlternatives(char[], boolean) - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
The return value is an index into alternatives.
scanBracketedString(StringBuilder, boolean) - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
 
scanEntryTypeName() - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
 
scanLiteral(char[], boolean, boolean) - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
 
scanLiteral(char[], boolean, boolean, StringBuilder) - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
The return value is an index into the termination set the result is appended in the resultTargetBuffer
scanQuotedString() - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
 
scanTopLevelCommentOrAtOrEOF() - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
If it's a top-level comment, result.choice will be 0, for @ 1, for EOF 2.
Schema - Class in de.hpi.fgis.dude.database.util
The Schema encapsulates all the information concerning a database table schema.
Schema(Collection<ColumnInfo>) - Constructor for class de.hpi.fgis.dude.database.util.Schema
Initializes a Schema using a given collection of ColumnInfo instances.
Schema(Connection, String, String) - Constructor for class de.hpi.fgis.dude.database.util.Schema
Initializes the Schema out of the passed table.
searchFor(String) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
Returns the first occurrence of the passed attribute (using a recursive look-up).
SECOND_ELEMENT_ATTRIBUTE_NAME - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectPair
The second element's Json attribute name.
SECOND_ELEMENTS_DEFAULT_OBJECT_ID_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.util.GoldStandard
The default attribute for extracting the second element's object id.
SECOND_OBJECT_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.output.JsonOutput
The attribute name of the second object within each pair.
set(int, int) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyList
 
set(int, int) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyMatrix
 
set(int, int) - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.GraphRepresentation
Sets elements (i, j) in the matrix to true.
setAlgorithm(Algorithm) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the algorithm of this Experiment.
setAttribute(DataSource, String...) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
Sets the attribute for the passed DataSource.
setAttributeValue(JsonRecord, String, JsonValue) - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource.AbstractDataSourceIterator
Sets a JsonValue instance to an attribute of the passed JsonRecord.
setBeginningTime() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Deprecated. Replaced by StatisticComponent.setStartTime()
setCharBlockKey(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Set the number of characters of the sorting key that are used for defining the blocks.
setCheckMemory(boolean) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Sets the boolean flag that indicates the activation status of memory checking.
setColumnCount(int) - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Sets the column count.
setCommandLineArguments(String[]) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets the global configuration based on the command-line arguments.
setCompareArrayArrayStrategy(CalculationStrategy<JsonArray, JsonArray>) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Sets a new strategy for comparing JsonArrays.
setCompareArrayAtomicStrategy(CalculationStrategy<JsonArray, JsonAtomic>) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Sets a new strategy for comparing JsonArrays and atomic values.
setCompareArrayRecordStrategy(CalculationStrategy<JsonArray, JsonRecord>) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Sets a new strategy for comparing JsonArrays and JsonRecords.
setCompareRecordAtomicStrategy(CalculationStrategy<JsonRecord, JsonAtomic>) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Sets a new strategy for comparing JsonRecords and atomic values.
setCompareRecordRecordStrategy(CalculationStrategy<JsonRecord, JsonRecord>) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.ContentBasedSimilarityFunction
Sets a new strategy for comparing JsonRecords.
setConfig(ExtendedStatisticComponent.Config) - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Sets the cost configuration to use for GMD processing
setContent(BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPreamble
Sets the content.
setContent(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexString
Sets the content.
setContent(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexToplevelComment
Sets the content.
setCoreERAlgorithm(Algorithm) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Sets the entity resolution algorithm that is used to process blocks internally
setCrossProductStrategy(ContentBasedSimilarityFunction<?>) - Static method in class de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh
Sets the strategy for comparing arrays of the passed ContentBasedSimilarityFunction to the CrossProductStrategy.
setCrossProductStrategy(ContentBasedSimilarityFunction<?>) - Static method in class de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh
Sets the strategy for comparing array of the passed ContentBasedSimilarityFunction to the CrossProductStrategy.
setDatabaseName(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Sets a new database.
setdCostFunc(AbstractSubstitutionCost) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
sets the d(i,j) cost function used.
setDefaultAttribute(String...) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
Sets the default attribute.
setDefaultEncoding(String) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets DuDe's default encoding for file-based data-processing.
setDefaultSortingKey(SortingKey) - Method in class de.hpi.fgis.dude.algorithm.SortingRecordLinkage
Sets the default SortingKey.
setDuplicateInfo(DuDeObjectPair.DuplicateType) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Sets the is-duplicate property.
setElements(T, T) - Method in class de.hpi.fgis.dude.util.OrderedPair
Resets the pair's elements.
setEndTime() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Sets current time as finishing time for the runtime.
setEntryKey(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Sets the entryKey.
setEntryType(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Sets the entryType.
setEscapeCharacter(char) - Method in class de.hpi.fgis.dude.output.CSVOutput
Sets the escape character.
setEscapeCharacter(char) - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Sets the escape character.
setEscapeCharacter(char) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Sets the escape character.
setExtractedData() - Method in class de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent
Sets the size of the extracted data.
setExtractedData(Vector<DuDeObject>) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the amount of records extracted (retrieved from Algorithm-Object).
setExtractedRecordCount(int) - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
Resets the extracted-record count, if the passed count is larger than the current one.
setField(String, BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Sets a field within this entry.
setFilename(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the name of the file that contains the gold standard in cluster format.
setFinishingTime() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Deprecated. Replaced by StatisticComponent.setEndTime()
setFirst(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Sets the first.
setFirstElement(T) - Method in class de.hpi.fgis.dude.util.OrderedPair
 
setFirstElement(T1) - Method in class de.hpi.fgis.dude.util.Pair
Sets the first element.
setFirstElementsObjectIdAttributes(String...) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the attributes that store the object id of the pair's first element.
setFirstElementsSourceId(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the source id of the pair's first element.
setFirstElementsSourceIdLiteral(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the actual source id of the pair's first element.
setFixBlockSize(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Set the new fix block size.
setGapCost(float) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
sets the gap cost for the distance function.
setGoldStandard(GoldStandard) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the gold standard loader of this Experiment.
setHost(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Sets the host of the currently used database system.
setIgnoredCharactersRegEx(String) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Sets the regular expression which specifies the characters that will be ignored.
setIncreaseFactor(float) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Sets the increase factor.
setIncreaseThreshold(float) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Sets the increase threshold.
setInMemoryObjectThreshold(int) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets the threshold for storing DuDeObjects in memory, if file-based processing is enabled.
setInMemoryObjectThreshold(String) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets the in-memory object threshold based on the passed String.
setKey(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroDefinition
Sets the key.
setKey(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroReference
Sets the key.
setLast(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Sets the last.
setLeft(BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexConcatenatedValue
Sets the left.
setLineage(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Sets the lineage.
setLineage(DuDeObjectPair.GeneratedBy) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Sets the lineage property.
setLowerThreshold(double) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the lower threshold for this experiment.
setMaxBlockSize(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Set the new maxmimum block size.
setMaximumMemoryUsage(double) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets the maximum relative memory that is used for sorting the data.
setMaximumMemoryUsage(String) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets the maximum relative memory usage based on the passed String.
setMaximumMemoryUsage(double) - Method in class de.hpi.fgis.dude.util.MemoryChecker
Sets the maximum memory usage in percent.
setMemoryCheckFrequency(long) - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Gets the frequency of memory checks.
setMerger(Merger) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Sets the merger that merges several DuDeObjects into one
setNotification(GSwoosh.ComparisonResult) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh
Notifies the GSwoosh algorithm of the result of the last comparison.
setNotification(RSwoosh.ComparisonResult) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh
Notifies the RSwoosh algorithm of the result of the last comparison.
setNrCharForBlocking(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.NaiveBlockingAlgorithm
Set the number of characters of the sorting key that are used as blocking criterion.
setNumBaseRecords(int) - Method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Setter for total number of base records, needed for computation of VI.
setOptionalColumn(String) - Method in class de.hpi.fgis.dude.output.CSVOutput
Sets a new optional column with no value.
setOptionalColumn(String, String) - Method in class de.hpi.fgis.dude.output.CSVOutput
Sets a new optional column with the passed value.
setOptionalStatisticEntry(String, String) - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
setOptionalStatisticEntry(String) - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
setOptionalStatisticEntry(String, String) - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Sets a new optional label with the passed value.
setOptionalStatisticEntry(String) - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Sets a new optional label with no value.
setOthers(boolean) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Sets the isAndOthers.
setPassword(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Sets the password which is used for establishing the current database connection.
setPort(int) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Sets the port of the currently used database system.
setPort(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Sets the port of the currently used database system.
setPositions(Integer...) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Sets the positions that shall be taken into account explicitly.
setPreLast(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexPerson
Sets the preLast.
setProperty(String, Object) - Method in class de.hpi.fgis.dude.util.OrderedPair
Sets the passed property.
setQuoteCharacter(char) - Method in class de.hpi.fgis.dude.datasource.CSVSource
Sets the quote character.
setQuoteCharacter(char) - Method in class de.hpi.fgis.dude.output.CSVOutput
Sets the quote character.
setQuoteCharacter(char) - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Sets the quote character.
setQuoteCharacter(char) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Sets the quote character.
setRange(int) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Sets the range that shall be taken into account.
setRange(int, int) - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Sets the range that shall be taken into account.
setRight(BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexConcatenatedValue
Sets the right.
setSecondElement(T) - Method in class de.hpi.fgis.dude.util.OrderedPair
 
setSecondElement(T2) - Method in class de.hpi.fgis.dude.util.Pair
Sets the second element.
setSecondElementsObjectIdAttributes(String...) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the attributes that store the object id of the pair's second element.
setSecondElementsSourceId(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the source id of the pair's second element.
setSecondElementsSourceIdLiteral(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the actual source id of the pair's second element.
setSeparator(char) - Method in class de.hpi.fgis.dude.output.CSVOutput
Sets the separator character.
setSeparator(char) - Method in class de.hpi.fgis.dude.util.csv.CSVReader
Sets the separator character.
setSeparator(char) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Sets the separator character.
setSeparatorCharacter(char) - Method in class de.hpi.fgis.dude.datasource.CSVSource
Sets the separator character.
setSimilarity(double) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
Sets the similarity of this DuDeObjectPair.
setSimilarityFunction(SimilarityFunction) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the internally used SimilarityFunction.
setSizeOverlap(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Sets the new overlap size.
setSortingKey(SortingKey) - Method in class de.hpi.fgis.dude.algorithm.SortingDuplicateDetection
Sets the SortingKey.
setSortingKey(SortingKey) - Method in class de.hpi.fgis.dude.util.sorting.sorter.AbstractDuDeObjectSorter
 
setSortingKey(SortingKey) - Method in interface de.hpi.fgis.dude.util.sorting.sorter.DuDeObjectSorter
Sets a new SortingKey.
setSortingKeys(Set<SortingKey>) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.Lego
Defines the blocking criteria that shall be used for blocking
setSourceIdLiteral(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Sets the actual source id of both elements.
setSplitToken(String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Sets the split token.
setSQLSchema(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Set a new sqlSchema.
setStartTime() - Method in class de.hpi.fgis.dude.postprocessor.StatisticComponent
Sets the current time as starting time for the runtime and initiates memory monitoring.
setStatistics(StatisticComponent) - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
setStatistics(StatisticComponent) - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Sets the current statistic component that is used by the output.
setThreshold(float) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007
Set the threshold.
setThreshold(double) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the threshold for this experiment.
setThresholds(double, double) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the thresholds for this experiment.
setUpperThreshold(double) - Method in class de.hpi.fgis.dude.util.Experiment
Sets the thresholds for this experiment.
setUser(String) - Method in class de.hpi.fgis.dude.database.util.DBInfo
Sets the user name which is used for establishing the current database connection.
setValidationState(SimilarityFunction.SimilarityValidationState) - Method in class de.hpi.fgis.dude.similarityfunction.AbstractSimilarityFunction
Sets the validation state.
setValue(BibtexAbstractValue) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexMacroDefinition
Sets the value.
setWhereFilter(String) - Method in class de.hpi.fgis.dude.database.DatabaseSource
Enables the where clause with the passed query extension.
setWindowSize(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Sets the window Size.
setWindowSize(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
Sets the window size.
setWorkingDirectory(String) - Method in class de.hpi.fgis.dude.util.GlobalConfig
Sets DuDe's default directory for file-based data-processing.
SIMILARITY_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.output.JsonOutput
The attribute name of the similarity that belongs to each pair
SIMILARITY_PROPERTY - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectPair
The name of the similarity property.
SimilarityFunction - Interface in de.hpi.fgis.dude.similarityfunction
SimilarityFunction is used to determine the similarity of two DuDeObject's.
SimilarityFunction.SimilarityValidationState - Enum in de.hpi.fgis.dude.similarityfunction
SimilarityValidationState is a descriptor whether two values could be used for similarity calculation or not.
similarityFunctionSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether a SimilarityFunction was set.
SimmetricsFunction<T extends SimmetricsFunction<T,M>,M extends uk.ac.shef.wit.simmetrics.similaritymetrics.InterfaceStringMetric> - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
SimmetricsFunction is a skeleton class providing the common functionality of all Simmetric similarity functions.
SimmetricsFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SimmetricsFunction
Internal constructor for Jsonable deserialization.
SimmetricsFunction(M, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SimmetricsFunction
Initializes the SimmetricsFunction with the passed metric and the default values.
SimmetricsFunction(M, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SimmetricsFunction
Initializes the SimmetricsFunction with the passed metric and the default values.
SimpleStatisticOutput - Class in de.hpi.fgis.dude.output.statisticoutput
SimpleStatisticOutput prints the statistics in a simple, formatted fashion.
SimpleStatisticOutput() - Constructor for class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
Internal constructor for Jsonable deserialization.
SimpleStatisticOutput(File) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
Initializes a SimpleStatisticOutput with no statistics.
SimpleStatisticOutput(File, StatisticComponent) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
Initializes the SimpleStatisticOutput.
SimpleStatisticOutput(OutputStream) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
Initializes a SimpleStatisticOutput with no statistics.
SimpleStatisticOutput(OutputStream, StatisticComponent) - Constructor for class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
Initializes the SimpleStatisticOutput.
SimpleTextOutput - Class in de.hpi.fgis.dude.output
SimpleTextOutput writes the passed DuDeObject pair to an OutputStream line by line.
SimpleTextOutput(OutputStream) - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Initializes a new SimpleTextOutput with the passed parameters.
SimpleTextOutput(OutputStream, String) - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Initializes a new SimpleTextOutput with the passed parameters.
SimpleTextOutput(OutputStream, String, String) - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Initializes a new SimpleTextOutput with the passed parameters.
SimpleTextOutput(File) - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Initializes a new SimpleTextOutput with the passed parameters.
SimpleTextOutput(File, String) - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Initializes a new SimpleTextOutput with the passed parameters.
SimpleTextOutput(File, String, String) - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Initializes a new SimpleTextOutput with the passed parameters.
SimpleTextOutput() - Constructor for class de.hpi.fgis.dude.output.SimpleTextOutput
Internal constructor for Jsonable deserialization.
size() - Method in class de.hpi.fgis.dude.database.util.Schema
Returns the schema's column count.
size() - Method in class de.hpi.fgis.dude.postprocessor.NaiveTransitiveClosureGenerator.TransitiveClosure
 
size() - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
Returns 1 since JsonBoolean is an atomic value.
size() - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
Returns 0 since JsonRecord.JsonNull does not have any value.
size() - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
Returns 1 since JsonNumber is an atomic value.
size() - Method in class de.hpi.fgis.dude.util.data.json.JsonString
Returns 1 since JsonString is an atomic value.
size() - Method in interface de.hpi.fgis.dude.util.data.json.JsonValue
The size of the current instance.
size() - Method in class de.hpi.fgis.dude.util.data.storage.FileBasedStorage
 
size() - Method in class de.hpi.fgis.dude.util.data.storage.InMemoryStorage
 
size() - Method in class de.hpi.fgis.dude.util.data.storage.InputStreamReadable
 
size() - Method in interface de.hpi.fgis.dude.util.data.storage.JsonReadable
Returns the number of instances that are contained.
size() - Method in class de.hpi.fgis.dude.util.GoldStandard
Returns the size of the gold standard.
skipFieldName(String) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Safely skips the current token if it is a field name with the specified value.
skipToken() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Skips the current token.
skipToken(JsonToken) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonParser
Safely skips the current token if it is of the given type.
skipWhiteSpace() - Method in class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser.StringIterator
 
skipWhitespace() - Method in class de.hpi.fgis.dude.util.bibtex.parser.PseudoLexer
 
SmithWatermanDistance - Class in de.hpi.fgis.dude.similarityfunction.contentbased.util
SmithWatermanDistance implements the Smith-Waterman distance.
SmithWatermanDistance() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
constructor - default (empty).
SmithWatermanDistance(float) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
constructor.
SmithWatermanDistance(float, AbstractSubstitutionCost) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
constructor.
SmithWatermanDistance(AbstractSubstitutionCost) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.SmithWatermanDistance
constructor.
SmithWatermanFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
SmithWatermanFunction compares two DuDeObjects based on the Smith Waterman Distance of the given attribute.
SmithWatermanFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Internal constructor for Jsonable deserialization.
SmithWatermanFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(float, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(float, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(AbstractSubstitutionCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(AbstractSubstitutionCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(float, AbstractSubstitutionCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanFunction(float, AbstractSubstitutionCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanFunction
Initializes the SmithWatermanFunction.
SmithWatermanGotohFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics
SmithWatermanGotohFunction compares two DuDeObjects based on the Smith Waterman Gotoh Distance of the given attribute.
SmithWatermanGotohFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Internal constructor for Jsonable deserialization.
SmithWatermanGotohFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(AbstractAffineGapCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(AbstractAffineGapCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(AbstractSubstitutionCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(AbstractSubstitutionCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(AbstractAffineGapCost, AbstractSubstitutionCost, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SmithWatermanGotohFunction(AbstractAffineGapCost, AbstractSubstitutionCost, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.simmetrics.SmithWatermanGotohFunction
Initializes the SmithWatermanGotohFunction.
SortedBlocks - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
SortedBlocks combines blocking and the SNM method.
SortedBlocks(SortedBlocks.AlgorithmVariant, SortingKey, int) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks
Initializes a SortedBlocks instance using fixed size blocks with the passed windows size.
SortedBlocks.AlgorithmVariant - Enum in de.hpi.fgis.dude.algorithm.duplicatedetection
This enumeration collects the possible SortedBlocks variants.
SortedBlocksExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
Example experiment for the SortedBlocks algorithm.
SortedBlocksExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.SortedBlocksExec
 
SortedDataFile - Class in de.hpi.fgis.dude.util.sorting.sorter
SortedDataFile encapsulates the functionality that is needed for the TwoPhaseMultiWayMergeSorter in phase two.
SortedDataFile(FileBasedStorage<DuDeObject>) - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.SortedDataFile
Initializes a SortedDataFile and loads the first element.
SortedNeighborhoodMethod - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
SortedNeighborhoodMethod is a simple Sorted-Neighborhood Method implementation without allowing multiple runs.
SortedNeighborhoodMethod() - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
For serialization
SortedNeighborhoodMethod(SortingKey) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
Initializes a SortedNeighborhoodMethod instance with the passed SortingKey the SortedNeighborhoodMethod.DEFAULT_WINDOW_SIZE.
SortedNeighborhoodMethod(SortingKey, int) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod
Initializes a SortedNeighborhoodMethod instance with the passed SortingKey and a window size.
SortedNeighborhoodMethod.SortedNeighborhoodMethodIterator - Class in de.hpi.fgis.dude.algorithm.duplicatedetection
SortedNeighborhoodMethod.SortedNeighborhoodMethodIterator implements the behavior of a simple Sorted-Neighborhood-Method implementation.
SortedNeighborhoodMethod.SortedNeighborhoodMethodIterator(int, Iterator<DuDeObject>) - Constructor for class de.hpi.fgis.dude.algorithm.duplicatedetection.SortedNeighborhoodMethod.SortedNeighborhoodMethodIterator
Initializes a SNM iterator with the given window size and data iterator.
SortedNeighborhoodMethodExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
The execution class for the SortedNeighborhoodMethod algorithm.
SortedNeighborhoodMethodExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.SortedNeighborhoodMethodExec
 
SortingDuplicateDetection - Class in de.hpi.fgis.dude.algorithm
SortingDuplicateDetection implements the preprocessing phase were the data is sorted based on a given SortingKey.
SortingDuplicateDetection() - Constructor for class de.hpi.fgis.dude.algorithm.SortingDuplicateDetection
For serialization
SortingDuplicateDetection(SortingKey) - Constructor for class de.hpi.fgis.dude.algorithm.SortingDuplicateDetection
Initializes the SortingDuplicateDetection with the passed SortingKey.
SortingKey - Class in de.hpi.fgis.dude.util.sorting.sortingkey
The SortingKey collects different sub-keys and compares different DuDeObject based on these sub-keys.
SortingKey() - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Initializes a SortingKey with no sub-key(s).
SortingKey(Subkey...) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
Initializes a SortingKey instance.
sortingKey_iterator - Variable in class de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.YanIterator
Sorting key for the records
SortingRecordLinkage - Class in de.hpi.fgis.dude.algorithm
SortingRecordLinkage implements the preprocessing phase were the data is sorted based on one or several SortingKeys.
SortingRecordLinkage() - Constructor for class de.hpi.fgis.dude.algorithm.SortingRecordLinkage
Initializes the SortingRecordLinkage with no default SortingKey.
SortingRecordLinkage(SortingKey) - Constructor for class de.hpi.fgis.dude.algorithm.SortingRecordLinkage
Initializes the SortingRecordLinkage with the passed default SortingKey.
SoundEx - Class in de.hpi.fgis.dude.similarityfunction.contentbased.util
SoundEx implements a phonetic algorithm for indexing names by sound.
SoundEx() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.util.SoundEx
 
SoundExFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl
SoundExFunction compares two DuDeObjects based on the SoundEx values of the given attribute.
SoundExFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.SoundExFunction
Internal constructor for Jsonable deserialization.
SoundExFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.SoundExFunction
Initializes a SoundExFunction with the passed default attribute.
SoundExFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.SoundExFunction
Initializes a SoundExFunction with the passed default attribute.
SOURCE_ID_ATTRIBUTE_NAME - Static variable in class de.hpi.fgis.dude.util.data.DuDeObjectId
The attribute name of the source id within the Json representation of this DuDeObject.
sourceIdMappingEnabled() - Method in class de.hpi.fgis.dude.util.GoldStandard
Checks whether source-id mapping is enabled.
StableMarriageStrategy - Class in de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy
StableMarriageStrategy implements the Stable-Marriage algorithm.
StableMarriageStrategy() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.calculationstrategy.StableMarriageStrategy
 
startDate - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
startTime - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
StatisticComponent - Class in de.hpi.fgis.dude.postprocessor
StatisticComponent provides functionality for gathering statistics concerning the recall, precision and f-measure.
StatisticComponent(Algorithm) - Constructor for class de.hpi.fgis.dude.postprocessor.StatisticComponent
Initializes a StatisticComponent with no gold standard.
StatisticComponent(GoldStandard, Algorithm) - Constructor for class de.hpi.fgis.dude.postprocessor.StatisticComponent
Initializes a StatisticComponent using the passed DuDeObjectPairs as real duplicates.
StatisticComponent() - Constructor for class de.hpi.fgis.dude.postprocessor.StatisticComponent
Internal constructor for Jsonable deserialization.
StatisticOutput - Interface in de.hpi.fgis.dude.output.statisticoutput
StatisticOutput offers all methods needed to write out the statistics provided by a StatisticComponent instance.
statisticOutputSet() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether any StatisticOutput instance is set.
statisticsEnabled() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether gathering statistics is enabled.
step() - Method in class de.hpi.fgis.dude.util.bibtex.expander.BibtexPersonListParser.StringIterator
 
step() - Method in class de.hpi.fgis.dude.util.bibtex.parser.LookAheadReader
Jumps to the next character.
STR_DATABASE_HOST - Static variable in class de.hpi.fgis.dude.database.util.DBInfo
Parameter name of the database host.
STR_DATABASE_NAME - Static variable in class de.hpi.fgis.dude.database.util.DBInfo
Parameter name of the database name.
STR_DATABASE_PASSWORD - Static variable in class de.hpi.fgis.dude.database.util.DBInfo
Parameter name of the database password.
STR_DATABASE_PORT - Static variable in class de.hpi.fgis.dude.database.util.DBInfo
Parameter name of the database port.
STR_DATABASE_SCHEMA - Static variable in class de.hpi.fgis.dude.database.util.DBInfo
Parameter name of the database schema.
STR_DATABASE_USER - Static variable in class de.hpi.fgis.dude.database.util.DBInfo
Parameter name of the database user.
StreetSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
StreetSimilarityFunction compares two strings and treats them as street names, allowing for some special normalization and comparison techniques.
StreetSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.StreetSimilarityFunction
 
StringSimilarity - Interface in de.hpi.fgis.dude.similarityfunction
StringSimilarity is an interface for comparing Strings.
Subkey - Interface in de.hpi.fgis.dude.util.sorting.sortingkey
Subkey is an interface that is used within the SortingKey implementation.

T

task - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
TextBasedSubkey - Class in de.hpi.fgis.dude.util.sorting.sortingkey
The class TextBasedSubkey provides the functionality for generating sub-keys based on String values.
TextBasedSubkey() - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
An empty constructor for supporting Jsonable.fromJson(DuDeJsonParser).
TextBasedSubkey(String) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Initializes this subkey with the passed attribute name.
TextBasedSubkey(String, String) - Constructor for class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
Initializes this subkey with the passed attribute name and the specified ignoredCharacters regular expression.
TFIDFExperimentExec - Class in de.hpi.fgis.dude.exec.duplicatedetection
Executes a run using a large data file with the SortedNeighborhoodMethod and preprocessing for the tf-idf comparator.
TFIDFExperimentExec() - Constructor for class de.hpi.fgis.dude.exec.duplicatedetection.TFIDFExperimentExec
 
TFIDFSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.contentbased.impl
TFIDFSimilarityFunction compares two DuDeObjects based on the classic tf-idf metric.
TFIDFSimilarityFunction() - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Internal constructor for Jsonable deserialization.
TFIDFSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Initializes a TFIDFSimilarityFunction object for the passed attribute.
TFIDFSimilarityFunction(int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Initializes a TFIDFSimilarityFunction object for the passed attribute.
TFIDFSimilarityFunction(DocumentFrequencyPreprocessor, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Initializes a TFIDFSimilarityFunction object for the passed attribute.
TFIDFSimilarityFunction(DocumentFrequencyPreprocessor, int, String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Initializes a TFIDFSimilarityFunction object for the passed attribute.
throwExpansionException(String) - Method in class de.hpi.fgis.dude.util.bibtex.expander.AbstractExpander
Call this whenever you want to throw an ExpansionException.
throwExpansionException(Exception) - Method in class de.hpi.fgis.dude.util.bibtex.expander.AbstractExpander
Call this whenever you want to throw an ExpansionException.
tillEnd - Variable in class de.hpi.fgis.dude.util.sorting.sortingkey.TextBasedSubkey
The property stores the information whether the suffix shall is requested.
timer - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
TitleSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
TitleSimilarityFunction compares two strings and treats them as person's titles, allowing for some special normalization and comparison techniques.
TitleSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.TitleSimilarityFunction
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.datasource.BibtexSource
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.datasource.CSVSource
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.datasource.JSONSource
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.datasource.XMLSource
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.output.CSVOutput
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.DuDeObject
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.JsonArray
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.JsonRecord
 
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.JsonString
 
toJson(AutoJsonable, boolean) - Static method in class de.hpi.fgis.dude.util.data.json.JsonUtil
Serializes a Jsonable and returns the resulting json string.
toJson(AutoJsonable) - Static method in class de.hpi.fgis.dude.util.data.json.JsonUtil
Serializes a Jsonable and returns the resulting json string.
toJson(DuDeJsonGenerator) - Method in interface de.hpi.fgis.dude.util.data.Jsonable
Generates the Json code using the passed DuDeJsonGenerator.
toJson(DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.Experiment
 
tokeniser - Variable in class de.hpi.fgis.dude.similarityfunction.contentbased.util.MongeElkanSimilarity
private tokeniser for tokenisation of the query strings.
toString() - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
 
toString() - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
toString() - Method in class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.IntPair
 
toString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.AbsoluteNumberDiffFunction
 
toString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.RelativeNumberDiffFunction
 
toString() - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
 
toString() - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexNode
 
toString() - Method in class de.hpi.fgis.dude.util.BoundType
 
toString() - Method in class de.hpi.fgis.dude.util.data.DuDeObject
Converts the DuDeObject to its Json representation.
toString() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectId
 
toString() - Method in class de.hpi.fgis.dude.util.data.DuDeObjectPair
 
toString() - Method in class de.hpi.fgis.dude.util.data.json.JsonBoolean
 
toString() - Method in class de.hpi.fgis.dude.util.data.json.JsonNull
Returns the "null" String value.
toString() - Method in class de.hpi.fgis.dude.util.data.json.JsonNumber
 
toString() - Method in class de.hpi.fgis.dude.util.data.json.JsonString
Returns the actual String.
toString() - Method in class de.hpi.fgis.dude.util.Experiment
 
toString() - Method in class de.hpi.fgis.dude.util.GoldStandard
 
toString() - Method in class de.hpi.fgis.dude.util.MemoryChecker
 
toString() - Method in class de.hpi.fgis.dude.util.Pair
 
toString() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.AbstractSubkey
 
toString() - Method in class de.hpi.fgis.dude.util.sorting.sortingkey.SortingKey
 
transitiveClosuresEnabled() - Method in class de.hpi.fgis.dude.util.Experiment
Checks whether the usage of a transitive closure is enabled.
TRUE - Static variable in class de.hpi.fgis.dude.util.data.json.JsonBoolean
A Json-convertible true.
trueNegativesByComparison - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
truePositives - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
truePositivesByComparison - Variable in class de.hpi.fgis.dude.postprocessor.StatisticComponent
 
TwoPhaseMultiWayMergeSorter - Class in de.hpi.fgis.dude.util.sorting.sorter
TwoPhaseMultiWayMergeSorter implements a file-based sorting using the Two-Phase Multi-Way Merge-Sort algorithm (TPMMS).
TwoPhaseMultiWayMergeSorter() - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
Initializes a TwoPhaseMultiWayMergeSorter with no SortingKey.
TwoPhaseMultiWayMergeSorter(SortingKey) - Constructor for class de.hpi.fgis.dude.util.sorting.sorter.TwoPhaseMultiWayMergeSorter
Initializes a TwoPhaseMultiWayMergeSorter with the passed SortingKey.
TYPE_ATTRIBUTE - Static variable in class de.hpi.fgis.dude.datasource.BibtexSource
The name of the attribute that stores the type.

U

undefineField(String) - Method in class de.hpi.fgis.dude.util.bibtex.data.BibtexEntry
Removes the field from this entry.
unlockInstance() - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM
Unlocks the instance.
unregisterDataSources() - Method in class de.hpi.fgis.dude.algorithm.AbstractDuplicateDetection
 
unregisterDataSources() - Method in class de.hpi.fgis.dude.algorithm.AbstractRecordLinkage
 
unregisterDataSources() - Method in interface de.hpi.fgis.dude.algorithm.Algorithm
Unregisters all DataSources.

V

valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.AlgorithmVariant
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptionMode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.ComparisonResult
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh.ComparisonResult
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.ComparisonResult
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh.ComparisonResult
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks.AlgorithmVariant
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.similarityfunction.SimilarityFunction.SimilarityValidationState
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.util.data.DuDeObjectPair.DuplicateType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.util.data.DuDeObjectPair.GeneratedBy
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum de.hpi.fgis.dude.util.data.json.JsonValue.JsonType
Returns the enum constant of this type with the specified name.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.AdaptiveSNM_Yan2007.AlgorithmVariant
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptionMode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.ComparisonResult
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.GSwoosh.ComparisonResult
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.Lego.ComparisonResult
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.RSwoosh.ComparisonResult
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.algorithm.duplicatedetection.SortedBlocks.AlgorithmVariant
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.postprocessor.ExtendedStatisticComponent.Config
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.similarityfunction.SimilarityFunction.SimilarityValidationState
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.util.data.DuDeObjectPair.DuplicateType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.util.data.DuDeObjectPair.GeneratedBy
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum de.hpi.fgis.dude.util.data.json.JsonValue.JsonType
Returns an array containing the constants of this enum type, in the order they are declared.

W

WarshallTransitiveClosureGenerator - Class in de.hpi.fgis.dude.postprocessor
WarshallTransitiveClosureGenerator implements the Warshall algorithm to calculate the transitive closure.
WarshallTransitiveClosureGenerator() - Constructor for class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator
 
WarshallTransitiveClosureGenerator.AdjacencyList - Class in de.hpi.fgis.dude.postprocessor
WarshallTransitiveClosureGenerator.AdjacencyList is the adjacency list representation of the added pairs.
WarshallTransitiveClosureGenerator.AdjacencyList(int) - Constructor for class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyList
Constructor of WarshallTransitiveClosureGenerator.AdjacencyList
WarshallTransitiveClosureGenerator.AdjacencyMatrix - Class in de.hpi.fgis.dude.postprocessor
WarshallTransitiveClosureGenerator.AdjacencyMatrix is the matrix representation of the added pairs.
WarshallTransitiveClosureGenerator.AdjacencyMatrix(int) - Constructor for class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.AdjacencyMatrix
Constructor of WarshallTransitiveClosureGenerator.AdjacencyMatrix
WarshallTransitiveClosureGenerator.GraphRepresentation - Class in de.hpi.fgis.dude.postprocessor
WarshallTransitiveClosureGenerator.GraphRepresentation is an interface that should be implemented by all classes representing a graph of duplicates.
WarshallTransitiveClosureGenerator.GraphRepresentation() - Constructor for class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.GraphRepresentation
 
WarshallTransitiveClosureGenerator.IntPair - Class in de.hpi.fgis.dude.postprocessor
WarshallTransitiveClosureGenerator.IntPair is used to create a pair of integer values.
WarshallTransitiveClosureGenerator.IntPair(int, int) - Constructor for class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.IntPair
Constructor of class IntPair
WarshallTransitiveClosureGenerator.TransitiveClosureIterator - Class in de.hpi.fgis.dude.postprocessor
WarshallTransitiveClosureGenerator.TransitiveClosureIterator is used to iterate over all pairs collected or generated by the WarshallTransitiveClosureGenerator.
WarshallTransitiveClosureGenerator.TransitiveClosureIterator(WarshallTransitiveClosureGenerator.GraphRepresentation) - Constructor for class de.hpi.fgis.dude.postprocessor.WarshallTransitiveClosureGenerator.TransitiveClosureIterator
Constructor of WarshallTransitiveClosureGenerator.TransitiveClosureIterator
windowSize(int) - Method in class de.hpi.fgis.dude.algorithm.duplicatedetection.DuplicateCountSNM.AdaptiveWindowSizeSNMBuilder
Sets the size of the window.
withData() - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
 
withData() - Method in class de.hpi.fgis.dude.output.CSVOutput
 
withData() - Method in interface de.hpi.fgis.dude.output.DuDeOutput
Enables printing the data.
withData() - Method in class de.hpi.fgis.dude.output.MultipleOutput
 
withFirstElement(String, String...) - Method in class de.hpi.fgis.dude.util.GoldStandard
Fluent method for GoldStandard.setFirstElementsSourceId(String) and GoldStandard.setFirstElementsObjectIdAttributes(String...).
withHeader() - Method in class de.hpi.fgis.dude.datasource.CSVSource
By calling this method the extractor will interpret the first line as a header.
withHeader() - Method in class de.hpi.fgis.dude.output.CSVOutput
Writing the header before writing the first pair is enabled.
withIdAttributes(String...) - Method in class de.hpi.fgis.dude.datasource.AbstractDataSource
Sets the id attributes.
withoutData() - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
 
withoutData() - Method in class de.hpi.fgis.dude.output.CSVOutput
 
withoutData() - Method in interface de.hpi.fgis.dude.output.DuDeOutput
Disables printing the data.
withoutData() - Method in class de.hpi.fgis.dude.output.MultipleOutput
 
withoutHeader() - Method in class de.hpi.fgis.dude.output.CSVOutput
Writing the header before writing the first pair is disabled.
withQuoteCharacter(char) - Method in class de.hpi.fgis.dude.datasource.CSVSource
Set the quote character that is used in the file.
withSecondElement(String, String...) - Method in class de.hpi.fgis.dude.util.GoldStandard
Fluent method for GoldStandard.setSecondElementsSourceId(String) and GoldStandard.setSecondElementsObjectIdAttributes(String...).
withSeparatorCharacter(char) - Method in class de.hpi.fgis.dude.datasource.CSVSource
Sets the separator character that is used in the file.
withSplitToken(String) - Method in class de.hpi.fgis.dude.similarityfunction.contentbased.impl.TFIDFSimilarityFunction
Sets the split token and returns the current instance.
withWhereFilter(String) - Method in class de.hpi.fgis.dude.database.DatabaseSource
Sets the where clause for the data extraction.
write(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
 
write(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.CSVOutput
Writes the Ids of the DuDeObjects their similarity value and specified optional values into the file.
write(DuDeObjectPair) - Method in interface de.hpi.fgis.dude.output.DuDeOutput
Writes the DuDeObjectPair onto an stream.
write(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.JsonOutput
 
write(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.MultipleOutput
 
write(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.SimpleTextOutput
Writes the DuDeObject pair line by line.
write(String) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Writes the String into the output.
write(String...) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Converts the passed data and writes the CSV-formatted String into the output.
write(DuDeJsonGenerator, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.ArrayJsonSerialization
 
write(DuDeJsonGenerator, T) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Writes the given instance to the specified DuDeJsonGenerator.
write(DuDeJsonGenerator, C) - Method in class de.hpi.fgis.dude.util.data.json.auto.CollectionJsonSerialization
 
write(DuDeJsonGenerator, T) - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
Writes the given instance to the specified DuDeJsonGenerator.
write(DuDeJsonGenerator, E) - Method in class de.hpi.fgis.dude.util.data.json.auto.EnumJsonSerialization
 
write(T, DuDeJsonGenerator) - Method in class de.hpi.fgis.dude.util.data.json.auto.Primitive
Writes the given primitive to the json stream.
writeArrayEnd() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes an array end into the stream.
writeArrayRecordEntry(String, Collection<? extends Jsonable>) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new collection record entry into the stream
writeArrayStart() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes an array start into the stream.
writeBoolean(boolean) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a raw boolean into the stream.
writeBooleanRecordEntry(String, boolean) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new boolean record entry into the stream
writeClusterFile(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Writes the cluster form into a file in the following format: [objID1,sourceID1];[objID2,sourceID2]; Where each line represents a cluster of duplicates
writeDuplicatePairsFile(String) - Method in class de.hpi.fgis.dude.util.GoldStandard
Writes the cluster form into a file in the following format: [objID1,sourceID1];[objID2,sourceID2]; Where each line represents a cluster of duplicates
writeDuplicatesOnly(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
DuDeOutput#writeDuplicatesOnly(DuDeObjectPair)
writeDuplicatesOnly(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.CSVOutput
 
writeDuplicatesOnly(DuDeObjectPair) - Method in interface de.hpi.fgis.dude.output.DuDeOutput
Writes the passed DuDeObjectPair onto the stream, if it is flagged as a duplicate.
writeDuplicatesOnly(DuDeObjectPair) - Method in class de.hpi.fgis.dude.output.MultipleOutput
 
writeFields(DuDeJsonGenerator, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Writes all non-transient fields of the jsonable to the DuDeJsonGenerator.
writeFields(DuDeJsonGenerator, Object) - Method in class de.hpi.fgis.dude.util.data.json.auto.CompositeJsonSerialization
 
writeFields(DuDeJsonGenerator, AutoJsonable) - Static method in class de.hpi.fgis.dude.util.data.json.JsonUtil
Writes all non-transient fields of the jsonable to the DuDeJsonGenerator.
writeHeader() - Method in class de.hpi.fgis.dude.output.CSVOutput
Writes the header into the output.
writeHeader() - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
Checks whether the header information will be written into the output.
writeJsonArray(JsonArray) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes the passed JsonArray.
writeJsonBoolean(JsonBoolean) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes the passed JsonBoolean.
writeJsonNull() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a JsonNull value to the stream.
writeJsonNumber(JsonNumber) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes the passed JsonNumber.
writeJsonRecord(JsonRecord) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes the passed JsonRecord.
writeJsonString(JsonString) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes the passed JsonString.
writeJsonValue(JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes the passed JsonValue.
writeln(String) - Method in class de.hpi.fgis.dude.util.csv.CSVWriter
Adds a line break to the String and writes it into the output.
writelnToStream(String) - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
Writes the passed string followed by a newline into the OutputStream.
writeNumber(Number) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a raw string into the stream.
writeNumberRecordEntry(String, Number) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new number record entry into the stream
writeRecord(AutoJsonable) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new record entry into the stream
writeRecordEnd() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a record end into the stream.
writeRecordEntry(String, JsonValue) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new record entry into the stream
writeRecordEntry(String, Jsonable) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new record entry into the stream
writeRecordEntry(String, AutoJsonable) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new record entry into the stream
writeRecordFieldName(String) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new field name into the stream.
writeRecordStart() - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a record start into the stream.
writeStatistics() - Method in class de.hpi.fgis.dude.output.statisticoutput.AbstractStatisticOutput
 
writeStatistics() - Method in class de.hpi.fgis.dude.output.statisticoutput.CSVStatisticOutput
 
writeStatistics() - Method in class de.hpi.fgis.dude.output.statisticoutput.SimpleStatisticOutput
 
writeStatistics() - Method in interface de.hpi.fgis.dude.output.statisticoutput.StatisticOutput
Writes the stored statistics.
writeString(String) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a raw string into the stream.
writeStringRecordEntry(String, String) - Method in class de.hpi.fgis.dude.util.data.json.DuDeJsonGenerator
Writes a new string record entry into the stream
writeToStream(String) - Method in class de.hpi.fgis.dude.output.AbstractDuDeOutput
Writes the passed string into the OutputStream.
writeWithType(DuDeJsonGenerator, T) - Method in class de.hpi.fgis.dude.util.data.json.auto.AutoJsonSerialization
Writes the given value with additional type information.
This method should be used when the actual type of a value cannot be inferred from the declaration.
The typical scenario is the usage of polymorphism without secondary information which would allow to infer the type.
writeWithType(DuDeJsonGenerator, C) - Method in class de.hpi.fgis.dude.util.data.json.auto.CollectionJsonSerialization
 

X

XMLSource - Class in de.hpi.fgis.dude.datasource
XMLSource represents *.xml files.
XMLSource() - Constructor for class de.hpi.fgis.dude.datasource.XMLSource
Internal constructor for Jsonable deserialization.
XMLSource(String, File) - Constructor for class de.hpi.fgis.dude.datasource.XMLSource
Initializes a XMLSource that converts all elements in the first XML layer into JsonRecords.
XMLSource(String, File, String) - Constructor for class de.hpi.fgis.dude.datasource.XMLSource
Initializes a XMLSource that converts all direct child elements of the given root into DuDeObjects.
XMLSource.XMLSourceIterator - Class in de.hpi.fgis.dude.datasource
XMLSourceIterator is used for generating DuDeObjects out of XMLSources.
XMLSource.XMLSourceIterator(XMLSource) - Constructor for class de.hpi.fgis.dude.datasource.XMLSource.XMLSourceIterator
Initializes a XMLSourceIterator using the passed XMLSource.

Z

ZIPSimilarityFunction - Class in de.hpi.fgis.dude.similarityfunction.domainspecific.address
ZIPSimilarityFunction compares two strings and treats them as ZIP codes, allowing for some special normalization and comparison techniques.
ZIPSimilarityFunction(String...) - Constructor for class de.hpi.fgis.dude.similarityfunction.domainspecific.address.ZIPSimilarityFunction
 

A B C D E F G H I J K L M N O P Q R S T U V W X Z

Copyright © 2011 Hasso Plattner Institute - Chair of Information Systems. All Rights Reserved.