|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object de.hpi.fgis.dude.preprocessor.DocumentFrequencyPreprocessor
public class DocumentFrequencyPreprocessor
The DocumentFrequencyPreprocessor
collects frequencies of terms within an attribute value. Each value from the considered attribute is
regarded as a document.
TFIDFSimilarityFunction
Constructor Summary | |
---|---|
DocumentFrequencyPreprocessor(String attrName)
Initializes a DocumentFrequencyPreprocessor object for the passed attribute. |
Method Summary | |
---|---|
void |
analyzeDuDeObject(DuDeObject data)
Retrieves the value frequencies within the considered attribute and ads them to the total document frequency of the terms |
void |
clearData()
Clears statistics that were already gathered. |
void |
finish()
This method is called after finishing the data extraction process. |
double |
getInverseDocumentFrequency(String term)
Retrieves the inverse document frequency of the passed term. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public DocumentFrequencyPreprocessor(String attrName)
DocumentFrequencyPreprocessor
object for the passed attribute.
attrName
- The attribute on which the document frequencies are calculated.Method Detail |
---|
public void analyzeDuDeObject(DuDeObject data)
analyzeDuDeObject
in interface Preprocessor
data
- The DuDeObject
that shall be analyzed.public void clearData()
Preprocessor
clearData
in interface Preprocessor
public void finish()
Preprocessor
finish
in interface Preprocessor
public double getInverseDocumentFrequency(String term)
term
- The considered term
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |