public abstract class LucenePFEBase extends LuceneFeatureExtractorBase
| Modifier and Type | Field and Description |
|---|---|
protected String |
fieldOfTheMoment |
protected int |
kngramUseTopK |
protected boolean |
ngramBinaryFeatureValuesCombos |
protected int |
ngramUseTopK1 |
protected int |
ngramUseTopK2 |
static String |
PARAM_NGRAM_BINARY_FEATURE_VALUES_COMBO
Whether features should be marked with binary (occurs, doesn't occur in this document pair)
values, versus the document count of the feature.
|
static String |
PARAM_NGRAM_USE_TOP_K_VIEW1
Use this number of most frequent ngrams from View 1's.
|
static String |
PARAM_NGRAM_USE_TOP_K_VIEW2
Use this number of most frequent ngrams from View 2's.
|
protected de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> |
topKSetView1 |
protected de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> |
topKSetView2 |
protected int |
topNOfTheMoment |
forceRereadFromIndex, LUCENE_NGRAM_FIELD, luceneDir, PARAM_SOURCE_LOCATIONdfStore, filterPartialStopwordMatches, ngramFreqThreshold, ngramLowerCase, ngramMaxN, ngramMinN, ngramStopwordsFile, ngramUseTopK, PARAM_FILTER_PARTIAL_STOPWORD_MATCHES, PARAM_NGRAM_FREQ_THRESHOLD, PARAM_NGRAM_LOWER_CASE, PARAM_NGRAM_MAX_N, PARAM_NGRAM_MIN_N, PARAM_NGRAM_STOPWORDS_FILE, PARAM_NGRAM_USE_TOP_K, PARAM_TF_IDF_CALCULATION, prefix, stopwords, tfIdfCalculation, topKSetfeatureExtractorName, PARAM_UNIQUE_EXTRACTOR_NAME| Constructor and Description |
|---|
LucenePFEBase() |
| Modifier and Type | Method and Description |
|---|---|
protected Set<Feature> |
addToFeatureArray(de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> viewNgrams,
de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> topKSet,
Set<Feature> features) |
boolean |
initialize(org.apache.uima.resource.ResourceSpecifier aSpecifier,
Map<String,Object> aAdditionalParams) |
getFieldName, getTopN, getTopNgrams, logSelectionProcess, passesScreeninggetFeaturePrefixafterResourcesInitialized, getLogger, getResourceNamedestroy, getCasManager, getMetaData, getRelativePathResolver, getResourceManager, getUimaContext, getUimaContextAdmin, setLogger, setMetaDataclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetMetaCollectorClassesprotected int kngramUseTopK
public static final String PARAM_NGRAM_USE_TOP_K_VIEW1
protected int ngramUseTopK1
public static final String PARAM_NGRAM_USE_TOP_K_VIEW2
protected int ngramUseTopK2
public static final String PARAM_NGRAM_BINARY_FEATURE_VALUES_COMBO
protected boolean ngramBinaryFeatureValuesCombos
protected de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> topKSetView1
protected de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> topKSetView2
protected String fieldOfTheMoment
protected int topNOfTheMoment
public boolean initialize(org.apache.uima.resource.ResourceSpecifier aSpecifier,
Map<String,Object> aAdditionalParams)
throws org.apache.uima.resource.ResourceInitializationException
initialize in interface org.apache.uima.resource.Resourceinitialize in class NGramFeatureExtractorBaseorg.apache.uima.resource.ResourceInitializationExceptionCopyright © 2013–2018 Ubiquitous Knowledge Processing (UKP) Lab. All rights reserved.