public class LuceneNGramCPFE extends LuceneNGramPFE implements PairFeatureExtractor
LuceneNGramPFE.| Modifier and Type | Field and Description |
|---|---|
static String |
LUCENE_NGRAM_FIELDCOMBO |
protected int |
ngramMaxNCombo |
protected int |
ngramMinNCombo |
protected boolean |
ngramUseSymmetricalCombos |
protected int |
ngramUseTopKCombo |
static String |
PARAM_NGRAM_MAX_N_COMBO
Maximum token length of the combination
|
static String |
PARAM_NGRAM_MIN_N_COMBO
Minimum token length of the combination.
|
static String |
PARAM_NGRAM_SYMMETRY_COMBO
If true, both orderings of ngram combinations will be used.
|
static String |
PARAM_NGRAM_USE_TOP_K_COMBO
Use this number of most frequent combinations
|
protected de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> |
topKSetCombo |
LUCENE_NGRAM_FIELD1, LUCENE_NGRAM_FIELD2, markViewBlindNgramsWithLocalView, ngramBinaryFeatureValuesCombos, ngramMaxN1, ngramMaxN2, ngramMinN1, ngramMinN2, ngramUseTopK1, ngramUseTopK2, PARAM_MARK_VIEWBLIND_NGRAMS_WITH_LOCAL_VIEW, PARAM_NGRAM_BINARY_FEATURE_VALUES_COMBO, PARAM_NGRAM_MAX_N_VIEW1, PARAM_NGRAM_MAX_N_VIEW2, PARAM_NGRAM_MIN_N_VIEW1, PARAM_NGRAM_MIN_N_VIEW2, PARAM_NGRAM_USE_TOP_K_VIEW1, PARAM_NGRAM_USE_TOP_K_VIEW2, PARAM_USE_VIEW1_NGRAMS_AS_FEATURES, PARAM_USE_VIEW2_NGRAMS_AS_FEATURES, PARAM_USE_VIEWBLIND_NGRAMS_AS_FEATURES, topKSetView1, topKSetView2, useView1NgramsAsFeatures, useView2NgramsAsFeatures, useViewBlindNgramsAsFeaturesforceRereadFromIndex, LUCENE_NGRAM_FIELD, luceneDir, PARAM_SOURCE_LOCATIONdfStore, filterPartialStopwordMatches, ngramFreqThreshold, ngramLowerCase, ngramMaxN, ngramMinN, ngramStopwordsFile, ngramUseTopK, PARAM_FILTER_PARTIAL_STOPWORD_MATCHES, PARAM_NGRAM_FREQ_THRESHOLD, PARAM_NGRAM_LOWER_CASE, PARAM_NGRAM_MAX_N, PARAM_NGRAM_MIN_N, PARAM_NGRAM_STOPWORDS_FILE, PARAM_NGRAM_USE_TOP_K, PARAM_TF_IDF_CALCULATION, prefix, stopwords, tfIdfCalculation, topKSetfeatureExtractorName, PARAM_UNIQUE_EXTRACTOR_NAME| Constructor and Description |
|---|
LuceneNGramCPFE() |
| Modifier and Type | Method and Description |
|---|---|
Set<Feature> |
extract(org.apache.uima.jcas.JCas view1,
org.apache.uima.jcas.JCas view2) |
List<MetaCollectorConfiguration> |
getMetaCollectorClasses(Map<String,Object> parameterSettings) |
boolean |
initialize(org.apache.uima.resource.ResourceSpecifier aSpecifier,
Map<String,Object> aAdditionalParams) |
addToFeatureArray, getFeaturePrefix, getFieldName, getTopN, getTopNgrams, getTopNgramsView1, getTopNgramsView2, getViewNgramslogSelectionProcess, passesScreeningafterResourcesInitialized, getLogger, getResourceNamepublic static final String PARAM_NGRAM_MIN_N_COMBO
protected int ngramMinNCombo
public static final String PARAM_NGRAM_MAX_N_COMBO
protected int ngramMaxNCombo
public static final String PARAM_NGRAM_USE_TOP_K_COMBO
protected int ngramUseTopKCombo
public static final String PARAM_NGRAM_SYMMETRY_COMBO
protected boolean ngramUseSymmetricalCombos
public static final String LUCENE_NGRAM_FIELDCOMBO
protected de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> topKSetCombo
public boolean initialize(org.apache.uima.resource.ResourceSpecifier aSpecifier,
Map<String,Object> aAdditionalParams)
throws org.apache.uima.resource.ResourceInitializationException
initialize in interface org.apache.uima.resource.Resourceinitialize in class LuceneNGramPFEorg.apache.uima.resource.ResourceInitializationExceptionpublic List<MetaCollectorConfiguration> getMetaCollectorClasses(Map<String,Object> parameterSettings) throws org.apache.uima.resource.ResourceInitializationException
getMetaCollectorClasses in interface MetaDependentgetMetaCollectorClasses in class LuceneNGramPFEorg.apache.uima.resource.ResourceInitializationExceptionpublic Set<Feature> extract(org.apache.uima.jcas.JCas view1, org.apache.uima.jcas.JCas view2) throws org.dkpro.tc.api.exception.TextClassificationException
extract in interface PairFeatureExtractorextract in class LuceneNGramPFEorg.dkpro.tc.api.exception.TextClassificationExceptionCopyright © 2013–2018 Ubiquitous Knowledge Processing (UKP) Lab. All rights reserved.