public class ComboUtils extends Object
| Modifier and Type | Field and Description |
|---|---|
static String |
JOINT
This is the character for joining strings for combo ngrams.
|
| Constructor and Description |
|---|
ComboUtils() |
| Modifier and Type | Method and Description |
|---|---|
static de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> |
getCombinedNgrams(de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> document1NGrams,
de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> document2NGrams,
int minN,
int maxN,
boolean ngramUseSymmetricalCombos)
Get combinations of ngrams from a pair of documents.
|
static de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> |
getMultipleViewNgrams(List<org.apache.uima.jcas.JCas> jcases,
org.apache.uima.jcas.tcas.Annotation preSetTarget,
boolean ngramLowerCase,
boolean filterPartialStopwords,
int ngramMinN,
int ngramMaxN,
Set<String> stopwords) |
public static final String JOINT
public static de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> getCombinedNgrams(de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> document1NGrams, de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> document2NGrams, int minN, int maxN, boolean ngramUseSymmetricalCombos)
document1NGrams - ngrams from document 1document2NGrams - ngrams from document 2minN - minimum size for a new combined ngrammaxN - max size for a new combined ngramngramUseSymmetricalCombos - whether or not to return view-neutral ngramspublic static de.tudarmstadt.ukp.dkpro.core.api.frequency.util.FrequencyDistribution<String> getMultipleViewNgrams(List<org.apache.uima.jcas.JCas> jcases, org.apache.uima.jcas.tcas.Annotation preSetTarget, boolean ngramLowerCase, boolean filterPartialStopwords, int ngramMinN, int ngramMaxN, Set<String> stopwords) throws org.dkpro.tc.api.exception.TextClassificationException
Copyright © 2013–2018 Ubiquitous Knowledge Processing (UKP) Lab. All rights reserved.