Package apple.naturallanguage.enums
Class NLTokenUnit
- java.lang.Object
-
- apple.naturallanguage.enums.NLTokenUnit
-
public final class NLTokenUnit extends java.lang.ObjectNLTokenizer is a class used to automatically segment natural-language text. An instance of this class is created with a specific unit and assigned a string to tokenize, and clients can then obtain ranges for tokens in that string appropriate to the given unit. Units are defined by NLTokenUnit, which specifies the size of the units in a string to which tokenization or tagging applies, whether word, sentence, paragraph, or document.
-
-
Field Detail
-
Word
public static final long Word
Token units are at word or equivalent level- See Also:
- Constant Field Values
-
Sentence
public static final long Sentence
Token units are at sentence level- See Also:
- Constant Field Values
-
Paragraph
public static final long Paragraph
Token units are at paragraph level- See Also:
- Constant Field Values
-
Document
public static final long Document
Token unit is the entire string- See Also:
- Constant Field Values
-
-