Package de.danielnaber.jwordsplitter
Class GermanWordSplitter
java.lang.Object
de.danielnaber.jwordsplitter.AbstractWordSplitter
de.danielnaber.jwordsplitter.GermanWordSplitter
Split German compound words. Based on an embedded dictionary, or on an
external plain text dictionary.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate GermanInterfixDisambiguator
private static final String
private static final Collection
<String> Interfixes = FugenelementeFields inherited from class de.danielnaber.jwordsplitter.AbstractWordSplitter
words
-
Constructor Summary
ConstructorsConstructorDescriptionGermanWordSplitter
(boolean hideInterfixCharacters) GermanWordSplitter
(boolean hideInterfixCharacters, File plainTextDict) GermanWordSplitter
(boolean hideInterfixCharacters, InputStream plainTextDict) GermanWordSplitter
(boolean hideInterfixCharacters, Set<String> words) -
Method Summary
Modifier and TypeMethodDescriptionprotected int
protected GermanInterfixDisambiguator
protected Collection
<String> Interfix elements in lowercase, e.g.getWordList
(InputStream stream) private void
init()
Methods inherited from class de.danielnaber.jwordsplitter.AbstractWordSplitter
addException, getAllSplits, getAllSplits, getSubWords, setExceptionFile, setMaximumWordLength, setMinimumWordLength, setStrictMode, splitWord, splitWord
-
Field Details
-
EXCEPTION_DICT
- See Also:
-
INTERFIXES
Interfixes = Fugenelemente -
disambiguator
-
-
Constructor Details
-
GermanWordSplitter
- Throws:
IOException
-
GermanWordSplitter
public GermanWordSplitter(boolean hideInterfixCharacters, InputStream plainTextDict) throws IOException - Throws:
IOException
-
GermanWordSplitter
- Throws:
IOException
-
GermanWordSplitter
- Throws:
IOException
- Since:
- 4.2
-
-
Method Details
-
init
- Throws:
IOException
-
getWordList
- Specified by:
getWordList
in classAbstractWordSplitter
- Throws:
IOException
-
getWordList
- Specified by:
getWordList
in classAbstractWordSplitter
-
getDisambiguator
- Specified by:
getDisambiguator
in classAbstractWordSplitter
-
getDefaultMinimumWordLength
protected int getDefaultMinimumWordLength()- Specified by:
getDefaultMinimumWordLength
in classAbstractWordSplitter
-
getInterfixCharacters
Description copied from class:AbstractWordSplitter
Interfix elements in lowercase, e.g. at least "s" for German.- Specified by:
getInterfixCharacters
in classAbstractWordSplitter
-