net.sf.zekr.engine.search.lucene
Class ZekrLuceneAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
net.sf.zekr.engine.search.lucene.ZekrLuceneAnalyzer
public class ZekrLuceneAnalyzer
- extends org.apache.lucene.analysis.Analyzer
A wrapper class to analyze texts of different language. When using with some European languages,
SnowballFilter
will be used inside. It uses the following set of filters and tokenizers:
- StandardTokenizer
- StandardFilter
- LowerCaseFilter
- StopFilter
- RegexReplaceFilter
- SnowballFilter. It is used only if a snowball filter is already available
- Author:
- Mohsen Saboorian
Method Summary |
org.apache.lucene.analysis.TokenStream |
tokenStream(java.lang.String fieldName,
java.io.Reader reader)
|
Methods inherited from class org.apache.lucene.analysis.Analyzer |
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
QURAN_LANG_CODE
public static final java.lang.String QURAN_LANG_CODE
- This constant is to be used as
ZekrLuceneAnalyzer
ID for instantiating a Quran (Arabic)
Analyzer.
- See Also:
- Constant Field Values
ZekrLuceneAnalyzer
public ZekrLuceneAnalyzer(java.lang.String langCode,
java.lang.String name)
tokenStream
public org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName,
java.io.Reader reader)
- Specified by:
tokenStream
in class org.apache.lucene.analysis.Analyzer