net.sf.zekr.engine.search.lucene
Class ZekrLuceneAnalyzer

java.lang.Object
  extended by org.apache.lucene.analysis.Analyzer
      extended by net.sf.zekr.engine.search.lucene.ZekrLuceneAnalyzer

public class ZekrLuceneAnalyzer
extends org.apache.lucene.analysis.Analyzer

A wrapper class to analyze texts of different language. When using with some European languages, SnowballFilter will be used inside. It uses the following set of filters and tokenizers:

  1. StandardTokenizer
  2. StandardFilter
  3. LowerCaseFilter
  4. StopFilter
  5. RegexReplaceFilter
  6. SnowballFilter. It is used only if a snowball filter is already available

Author:
Mohsen Saboorian

Field Summary
static java.lang.String QURAN_LANG_CODE
          This constant is to be used as ZekrLuceneAnalyzer ID for instantiating a Quran (Arabic) Analyzer.
 
Constructor Summary
ZekrLuceneAnalyzer(java.lang.String langCode, java.lang.String name)
           
 
Method Summary
 org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
           
 
Methods inherited from class org.apache.lucene.analysis.Analyzer
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

QURAN_LANG_CODE

public static final java.lang.String QURAN_LANG_CODE
This constant is to be used as ZekrLuceneAnalyzer ID for instantiating a Quran (Arabic) Analyzer.

See Also:
Constant Field Values
Constructor Detail

ZekrLuceneAnalyzer

public ZekrLuceneAnalyzer(java.lang.String langCode,
                          java.lang.String name)
Method Detail

tokenStream

public org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName,
                                                          java.io.Reader reader)
Specified by:
tokenStream in class org.apache.lucene.analysis.Analyzer