Class KeywordTokenizer

java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.Tokenizer
org.apache.lucene.analysis.CharTokenizer
org.dlese.dpc.index.analysis.KeywordTokenizer
All Implemented Interfaces:
Closeable, AutoCloseable

public class KeywordTokenizer extends org.apache.lucene.analysis.CharTokenizer
Includes all characters as part of the token.
Author:
John Weatherley
  • Nested Class Summary

    Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource

    org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
  • Field Summary

    Fields inherited from class org.apache.lucene.analysis.Tokenizer

    input
  • Constructor Summary

    Constructors
    Constructor
    Description
    Constructor for the KeywordTokenizer object
  • Method Summary

    Modifier and Type
    Method
    Description
    protected boolean
    isTokenChar(char c)
    Accepts all characters.

    Methods inherited from class org.apache.lucene.analysis.CharTokenizer

    end, incrementToken, normalize, reset

    Methods inherited from class org.apache.lucene.analysis.Tokenizer

    close, correctOffset

    Methods inherited from class org.apache.lucene.analysis.TokenStream

    reset

    Methods inherited from class org.apache.lucene.util.AttributeSource

    addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString

    Methods inherited from class java.lang.Object

    clone, finalize, getClass, notify, notifyAll, wait, wait, wait
  • Constructor Details

    • KeywordTokenizer

      public KeywordTokenizer(Reader in)
      Constructor for the KeywordTokenizer object
      Parameters:
      in - The Reader
  • Method Details

    • isTokenChar

      protected boolean isTokenChar(char c)
      Accepts all characters.
      Specified by:
      isTokenChar in class org.apache.lucene.analysis.CharTokenizer
      Parameters:
      c - The c
      Returns:
      true