edu.iastate.jtm
Class PorterStemmer

java.lang.Object
  extended by edu.iastate.jtm.PorterStemmer
All Implemented Interfaces:
Stemmer

public class PorterStemmer
extends java.lang.Object
implements Stemmer

PorterStemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form. The input word can be provided a character at time (by calling add()), or at once by calling one of the various stem(something) methods.


Field Summary
 
Fields inherited from interface edu.iastate.jtm.Stemmer
ALLCAP, allCapPattern, ANYWORD, anyWordPattern, delimiters, FIRSTCAP, firstCapPattern, NOCAP, noCapPattern
 
Constructor Summary
PorterStemmer()
          Creates a new instance of PorterStemmer
 
Method Summary
static void main(java.lang.String[] args)
          Test program for demonstrating the Stemmer.
 java.lang.String stemSentence(java.lang.String sen)
          Stem a sentence.
 java.lang.String stemWord(java.lang.String word)
          Stem a lower-cased word.
 java.lang.String stemWord(java.lang.String word, int type)
          Stem a word.
 java.lang.String toString()
          After a word has been stemmed, it can be retrieved by toString(), or a reference to the internal buffer can be retrieved by getResultBuffer and getResultLength (which is generally more efficient.)
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

PorterStemmer

public PorterStemmer()
Creates a new instance of PorterStemmer

Method Detail

toString

public java.lang.String toString()
After a word has been stemmed, it can be retrieved by toString(), or a reference to the internal buffer can be retrieved by getResultBuffer and getResultLength (which is generally more efficient.)

Overrides:
toString in class java.lang.Object
Returns:
String representation of the buffer

stemWord

public java.lang.String stemWord(java.lang.String word,
                                 int type)
Stem a word.

Specified by:
stemWord in interface Stemmer
Parameters:
word - the word to be stemmed
type - capitalization type of the word
Returns:
stemmed word

stemWord

public java.lang.String stemWord(java.lang.String word)
Stem a lower-cased word.

Specified by:
stemWord in interface Stemmer
Parameters:
word - the word to be stemmed
Returns:
stemmed word

stemSentence

public java.lang.String stemSentence(java.lang.String sen)
Stem a sentence.

Specified by:
stemSentence in interface Stemmer
Parameters:
sen - the sentence to be stemmed
Returns:
stemmed sentence

main

public static void main(java.lang.String[] args)
Test program for demonstrating the Stemmer.

Parameters:
args - sentences/words to be tested