Ticket #141 (closed defect: fixed)
API wrong? StandardAnalyzer really using FULL_ENGLISH_STOP_WORDS
| Reported by: | chris@… | Owned by: | somebody |
|---|---|---|---|
| Priority: | minor | Milestone: | |
| Component: | component1 | Version: | |
| Keywords: | Cc: |
Description
The api says it is using the ENGLISH_STOP_WORDS, but it seems to be using the FULL_ENGLISH_STOP_WORDS.
http://ferret.davebalmain.com/api/classes/Ferret/Analysis/StandardAnalyzer.html
I discovered this because I noticed that "will" is not being dropped from my queries, but it is in the ENGLISH_STOP_WORDS referenced in src/analysis/stopwords.c and lib/ferret/analysis/analyzers.rb
Ferret::Analysis::StandardAnalyzer?.new().token_stream(:title,'will').next
This points out something else that I found interesting: some words in the ENGLISH_STOP_WORDS are not in FULL_ENGLISH_STOP_WORDS (such as "will"). I haven't done my homework on this one, so maybe there is a reason for this, but thought I'd just point it out.
cheers
