Class: Ferret::Analysis::AsciiWhiteSpaceTokenizer
Summary
A WhiteSpaceTokenizer is a tokenizer that divides text at white-space. Adjacent sequences of non-WhiteSpace characters form tokens.
Example
"Dave's résumé, at http://www.davebalmain.com/ 1234"
=> ["Dave's", "résumé,", "at", "http://www.davebalmain.com", "1234"]
Public Class Methods
AsciiWhiteSpaceTokenizer.new() → tokenizer
Create a new AsciiWhiteSpaceTokenizer
/*
* call-seq:
* AsciiWhiteSpaceTokenizer.new() -> tokenizer
*
* Create a new AsciiWhiteSpaceTokenizer
*/
static VALUE
frb_a_whitespace_tokenizer_init(VALUE self, VALUE rstr)
{
return get_wrapped_ts(self, rstr, whitespace_tokenizer_new());
}