elasticsearch/docs/reference/analysis/tokenizers
Luca Cavanna 8efd08b019
Upgrade to Lucene 10 (#114741)
The most relevant ES changes that upgrading to Lucene 10 requires are:

- use the appropriate IOContext
- Scorer / ScorerSupplier breaking changes
- Regex automaton are no longer determinized by default
- minimize moved to test classes
- introduce Elasticsearch900Codec
- adjust slicing code according to the added support for intra-segment concurrency
- disable intra-segment concurrency in tests
- adjust accessor methods for many Lucene classes that became a record
- adapt to breaking changes in the analysis area

Co-authored-by: Christoph Büscher <christophbuescher@posteo.de>
Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>
Co-authored-by: ChrisHegarty <chegar999@gmail.com>
Co-authored-by: Brian Seeders <brian.seeders@elastic.co>
Co-authored-by: Armin Braun <me@obrown.io>
Co-authored-by: Panagiotis Bailis <pmpailis@gmail.com>
Co-authored-by: Benjamin Trent <4357155+benwtrent@users.noreply.github.com>
2024-10-21 13:38:23 +02:00
..
chargroup-tokenizer.asciidoc
classic-tokenizer.asciidoc
edgengram-tokenizer.asciidoc [Docs] Update edgengram-tokenizer.asciidoc (#79577) 2021-10-26 13:05:35 +02:00
keyword-tokenizer.asciidoc [DOCS] Fix double spaces (#71082) 2021-03-31 09:57:47 -04:00
letter-tokenizer.asciidoc
lowercase-tokenizer.asciidoc [DOCS] Fix double spaces (#71082) 2021-03-31 09:57:47 -04:00
ngram-tokenizer.asciidoc [DOCS] Fix double spaces (#71082) 2021-03-31 09:57:47 -04:00
pathhierarchy-tokenizer.asciidoc Upgrade to Lucene 10 (#114741) 2024-10-21 13:38:23 +02:00
pattern-tokenizer.asciidoc [DOCS] Fix double spaces (#71082) 2021-03-31 09:57:47 -04:00
simplepattern-tokenizer.asciidoc
simplepatternsplit-tokenizer.asciidoc
standard-tokenizer.asciidoc
thai-tokenizer.asciidoc
uaxurlemail-tokenizer.asciidoc
whitespace-tokenizer.asciidoc