elasticsearch/docs/reference/text-analysis/analysis-whitespace-tokeniz...

---
navigation_title: "Whitespace"
mapped_pages:
  - https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-whitespace-tokenizer.html
---

# Whitespace tokenizer [analysis-whitespace-tokenizer]


The `whitespace` tokenizer breaks text into terms whenever it encounters a whitespace character.


## Example output [_example_output_19]

```console
POST _analyze
{
  "tokenizer": "whitespace",
  "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone."
}
```

The above sentence would produce the following terms:

```text
[ The, 2, QUICK, Brown-Foxes, jumped, over, the, lazy, dog's, bone. ]
```


## Configuration [_configuration_22]

The `whitespace` tokenizer accepts the following parameters:

`max_token_length`
:   The maximum token length. If a token is seen that exceeds this length then it is split at `max_token_length` intervals. Defaults to `255`.
[docs] Migrate docs from AsciiDoc to Markdown (#123507) * delete asciidoc files * add migrated files * fix errors * Disable docs tests * Clarify release notes page titles * Revert "Clarify release notes page titles" This reverts commit 8be688648dcc9249943fbaabe37b0d58f87c09e8. * Comment out edternal URI images * Clean up query languages landing pages, link to conceptual docs * Add .md to url * Fixes inference processor nesting. --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: Liam Thompson <leemthompo@gmail.com> Co-authored-by: Martijn Laarman <Mpdreamz@gmail.com> Co-authored-by: István Zoltán Szabó <szabosteve@gmail.com> 2025-02-28 00:56:14 +08:00			`---`
			`navigation_title: "Whitespace"`
			`mapped_pages:`
			`- https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-whitespace-tokenizer.html`
			`---`

			`# Whitespace tokenizer [analysis-whitespace-tokenizer]`


			The `whitespace` tokenizer breaks text into terms whenever it encounters a whitespace character.


			`## Example output [_example_output_19]`

			```console
			`POST _analyze`
			`{`
			`"tokenizer": "whitespace",`
			`"text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone."`
			`}`
			```

			`The above sentence would produce the following terms:`

			```text
			`[ The, 2, QUICK, Brown-Foxes, jumped, over, the, lazy, dog's, bone. ]`
			```


			`## Configuration [_configuration_22]`

			The `whitespace` tokenizer accepts the following parameters:

			`max_token_length`
			: The maximum token length. If a token is seen that exceeds this length then it is split at `max_token_length` intervals. Defaults to `255`.