Until now, we have been extracted a few number of fields from the binary files sent to the ingest attachment plugin:
* `content`,
* `title`,
* `author`,
* `keywords`,
* `date`,
* `content_type`,
* `content_length`,
* `language`.
Tika has a list of more standard properties which can be extracted:
* `modified`,
* `format`,
* `identifier`,
* `contributor`,
* `coverage`,
* `modifier`,
* `creator_tool`,
* `publisher`,
* `relation`,
* `rights`,
* `source`,
* `type`,
* `description`,
* `print_date`,
* `metadata_date`,
* `latitude`,
* `longitude`,
* `altitude`,
* `rating`,
* `comments`
This commit exposes those new fields.
Related to #22339.
Co-authored-by: Keith Massey <keith.massey@elastic.co>