elasticsearch

Commit Graph

Author	SHA1	Message	Date
István Zoltán Szabó	e45d7a942d	[DOCS] Expands feature processors property description and adds a link of conceptual docs (#68213 )	2021-02-02 14:48:43 +01:00
Valeriy Khakhutskyy	78368428b3	[ML] Add early stopping DFA configuration parameter (#68099 ) The PR adds early_stopping_enabled optional data frame analysis configuration parameter. The enhancement was already described in elastic/ml-cpp#1676 and so I mark it here as non-issue.	2021-02-01 11:41:28 +01:00
Dimitris Athanasiou	5c961c1c81	[ML] Expand regression/classification hyperparameters (#67950 ) Expands data frame analytics regression and classification analyses with the followin hyperparameters: - alpha - downsample_factor - eta_growth_rate_per_tree - max_optimization_rounds_per_hyperparameter - soft_tree_depth_limit - soft_tree_depth_tolerance	2021-01-26 12:56:41 +02:00
István Zoltán Szabó	addb5cbd3a	[DOCS] Adds custom feature processors description to PUT DFA API (#67424 ) Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2021-01-19 09:47:32 +01:00
Dimitris Athanasiou	7574013604	[ML] Remove DFA job states reindexing and analyzing from docs (#67658 ) These states do no longer exist as of #67423	2021-01-18 17:39:22 +02:00
Benjamin Trent	35f478b618	[ML] [DOCS] adding missing fields to the get trained models API docs (#67590 ) Adds missing fields description, inference_config, and input to the GET trained models API documentation	2021-01-15 13:20:53 -05:00
Benjamin Trent	24ebcc8c24	[ML] [DOCS] update find-structure reference docs (#67586 ) The text structure finder API documentation had many references to the "files". While this is one use of the API, the API now has a more generic name. This commit replaces many references to the word "file" to the more generic word "text".	2021-01-15 12:19:38 -05:00
István Zoltán Szabó	085a288af5	[DOCS] Adds hyperparameter metadata property to GET trained models API docs. (#67412 )	2021-01-13 13:49:51 +01:00
Lisa Cawley	401d302c69	[DOCS] Move find file structure to a new API endpoint (#67314 )	2021-01-12 11:59:45 -08:00
Benjamin Trent	af179ab2f5	[ML] move find file structure to a new API endpoint (#67123 ) This introduces a new `text-structure` plugin. This is the new home of the find file structure API. The old REST URL is still available but is deprecated. The new URL is: `_text_structure/find_structure`. All parameters and behavior are unchanged. Changes to the high-level REST client and docs will be in separate commit. related to: https://github.com/elastic/elasticsearch/issues/67001	2021-01-11 08:56:02 -05:00
Lisa Cawley	eff9dfc3a4	[DOCS] Clarify impact of delayed data in anomaly detection (#66816 ) Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2021-01-05 12:14:51 -08:00
István Zoltán Szabó	d3ad9fe632	[DOCS] Improves inference processor linking and docs (#66119 )	2021-01-05 09:42:06 +01:00
David Roberts	c5bef7f9a7	[ML] Deprecate anomaly detection post data endpoint (#66347 ) There is little evidence of this endpoint being used and there is quite a lot of code complexity associated with the various formats that can be used to upload data and the different errors that can occur when direct data upload is open to end users. In a future release we can make this endpoint internal so that only datafeeds can use it, and remove all the options and formats that are not used by datafeeds. End users will have to store their input data for anomaly detection in Elasticsearch indices (which we believe all do today) and use a datafeed to feed it to anomaly detection jobs.	2020-12-15 18:37:20 +00:00
Dimitris Athanasiou	3bed6661de	[ML] Add log_time to AD data_counts and decide current based on it (#66343 ) This commit is fixing a potential bug if we support anomaly detection results index rollover in the future. In particular, we determine the current `data_counts` by sorting on the latest record time. However, this is not correct if the job reverts to an older model snapshot. To fix this we add `log_time` to `data_counts` (similarly to `model_size_stats`) and sort on `log_time` to figure out the current counts for the job.	2020-12-15 19:09:13 +02:00
István Zoltán Szabó	bc989e4a86	[DOCS] Adds note about data_counts values to Revert snapshot API docs. (#66085 )	2020-12-09 10:47:51 +01:00
István Zoltán Szabó	3081cf4944	[DOCS] Adds empty snapshot_id description to revert snapshot API docs (#66036 )	2020-12-09 10:01:26 +01:00
David Kyle	22dadfd407	[ML] Docs and HRLC for datafeed runtime mappings (#65810 ) For the changes in #65606	2020-12-08 10:06:58 +00:00
David Roberts	49e492f313	[ML] Adding assignment_memory_basis to model_size_stats (#65561 ) At present the Java code makes a decision on whether to use current model memory or model memory limit to calculate how much memory a job requires to be assigned. The plan is to move this decision to the C++ code, which will report it via a new field in the model size stats. An additional change will be that once we have made the switch from using model memory limit to using current model memory we will never switch back, as this causes large fluctuations up and down in memory requirement which will be much more noticeable when autoscaling is in use. Although the only two options at present are model memory limit and current model memory, the new enum includes a third possibility, peak model memory. To switch to this now would be tricky, as there have been two bugs in the implementation of peak model memory which render its value unreliable in 7.x. However, in 8.x it might make sense to switch to using peak model memory instead of current model memory and it's much easier from a BWC perspective if the enum contains all the values from the start. Relates #63163	2020-12-03 17:18:08 +00:00
David Roberts	fc72b39a17	[ML] Adjusting soft_limit description (#65383 ) This PR adds detail to the explanation of the soft_limit memory_status in ML job stats. A consequence that was not mentioned before is that examples are not added to category definitions. Relates elastic/ml-cpp#1590	2020-11-24 09:35:07 +00:00
István Zoltán Szabó	a85fb5534a	[DOCS] Fixes typo in Aggregating data for faster performance. (#65354 )	2020-11-23 12:44:59 +01:00
István Zoltán Szabó	f1e54a63a1	[DOCS] Adds UI related limitation to configuring aggs docs (#65184 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-11-20 19:03:18 +01:00
István Zoltán Szabó	1e045da339	[DOCS] Makes the screenshot larger on the custom URLs page. (#65269 )	2020-11-20 09:29:39 +01:00
David Roberts	e4ce39845b	[ML] Add total ML memory to ML info (#65195 ) This change adds an extra piece of information, limits.total_ml_memory, to the ML info response. This returns the total amount of memory that ML is permitted to use for native processes across all ML nodes in the cluster. Some of this may already be in use; the value returned is total, not available ML memory.	2020-11-18 15:06:21 +00:00
Lisa Cawley	9fef6e7b7e	[DOCS] Adds new snapshot upgrade API (#65095 )	2020-11-16 09:48:07 -08:00
István Zoltán Szabó	95a0ed4304	[DOCS] Adds recommendation about when to use chunking_config in manual mode. (#65060 )	2020-11-16 16:12:07 +01:00
Benjamin Trent	33de89d94c	[ML] add new snapshot upgrader API for upgrading older snapshots (#64665 ) This new API provides a way for users to upgrade their own anomaly job model snapshots. To upgrade a snapshot the following is done: - Open a native process given the job id and the desired snapshot id - load the snapshot to the process - write the snapshot again from the native task (now updated via the native process) relates #64154	2020-11-12 10:45:56 -05:00
István Zoltán Szabó	db15c4d6b9	[DOCS] Adds scroll_size maximum value to datafeeds API docs (#64986 )	2020-11-12 15:53:53 +01:00
István Zoltán Szabó	9ed907bc75	[DOCS] Fixes example aggregation syntax in datafeed aggregations. (#64936 )	2020-11-11 16:33:36 +01:00
Lisa Cawley	919c79b745	[DOCS] Add custom feature processor example (#64681 )	2020-11-06 09:24:01 -08:00
James Rodewig	1ea83359bb	[DOCS] Fix case for 'Boolean' (#64299 )	2020-10-29 09:04:43 -04:00
István Zoltán Szabó	6093518f4a	[DOCS] Changes experimental flag to beta in DFA related docs (#63992 )	2020-10-26 17:02:46 +01:00
Lisa Cawley	a00c7a2b6c	[DOCS] Add tips for num_top_classes classification parameter (#63781 )	2020-10-21 09:27:13 -07:00
István Zoltán Szabó	9defe10616	[DOCS] Expands DFA evaluation API docs with the default set of metrics (#63971 )	2020-10-21 14:30:33 +02:00
Benjamin Trent	c1de07fa83	[ML] adding new flag exclude_generated that removes generated fields in GET config APIs (#63899 ) When exporting and cloning ml configurations in a cluster it can be frustrating to remove all the fields that were generated by the plugin. Especially as the number of these fields change from version to version. This flag, exclude_generated, allows the GET config APIs to return configurations with these generated fields removed. APIs supporting this flag: - GET _ml/anomaly_detection/<job_id> - GET _ml/datafeeds/<datafeed_id> - GET _ml/data_frame/analytics/<analytics_id> The following fields are not returned in the objects: - any field that is not user settable (e.g. version, create_time) - any field that is a calculated default value (e.g. datafeed chunking_config) - any field that is automatically set via another Elastic stack process (e.g. anomaly job custom_settings.created_by) relates to #63055	2020-10-20 11:28:29 -04:00
Dimitris Athanasiou	03ed7de6c1	[ML] Rename evaluation metric result fields to value (#63809 ) Renames data frame analytics _evaluate API results as follows: - per class accuracy renamed from `accuracy` to `value` - per class precision renamed from `precision` to `value` - per class recall renamed from `recall` to `value` - auc_roc `score` renamed to `value` for both outlier detection and classification	2020-10-20 10:30:50 +03:00
David Roberts	977a4ad3f9	[ML] Change docs test mute comment (#63866 ) The original comment mentioned issue #48583, but issue #48941 is specifically open for this mute. However, this is inappropriate, as the underlying reason the test cannot be unmuted is the same as for all the other tests skipped with the comment "Kibana sample data": issues #51572, #51576 and #51678. Closes #48941	2020-10-19 10:17:27 +01:00
Przemysław Witek	d9e7d88f08	[ML] Allow setting num_top_classes to a special value -1 (#63587 )	2020-10-13 13:14:17 +02:00
István Zoltán Szabó	e8930a44a4	[DOCS] Adds AUC ROC classification metric to the API examples (#63563 )	2020-10-13 11:03:20 +02:00
István Zoltán Szabó	b517d4d9b5	[DOCS] Adds huber and msle metrics to Evaluate API example calls (#63414 )	2020-10-08 17:05:04 +02:00
Przemysław Witek	b0019bd0a6	[ML] Validate that AucRoc has the data necessary to be calculated (#63302 )	2020-10-08 08:19:43 +02:00
lcawl	2177b46289	[DOCS] Fixes typo	2020-10-06 09:19:43 -07:00
Lisa Cawley	49ab8f8688	[DOCS] Add feature_importance_baseline to get trained model API (#63279 ) Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2020-10-06 07:56:55 -07:00
István Zoltán Szabó	de3ce8bc39	[DOCS] Adds delta and offset parameters to Evaluate DFA API docs (#63317 )	2020-10-06 16:06:35 +02:00
Lisa Cawley	51f9bf657d	[DOCS] Fix titles for ML APIs (#63152 )	2020-10-02 11:53:49 -07:00
István Zoltán Szabó	baffdd1ec0	[DOCS] Updates trained models API docs titles. (#63165 )	2020-10-02 10:15:14 -07:00
Benjamin Trent	7bd6e78dae	[ML] adding for_export flag for ml plugin GET resource APIs (#63092 ) This adds the new `for_export` flag to the following APIs: - GET _ml/anomaly_detection/<job_id> - GET _ml/datafeeds/<datafeed_id> - GET _ml/data_frame/analytics/<analytics_id> The flag is designed for cloning or exporting configuration objects to later be put into the same cluster or a separate cluster. The following fields are not returned in the objects: - any field that is not user settable (e.g. version, create_time) - any field that is a calculated default value (e.g. datafeed chunking_config) - any field that would effectively require changing to be of use (e.g. datafeed job_id) - any field that is automatically set via another Elastic stack process (e.g. anomaly job custom_settings.created_by) closes https://github.com/elastic/elasticsearch/issues/63055	2020-10-02 08:29:19 -04:00
Benjamin Trent	1084aaf18a	[ML] renames /inference apis to /trained_models (#63097 ) This commit renames all `inference` CRUD APIs to `trained_models`. This aligns with internal terminology, documentation, and use-cases.	2020-10-01 12:13:49 -04:00
Przemysław Witek	cd1a27f273	[ML] Implement AucRoc metric for classification (#60502 )	2020-09-30 08:56:23 +02:00
Lisa Cawley	e48eab95e9	[DOCS] Formatting fix in get trained model API (#62643 )	2020-09-21 08:19:37 -07:00
Benjamin Trent	a653a1cbb8	[ML] all multiple wildcard values for GET Calendars, Events, and DELETE forecasts (#62563 ) This commit adjusts the following APIs so now they not only support an `_all` case, but wildcard patterned Ids as well. - `GET _ml/calendars/<calendar_id>/events` - `GET _ml/calendars/<calendar_id>` - `GET _ml/anomaly_detectors/<job_id>/model_snapshots/<snapshot_id>` - `DELETE _ml/anomaly_detectors/<job_id>/_forecast/<forecast_id>`	2020-09-18 09:39:40 -04:00

1 2 3 4 5 ...

344 Commits