Rejected requests

Stack ECE ECK ECH Self-Managed

When Elasticsearch rejects a request, it stops the operation and returns an error with a 429 response code. Rejected requests are commonly caused by:

A depleted thread pool. A depleted search or write thread pool returns a TOO_MANY_REQUESTS error message.
A circuit breaker error.
High indexing pressure that exceeds the indexing_pressure.memory.limit.

Tip

If you're using Elastic Cloud Hosted, then you can use AutoOps to monitor your cluster. AutoOps significantly simplifies cluster management with performance recommendations, resource utilization visibility, and real-time issue detection with resolution paths. For more information, refer to Monitor with AutoOps.

Check rejected tasks

To check the number of rejected tasks for each thread pool, use the cat thread pool API. A high ratio of rejected to completed tasks, particularly in the search and write thread pools, means Elasticsearch regularly rejects requests.

 GET /_cat/thread_pool?v=true&h=id,name,queue,active,rejected,completed

write thread pool rejections frequently appear in the erring API and correlating log as EsRejectedExecutionException with either QueueResizingEsThreadPoolExecutor or queue capacity.

These errors are often related to backlogged tasks.

See this video for a walkthrough of troubleshooting threadpool rejections.

Check circuit breakers

To check the number of tripped circuit breakers, use the node stats API.

 GET /_nodes/stats/breaker

These statistics are cumulative from node startup. For more information, see circuit breaker errors.

See this video for a walkthrough of diagnosing circuit breaker errors.

Check indexing pressure

To check the number of indexing pressure rejections, use the node stats API.

 GET _nodes/stats?human&filter_path=nodes.*.indexing_pressure

These stats are cumulative from node startup.

Indexing pressure rejections appear as an EsRejectedExecutionException, and indicate that they were rejected due to combined_coordinating_and_primary, coordinating, primary, or replica.

These errors are often related to backlogged tasks, bulk index sizing, or the ingest target's refresh_interval setting.

Note

Stack 9.1.0 Serverless Another cause of indexing pressure rejections might be the use of the semantic_text field type, which can cause rejections when indexing large batches of documents if the batch may otherwise incur an Out of Memory (OOM) error.

See this video for a walkthrough of diagnosing indexing pressure rejections.

Prevent rejected requests

Fix high CPU and memory usage

If Elasticsearch regularly rejects requests and other tasks, your cluster likely has high CPU usage or high JVM memory pressure. For tips, see High CPU usage and High JVM memory pressure.

Fix for `semantic_text` ingestion issues

Stack 9.1.0 Serverless

When bulk indexing documents with the semantic_text field type, you may encounter rejections due to high memory usage during inference processing. These rejections will appear as an InferenceException in your cluster logs.

To resolve this issue:

Reduce the batch size of documents in your indexing requests.
If reducing batch size doesn't resolve the issue, then consider scaling up your machine resources.
Serverless Unavailable A last resort option is to adjust the indexing_pressure.memory.coordinating.limit cluster setting. The default value is 10% of the heap. Increasing this limit allows more memory to be used for coordinating operations before rejections occur.

Warning

This adjustment should only be considered after exhausting other options, as setting this value too high may risk Out of Memory (OOM) errors in your cluster. A cluster restart is required for this change to take effect.