Performing Time Series Analysis with Date Aggregation in Elasticsearch

Last Updated : 23 Jul, 2025

Time series analysis is a crucial technique for analyzing data collected over time, such as server logs, financial data, and IoT sensor data. Elasticsearch, with its powerful aggregation capabilities, is well-suited for performing such analyses. This article will explore how to perform time series analysis using date aggregation in Elasticsearch, with detailed examples and outputs to illustrate the concepts.

Introduction to Time Series Data and Elasticsearch

Time series data consists of sequences of data points indexed by time, often used to monitor and analyze trends over specific periods. Elasticsearch is a distributed, RESTful search and analytics engine capable of handling large volumes of time-stamped data. By leveraging its aggregation framework, we can efficiently perform various time-based analyses.

Setting Up Elasticsearch for Time Series Analysis

Before diving into aggregations, let's set up an index with sample time series data.

Creating an Index

We will create an index called server_metrics to store our time series data, which includes CPU usage metrics from different servers.

PUT /server_metrics
{
  "mappings": {
    "properties": {
      "timestamp": { "type": "date" },
      "cpu_usage": { "type": "float" },
      "server_id": { "type": "keyword" }
    }
  }
}

Ingesting Sample Data

Next, we'll ingest some sample data into the server_metrics index.

POST /server_metrics/_bulk
{ "index": {} }
{ "timestamp": "2023-05-01T01:00:00Z", "cpu_usage": 30.5, "server_id": "server1" }
{ "index": {} }
{ "timestamp": "2023-05-01T02:00:00Z", "cpu_usage": 45.3, "server_id": "server2" }
{ "index": {} }
{ "timestamp": "2023-05-01T03:00:00Z", "cpu_usage": 50.1, "server_id": "server1" }
{ "index": {} }
{ "timestamp": "2023-05-01T04:00:00Z", "cpu_usage": 75.0, "server_id": "server2" }
{ "index": {} }
{ "timestamp": "2023-05-01T05:00:00Z", "cpu_usage": 60.2, "server_id": "server1" }

Performing Date Aggregations

Elasticsearch provides several data aggregation capabilities to efficiently group and analyze time series data. We will cover the most common types of date aggregations: date histogram, date range, and nested aggregations.

Date Histogram Aggregation

The date histogram aggregation groups data into buckets based on a specified interval (e.g., hourly, daily). This is useful for visualizing trends over time.

Example: Hourly Aggregation of CPU Usage

POST /server_metrics/_search
{
  "size": 0,
  "aggs": {
    "hourly_cpu_usage": {
      "date_histogram": {
        "field": "timestamp",
        "calendar_interval": "hour"
      },
      "aggs": {
        "average_cpu_usage": {
          "avg": {
            "field": "cpu_usage"
          }
        }
      }
    }
  }
}

Output:

{
  "aggregations": {
    "hourly_cpu_usage": {
      "buckets": [
        {
          "key_as_string": "2023-05-01T01:00:00.000Z",
          "key": 1682902800000,
          "doc_count": 1,
          "average_cpu_usage": {
            "value": 30.5
          }
        },
        {
          "key_as_string": "2023-05-01T02:00:00.000Z",
          "key": 1682906400000,
          "doc_count": 1,
          "average_cpu_usage": {
            "value": 45.3
          }
        },
        {
          "key_as_string": "2023-05-01T03:00:00.000Z",
          "key": 1682910000000,
          "doc_count": 1,
          "average_cpu_usage": {
            "value": 50.1
          }
        },
        {
          "key_as_string": "2023-05-01T04:00:00.000Z",
          "key": 1682913600000,
          "doc_count": 1,
          "average_cpu_usage": {
            "value": 75.0
          }
        },
        {
          "key_as_string": "2023-05-01T05:00:00.000Z",
          "key": 1682917200000,
          "doc_count": 1,
          "average_cpu_usage": {
            "value": 60.2
          }
        }
      ]
    }
  }
}

In this example, the CPU usage is aggregated hourly, and the average CPU usage for each hour is calculated.

Date Range Aggregation

The date range aggregation groups data into buckets based on specified date ranges. This is useful for comparing data across different time periods.

Example: Comparing CPU Usage in Different Time Ranges

POST /server_metrics/_search
{
  "size": 0,
  "aggs": {
    "cpu_usage_ranges": {
      "date_range": {
        "field": "timestamp",
        "ranges": [
          { "from": "2023-05-01T01:00:00Z", "to": "2023-05-01T03:00:00Z" },
          { "from": "2023-05-01T03:00:01Z", "to": "2023-05-01T05:00:00Z" }
        ]
      },
      "aggs": {
        "average_cpu_usage": {
          "avg": {
            "field": "cpu_usage"
          }
        }
      }
    }
  }
}

Output:

{
  "aggregations": {
    "cpu_usage_ranges": {
      "buckets": [
        {
          "key": "2023-05-01T01:00:00.000Z-2023-05-01T03:00:00.000Z",
          "from": 1682902800000,
          "to": 1682910000000,
          "doc_count": 2,
          "average_cpu_usage": {
            "value": 37.9
          }
        },
        {
          "key": "2023-05-01T03:00:01.000Z-2023-05-01T05:00:00.000Z",
          "from": 1682910001000,
          "to": 1682917200000,
          "doc_count": 2,
          "average_cpu_usage": {
            "value": 67.6
          }
        }
      ]
    }
  }
}

This example compares CPU usage across two different time ranges, with the average CPU usage calculated for each range.

Nested Aggregations

Nested aggregations allow us to perform more complex analyses by nesting one aggregation within another. This is useful for breaking down data further based on additional criteria.

Example: Aggregating CPU Usage by Server and Hour

POST /server_metrics/_search
{
  "size": 0,
  "aggs": {
    "by_server": {
      "terms": {
        "field": "server_id"
      },
      "aggs": {
        "hourly_cpu_usage": {
          "date_histogram": {
            "field": "timestamp",
            "calendar_interval": "hour"
          },
          "aggs": {
            "average_cpu_usage": {
              "avg": {
                "field": "cpu_usage"
              }
            }
          }
        }
      }
    }
  }
}

Output:

{
  "aggregations": {
    "by_server": {
      "buckets": [
        {
          "key": "server1",
          "doc_count": 3,
          "hourly_cpu_usage": {
            "buckets": [
              {
                "key_as_string": "2023-05-01T01:00:00.000Z",
                "key": 1682902800000,
                "doc_count": 1,
                "average_cpu_usage": {
                  "value": 30.5
                }
              },
              {
                "key_as_string": "2023-05-01T03:00:00.000Z",
                "key": 1682910000000,
                "doc_count": 1,
                "average_cpu_usage": {
                  "value": 50.1
                }
              },
              {
                "key_as_string": "2023-05-01T05:00:00.000Z",
                "key": 1682917200000,
                "doc_count": 1,
                "average_cpu_usage": {
                  "value": 60.2
                }
              }
            ]
          }
        },
        {
          "key": "server2",
          "doc_count": 2,
          "hourly_cpu_usage": {
            "buckets": [
              {
                "key_as_string": "2023-05-01T02:00:00.000Z",
                "key": 168290640000

Conclusion

Date aggregation in Elasticsearch is a powerful tool for performing time series analysis. Leveraging data histograms and other date-based aggregations allows you to analyze time series data at different granularities and extract valuable insights. Whether you're analyzing server logs, monitoring IoT devices, or tracking financial data, date aggregation provides the flexibility and functionality to make sense of your time-based data. With the examples and concepts covered in this guide, you should be well-equipped to perform time series analysis in Elasticsearch and derive meaningful conclusions from your data.

kumarsar29u2

Improve

Article Tags :

Performing Time Series Analysis with Date Aggregation in Elasticsearch

Introduction to Time Series Data and Elasticsearch

Setting Up Elasticsearch for Time Series Analysis

Creating an Index

Ingesting Sample Data

Performing Date Aggregations

Date Histogram Aggregation

Date Range Aggregation

Example: Comparing CPU Usage in Different Time Ranges

Nested Aggregations

Example: Aggregating CPU Usage by Server and Hour

Conclusion

Similar Reads

Elasticsearch Fundamentals

Concepts of Elasticsearch

Data Indexing and Querying

Advanced Querying and Full-text Search

Data Modeling and Mapping

Scaling and Performance

Data Ingestion and Processing

Advanced Indexing Techniques

Monitoring and Optimization

Thank You!

What kind of Experience do you want to share?