Introduction to the OpenTelemetry Collector

On this page

The OpenTelemetry Collector is a stand-alone service designed to collect, process, and export telemetry data such as logs, metrics, and traces. It provides a vendor-neutral way to manage this data, offering flexibility in configuration and deployment. The collector is extremely lightweight, and can run in almost any environment without significant infrastructure overhead.

Setting Up Observability with OpenTelemetry

Instrumentation

To begin, you need to instrument your code with OpenTelemetry client libraries. These libraries help generate telemetry signals such as logs, metrics, and traces from your application.

Data Collection and Processing

Once the telemetry data is generated, it can be exported directly to a backend or processed through the OpenTelemetry Collector. Using a collector helps offload the responsibility of data management from the application, making it easier to handle different data pipelines. To use the collector, after setting up your first collector instance, you’ll configure your application’s OpenTelemetry installation to send data to your collector, generally via the OpenTelemetry protocol or OTLP.

Deployment Options

The OpenTelemetry Collector can be deployed in multiple ways:

As an agent: Installed on the same host as the application reporting data. Generally one collector per application
As a standalone service with a gateway: Runs independently, receiving telemetry from multiple sources, possibly with a load balancer. There may be a load balancer between all applications and multiple collectors, but this is the simplest version

In larger deployments, a combination of agents and standalone services may be employed to manage scale.

Data Storage

The OpenTelemetry collector is fully stateless, it produces no dashboards and stores no data. It doesn’t even create an API endpoint to get status information. The collector is only useful when transmitting data, so your OpenTelemetry data needs somewhere to go. SaaS tools like Coralogix can receive your data, or you’ll need to set up your own datastore with something like Prometheus.

OpenTelemetry and the CNCF

The OpenTelemetry framework is part of the Cloud Native Computing Foundation (CNCF) and aims to standardize the handling of telemetry data. It provides a consistent interface to collect and export data across many programming languages. The collector is an important part of this mission, since a standardized proxy for all OpenTelemetry data helps different teams in different languages form a shared understanding of how their data is collected, processed, and transmitted.

Key Functions of the OpenTelemetry Collector

Collection: Collects telemetry data in various formats from multiple sources.
Processing: Applies transformations, such as sampling, batching, or removing sensitive data.
Exporting: Sends telemetry data to different backend systems.

Components of the OpenTelemetry Collector

Receivers:

These accept data into the collector. Common formats include OTLP, Jaeger, and Prometheus.
Processors:

Apply operations on the data, such as batching, retries, or adding metadata. Processors can also handle privacy-related tasks like removing personally identifiable information (PII). See our advanced guide on filtering data with the OpenTelemetry collector.
Exporters:

These send data to backends. You can send metrics to tools like Prometheus and traces to Jaeger or other supported systems.

Configuring the OpenTelemetry Collector

The configuration is managed via a YAML file, defining how receivers, processors, and exporters are connected in pipelines.

Example Configuration

Receivers

receivers:
  otlp:
    protocols:
      grpc:
      http:
    prometheus:
      config:
        scrape_configs:
          - job_name: 'otel-collector'
            scrape_interval: 5s
            static_configs:
              - targets: ['0.0.0.0:8888']

There are a large number of receivers for multiple formats of data coming in to an OpenTelemetry system. Note that, sadly, not all Prometheus data can be translated 1:1 into OTLP. The Prometheus receiver on Github is currently listed as a work in progress.

Processors

processors:
  batch:
    send_batch_size: 1000
    timeout: 10s
  memory_limiter:
    ballast_size_mib: 683
    limit_mib: 1500
    check_interval: 5s
  queued_retry:
    num_workers: 4
    retry_on_failure: true

Processors are the most diverse component within a collector, with a processor doing anything from removing sensitive data, batching data for transmission, or filtering unwanted information. These examples are all about how data will be batched, the max memory to use (after which data will be thrown out in this reporting cycle), and retries for sending data.

Exporters

exporters:
  prometheus:
    endpoint: "localhost:9090"
  jaeger:
    endpoint: "localhost:14250"

At their most basic, exporters will contain where the data is headed.

Pipelines

pipelines:
  traces:
    receivers: [jaeger, otlp]
    processors: [batch, memory_limiter]
    exporters: [jaeger]
  metrics:
    receivers: [otlp]
    processors: [batch]
    exporters: [prometheus]

Pipelines a path from receiver to exporter, and any data will go through the processors listed in order, left to right.

Backend Options for OpenTelemetry Metrics and Traces

After setting up the OpenTelemetry Collector, you will need tools to visualize and analyze the data. Prometheus is a popular choice for storing and querying metrics, while Grafana can visualize these metrics with dashboards. For tracing data, Jaeger is commonly used.

Prometheus: Ideal for collecting and storing time-series metrics.
Grafana: A visualization tool compatible with Prometheus metrics.
Jaeger: Focuses on distributed tracing for microservices-based applications.

Use Cases

The OpenTelemetry Collector is useful for:

Redacting sensitive data: Ensure compliance with privacy and security standards.
Batching telemetry data: Compress data and reduce the number of outgoing connections.
Customizing telemetry pipelines: Modify attributes, transform metrics, or sample traces to suit your observability needs.

FAQs

Do I have to deploy an OpenTelemetry collector?

You don’t have to use a collector to gather OpenTelemetry data. In the simplest example where you’re monitoring a single monolithic application, the difference without a collector would be at the time of reporting data: instead of traces, logs, and metrics going to a collector; data would travel directly to the storage endpoint. If you’re starting out with a demo project, or you’re only using OpenTelemetry for a very limited purpose (for example you’re only implementing OpenTelemetry to send a few key traces to Checkly), it may make sense not to use a collector at all.

What is the difference between the OpenTelemetry Agent and Collector?

There isn’t an ‘agent’ as such that’s part of the OpenTelemetry model. As mentioned above the collector may be deployed in an ‘agent’ pattern where the collector is running on the same host as the application, but the term ‘agent’ is a little overloaded in observibility and requires brief disambugation. The other part of observability sometimes called an ‘agent’ is a process running within an application that receives data on the system. For example, automatic instrumentation of Java applications is made possible by the standard javaagent jvm argument. Using an agent to observe your application will depend on your language library’s implementation.

The collector is not an ‘agent’ running within an application, it runs outside your application and collects and forwards data.