6/6 Support gzipped OTLP ingest requests by gareth-ellis · Pull Request #2139 · elastic/rally

gareth-ellis · 2026-05-29T14:11:17Z

Summary

Adds an optional gzip operation parameter that turns on Content-Encoding: gzip for OTLP ingest.

Records are pre-compressed at prepare-track time into a sibling .pbgz corpus file (each record independently gzipped). The benchmark hot path ships those bytes verbatim — no in-process compression, no decompress/recompress. Operations sharing a corpus can opt in independently; when a corpus has both gzip and non-gzip consumers, Rally generates both .pb and .pbgz.

Measured impact: ~12% throughput improvement against an ES cluster where coordinator-bytes backpressure is the bottleneck (gzip reduces decompressed body size accounted against the bucket).

Depends on #2138 — merge after #2138 (which in turn depends on #2135, #2136, #2137). Part 6 of 6, completes the OTLP ingest series.

Series

1/6 Don't crash on non-UTF-8 ApiError bodies #2134 — Don't crash on non-UTF-8 ApiError bodies
2/6 Add OTLP binary protobuf core IO and track preparation #2135 — Add OTLP binary protobuf core IO and track preparation
3/6 Add OtlpParamSource for OTLP corpora #2136 — Add OtlpParamSource for OTLP corpora
4/6 Add OtlpIngest runner with backpressure-aware retries #2137 — Add OtlpIngest runner
5/6 Parallelize OTLP corpus generation; download compressed .pb when available #2138 — Parallelize OTLP corpus generation
6/6 Support gzipped OTLP ingest requests #2139 (this PR) — Support gzipped OTLP ingest requests

Test plan

New tests: gzip flag toggles .pbgz path, body bytes are gzip-magic-prefixed, Content-Encoding header set correctly
All 321 OTLP tests green
pre-commit clean

🤖 Generated with Claude Code

The ApiError handler in execute_single() decodes `e.body`, `e.error`, and `e.info` as UTF-8 to build a human-readable error message. When the body is binary (e.g., binary protobuf returned by ES OTLP endpoints on 4xx/5xx), the strict decode raises UnicodeDecodeError, which crashes the worker mid-task. Switch the six decode() calls to use errors="replace" so undecodable bytes become U+FFFD instead of aborting the worker. No semantic change for valid UTF-8 (the common case). This is a latent bug independent of OTLP — any operation that surfaces a binary error body would have hit it.

Introduces OtlpProtobufFile in esrally/utils/io.py for reading/writing length-prefixed OTLP ExportMetricsServiceRequest protobufs, plus an offset sidecar to allow worker partitions to seek without scanning. Wires preparation into esrally/track/loader.py and esrally/track/track.py: - New OTLP document set fields (otlp_pb_size_in_bytes, etc.) - prepare_otlp_document_set tries to download a .pb from the corpus base URL, otherwise converts a local JSON corpus to .pb on disk. - set_absolute_data_path picks up the .pb when present. Adds OTLP protobuf bindings to pyproject.toml (opentelemetry-proto). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

OtlpParamSource streams length-prefixed protobuf records out of an OtlpProtobufFile, partitions them across workers using the offset sidecar, and surfaces percent_completed so the progress bar tracks real progress. Supports a "looped" mode that cycles the partition indefinitely for time-bound benchmarks. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

OtlpIngest POSTs serialized protobuf bytes to the OTLP metrics endpoint, disabling transport-level fast retries in favour of an explicit exponential-with-full-jitter backoff loop. 429/502/503/504 and connection errors are retried; non-retryable ApiErrors return a failure dict so the driver records the error without crashing the worker. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…lable Adds two perf improvements for OTLP corpora that don't change the on-the-wire benchmark behaviour: 1. Parallel .pb generation. OtlpProtobufFile.create() now uses a ProcessPoolExecutor pipeline (worker count tunable via RALLY_OTLP_CONVERSION_WORKERS) so converting a multi-GB JSON corpus completes in minutes instead of hours. 2. Compressed .pb download. When the JSON corpus is published in a compressed archive, prepare-track first tries the matching compressed .pb (e.g. .pb.zst) from the corpus URL and decompresses locally — typically 2-4x less network bytes than the raw .pb. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Adds a "gzip" operation parameter that turns on Content-Encoding: gzip for OTLP ingest. Records are pre-compressed at prepare-track time and stored in a sibling .pbgz corpus file, so the benchmark hot path ships the bytes verbatim with no in-process compression. Operations sharing a corpus can opt in independently — Rally generates both .pb and .pbgz when needed. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

gareth-ellis and others added 6 commits May 29, 2026 14:41

gareth-ellis requested a review from a team as a code owner May 29, 2026 14:11

gareth-ellis changed the base branch from otlp-pr5-perf to master May 29, 2026 14:32

gareth-ellis changed the title ~~Support gzipped OTLP ingest requests~~ 6/6 Support gzipped OTLP ingest requests May 29, 2026

gareth-ellis mentioned this pull request May 29, 2026

Add OTLP support to rally #2127

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

6/6 Support gzipped OTLP ingest requests#2139

6/6 Support gzipped OTLP ingest requests#2139
gareth-ellis wants to merge 6 commits into
masterfrom
otlp-pr6-gzip-wire

gareth-ellis commented May 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gareth-ellis commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Series

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gareth-ellis commented May 29, 2026 •

edited

Loading