Fast single node rate limiter implementing Token Bucket algorithm.
The goal is to provide dependable solution that JustWorks™ with a
focus on performance, correctness and ease of use. Bucket data is
stored using :atomics module. Bucket references are stored in
ETS and optionally cached as persistent terms.
-
lock-free and race-free with compare-and-swap operations
-
BlazingFast™ performance, see benchmarks section. Req/s go brrrrrr
-
monotonic timer for correct calculations
-
millisecond tick supporting wider range of parameters and preventing request starvation
-
automatic calculation of bucket parameters based on target rate and burst size (for fixed cost requests)
-
handy timeouts for retries
-
support for token "refunds" and variable cost requests
-
compile-time validation of arguments when possible
Add it to your list of dependencies in mix.exs and run mix deps.get:
def deps do
[
{:atomic_bucket, "~> 0.2"}
]
endYou must start AtomicBucket server for each bucket table you want to use - without it the library will not work. Add to your application:
children = [.., AtomicBucket, ..]This will once per hour clean buckets that haven't had requests in
the last 24 hours. See start_link/1 for info about available options.
For simple cases where requests have fixed cost use request/5 macro with desired
rate and burst parameters. When possible, call the macro with literal arguments
for better performance and compile-time validation. Module attributes are fine too.
require AtomicBucket
# Averate rate: 10 reqs/s with 3 burst requests.
case AtomicBucket.request(:mybucket, 1, 10, 3) do
{:allow, count, _ref} ->
# Request is allowed. May immediately attempt to make additional
# <count> calls.
{:deny, timeout, _ref} ->
# Request is denied. The bucket may have enough tokens in <timeout>
# milliseconds.
end
Bucket id can be any term.
AtomicBucket.request({:client, ip_addr}, 1, 10, 3)Cache bucket reference in :persistent_term for better performance.
Good fit for buckets with low churn. Best for fixed buckets like per-user-id
rate limits. See :persistent_term docs
for more info on the tradeoffs.
AtomicBucket.request(:mybucket, 1, 10, 3, persistent: true)Reuse bucket references in long running processes for top performance.
{:allow, _requests, bucket_ref} = AtomicBucket.request(:mybucket, 1, 10, 3)
AtomicBucket.request(:mybucket, 1, 10, 3, ref: bucket_ref)Use raw_request/5 macro to implement advanced features such as token "refunds"
or variable cost. It supports same options as request/5
# This would be 10 req/s with 2 burst requests in fixed cost scenario
{:allow, tokens, ref} = AtomicBucket.raw_request(:mybucket, 200, 1, 100)
# But the next request may have a different cost
AtomicBucket.raw_request(:mybucket, 200, 1, 150)
# Token "refund" is always allowed
AtomicBucket.raw_request(:mybucket, 200, 1, -100)To implement different retention policies start multiple servers and
use the table option of request/5 and raw_request/5. Bucket ids are
table-scoped and don't have to be globally unique.
The library is optimized for common cases where rate limiters are used. Extremely slow/fast rates and/or huge bursts may exceed the bucket storage limits (64 bits). In practice, most people wouldn't need these extreme parameters.
The library makes no effort to ensure that bucket parameters remain
stable across calls: the parameters are not stored at all! Using same bucket
with different parameters will result in silent bugs. This also applies to
mixing request/5 and raw_request/5 - it must be avoided. No guarantee
of future compatibility of the two is provided.
The library provides a benchmark measuring series of 1000 rate limit checks (see bench/benchmark.exs). It serves as an illustration of available options. In general, such benchmarks should be taken with a grain of salt:
-
they often use artificial conditions different from real life, thus devaluing the results
-
above certain point a rate limiter is "fast enough" for many purposes. For BEAM this probably means "ETS or faster". Any atomics-based library should be much faster than ETS and is likely the fastest you can get.
-
comparing results between differently implemented or completely different algorithms is neither 100% valid nor very useful. It's very easy to make a pointless benchmark measuring wrong things, simply because it's hard to make different solutions perform same work achieving same results.
-
Tradeoffs are important. It's better to look at the overall picture, rather than raw performance.
$ mix run bench/benchmark.exs
Operating System: Linux
CPU Information: 13th Gen Intel(R) Core(TM) i7-13700K
Number of Available Cores: 24
Available memory: 62.52 GB
Elixir 1.20.0
Erlang 29.0.1
JIT enabled: true
Benchmark suite executing with the following configuration:
warmup: 2 s
time: 5 s
memory time: 0 ns
reduction time: 0 ns
parallel: 1
inputs: none specified
Estimated total run time: 42 s
Excluding outliers: false
Benchmarking raw_request (literals, default opts) ...
Benchmarking raw_request (non-literals, default opts) ...
Benchmarking request (literals, default opts) ...
Benchmarking request (literals, persistent) ...
Benchmarking request (literals, reusing ref) ...
Benchmarking request (non-literals, default opts) ...
Calculating statistics...
Formatting results...
Name ips average deviation median 99th %
request (literals, reusing ref) 11.81 K 84.65 μs ±13.79% 82.21 μs 135.81 μs
request (literals, persistent) 8.42 K 118.71 μs ±11.83% 115.87 μs 193.22 μs
raw_request (literals, default opts) 5.31 K 188.18 μs ±12.08% 183.14 μs 308.21 μs
request (literals, default opts) 5.29 K 188.90 μs ±11.78% 184.10 μs 306.53 μs
raw_request (non-literals, default opts) 5.00 K 200.19 μs ±11.56% 195.08 μs 321.36 μs
request (non-literals, default opts) 4.27 K 234.02 μs ±10.90% 227.43 μs 353.50 μs
Comparison:
request (literals, reusing ref) 11.81 K
request (literals, persistent) 8.42 K - 1.40x slower +34.06 μs
raw_request (literals, default opts) 5.31 K - 2.22x slower +103.53 μs
request (literals, default opts) 5.29 K - 2.23x slower +104.26 μs
raw_request (non-literals, default opts) 5.00 K - 2.37x slower +115.55 μs
request (non-literals, default opts) 4.27 K - 2.76x slower +149.37 μs
Extended statistics:
Name minimum maximum sample size mode
request (literals, reusing ref) 77.47 μs 369.01 μs 58.94 K79.16 μs, 79.58 μs, 79.75
request (literals, persistent) 110.33 μs 378.21 μs 42.05 K 112.17 μs, 112.11 μs
raw_request (literals, default opts) 164.90 μs 464.82 μs 26.54 K 187.78 μs
request (literals, default opts) 164.81 μs 418.69 μs 26.44 K 186.95 μs, 187.26 μs
raw_request (non-literals, default opts) 176.23 μs 470.45 μs 24.95 K198.13 μs, 189.40 μs, 198
request (non-literals, default opts) 206.55 μs 434.20 μs 21.34 K227.04 μs, 224.74 μs, 227Copyright 2026 Andrey Tretyakov
The source code of the project is released under Apache License 2.0.
Check LICENSE file for more information.
