python 38 lines · 6 steps

How a thread-safe token bucket rate limiter works

A token bucket meters request rates by refilling tokens over time and only proceeding when enough are available.

Explained by highlit

Intermediate rate-limiting concurrency locking algorithms

1import time
2import threading
3 
4 
5class TokenBucket:
  def __init__(self, rate: float, capacity: int):
      self.rate = rate
      self.capacity = capacity
      self._tokens = float(capacity)
      self._last = time.monotonic()
      self._lock = threading.Lock()
12 
  def _refill(self) -> None:
      now = time.monotonic()
      elapsed = now - self._last
      self._tokens = min(self.capacity, self._tokens + elapsed * self.rate)
      self._last = now
18 
  def acquire(self, tokens: int = 1) -> None:
      if tokens > self.capacity:
          raise ValueError("requested tokens exceed bucket capacity")
      while True:
          with self._lock:
              self._refill()
              if self._tokens >= tokens:
                  self._tokens -= tokens
                  return
              deficit = tokens - self._tokens
              wait = deficit / self.rate
          time.sleep(wait)
31 
  def try_acquire(self, tokens: int = 1) -> bool:
      with self._lock:
          self._refill()
          if self._tokens >= tokens:
              self._tokens -= tokens
              return True
          return False

01 / 01

STEP 01

Walkthrough

Space play ←→ step click any line

Three takeaways

1Tracking elapsed time lets you compute refills lazily instead of running a background timer.
2A lock around the read-modify-write of token state keeps concurrent callers correct.
3Offering both blocking and non-blocking acquire methods covers backpressure and fail-fast use cases.

Related explainers

rust

use std::collections::HashMap;
use std::sync::{Arc, Mutex};
use std::thread;

Aggregating metrics across threads in Rust

concurrency shared-state mutex

Intermediate 7 steps

python

import argparse
import sys
from pathlib import Path

Building a subcommand CLI with argparse

cli argparse subcommands

Intermediate 6 steps

java

public class ThumbnailProcessor {
 
    private static final int MAX_CONCURRENCY = 4;

Bounded parallel thumbnail rendering in Java

concurrency thread-pool futures

Intermediate 7 steps

rust

use std::sync::{mpsc, Arc, Mutex};
use std::thread;
use std::time::Duration;

Building a thread pool in Rust

concurrency channels thread-pool

Advanced 9 steps

python

from collections.abc import Mapping
from typing import Any, Iterator

Flattening nested config into dotted keys

recursion generators tree-traversal

Intermediate 7 steps

python

import csv
import io
from datetime import datetime

Streaming a CSV export in Flask

streaming generators csv

Intermediate 9 steps

Share this explainer

Here's the card — post it anywhere.

How a thread-safe token bucket rate limiter works — share card

Made with highlit — turn any snippet into a walkthrough like this in about a minute.

Explain your code

How a thread-safe token bucket rate limiter works

Walkthrough

Related explainers

Aggregating metrics across threads in Rust

Building a subcommand CLI with argparse

Bounded parallel thumbnail rendering in Java

Building a thread pool in Rust

Flattening nested config into dotted keys

Streaming a CSV export in Flask

Share this explainer

Embed this explainer