MongoDB

Designing a Rate Limiter

Learn how to design a rate limiter to protect APIs from abuse, covering token bucket, leaky bucket, and sliding window algorithms.

srikanthtelkalapally888@gmail.com

March 14, 2026

A rate limiter controls the number of requests a client can make to an API in a given time window.

Why Rate Limiting?

- Bucket holds N tokens
- Token added every T seconds
- Each request consumes 1 token
- Reject if bucket is empty

- Requests queue up
- Processed at fixed rate
- Excess requests dropped

Divides time into small buckets, counts requests per rolling window.

redis.INCR("rate:{user_id}:{timestamp_minute}")
redis.EXPIRE(key, 60)
if count > limit: reject()

Client → API Gateway (Rate Limiter)
             ↓
           Redis
             ↓
       Microservices

For multi-region, use Redis Cluster and synchronize counters across nodes.

Rate limiters are critical for API stability. Token bucket is best for burst traffic, sliding window for precision.