Rate Limiter
Design a distributed rate limiting system that controls the rate of requests a client can send to an API within a given time window.
These requirements and scale numbers are for reference. The AI reviewer will evaluate your design based on the functional requirements and assumptions you define.
🎯 Key Requirements
- •Enforce per-user / per-API request limits with a chosen algorithm (e.g., token bucket)
- •Stay consistent across multiple application servers behind a load balancer
- •Return standard rate-limit headers (X-RateLimit-Remaining, Retry-After)
📐 Scale Expectations
- •~10K requests per second across all clients
- •Tens of thousands of distinct clients
- •Decision overhead under 10ms per request
Community Submissions (1)