Lightweight Token Factory proxy with caching and rate-limiting.
A serverless OpenAI-compatible proxy in front of Nebius Token Factory. Adds prompt caching, per-key rate limits, and per-user accounting. Drop in via env var.