~/projects/llm-integration-api

LLM Integration API

LLM access treated as a production dependency, not a demo call.

portfolio-readybackend · ai

PythonFastAPIRedisSSEPrometheusDocker

A backend service that sits between applications and LLM providers, adding the operational layer those calls usually lack: caching, rate limiting, streaming, failure isolation and observability.

Problem

Direct LLM calls from applications lack operational control — no caching, no rate limiting, no failure isolation and no visibility into cost or latency.

Solution

A proxy API with Redis caching, per-client rate limiting, SSE streaming, circuit-breaker behavior on provider failures, classifier model versioning and Prometheus metrics for latency and usage.

Senior signal

Designs for the failure and cost modes of an external dependency before the happy path — the difference between a demo and something operable.

repository

back to projects back to home · selected systems