edge-serve-ab
PublicThis project is a FastAPI + ONNX Runtime microservice that serves image inference with A/B routing, optional shadow traffic, token-bucket rate limiting, payload size guards, and end-to-end observability (Prometheus metrics, Grafana dashboards, structured logs, request tracing).