e5-embedding-ray-serve
PublicProduction-grade scalable embedding API server using SentenceTransformers "intfloat/multilingual-e5-base" model, powered by Ray Serve for multi-GPU orchestration, with Prometheus & Grafana monitoring.
Creat:2025-07-11T10:12:48
Update:2025-07-14T00:12:05
1
Stars
0
Stars Increase