GenerativeAIExamples
PublicGenerative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
gpu-accelerationlarge-language-modelsllmllm-inferencemicroservicenemoragretrieval-augmented-generationtensorrttriton-inference-server
Creat:2023-10-19T21:46:31
Update:2025-03-27T10:42:28
3.3K
Stars
5
Stars Increase