triton-ft-api
Publictutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
Creat:2022-12-01T05:44:23
Update:2023-06-30T04:54:08
5
Stars
0
Stars Increase
tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server