HomeAI Tutorial

llm-acceleration

Public

The final project for EdgeAI course at NYCU, focusing on accelerating Llama-3.2-3B-Instruct inference on a single NVIDIA T4 GPU.

Creat2025-06-02T23:23:38
Update2025-06-08T00:40:35
0
Stars
0
Stars Increase

Related projects