HomeAI Tutorial

llm-quantisation-performance-study

Public

Code and data accompanying the article "The impact of quantising a small open source LLM". This repository explores how quantisation affects performance, VRAM usage, and inference speed in Qwen3 1.7B.

Creat2025-07-03T11:57:05
Update2025-09-19T19:57:34
2
Stars
0
Stars Increase

Related projects