HomeAI Tutorial

cold-compress

Public

Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.

Creat2024-05-21T02:39:19
Update2025-03-17T22:21:12
https://www.answer.ai/posts/2024-08-01-cold-compress.html
146
Stars
0
Stars Increase