AIbase
Product LibraryTool Navigation

Q-LLM

Public

This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"

Creat2024-06-11T15:45:03
Update2025-01-19T20:04:48
https://arxiv.org/abs/2406.07528
49
Stars
0
Stars Increase