AIbase

tinyllama-coreml-ios18-quantization

Public

Quantize TinyLlama-1.1B-Chat from PyTorch to CoreML (float16, int8, int4) for efficient on-device inference on iOS 18+.

Creat2025-05-19T00:26:39
Update2025-06-17T09:50:50
1
Stars
0
Stars Increase

Related projects