Quantization-in-Depth

Public

Dive into advanced quantization techniques. Learn to implement and customize linear quantization functions, measure quantization error, and compress model weights using PyTorch for efficient and accessible AI models.

2-bit-weights 8-bit-compression advanced-quantization ai-optimization asymmetric-quantization linear-quantization machine-learning model-compression per-channel-granularity per-group-granularity

Creat：2024-05-14T21:24:17

Update：2024-06-27T02:20:00

https://www.deeplearning.ai/short-courses/quantization-in-depth/

Stars

Stars Increase

Related projects

N8n

Hot

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

161403

5年前

+680today

Stable Diffusion Webui

Hot

Stable Diffusion web UI

158833

1年前

+73today

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

153615

2年前

+136today

Frp

Hot

expose

A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.

101486

1年前

+118today

Supabase

Hot

The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

94339

1年前

+193today

Playwright

Hot

automation

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

80014

1年前

+98today

PaddleOCR

Hot

ai4science

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.

65993

8个月前

+167today

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

63700

1年前

+147today

Gitea

Hot

bitbucket

Git with a cup of tea! Painless self-hosted all-in-one software development service, including Git hosting, code review, team collaboration, package registry and CI/CD

52405

1年前

+74today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

49117

1年前

+115today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator