flux-fp8-api

Public

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

diffusion fast-inference flux fp8 pytorch quantization

Creat：2024-08-06T04:04:25

Update：2025-03-26T17:43:11

287

Stars

Stars Increase

Related projects

Stable Diffusion Webui

Hot

Stable Diffusion web UI

158833

1年前

+73today

Comfyui

Hot

ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.

96213

1年前

+526today

Gpt4all

ai-chat

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

76955

1年前

+7today

Vllm

Hot

amd

A high-throughput and memory-efficient inference and serving engine for LLMs

64909

2年前

+236today

Whisper.Cpp

Hot

inference

Port of OpenAI's Whisper model in C/C++

44989

1年前

+81today

ColossalAI

Making large AI models cheaper, faster and more accessible

41288

1年前

+6today

DeepSpeed

billion-parameters

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

40940

1年前

+29today

Ray

Hot

data-science

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

40214

1年前

+57today

LocalAI

Hot

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

39967

1年前

+190today

ChatTTS

agent

A generative speech model for daily dialogue.

38294

1年前

+31today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator