Antidote

Public

This is the unofficial re-implementation of "Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning Attack" (ICML2025)

alignment fine-tuning harmful language large llm model safety

Creat：2024-04-11T08:31:45

Update：2025-07-27T15:57:11

https://openreview.net/pdf?id=Arepl4R86m

Stars

Stars Increase

Related projects

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

63700

1年前

+147today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

49117

1年前

+115today

Pandas

alignment

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

47277

1年前

+15today

Llama_index

Hot

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

45704

1年前

+65today

Insightface

age-estimation

State-of-the-art 2D and 3D Face Analysis Project

27257

1年前

+31today

LLaVA

chatbot

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

24131

1年前

+23today

CosyVoice

Hot

audio-generation

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

17548

1年前

+55today

Awesome Multimodal Large Language Models

chain-of-thought

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

16892

1年前

+44today

Parlant

ai-agents

Control GenAI interactions with power, precision, and consistency using LLM-native Conversation Design paradigms

16632

1年前

+31today

Nni

automated-machine-learning

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14308

5年前

+3today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator