Alpaca-LoRA-RLHF-PyTorch

Public

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

alpaca chatgpt deepspeed finetune gpt llama llm lora peft ppo

作成時間：2023-04-18T14:03:08

更新時間：2024-12-24T11:41:43

Stars

Stars Increase

関連プロジェクト

NextChat

Gpt_academic

academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

68303

1个月前

+23today

Openai Cookbook

Hot

chatgpt

Examples and guides for using the OpenAI API

63526

1个月前

+52today

Lobe Chat

Hot

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

59446

2个月前

+118today

Open Interpreter

chatgpt

A natural language interface for computers

59196

1个月前

+8today

Awesome Chatgpt Prompts Zh

chat-gpt

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

54839

1年前

+28today

ChatGPT

Hot

? ChatGPT Desktop Application (Mac, Windows and Linux)

53722

2个月前

+53679today

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

46948

8个月前

+46948today

Autogen

Hot

agentic

A programming framework for agentic AI ? PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

43815

1年前

+155today

JeecgBoot

activiti

?「AI 低代码平台」前后端分离 SpringBoot 2.x/3.x，SpringCloud，Ant Design&Vue3，Mybatis，Shiro！强大的代码生成器让前后端代码一键生成，无需写任何代码! 引领AI低代码开发模式 AI生成->OnlineCoding->代码生成->手工MERGE，帮助Java项目解决80%重复工作，让开发更关注业务，提高开发效率、节省成本，同时又不失灵活性

42496

1个月前

+31today

AIニュース

AIデイリー

AIタイムライン

Alハードウェアです

最新事例

画像コレクション

ビデオコレクション

オーディオコレクション

コンテンツコレクション

最新チュートリアル

AIプロダクトランキング

AIトラフィック成長ランキング

AIトラフィック減少ランキング

AI週間ランキング

アメリカ合衆国

中国

インド

ブラジル

画像生成

パーソナルアシスタント

キャラクター生成

ビデオ生成

AIプロジェクトランキング

AIプロジェクト成長ランキング

AI開発者ランキング

AI組織ランキング

Deepseek

TTS

LLM

ChatGPT

概要

Alpaca-LoRA-RLHF-PyTorch

関連プロジェクト

NextChat

Gpt_academic

Openai Cookbook

Lobe Chat

Open Interpreter

Awesome Chatgpt Prompts Zh

ChatGPT

LLMs From Scratch

Autogen

JeecgBoot