Alpaca-LoRA-RLHF-PyTorch

Public

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

alpaca chatgpt deepspeed finetune gpt llama llm lora peft ppo

Creat：2023-04-18T14:03:08

Update：2024-12-24T11:41:43

Stars

Stars Increase

Related projects

Langchain

Hot

Elixir implementation of a LangChain style framework that lets Elixir projects integrate with and leverage LLMs.

117414

1年前

+98today

Generative Ai For Beginners

Hot

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

100385

8个月前

+140today

NextChat

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

75635

1年前

+96today

Gpt_academic

academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

69350

6个月前

+7today

Openai Cookbook

chatgpt

Examples and guides for using the OpenAI API

68579

6个月前

+27today

Lobe Chat

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

66928

7个月前

+31today

Open Interpreter

chatgpt

A natural language interface for computers

60657

6个月前

+6today

Awesome Chatgpt Prompts Zh

chat-gpt

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

56554

1年前

+33today

ChatGPT

? ChatGPT Desktop Application (Mac, Windows and Linux)

54187

7个月前

+7today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Alpaca-LoRA-RLHF-PyTorch

Related projects

Langchain

Generative Ai For Beginners

NextChat

LLMs From Scratch

Gpt_academic

Openai Cookbook

Lobe Chat

Open Interpreter

Awesome Chatgpt Prompts Zh

ChatGPT

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Alpaca-LoRA-RLHF-PyTorch

Related projects

Langchain

Generative Ai For Beginners

NextChat

LLMs From Scratch

Gpt_academic

Openai Cookbook

Lobe Chat

Open Interpreter

Awesome Chatgpt Prompts Zh

ChatGPT

GEO Services