Alpaca-LoRA-RLHF-PyTorch

Public

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

alpaca chatgpt deepspeed finetune gpt llama llm lora peft ppo

Erstellungszeit：2023-04-18T14:03:08

Aktualisierungszeit：2024-12-24T11:41:43

Stars

Stars Increase

Verwandte Projekte

Generative Ai For Beginners

Hot

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

85509

4个月前

+59today

NextChat

Gpt_academic

academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

68821

3个月前

+5today

Openai Cookbook

chatgpt

Examples and guides for using the OpenAI API

64854

3个月前

+32today

Lobe Chat

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

62731

4个月前

+24today

Open Interpreter

chatgpt

A natural language interface for computers

59741

3个月前

+10today

Awesome Chatgpt Prompts Zh

chat-gpt

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

55490

1年前

+8today

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

55142

9个月前

+1984today

ChatGPT

? ChatGPT Desktop Application (Mac, Windows and Linux)

53861

3个月前

+7today

Autogen

Hot

agentic

A programming framework for agentic AI ? PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

46359

1年前

+57today

KI-Nachrichten

KI-Tagesübersicht

KI-Zeitleiste

Al hardware

Neueste Fälle

Bildersammlung

Videosammlung

Audiosammlung

Inhaltssammlung

Neueste Tutorials

KI-Produkt-Ranking

KI-Traffic-Wachstumsranking

KI-Traffic-Rückgangsranking

KI-Wochenranking

Vereinigte Staaten

China

Indien

Brasilien

Bildgenerierung

Persönlicher Assistent

Charaktergenerierung

Videogenerierung

KI-Projektranking

KI-Projektwachstumsranking

KI-Entwickler-Ranking

KI-Organisationsranking

Deepseek

TTS

LLM

ChatGPT

Überblick

Alpaca-LoRA-RLHF-PyTorch

Verwandte Projekte

Generative Ai For Beginners

NextChat

Gpt_academic

Openai Cookbook

Lobe Chat

Open Interpreter

Awesome Chatgpt Prompts Zh

LLMs From Scratch

ChatGPT

Autogen