Build-a-LLM-model-from-scratch-simple

Public

LLM pipeline: data→tokenizer→attention→GPT train/eval→instruction FT→sampling. Reproducible, clean configs, RTX-4060 defaults, ready for AMP/LoRA/DDP.

attention-mechanism bytes causal-attention ddp gpt instruction-tuning llm lora next-token-prediction peft

Creat：2025-08-15T00:09:39

Update：2025-08-26T10:59:13

Stars

Stars Increase

Related projects

Annotated_deep_learning_paper_implementations

Hot

attention

??? 60+ Implementations/tutorials of deep learning papers with side-by-side notes ?; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ? reinforcement learning (ppo, dqn), capsnet, distillation, ... ?

64723

1年前

+52today

Vit Pytorch

artificial-intelligence

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

24612

1年前

+24today

Numpy Ml

attention

Machine learning, in numpy

16210

1年前

+1today

Leedl Tutorial

bert

《李宏毅深度学习教程》（李宏毅老师推荐?，苹果书?），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

16083

1年前

+11today

Nlp Tutorial

attention

Natural Language Processing Tutorial for Deep Learning Researchers

14800

1年前

+3today

RWKV LM

attention-mechanism

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

14211

1年前

+11today

External Attention Pytorch

attention

? Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.???

12125

1年前

+5today

Attention Is All You Need Pytorch

attention

A PyTorch implementation of the Transformer model in "Attention is All You Need".

9543

1年前

+2today

PaLM Rlhf Pytorch

artificial-intelligence

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

7870

1年前

-3today

Dowhy

bayesian-networks

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.

7850

1年前

+5today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Build-a-LLM-model-from-scratch-simple

Related projects

Annotated_deep_learning_paper_implementations

Vit Pytorch

Numpy Ml

Leedl Tutorial

Nlp Tutorial

RWKV LM

External Attention Pytorch

Attention Is All You Need Pytorch

PaLM Rlhf Pytorch

Dowhy