Machine-Learning-and-Language-Model

Public

This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).

chainofthought finetune-llm gpt2 llama llm llm-inference pretrained-language-model

Creat：2025-01-04T00:15:17

Update：2025-01-15T15:12:34

https://github.com/Ledzy/MDS5210-24fall

Stars

Stars Increase

Related projects

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

179020

8个月前

+23today

Stable Diffusion Webui

Stable Diffusion web UI

157260

1年前

+27today

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

151105

2年前

+55today

Langchain

Hot

Elixir implementation of a LangChain style framework that lets Elixir projects integrate with and leverage LLMs.

117244

1年前

+65today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

116448

6个月前

+78today