frad

Public

A from-scratch PyTorch LLM implementing Sparse Mixture-of-Experts (MoE) with Top-2 gating. Integrates modern Llama-3 components (RMSNorm, SwiGLU, RoPE, GQA) and a custom-coded Byte-Level BPE tokenizer. Pre-trained on a curated corpus of existential & dark philosophical literature.

bpe custom-tokenizer existential-ai from-scratch gqa llama-3-architecture llm mixture-of-experts moe python

Hora de creación：2025-12-01T22:30:51

Hora de actualización：2025-12-01T22:45:07

Stars

Stars Increase

Proyectos relacionados

AutoGPT

Hot

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

180179

10个月前

+59today

N8n

Hot

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

161403

4年前

+680today

Stable Diffusion Webui

Hot

Stable Diffusion web UI

158833

1年前

+73today

Langchain

Hot

Elixir implementation of a LangChain style framework that lets Elixir projects integrate with and leverage LLMs.

121438

1年前

+224today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

120901

8个月前

+280today