python-longnet

Public

Tools and experiments with the LongNet model

attention-is-all-you-need attention-mechanism longnet python pytorch

Creat：2023-07-12T16:46:44

Update：2024-02-20T02:01:24

https://www.youtube.com/watch?v=nC2nU9j9DVQ

Stars

Stars Increase

Related projects

Annotated_deep_learning_paper_implementations

Hot

attention

??? 60+ Implementations/tutorials of deep learning papers with side-by-side notes ?; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ? reinforcement learning (ppo, dqn), capsnet, distillation, ... ?

63553

6个月前

+59today

Meilisearch

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

53655

6个月前

+13today

LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

35830

6个月前

+22today

Vit Pytorch

artificial-intelligence

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

24145

6个月前

+6today

Numpy Ml

attention

Machine learning, in numpy

16160

6个月前

Leedl Tutorial

bert

《李宏毅深度学习教程》（李宏毅老师推荐?，苹果书?），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

15862

6个月前

+8today

Nlp Tutorial

attention

Natural Language Processing Tutorial for Deep Learning Researchers

14761

1年前

+2today

RWKV LM

attention-mechanism

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

14023

6个月前

+3today

External Attention Pytorch

attention

? Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.???

12078

6个月前

H2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

11931

6个月前

+1today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

python-longnet

Related projects

Annotated_deep_learning_paper_implementations

Meilisearch

LocalAI

Vit Pytorch

Numpy Ml

Leedl Tutorial

Nlp Tutorial

RWKV LM

External Attention Pytorch

H2ogpt

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

python-longnet

Related projects

Annotated_deep_learning_paper_implementations

Meilisearch

LocalAI

Vit Pytorch

Numpy Ml

Leedl Tutorial

Nlp Tutorial

RWKV LM

External Attention Pytorch

H2ogpt

GEO Services