AIBase
Home
AI NEWS
AI Tools
GEO & AEO
MCP
AI Models
AI Marketplace
EN

AI Products

View More
Megatron-LM

Megatron-LM

Continuous research on training Transformer models at scale.

Language Model
9.5k

Models

View More

Bert 1.3b

retrieva-jp

B

Transformer encoder pretrained based on Megatron-LM, specifically designed for Japanese scenarios

Natural Language ProcessingTransformersTransformersMultiple Languages
retrieva-jp
56
15

Bloom Tiny Random

Muennighoff

B

This is a small GPT-2-like model designed for testing the conversion functionality between Megatron-LM and transformers, primarily used for integration testing and debugging scripts

Natural Language ProcessingTransformersTransformersEnglish
Muennighoff
127
0

Bigscience Small Testing

bigscience

B

This is a small GPT-2-like model designed for testing the conversion between Megatron-LM and transformers, primarily used for integration testing and debugging scripts.

Natural Language ProcessingTransformersTransformersEnglish
bigscience
18.7k
4

Bert Large Swedish Cased

AI-Nordics

B

A Swedish Bert Large model implemented based on the Megatron-LM framework, containing 340 million parameters, pre-trained on 85GB of Swedish text

Natural Language ProcessingTransformersTransformersOther
AI-Nordics
734
11
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map