ByteDance Releases a New Open-Source Long Text Processing Model Seed-OSS-36B

AIbase基地

Published inAI News · 4 min read · Aug 21, 2025

Recently, the Seed team under ByteDance has released the latest open-source large language model, Seed-OSS-36B, on the AI code sharing platform Hugging Face. This new model focuses on advanced reasoning and developer-friendliness. Its main feature is supporting input text processing of up to 512,000 tokens, far exceeding the products of American tech companies such as OpenAI and Anthropic.

The Seed-OSS-36B series includes three main variants: Seed-OSS-36B-Base (with synthetic data), Seed-OSS-36B-Base (without synthetic data), and Seed-OSS-36B-Instruct. The synthetic data version performs better in standard benchmark tests and is suitable for general use, while the version without synthetic data provides a more pure foundation for research. Seed-OSS-36B-Instruct is focused on task execution and instruction following, and has been fine-tuned to optimize performance.

All models are licensed under the Apache-2.0 license, which means researchers and developers can freely use, modify, and redistribute these models without paying any licensing fees to ByteDance. This marks another important advancement for Chinese companies in the field of open-source models and also provides more possibilities for international applications.

The design and core features of Seed-OSS-36B include 3.6 billion parameters, a 64-layer architecture, and a vocabulary size of 155,000 tokens. The model's long text processing capabilities and reasoning budget settings allow developers to adjust the depth of model reasoning based on the complexity of tasks. In addition, the model has demonstrated excellent performance in multiple benchmark tests, achieving industry-leading results in mathematical and programming tasks.

The Seed team also pays special attention to the accessibility of the model. Users can deploy it through Hugging Face Transformers and support 4-bit and 8-bit quantization formats to reduce memory requirements. In addition, the team provides scripts for inference, prompt customization, and tool integration, further lowering the operational threshold for small teams.

By providing high-performance and flexible deployment open models, the Seed team of ByteDance brings new choices for enterprises, researchers, and developers.

huggingface:https://huggingface.co/collections/ByteDance-Seed/seed-oss-68a609f4201e788db05b5dcd

Key Points:
🌟 The Seed-OSS-36B model supports input of up to 512,000 tokens, surpassing competitors.
💡 The model comes in versions with and without synthetic data to meet different user needs.
🔧 All models are available for free and support various deployment and integration options, making them easy for developers to use.

5 Billion Yuan B+ Round Financing Sets New Record! Yin Qi Joins Step Star as Chairman, Betting on AI + Terminal

Shanghai's large model unicorn, Step Star, has completed a super 5 billion yuan B+ round financing, setting a record for the largest single investment in the large model sector in China over the past year. The funds will be used for R&D of the base model and promotion of the "AI + Terminal" strategy. Yin Qi, founder of Megvii Technology, joined and was appointed chairman, responsible for company strategy development.

New Breakthrough in Domestic Computing Power! Mooley × Silicon Flow Achieve Efficient Inference of DeepSeek V3 671B on MTT S5000, Single Card Performance Approaching International Top Standards

Domestic AI chips and large models have achieved significant progress in collaborative optimization. Moore Threads and Silicon-based Flow successfully adapted the 671B-parameter DeepSeek V3 model to the domestic GPU MTT S5000. Using FP8 low-precision inference, they achieved over 4000 tokens/sec prefill and over 1000 tokens/sec decoding throughput, nearing international high-end AI accelerator performance.....

10B-Parameter Small Nuclear Bomb: Stepwise Star Open-Source Step3-VL-10B Performance Challenges 200B Large Models

The Stepwise Star open-source multimodal vision-language model Step3-VL-10B excels in multiple benchmark tests with only 10B parameters, solving the problem of insufficient intelligence in small models. The model achieves the best performance in its scale in visual perception, logical reasoning, and math competitions, even surpassing open-source and closed-source flagship models that are 10 to 20 times larger in size.

AI Daily: ByteDance Launches Cog 2.0; Alibaba Launches AIGC Design Platform Wuli; Step星辰 Launches AI Desktop Companion Windows Version

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. ByteDance launched a new AIAgent platform, Cog 2.0, with the AgentSkills feature attracting attention. ByteDance launched a new AIAgent platform, Cog 2.0, through Age

ByteDance Launches New AI Agent Platform 'Cousins' 2.0 Agent Skills Function Attracts Attention

The ByteDance AI platform 'Cousins' upgrades to version 2.0, evolving from a simple Q&A model into a comprehensive platform with long-term planning, in-depth office work, and cloud collaboration capabilities. The core new feature, Agent Skills, integrates scenario practice with tools deeply. For example, in marketing copywriting, it can call professional frameworks and integrate research tools for quality inspection.

NVIDIA Launches PersonaPlex-7B-v1: A Full-Duplex Black Tech That Redefines Real-Time Voice Interaction

NVIDIA launches PersonaPlex-7B-v1, a voice dialogue model that breaks the traditional 'one-question-one-answer' pattern, enabling more natural human-like conversations. It uses a single Transformer architecture for direct speech understanding and generation, eliminating the need for traditional ASR, LLM, and TTS processing.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

ByteDance Releases a New Open-Source Long Text Processing Model Seed-OSS-36B

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Allen AI Launches SERA: Build a Private Programming AI Assistant for $400

Industrial AI Startup CVector Secures $5 Million Seed Funding to Build a Nervous System for Large-scale Industry

5 Billion Yuan B+ Round Financing Sets New Record! Yin Qi Joins Step Star as Chairman, Betting on AI + Terminal

New Breakthrough in Domestic Computing Power! Mooley × Silicon Flow Achieve Efficient Inference of DeepSeek V3 671B on MTT S5000, Single Card Performance Approaching International Top Standards

Liquid AI Releases 1.2B Inference Model: Less than 1GB of Memory, Can Run on Mobile Edge Devices

10B-Parameter Small Nuclear Bomb: Stepwise Star Open-Source Step3-VL-10B Performance Challenges 200B Large Models

AI Daily: ByteDance Launches Cog 2.0; Alibaba Launches AIGC Design Platform Wuli; Step星辰 Launches AI Desktop Companion Windows Version

New Benchmark for 30B Specifications! Zhipu AI Open Sources GLM-4.7-Flash, Outperforming Alibaba and OpenAI in Multiple Tests

ByteDance Launches New AI Agent Platform 'Cousins' 2.0 Agent Skills Function Attracts Attention

NVIDIA Launches PersonaPlex-7B-v1: A Full-Duplex Black Tech That Redefines Real-Time Voice Interaction

AI News Recommendations

Allen AI Launches SERA: Build a Private Programming AI Assistant for $400

Industrial AI Startup CVector Secures $5 Million Seed Funding to Build a Nervous System for Large-scale Industry

5 Billion Yuan B+ Round Financing Sets New Record! Yin Qi Joins Step Star as Chairman, Betting on AI + Terminal

New Breakthrough in Domestic Computing Power! Mooley × Silicon Flow Achieve Efficient Inference of DeepSeek V3 671B on MTT S5000, Single Card Performance Approaching International Top Standards

Liquid AI Releases 1.2B Inference Model: Less than 1GB of Memory, Can Run on Mobile Edge Devices

10B-Parameter Small Nuclear Bomb: Stepwise Star Open-Source Step3-VL-10B Performance Challenges 200B Large Models

AI Daily: ByteDance Launches Cog 2.0; Alibaba Launches AIGC Design Platform Wuli; Step星辰 Launches AI Desktop Companion Windows Version

New Benchmark for 30B Specifications! Zhipu AI Open Sources GLM-4.7-Flash, Outperforming Alibaba and OpenAI in Multiple Tests

ByteDance Launches New AI Agent Platform 'Cousins' 2.0 Agent Skills Function Attracts Attention

NVIDIA Launches PersonaPlex-7B-v1: A Full-Duplex Black Tech That Redefines Real-Time Voice Interaction

GEO Services