HomeAI Tutorial

Drop-Upcycling

Public

[ICLR'25] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Creat2025-02-17T02:43:42
Update2025-09-27T15:02:03
https://arxiv.org/abs/2502.19261
20
Stars
0
Stars Increase

Related projects