TokenFormer
Public[ICLR2025 Spotlight?] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
Creat:2024-10-30T01:43:02
Update:2025-03-25T15:10:10
https://arxiv.org/abs/2410.23168
567
Stars
0
Stars Increase