memory-transformer-xl
PublicA variant of Transformer-XL where the memory is updated not with a queue, but with attention
Creat:2020-07-10T09:59:07
Update:2025-02-22T21:33:09
49
Stars
0
Stars Increase
A variant of Transformer-XL where the memory is updated not with a queue, but with attention