memory-transformer-xl
PublicA variant of Transformer-XL where the memory is updated not with a queue, but with attention
Erstellungszeit:2020-07-10T09:59:07
Aktualisierungszeit:2025-02-22T21:33:09
49
Stars
0
Stars Increase
A variant of Transformer-XL where the memory is updated not with a queue, but with attention