Home

stolen-attention

Public

All code used in my Master's Thesis: Stolen Attention in Transformers. Includes a PyTorchLightning implementation of a transformer and relevant training scripts. Checkpoints and data sources not included.

Creat2024-01-18T02:42:05
Update2024-06-06T09:12:15
0
Stars
0
Stars Increase