TUPE
PublicTransformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.
Creat:2020-06-24T10:30:16
Update:2024-10-30T15:42:23
251
Stars
0
Stars Increase