AIbase

beats-conformer-bart-audio-captioner

Public

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

Creat2024-01-05T05:33:29
Update2025-01-13T08:32:41
37
Stars
0
Stars Increase

Related projects