ADIFF: Explaining audio difference using natural language
Soham Deshmukh, Shuo Han, Rita Singh, Bhiksha Raj
Two new datasets were created; a prefix-tuning baseline and ADIFF, which uses a cross-projection module and position captioning, were compared; ADIFF showed significant improvements via objective and human evaluation.
10.02.2025 07:07
๐ 2
๐ 1
๐ฌ 0
๐ 0
Great opportunity to work with amazing set of people!
09.12.2024 21:41
๐ 3
๐ 0
๐ฌ 0
๐ 0
Hi @jonathanleroux.bsky.social, could you please add me to the list as well? Thank you in advance!
09.12.2024 02:43
๐ 0
๐ 0
๐ฌ 0
๐ 0