Skip Navigation

Audio Flamingo 3 - Fully Open Large Audio Language Models

research.nvidia.com Audio Flamingo 3

We present Audio Flamingo 3 (AF3), a fully open state-of-the-art (SOTA) large audio-language model that advances reasoning and understanding across speech, sound, and music. AF3 introduces: (i) AF-Whisper, a unified audio encoder trained using a novel strategy for joint representation learning acros...

13

You're viewing a single thread.

13 comments
13 comments