Skip Navigation

Microsoft’s VASA-1 can deepfake a person with one photo and one audio track

arstechnica.com Microsoft’s VASA-1 can deepfake a person with one photo and one audio track

YouTube videos of 6K celebrities helped train AI model to animate photos in real time.

Microsoft’s VASA-1 can deepfake a person with one photo and one audio track

cross-posted from: https://sh.itjust.works/post/18066953

On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don't require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

11

You're viewing a single thread.

11 comments
11 comments