Microsoft unveils VASA-1: generate videos with one photo & audio clip

Microsoft has unveiled VASA-1, an advanced AI model that can generate hyper-realistic videos of talking human faces with just a photo and audio clip.

As this example shows, the videos show accurate lip-synching, facial expressions and natural head movements making them seem very real.

An interesting development, but beware: deepfakes are becoming increasingly convincing. How will you recognize them soon?

For now, this is only a demonstration; there is no product or release plan yet.

Take a leap forward in your marketing AI transformation every week

Every Friday, we bring you the latest insights, news and real-world examples on the impact of AI in the marketing world. Whether you want to improve your marketing efficiency, increase customer engagement, sharpen your marketing strategy or digitally transform your business, "Marketing AI Friday" is your weekly guide.