8 months ago

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar

Abstract

In this work, we investigate the problem of lip-syncing a talking face videoof an arbitrary identity to match a target speech segment. Current works excelat producing accurate lip movements on a static image or videos of specificpeople seen during the training phase. However, they fail to accurately morphthe lip movements of arbitrary identities in dynamic, unconstrained talkingface videos, resulting in significant parts of the video being out-of-sync withthe new audio. We identify key reasons pertaining to this and hence resolvethem by learning from a powerful lip-sync discriminator. Next, we propose new,rigorous evaluation benchmarks and metrics to accurately measure lipsynchronization in unconstrained videos. Extensive quantitative evaluations onour challenging benchmarks show that the lip-sync accuracy of the videosgenerated by our Wav2Lip model is almost as good as real synced videos. Weprovide a demo video clearly showing the substantial impact of our Wav2Lipmodel and evaluation benchmarks on our website:\url{cvit.iiit.ac.in/research/projects/cvit-projects/a-lip-sync-expert-is-all-you-need-for-speech-to-lip-generation-in-the-wild}.The code and models are released at this GitHub repository:\url{github.com/Rudrabha/Wav2Lip}. You can also try out the interactive demo atthis link: \url{bhaasha.iiit.ac.in/lipsync}.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Video Generation

Audio and Speech Processing

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Video Generation

Audio and Speech Processing

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar2 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar2 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar2 more

Abstract

Build AI with AI

HyperAI Newsletters

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar

K R Prajwal∗ prajwal.k@research.iiti.ac.in IIIT, Hyderabad, India Vinay P. Namboodiri vpn22@bath.ac.uk University of Bath, England Rudrabha Mukhopadhyay∗ radrabha.m@research.iiti.ac.in IIIT, Hyderabad, India C V Jawahar