Wav2Lip-HD: Improving Wav2Lip to achieve High-Fidelity Videos

This repository contains code for achieving high-fidelity lip-syncing in videos, using the Wav2Lip algorithm for lip-syncing and the Real-ESRGAN algorithm for super-resolution. The combination of these two algorithms allows for the creation of lip-synced videos that are both highly accurate and visually stunning.

Algorithm

The algorithm for achieving high-fidelity lip-syncing with Wav2Lip and Real-ESRGAN can be summarized as follows:

The input video and audio are given to Wav2Lip algorithm.
Python script is written to extract frames from the video generated by wav2lip.
Frames are provided to Real-ESRGAN algorithm to improve quality.
Then, the high-quality frames are converted to video using ffmpeg, along with the original audio.
The result is a high-quality lip-syncing video.
The specific steps for running this algorithm are described in the Testing Model section of this README.

Testing Model

To test the "Wav2Lip-HD" model, follow these steps:

Clone this repository and install requirements using following command (Make sure, Python and CUDA are already installed):
```
git clone https://github.com/saifhassan/Wav2Lip-HD.git
cd Wav2Lip-HD
pip install -r requirements.txt
```
Downloading weights

Model	Directory	Download Link
Wav2Lip	checkpoints/	Link
ESRGAN	experiments/001_ESRGAN_x4_f64b23_custom16k_500k_B16G1_wandb/models/	Link
Face_Detection	face_detection/detection/sfd/	Link
Real-ESRGAN	Real-ESRGAN/gfpgan/weights/	Link
Real-ESRGAN	Real-ESRGAN/weights/	Link

Put input video to input_videos directory and input audio to input_audios directory.
Open run_final.sh file and modify following parameters:

filename=kennedy (just video file name without extension)

input_audio=input_audios/ai.wav (audio filename with extension)
Execute run_final.sh using following command:
```
bash run_final.sh
```
Outputs

output_videos_wav2lip directory contains video output generated by wav2lip algorithm.
frames_wav2lip directory contains frames extracted from video (generated by wav2lip algorithm).
frames_hd directory contains frames after performing super-resolution using Real-ESRGAN algorithm.
output_videos_hd directory contains final high quality video output generated by Wav2Lip-HD.

Results

The results produced by Wav2Lip-HD are in two forms, one is frames and other is videos. Both are shared below:

Example output frames

Frame by Wav2Lip	Optimized Frame

Example output videos

Video by Wav2Lip	Optimized Video
kennedy_low.mp4	kennedy_hd.mp4
mona_low.mp4	mona_hd-2.mp4

Acknowledgements

We would like to thank the following repositories and libraries for their contributions to our work:

The Wav2Lip repository, which is the core model of our algorithm that performs lip-sync.
The face-parsing.PyTorch repository, which provides us with a model for face segmentation.
The Real-ESRGAN repository, which provides the super resolution component for our algorithm.
ffmpeg, which we use for converting frames to video.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Real-ESRGAN		Real-ESRGAN
__pycache__		__pycache__
basicsr		basicsr
checkpoints		checkpoints
examples		examples
experiments/001_ESRGAN_x4_f64b23_custom16k_500k_B16G1_wandb/models		experiments/001_ESRGAN_x4_f64b23_custom16k_500k_B16G1_wandb/models
face_detection		face_detection
face_parsing		face_parsing
input_audios		input_audios
input_videos		input_videos
output_videos_hd		output_videos_hd
output_videos_wav2lip		output_videos_wav2lip
results		results
tb_logger		tb_logger
temp		temp
wav2lip_models		wav2lip_models
LICENSE		LICENSE
README.md		README.md
audio.py		audio.py
download_models.py		download_models.py
hparams.py		hparams.py
inference.py		inference.py
requirements.txt		requirements.txt
resizeframes.py		resizeframes.py
run_final.sh		run_final.sh
train.py		train.py
train_basicsr.yml		train_basicsr.yml
video2frames.py		video2frames.py
wav2lipHD.yaml		wav2lipHD.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wav2Lip-HD: Improving Wav2Lip to achieve High-Fidelity Videos

Algorithm

Testing Model

Results

Example output frames

Example output videos

Acknowledgements

About

Releases

Packages

Languages

License

prometheus-alien/Wav2Lip-HD

Folders and files

Latest commit

History

Repository files navigation

Wav2Lip-HD: Improving Wav2Lip to achieve High-Fidelity Videos

Algorithm

Testing Model

Results

Example output frames

Example output videos

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages