JoyHallo: Digital human model for Mandarin
-
Updated
Sep 23, 2024 - Python
JoyHallo: Digital human model for Mandarin
One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024
DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The system utilizes Coqui TTS for text-to-speech generation, along with various face rendering and animation techniques to create a video where the given avatar articulates the speech.
[ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation
[ECCV 2024] ScanTalk: 3D Talking Heads from Unregistered Scans
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Wav2Lip UHQ extension for Automatic1111
PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
Use one line code to call SadTalker API with modelscope
Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.
Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.
A curated list of resources of audio-driven talking face generation
Add a description, image, and links to the audio-driven-talking-face topic page so that developers can more easily learn about it.
To associate your repository with the audio-driven-talking-face topic, visit your repo's landing page and select "manage topics."