Realistic Speech-Driven Talking Video Generation with Personalized Pose

Realistic Speech-Driven Talking Video Generation with Personalized Pose

In this work, we propose a method to transform a speaker’s speech information into a target character’s talking video; the method could make the mouth shape synchronization, expression, and body posture more realistic in the synthesized speaker video. This is a challenging task because changes of mo...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xu Zhang, Liguo Weng
Format:	Article
Language:	English
Published:	Wiley 2020-01-01
Series:	Complexity
Online Access:	http://dx.doi.org/10.1155/2020/6629634
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SPPNet: Single-Person Human Parsing and Pose Estimation in RGB Videos
by: Aditi Verma, et al.
Published: (2025-03-01)

Directive Speech Act Performed by Male and Female Speakers’ in Ted Talk Language Learning Videos
by: Donni Husnan, et al.
Published: (2025-02-01)

Benchmarking the First Realistic Dataset for Speech Separation
by: Rawad MELHEM, et al.
Published: (2025-07-01)

AMT-Net: Adversarial Motion Transfer Network With Disentangled Shape and Pose for Realistic Image Animation
by: Nega Asebe Teka, et al.
Published: (2025-01-01)

Cross-Domain Person Re-Identification Based on Multi-Branch Pose-Guided Occlusion Generation
by: Pengnan Liu, et al.
Published: (2025-01-01)

A Unified Framework for Recognizing Dynamic Hand Actions and Estimating Hand Pose from First-Person RGB Videos
by: Jiayi Yang, et al.
Published: (2025-06-01)

Human Pose Estimation: Single-Person and Multi-Person Approaches
by: Tang Wan
Published: (2025-01-01)

Visibility Aware In-Hand Object Pose Tracking in Videos With Transformers
by: Phan Xuan Tan, et al.
Published: (2025-01-01)

DualPose: Dual-Block Transformer Decoder with Contrastive Denoising for Multi-Person Pose Estimation
by: Matteo Fincato, et al.
Published: (2025-05-01)

Research on Lightweight Model of Multi-person Pose Estimation Based on Improved YOLOv8s-Pose
by: FU Yu, GAO Shuhui
Published: (2025-03-01)

Multi-Level Feature Dynamic Fusion Neural Radiance Fields for Audio-Driven Talking Head Generation
by: Wenchao Song, et al.
Published: (2025-01-01)

From Closet Talk to PC Terminology : Gay Speech and the Politics of Visibility
by: Pascale Smorag
Published: (2008-05-01)

The contribution of personalized video feedback to robotic partial nephrectomy training in realistic 3D tumor kidney models: design, production and implementation
by: Ahmet Furkan Sarıkaya, et al.
Published: (2025-07-01)

Animation Pose Generation Model Based on Kinect Depth Image and Occlusion-Robust Pose-Maps Algorithm
by: Zhenxi Yu, et al.
Published: (2025-01-01)

Camera Pose Generation Based on Unity3D
by: Hao Luo, et al.
Published: (2025-04-01)

Generative Image Steganography via Encoding Pose Keypoints
by: Yi Cao, et al.
Published: (2024-12-01)

Person-centred care in pharmacy: time to walk the talk
by: Tarik Al-Diery, et al.
Published: (2025-12-01)

A Constraint-Based Approach to Visual Speech for a Mexican-Spanish Talking Head
by: Oscar Martinez Lazalde, et al.
Published: (2008-01-01)

Wearable Spine Tracker vs. Video-Based Pose Estimation for Human Activity Recognition
by: Jonas Walkling, et al.
Published: (2025-06-01)

Reliability and validity of OpenPose for measuring HKA angle in dynamic walking videos in patients with knee osteoarthritis
by: Fanghong Ge, et al.
Published: (2025-07-01)

Pose-Driven Body Shape Prediction Algorithm Based on the Conditional GAN
by: Jiwon Jang, et al.
Published: (2025-07-01)

THE EFFECTIVENESS OF «TED TALKS» VIDEO MATERIALS IN TEACHING ENGLISH AT A MEDICAL UNIVERSITY
by: Yu.V. Lysanets, et al.
Published: (2022-12-01)

Automatic detection of teacher behavior in classroom videos using AlphaPose and Faster R-CNN algorithms
by: Jing Huang, et al.
Published: (2025-05-01)

Prior-free 3D human pose estimation in a video using limb-vectors
by: Anam Memon, et al.
Published: (2024-12-01)

Climate Change‐Driven Heatwaves Pose Lethal Risks to Newborn Forest Bats
by: Danilo Russo, et al.
Published: (2025-05-01)

Usefulness of the Support Video “Talking Picture Book” for Overcoming Hesitancy to Start Galcanezumab Therapy
by: Hisanao Akiyama, et al.
Published: (2025-05-01)

LFG: An easy-to-use realistic synthetic LandFill Generator
by: Thanos Petsanis, et al.
Published: (2024-12-01)

Flexible Simulation Platform for Generating Realistic Waveforms with Voltage Notches
by: Joaquín E. Caicedo, et al.
Published: (2024-11-01)

A pose generation model for animated characters based on DCNN and PFNN
by: Boli Wang
Published: (2024-12-01)

Validation of markerless video-based gait analysis using pose estimation in toddlers with and without neurodevelopmental disorders
by: Jeffrey T. Anderson, et al.
Published: (2025-02-01)

Multi-point estimation weldment recognition and estimation of pose with data-driven robotics design
by: Meng XiangYi
Published: (2025-04-01)

Speech Genre in Linguistic Variantology (on Material of Speech Genre “Personal Letter”)
by: T. G. Rabenko
Published: (2017-12-01)

Generation of Benchmark of Software Testing Methods for Java with Realistic Introduced Errors
by: Tomas Potuzak, et al.
Published: (2023-10-01)

Automated generator for complex and realistic test data—a case study
by: Richard Lipka, et al.
Published: (2018-09-01)

Generating Pseudo-Realistic Infrared Observations for Observing System Simulation Experiments
by: Isaac Moradi, et al.
Published: (2025-01-01)

A GIS Data Realistic Road Generation Approach for Traffic Simulation
by: Yacine Amara, et al.
Published: (2019-09-01)

Well-Posed and Ill-Posed Boundary Value Problems for PDE
by: Allaberen Ashyralyev, et al.
Published: (2012-01-01)

REALİST VE REALİST KARŞITI GÖRÜŞLERDE GÖZLENEBİLİRLİK KAVRAMI
by: Sedat Yazıcı
Published: (2004-07-01)

Speech Diagnostics of the Holistic Essence of Personality
by: Natalia Fomina
Published: (2018-10-01)

PoseAlign network for hybrid structure in 2D human pose estimation
by: Jin Zhang, et al.
Published: (2025-05-01)