Follow
Se Jin Park
Title
Cited by
Cited by
Year
Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory
SJ Park, M Kim, J Hong, J Choi, YM Ro
Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 2062-2070, 2022
522022
Multi-modality associative bridging through memory: Speech sound recollected from face video
M Kim, J Hong, SJ Park, YM Ro
Proceedings of the IEEE/CVF International Conference on Computer Vision, 296-306, 2021
382021
Cromm-vsr: Cross-modal memory augmented visual speech recognition
M Kim, J Hong, SJ Park, YM Ro
IEEE Transactions on Multimedia 24, 4342-4355, 2021
262021
Speech reconstruction with reminiscent sound via visual voice memory
J Hong, M Kim, SJ Park, YM Ro
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3654-3667, 2021
182021
Av2av: Direct audio-visual speech to audio-visual speech translation with unified audio-visual speech representation
J Choi, SJ Park, M Kim, YM Ro
arXiv preprint arXiv:2312.02512, 2023
22023
Test-time adaptation for real image denoising via meta-transfer learning
A Gunawan, MA Nugroho, SJ Park
arXiv preprint arXiv:2207.02066, 2022
22022
Multilingual visual speech recognition with a single model by learning with discrete visual speech units
M Kim, JH Yeo, J Choi, SJ Park, YM Ro
arXiv preprint arXiv:2401.09802, 2024
12024
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
SJ Park, J Hong, M Kim, YM Ro
arXiv preprint arXiv:2310.05934, 2023
12023
Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models
J Choi, M Kim, SJ Park, YM Ro
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
S Han, SJ Park, CW Kim, YM Ro
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
J Hong, SJ Park, YM Ro
arXiv preprint arXiv:2310.14946, 2023
2023
Multilingual Visual Speech Recognition with a Single Model using Visual Speech Unit
M Kim, J Yeo, J Choi, SJ Park, YM Ro
2023
Reprogramming Audio-driven Talking Face Synthesis into Text-driven
J Choi, M Kim, SJ Park, YM Ro
arXiv preprint arXiv:2306.16003, 2023
2023
Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation
SJ Park, M Kim, J Choi, YM Ro
arXiv preprint arXiv:2305.19556, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–14