‪Se Jin Park‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	140	140
h-index	4	4
i10-index	4	4

0

80

40

20212022202320243 29 77 31

Co-authors

Yong Man RoProfessor of Electrical Engineering, KAISTVerified email at kaist.ac.kr
Minsu KimKorea Advanced Institute of Science and TechnologyVerified email at kaist.ac.kr
Joanna HongPh.D. at Korea Advanced Institute of Science and TechnologyVerified email at kaist.ac.kr
Jeongsoo ChoiKAISTVerified email at kaist.ac.kr
Jeong Hun YeoKorea Advanced Institute of Science and TechnologyVerified email at kaist.ac.kr

Se Jin Park

Se Jin Park

Korea Advanced Institute of Science and Technology (KAIST)

Verified email at kaist.ac.kr - Homepage

multimodal learning image/video generation speech processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory SJ Park, M Kim, J Hong, J Choi, YM Ro Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 2062-2070, 2022	52	2022
Multi-modality associative bridging through memory: Speech sound recollected from face video M Kim, J Hong, SJ Park, YM Ro Proceedings of the IEEE/CVF International Conference on Computer Vision, 296-306, 2021	38	2021
Cromm-vsr: Cross-modal memory augmented visual speech recognition M Kim, J Hong, SJ Park, YM Ro IEEE Transactions on Multimedia 24, 4342-4355, 2021	26	2021
Speech reconstruction with reminiscent sound via visual voice memory J Hong, M Kim, SJ Park, YM Ro IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3654-3667, 2021	18	2021
Av2av: Direct audio-visual speech to audio-visual speech translation with unified audio-visual speech representation J Choi, SJ Park, M Kim, YM Ro arXiv preprint arXiv:2312.02512, 2023	2	2023
Test-time adaptation for real image denoising via meta-transfer learning A Gunawan, MA Nugroho, SJ Park arXiv preprint arXiv:2207.02066, 2022	2	2022
Multilingual visual speech recognition with a single model by learning with discrete visual speech units M Kim, JH Yeo, J Choi, SJ Park, YM Ro arXiv preprint arXiv:2401.09802, 2024	1	2024
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion SJ Park, J Hong, M Kim, YM Ro arXiv preprint arXiv:2310.05934, 2023	1	2023
Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models J Choi, M Kim, SJ Park, YM Ro ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation S Han, SJ Park, CW Kim, YM Ro ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model J Hong, SJ Park, YM Ro arXiv preprint arXiv:2310.14946, 2023		2023
Multilingual Visual Speech Recognition with a Single Model using Visual Speech Unit M Kim, J Yeo, J Choi, SJ Park, YM Ro		2023
Reprogramming Audio-driven Talking Face Synthesis into Text-driven J Choi, M Kim, SJ Park, YM Ro arXiv preprint arXiv:2306.16003, 2023		2023
Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation SJ Park, M Kim, J Choi, YM Ro arXiv preprint arXiv:2305.19556, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–14