GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Y Chen, Z Chen, C Zhang, F Wang, X Yang, Y Wang, Z Cai, L Yang, H Liu, ... IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024, 2024 | 90 | 2024 |
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis Y Chen, C Zhang, X Yang, Z Cai, G Yu, L Yang, G Lin AAAI Conference on Artificial Intelligence (AAAI 2024), 2024 | 47 | 2024 |
Trrnet: Tiered relation reasoning for compositional visual question answering X Yang, G Lin, F Lv, F Liu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 31 | 2020 |
Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation X Yang, Y Chen, C Chen, C Zhang, Y Xu, X Yang, F Liu, G Lin European Conference on Computer Vision (ECCV) 2024, 2024 | 16* | 2024 |
Self-Training Vision Language BERTs with a Unified Conditional Model X Yang, F Lv, F Liu, G Lin IEEE Transactions on Circuits and Systems for Video Technology, 2023 | 15 | 2023 |
Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior C Cheng, X Yang, F Yang, C Feng, Z FU, CS Foo, G Lin, F Liu IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024, 2024 | 9 | 2024 |
Effective End-to-End Vision Language Pretraining with Semantic Visual Loss X Yang, F Liu, G Lin IEEE Transactions on Multimedia, 2023 | 4 | 2023 |
Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration X Yang, F Liu, Y Xu, H Su, Q Wu, G Lin AAAI Conference on Artificial Intelligence (AAAI 2024), 2024 | 3 | 2024 |
Learning language to symbol and language to vision mapping for visual grounding S He, X Yang, G Lin Image and Vision Computing 122, 104451, 2022 | 3 | 2022 |
Deco: Decoupled human-centered diffusion video editing with motion consistency X Zhong, X Huang, X Yang, G Lin, Q Wu European Conference on Computer Vision, 352-370, 2025 | 2 | 2025 |
Magic-boost: Boost 3d generation with mutli-view conditioned diffusion F Yang, J Zhang, Y Shi, B Chen, C Zhang, H Zhang, X Yang, J Feng, ... arXiv preprint arXiv:2404.06429, 2024 | 2 | 2024 |
Neural Logic Vision Language Explainer X Yang, F Liu, G Lin IEEE Transactions on Multimedia, 2023 | 2 | 2023 |
Text-to-Image Rectified Flow as Plug-and-Play Priors X Yang, C Chen, X Yang, F Liu, G Lin arXiv preprint arXiv:2406.03293, 2024 | 1 | 2024 |
Neural radiance selector: Find the best 2D representations of 3D data for CLIP based 3D tasks X Yang, F Liu, G Lin Knowledge-Based Systems, 112002, 2024 | | 2024 |