Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense Y Chen, S Huang, T Yuan, S Qi, Y Zhu, SC Zhu Proceedings of the IEEE International Conference on Computer Vision, 8648-8657, 2019 | 134 | 2019 |
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment Z Zhu, X Ma, Y Chen, Z Deng, S Huang, Q Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 99 | 2023 |
Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning Q Li, S Huang, Y Hong, Y Chen, YN Wu, SC Zhu International Conference on Machine Learning, 5884-5894, 2020 | 97 | 2020 |
Inferring shared attention in social scene videos L Fan, Y Chen, P Wei, W Wang, SC Zhu Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 95 | 2018 |
Humanise: Language-conditioned human motion generation in 3d scenes Z Wang, Y Chen, T Liu, Y Zhu, W Liang, S Huang Advances in Neural Information Processing Systems 35, 14959-14971, 2022 | 92 | 2022 |
Full-body articulated human-object interaction N Jiang, T Liu, Z Cao, J Cui, Z Zhang, Y Chen, H Wang, Y Zhu, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 52* | 2023 |
LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities B Jia, Y Chen, S Huang, Y Zhu, SC Zhu European Conference on Computer Vision, 767-786, 2020 | 52 | 2020 |
YouRefIt: Embodied Reference Understanding with Language and Gesture Y Chen, Q Li, D Kong, YL Kei, SC Zhu, T Gao, Y Zhu, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 42 | 2021 |
PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points S Huang, Y Chen, T Yuan, S Qi, Y Zhu, SC Zhu Advances in Neural Information Processing Systems, 8905-8917, 2019 | 40 | 2019 |
Sceneverse: Scaling 3d vision-language learning for grounded scene understanding B Jia, Y Chen, H Yu, Y Wang, X Niu, T Liu, Q Li, S Huang European Conference on Computer Vision, 289-310, 2025 | 39 | 2025 |
PartAfford: Part-level Affordance Discovery from 3D Objects C Xu, Y Chen, H Wang, SC Zhu, Y Zhu, S Huang arXiv preprint arXiv:2202.13519, 2022 | 28 | 2022 |
Scaling up dynamic human-scene interaction modeling N Jiang, Z Zhang, H Li, X Ma, Z Wang, Y Chen, T Liu, Y Zhu, S Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 27 | 2024 |
Detecting Human-Object Contact in Images Y Chen, SK Dwivedi, MJ Black, D Tzionas Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 25 | 2023 |
Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance Z Wang, Y Chen, B Jia, P Li, J Zhang, J Zhang, T Liu, Y Zhu, W Liang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 23 | 2024 |
Unifying 3d vision-language understanding via promptable queries Z Zhu, Z Zhang, X Ma, X Niu, Y Chen, B Jia, Z Deng, S Huang, Q Li European Conference on Computer Vision, 188-206, 2025 | 14 | 2025 |
Single-view 3d scene reconstruction with high-fidelity shape and texture Y Chen, J Ni, N Jiang, Y Zhang, Y Zhu, S Huang 2024 International Conference on 3D Vision (3DV), 1456-1467, 2024 | 11 | 2024 |
Top-Down Attention in End-to-End Spoken Language Understanding Y Chen, W Lu, A Mottini, LE Li, J Droppo, Z Du, B Zeng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Task-oriented sequential grounding in 3d scenes Z Zhang, Z Zhu, P Li, T Liu, X Ma, Y Chen, B Jia, S Huang, Q Li arXiv preprint arXiv:2408.04034, 2024 | 4 | 2024 |
Autonomous character-scene interaction synthesis from text instruction N Jiang, Z He, Z Wang, H Li, Y Chen, S Huang, Y Zhu SIGGRAPH Asia 2024 Conference Papers, 1-11, 2024 | 3 | 2024 |
Phyrecon: Physically plausible neural scene reconstruction J Ni, Y Chen, B Jing, N Jiang, B Wang, B Dai, P Li, Y Zhu, SC Zhu, ... arXiv preprint arXiv:2404.16666, 2024 | 3 | 2024 |