Grounded sam: Assembling open-world models for diverse visual tasks T Ren, S Liu, A Zeng, J Lin, K Li, H Cao, J Chen, X Huang, Y Chen, F Yan, ... arXiv preprint arXiv:2401.14159, 2024 | 21 | 2024 |
Revisiting scene text recognition: A data perspective Q Jiang, J Wang, D Peng, C Liu, L Jin Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 10 | 2023 |
T-Rex: Counting by Visual Prompting Q Jiang, F Li, T Ren, S Liu, Z Zeng, K Yu, L Zhang arXiv preprint arXiv:2311.13596, 2023 | 4 | 2023 |
Visual In-Context Prompting F Li, Q Jiang, H Zhang, T Ren, S Liu, X Zou, H Xu, H Li, C Li, J Yang, ... arXiv preprint arXiv:2311.13601, 2023 | 2 | 2023 |
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy Q Jiang, F Li, Z Zeng, T Ren, S Liu, L Zhang arXiv preprint arXiv:2403.14610, 2024 | | 2024 |