Pandalm: An automatic evaluation benchmark for llm instruction tuning optimization Y Wang, Z Yu, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, J Wang, ... arXiv preprint arXiv:2306.05087, 2023 | 70 | 2023 |
Similarity learning for cover song identification using cross-similarity matrices of multi-level deep sequences C Jiang, D Yang, X Chen ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 13 | 2020 |
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model C Jiang, H Xu, M Dong, J Chen, W Ye, M Yan, Q Ye, J Zhang, F Huang, ... arXiv:2312.06968, 2023 | 12 | 2023 |
Learn a robust representation for cover song identification via aggregating local and global music temporal context C Jiang, D Yang, X Chen 2020 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2020 | 5 | 2020 |
Exploiting Pseudo Image Captions for Multimodal Summarization C Jiang, R Xie, W Ye, J Sun, S Zhang Findings of the Association for Computational Linguistics: ACL 2023, 161–175, 2023 | 4 | 2023 |
Pandalm: Reproducible and automated language model assessment W Yidong, Y Zhuohao, Z Zhengran, Y Linyi, H Qiang, W Cunxiang, C Hao, ... | 4 | 2023 |
TRIPS: Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection C Jiang, H Xu, C Li, M Yan, W Ye, S Zhang, B Bi, S Huang Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 4 | 2022 |
Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation C Jiang, W Ye, H Xu, S Zhang, J Zhang, F Huang Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 3 | 2023 |
COPA: Efficient Vision-Language Pre-training through Collaborative Object-and Patch-Text Alignment C Jiang, H Xu, W Ye, Q Ye, C Li, M Yan, B Bi, S Zhang, F Huang, J Zhang Proceedings of the 31st ACM International Conference on Multimedia, 4480-4491, 2023 | 2 | 2023 |
BUS: Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization. C Jiang, H Xu, W Ye, Q Ye, C Li, M Yan, B Bi, S Zhang, F Huang, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 2 | 2023 |
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models C Jiang, W Ye, M Dong, H Jia, H Xu, M Yan, J Zhang, S Zhang arXiv preprint arXiv:2402.15721, 2024 | 1 | 2024 |
TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training C Jiang, W Ye, H Xu, Q Ye, M Yan, J Zhang, S Zhang arXiv:2312.08846, 2023 | 1 | 2023 |
Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection W Ye, C Jiang, H Xu, C Ye, C Li, M Yan, S Zhang, S Huang, F Huang arXiv preprint arXiv:2403.07883, 2024 | | 2024 |