Multi-modal knowledge graph construction and application: A survey X Zhu, Z Li, X Wang, X Jiang, P Sun, X Wang, Y Xiao, NJ Yuan IEEE Transactions on Knowledge and Data Engineering 36 (2), 715-735, 2022 | 198 | 2022 |
Shifting more attention to visual backbone: Query-modulated refinement networks for end-to-end visual grounding J Ye, J Tian, M Yan, X Yang, X Wang, J Zhang, L He, X Lin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 75 | 2022 |
CAT-MNER: multimodal named entity recognition with knowledge-refined cross-modal attention X Wang, J Ye, Z Li, J Tian, Y Jiang, M Yan, J Zhang, Y Xiao 2022 IEEE international conference on multimedia and expo (ICME), 1-6, 2022 | 47 | 2022 |
Bayes embedding (bem) refining representation by integrating knowledge graphs and behavior-specific networks Y Ye, X Wang, J Yao, K Jia, J Zhou, Y Xiao, H Yang Proceedings of the 28th ACM international conference on information and …, 2019 | 36 | 2019 |
Infiagent-dabench: Evaluating agents on data analysis tasks X Hu, Z Zhao, S Wei, Z Chai, Q Ma, G Wang, X Wang, J Su, J Xu, M Zhu, ... arXiv preprint arXiv:2401.05507, 2024 | 30 | 2024 |
WikiDiverse: a multimodal entity linking dataset with diversified contextual topics and entity types X Wang, J Tian, M Gui, Z Li, R Wang, M Yan, L Chen, Y Xiao Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 21 | 2022 |
PromptMNER: Prompt-Based Entity-Related Visual Clue Extraction and Integration for Multimodal Named Entity Recognition X Wang, J Tian, M Gui, Z Li, J Ye, M Yan, Y Xiao International Conference on Database Systems for Advanced Applications, 297-305, 2022 | 20 | 2022 |
Agree: Aligning cross-modal entities for image-text retrieval upon vision-language pre-trained models X Wang, L Li, Z Li, X Wang, X Zhu, C Wang, J Huang, Y Xiao Proceedings of the Sixteenth ACM International Conference on Web Search and …, 2023 | 13 | 2023 |
Multi-task entity linking with supervision from a taxonomy X Wang, L Chen, W Zhu, Y Ni, G Xie, D Yang, Y Xiao Knowledge and Information Systems 65 (10), 4335-4358, 2023 | 7 | 2023 |
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing Z Chai, G Wang, J Su, T Zhang, X Huang, X Wang, J Xu, J Yuan, H Yang, ... arXiv preprint arXiv:2403.16854, 2024 | 5 | 2024 |
FullStack Bench: Evaluating LLMs as Full Stack Coder S Liu, H Zhu, J Liu, S Xin, A Li, R Long, L Chen, J Yang, J Xia, ZY Peng, ... arXiv preprint arXiv:2412.00535, 2024 | 4 | 2024 |
Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering Z Zhao, T Shen, D Zhu, Z Li, J Su, X Wang, K Kuang, F Wu arXiv preprint arXiv:2409.16167, 2024 | 3 | 2024 |
Uniqrnet: Unifying referring expression grounding and segmentation with qrnet J Ye, J Tian, M Yan, H Xu, Q Ye, Y Shi, X Yang, X Wang, J Zhang, L He, ... ACM Transactions on Multimedia Computing, Communications and Applications 20 …, 2024 | 2 | 2024 |
OVEL: Large Language Model as Memory Manager for Online Video Entity Linking H Zhao, X Wang, S Chen, Z Li, X Zheng, Y Xiao arXiv preprint arXiv:2403.01411, 2024 | 2 | 2024 |
Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval H Liu, Y Song, X Wang, X Zhu, Z Li, W Song, T Li International Conference on Database Systems for Advanced Applications, 419-434, 2024 | 1 | 2024 |
CONSTRUCTURE: Benchmarking CONcept STRUCTUre REasoning for Multimodal Large Language Models Z Zha, X Zhu, Y Xu, C Huang, J Liu, Z Li, X Wang, Y Xiao, B Yang, X Xu Findings of the Association for Computational Linguistics: EMNLP 2024, 4954-4968, 2024 | | 2024 |
BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data X Wang, Q Cui, Y Tao, Y Wang, Z Chai, X Han, B Liu, J Yuan, J Su, ... arXiv preprint arXiv:2410.00773, 2024 | | 2024 |