Follow
Xuwu Wang
Xuwu Wang
ByteDance
Verified email at bytedance.com
Title
Cited by
Cited by
Year
Multi-modal knowledge graph construction and application: A survey
X Zhu, Z Li, X Wang, X Jiang, P Sun, X Wang, Y Xiao, NJ Yuan
IEEE Transactions on Knowledge and Data Engineering 36 (2), 715-735, 2022
1982022
Shifting more attention to visual backbone: Query-modulated refinement networks for end-to-end visual grounding
J Ye, J Tian, M Yan, X Yang, X Wang, J Zhang, L He, X Lin
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
752022
CAT-MNER: multimodal named entity recognition with knowledge-refined cross-modal attention
X Wang, J Ye, Z Li, J Tian, Y Jiang, M Yan, J Zhang, Y Xiao
2022 IEEE international conference on multimedia and expo (ICME), 1-6, 2022
472022
Bayes embedding (bem) refining representation by integrating knowledge graphs and behavior-specific networks
Y Ye, X Wang, J Yao, K Jia, J Zhou, Y Xiao, H Yang
Proceedings of the 28th ACM international conference on information and …, 2019
362019
Infiagent-dabench: Evaluating agents on data analysis tasks
X Hu, Z Zhao, S Wei, Z Chai, Q Ma, G Wang, X Wang, J Su, J Xu, M Zhu, ...
arXiv preprint arXiv:2401.05507, 2024
302024
WikiDiverse: a multimodal entity linking dataset with diversified contextual topics and entity types
X Wang, J Tian, M Gui, Z Li, R Wang, M Yan, L Chen, Y Xiao
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
212022
PromptMNER: Prompt-Based Entity-Related Visual Clue Extraction and Integration for Multimodal Named Entity Recognition
X Wang, J Tian, M Gui, Z Li, J Ye, M Yan, Y Xiao
International Conference on Database Systems for Advanced Applications, 297-305, 2022
202022
Agree: Aligning cross-modal entities for image-text retrieval upon vision-language pre-trained models
X Wang, L Li, Z Li, X Wang, X Zhu, C Wang, J Huang, Y Xiao
Proceedings of the Sixteenth ACM International Conference on Web Search and …, 2023
132023
Multi-task entity linking with supervision from a taxonomy
X Wang, L Chen, W Zhu, Y Ni, G Xie, D Yang, Y Xiao
Knowledge and Information Systems 65 (10), 4335-4358, 2023
72023
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Z Chai, G Wang, J Su, T Zhang, X Huang, X Wang, J Xu, J Yuan, H Yang, ...
arXiv preprint arXiv:2403.16854, 2024
52024
FullStack Bench: Evaluating LLMs as Full Stack Coder
S Liu, H Zhu, J Liu, S Xin, A Li, R Long, L Chen, J Yang, J Xia, ZY Peng, ...
arXiv preprint arXiv:2412.00535, 2024
42024
Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Z Zhao, T Shen, D Zhu, Z Li, J Su, X Wang, K Kuang, F Wu
arXiv preprint arXiv:2409.16167, 2024
32024
Uniqrnet: Unifying referring expression grounding and segmentation with qrnet
J Ye, J Tian, M Yan, H Xu, Q Ye, Y Shi, X Yang, X Wang, J Zhang, L He, ...
ACM Transactions on Multimedia Computing, Communications and Applications 20 …, 2024
22024
OVEL: Large Language Model as Memory Manager for Online Video Entity Linking
H Zhao, X Wang, S Chen, Z Li, X Zheng, Y Xiao
arXiv preprint arXiv:2403.01411, 2024
22024
Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval
H Liu, Y Song, X Wang, X Zhu, Z Li, W Song, T Li
International Conference on Database Systems for Advanced Applications, 419-434, 2024
12024
CONSTRUCTURE: Benchmarking CONcept STRUCTUre REasoning for Multimodal Large Language Models
Z Zha, X Zhu, Y Xu, C Huang, J Liu, Z Li, X Wang, Y Xiao, B Yang, X Xu
Findings of the Association for Computational Linguistics: EMNLP 2024, 4954-4968, 2024
2024
BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
X Wang, Q Cui, Y Tao, Y Wang, Z Chai, X Han, B Liu, J Yuan, J Su, ...
arXiv preprint arXiv:2410.00773, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–17