Flm-101b: An open llm and how to train it with $100 k budget X Li, Y Yao, X Jiang, X Fang, X Meng, S Fan, P Han, J Li, L Du, B Qin, ... arXiv preprint arXiv:2309.03852, 2023 | 22 | 2023 |
Not all layers of llms are necessary during inference S Fan, X Jiang, X Li, X Meng, P Han, S Shang, A Sun, Y Wang, Z Wang arXiv preprint arXiv:2403.02181, 2024 | 21 | 2024 |
Route search and planning: A survey K Li, X Rao, XB Pang, L Chen, S Fan Big data research 26, 100246, 2021 | 15 | 2021 |
Empmff: A multi-factor sequence fusion framework for empathetic response generation X Pang, Y Wang, S Fan, L Chen, S Shang, P Han Proceedings of the ACM Web Conference 2023, 1754-1764, 2023 | 6 | 2023 |
Interactive Information Extraction by Semantic Information Graph. S Fan, Y Wang, J Li, Z Zhang, S Shang, P Han IJCAI, 4100-4106, 2022 | 6 | 2022 |
Few-shot relation extraction towards special interests S Fan, B Zhang, S Zhou, M Wang, K Li Big Data Research 26, 100273, 2021 | 6 | 2021 |
SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms X Xing, Z Zhang, Z Ni, S Xiao, Y Ju, S Fan, Y Wang, J Zhang, G Li arXiv preprint arXiv:2406.03287, 2024 | 4 | 2024 |
UaMC: user-augmented conversation recommendation via multi-modal graph learning and context mining S Fan, Y Wang, X Pang, L Chen, P Han, S Shang World Wide Web 26 (6), 4109-4129, 2023 | 3 | 2023 |
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging Y Ju, Z Ni, X Xing, Z Zeng, S Fan, Z Zhang arXiv preprint arXiv:2410.03743, 2024 | | 2024 |
NanoLM: An Affordable LLM Study Benchmark via Accurate Loss Prediction Across Scales S Fan, X Huang, X Fang, Y Yao, X Li, Z Ni, X Jiang, X Meng, P Han, ... | | |