publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

The latest publication list can be found in my profile in Google Scholar.

2025

  1. arXiv
    LaoBench: A Large-Scale Multidimensional Lao Benchmark for Large Language Models
    J Gao, R Xuan, Z Kang, and 6 more authors
    arXiv preprint arXiv:2511.11334, Nov 2025
  2. arXiv
    Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT
    Y Liu, H Li, H Xu, and 8 more authors
    arXiv preprint arXiv:2511.17405, Nov 2025
  3. arXiv
    BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions
    Nan Huo, Xiaohan Xu, Jinyang Li, and 8 more authors
    arXiv preprint arXiv:2510.05318, Oct 2025
  4. arXiv
    FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions
    Bowen Qin, C Yue, F Yin, and 7 more authors
    arXiv preprint arXiv:2509.17177, Sep 2025
  5. ACL
    FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation
    Z He, Y Liu, J Zheng, and 5 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL), Aug 2025
  6. ACL
    FlagEval-Arena: A Side-by-Side Comparative Evaluation Platform for Large Language Models and Text-Driven AIGC
    JS Zheng, R Xuan, Bowen Qin, and 4 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL), Aug 2025
  7. arXiv
    Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information
    Y Huang, Bowen Qin, C Huang, and 3 more authors
    arXiv preprint arXiv:2508.11252, Aug 2025
  8. arXiv
    SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications
    Jinyang Li, Xiaolong Li, Reynold Cheng, and 2 more authors
    arXiv preprint arXiv:2506.18951, Jun 2025
  9. arXiv
    Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning
    Nan Huo, Jinyang Li, Bowen Qin, and 5 more authors
    arXiv preprint arXiv:2506.05278, Jun 2025
  10. arXiv
    The Price of a Second Thought: On the Evaluation of Reasoning Efficiency in Large Language Models
    Siqi Fan, Bowen Qin, Peng Han, and 3 more authors
    arXiv preprint arXiv:2505.22017, May 2025
  11. AAAI
    Legend: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets
    Duanyu Feng, Bowen Qin, Chen Huang, and 3 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2025

2024

  1. arXiv
    Towards understanding the influence of reward margin on preference model performance
    Bowen Qin, Duanyu Feng, and Xi Yang
    arXiv preprint arXiv:2404.04932, 2024
  2. arXiv
    Towards analyzing and understanding the limitations of DPO: A theoretical perspective
    Duanyu Feng, Bowen Qin, Chen Huang, and 2 more authors
    arXiv preprint arXiv:2404.04626, 2024
  3. ACL
    SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL
    Ge Qu, Jinyang Li, Bowen Qin, and 4 more authors
    In Association for Computational Linguistics (ACL Findings), 2024
  4. ACL
    Before generation, align it! A novel and effective strategy for mitigating hallucinations in text-to-sql generation
    Ge Qu, Jinyang Li, Bowen Li, and 4 more authors
    In Association for Computational Linguistics (ACL Findings), 2024

2023

  1. arXiv
    HanFei-1.0: China’s First Large-Scale Legal Model
    Wanwei He, Jiabao Wen, Lei Zhang, and 7 more authors
    arXiv preprint, 2023
  2. AAAI
    Graphix-t5: Mixing pre-trained transformers with graph-aware layers for text-to-sql parsing
    Jinyang Li, Binyuan Hui, Reynold Cheng, and 7 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2023
  3. arXiv
    FLM-101B: An open LLM and how to train it with $100k budget
    Xiang Li, Yiqun Yao, Xin Jiang, and 8 more authors
    arXiv preprint arXiv:2309.03852, 2023
  4. NeurIPS
    Can LLM already serve as a database interface? A big bench for large-scale database grounded text-to-sqls
    Jinyang Li, Binyuan Hui, Ge Qu, and 8 more authors
    In Advances in Neural Information Processing Systems, 2023

2022

  1. KBS
    Sdcup: Schema dependency-enhanced curriculum pre-training for table semantic parsing
    Bowen Qin, Lihan Wang, Binyuan Hui, and 5 more authors
    Knowledge-Based Systems, 2022
  2. ACL
    S^2 SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers
    Binyuan Hui, Ruiying Geng, Lihan Wang, and 4 more authors
    In Association for Computational Linguistics (ACL Findings), 2022
  3. COLING
    SUN: Exploring intrinsic uncertainties in text-to-SQL parsers
    Bowen Qin, Lihan Wang, Binyuan Hui, and 7 more authors
    In Proceedings of the 29th International Conference on Computational Linguistics, 2022
  4. SIGKDD
    Proton: Probing schema linking information from pre-trained language models for text-to-sql parsing
    Lihan Wang, Bowen Qin, Binyuan Hui, and 8 more authors
    In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022
  5. arXiv
    A survey on text-to-sql parsing: Concepts, methods, and future directions
    Bowen Qin, Binyuan Hui, Lihan Wang, and 8 more authors
    arXiv preprint arXiv:2208.13629, 2022

2021

  1. IJFS
    FR–KDE: a hybrid fuzzy rule-based information fusion method with its application in biomedical classification
    Xingjian Song, Bowen Qin, and Fuyuan Xiao
    International Journal of Fuzzy Systems, 2021
  2. AAAI
    Exploring auxiliary reasoning tasks for task-oriented dialog systems with meta cooperative learning
    Bowen Qin, Min Yang, Lidong Bing, and 3 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2021
  3. InfoSci
    A fuzzy preference-based Dempster-Shafer evidence theory for decision fusion
    Chaosheng Zhu, Bowen Qin, Fuyuan Xiao, and 2 more authors
    Information Sciences, 2021

2019

  1. IJDSN
    An improved method to determine basic probability assignment with interval number and its application in classification
    Bowen Qin and Fuyuan Xiao
    International Journal of Distributed Sensor Networks, 2019

2018

  1. Sensors
    A Weighted Combination Method for Conflicting Evidence in Multi-Sensor Data Fusion
    Fuyuan Xiao and Bowen Qin
    Sensors, 2018
  2. IEEE Access
    A Non-Parametric Method to Determine Basic Probability Assignment Based on Kernel Density Estimation
    Bowen Qin and Fuyuan Xiao
    IEEE Access, 2018