Table Reasoning
[EMNLP 2025] RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals
[EMNLP 2025 Findings] MULTITAT: Benchmarking Multilingual Table-and-Text Question Answering
[ACL 2025 Findings] SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
[FCS] A Survey of Table Reasoning with Large Language Models
[preprint] FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
[ACL 2024] Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes
Evaluation
[ICLR 2026] FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
- Liang Hu, Jianpeng Jiao, Jiashuo Liu, Yanle Ren, Zhoufutu Wen, Kaiyuan Zhang, Xuanliang Zhang, Xiang Gao, Tianci He, Fei Hu, Yali Liao, Zaiyuan Wang, Chenghao Yang, Qianyu Yang, Mingren Yin, Zhiyuan Zeng, Ge Zhang, Xinyi Zhang, Xiying Zhao, Zhenwei Zhu, Hongseok Namkoong, Wenhao Huang, Yuwen Tang (Core contributors, $\alpha$-$\beta$ order.)
- Paper | Data
[ICLR 2026] DiscoX: Benchmarking Discourse-Level Translation in Expert Domains
- Xiying Zhao, Zhoufutu Wen, Zhixuan Chen, Jingzhe Ding, Jianpeng Jiao, Shuai Li, Xi Li, Danni Liang, Shengda Long, Qianqian Liu, Xianbo Wu, Hongwan Gao, Xiang Gao, Liang Hu, Jiashuo Liu, Mengyun Liu, Weiran Shi, Chenghao Yang, Qianyu Yang, Xuanliang Zhang, Ge Zhang, Wenhao Huang, Yuwen Tang
- Paper | Data
[preprint] MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity
Mechanistic Interpretability
[preprint] How Do Language Models Understand Tables? A Mechanistic Analysis of Cell Location
- Xuanliang Zhang, Dingzirui Wang, Keyan Xu, Qingfu Zhu, Wanxiang Che
- Paper
[ICLR 2026] Bounds of Chain-of-Thought Robustness: Reasoning Steps, Embed Norms, and Beyond
[preprint] Multi-Layer Attention is the Amplifier of Demonstration Effectiveness
- Dingzirui Wang, Xuanliang Zhang, Keyan Xu, Qingfu Zhu, Wanxiang Che, Yang Deng
- Paper
[preprint] Learning-to-Context Slope: Evaluating In-Context Learning Effectiveness Beyond Performance Illusions
- Dingzirui Wang, Xuanliang Zhang, Keyan Xu, Qingfu Zhu, Wanxiang Che, Yang Deng
- Paper
Text-to-SQL
[EMNLP 2025 Findings] DAC: Decomposed Automation Correction for Text-to-SQL
[ACL 2025 Demo] Abacus-SQL: A Text-to-SQL System Empowering Cross-Domain and Open-Domain Database Retrieval
[COLING 2025] MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL
[EMNLP 2024 Findings] Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL
Prompt Engineering
[preprint] V-SYNTHESIS: Task-Agnostic Synthesis of Consistent and Diverse In-Context Demonstrations from Scratch via V-Entropy
- Dingzirui Wang, Xuanliang Zhang, Keyan Xu, Qingfu Zhu, Wanxiang Che, Yang Deng
- Paper
[preprint] Format-Adapter: Improving Reasoning Capability of LLMs by Adapting Suitable Format
- Dingzirui Wang, Xuanliang Zhang, Rongyu Cao, Longxu Dou, Xianzhen Luo, Yingwei Ma, Qingfu Zhu, Wanxiang Che, Binhua Li, Fei Huang, Yongbin Li
- Paper
[preprint] In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
Others
[preprint] A Survey on Latent Reasoning
- Rui-Jie Zhu, Tianhao Peng, Tianhao Cheng, Xingwei Qu, Jinfa Huang, Dawei Zhu, Hao Wang, Kaiwen Xue, Xuanliang Zhang, Yong Shan, Tianle Cai, Taylor Kergan, Assel Kembay, Andrew Smith, Chenghua Lin, Binh Nguyen, Yuqi Pan, Yuhong Chou, Zefan Cai, Zhenhe Wu, Yongchi Zhao, Tianyu Liu, Jian Yang, Wangchunshu Zhou, Chujie Zheng, Chongxuan Li, Yuyin Zhou, Zhoujun Li, Zhaoxiang Zhang, Jiaheng Liu, Ge Zhang, Wenhao Huang, Jason Eshraghian
- Paper | Code
[AAAI 2024] Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification
- Bohan Li, Xiao Xu, Xinghao Wang, Yutai Hou, Yunlong Feng, Feng Wang, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che
- Paper