Selected Publications
"Reinforcement Learning on Pre-Training Data", arXiv:2509.19249 (2025), (Technical Report).
"Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts", arXiv:2508.07785 (2025).
"On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding", The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025).
"ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving", arXiv:2505.12717 (2025).
"Efficient OpAmp Adaptation for Zoom Attention to Golden Contexts", The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025 Main).
"Divergent Thoughts toward One Goal: LLM-based Multi-Agent Collaboration System for Electronic Design Automation", The 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025 Main).
"Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks", The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024 Main).
"ChatEDA: A Large Language Model Powered Autonomous Agent for EDA", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), vol. 43, no. 10, pp. 3184–3197, 2024.
"p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models", The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024 Oral).