LITP Spring Symposium: Fine-Tuning Small Reasoning Models for Quantum Field Theory
Talk, The 11th LITP Spring Symposium: Theoretical Physics and AI, Room 340, West Hall
Despite the growing application of Large Language Models (LLMs) to theoretical physics, there is little academic exploration into how domain-specific physics reasoning ability develops while training these models. To investigate this, we perform the first academic fine-tuning study of small (7B-parameter) reasoning models dedicated specifically to theoretical physics. Selecting Quantum Field Theory (QFT) as our primary domain, we developed a robust data generation pipeline to create synthetic problems and adapt existing human-authored problems for training, generating over 2,500 synthetic problems alongside a curated collection of human-adapted problems. We conduct both Reinforcement Learning (RL) and Supervised Fine-Tuning (SFT) experiments, benchmark performance gains and generalization to other physics domains, and analyze how reasoning errors evolve during training.
