Loop invariant generation: a hybrid framework of reasoning optimised LLMs and SMT solvers

dc.contributor.authorKumar, Dhruv
dc.date.accessioned2025-08-14T10:09:10Z
dc.date.available2025-08-14T10:09:10Z
dc.date.issued2025-08
dc.description.abstractLoop invariants are essential for proving the correctness of programs with loops. Developing loop invariants is challenging, and fully automatic synthesis cannot be guaranteed for arbitrary programs. Some approaches have been proposed to synthesize loop invariants using symbolic techniques and more recently using neural approaches. These approaches are able to correctly synthesize loop invariants only for subsets of standard benchmarks. In this work, we investigate whether modern, reasoning-optimized large language models can do better. We integrate OpenAI's O1, O1-mini, and O3-mini into a tightly coupled generate-and-check pipeline with the Z3 SMT solver, using solver counterexamples to iteratively guide invariant refinement. We use Code2Inv benchmark, which provides C programs along with their formal preconditions and postconditions. On this benchmark of 133 tasks, our framework achieves 100% coverage (133 out of 133), outperforming the previous best of 107 out of 133, while requiring only 1-2 model proposals per instance and 14-55 seconds of wall-clock time. These results demonstrate that LLMs possess latent logical reasoning capabilities which can help automate loop invariant synthesis. While our experiments target C-specific programs, this approach should be generalizable to other imperative languages.en_US
dc.identifier.urihttps://arxiv.org/abs/2508.00419
dc.identifier.urihttps://dspace.bits-pilani.ac.in/handle/123456789/19198
dc.language.isoenen_US
dc.subjectComputer Scienceen_US
dc.subjectLoop invariant synthesisen_US
dc.subjectSymbolic reasoningen_US
dc.subjectInvariant refinementen_US
dc.subjectLanguage model reasoningen_US
dc.titleLoop invariant generation: a hybrid framework of reasoning optimised LLMs and SMT solversen_US
dc.typePreprinten_US

Files