Autoformalization for Agda via Fine-tuning Large Language Models

Huang, Pei

Autoformalization for Agda via Fine-tuning Large Language Models

dc.contributor.author	Huang, Pei
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.department	Chalmers University of Technology / Department of Computer Science and Engineering	en
dc.contributor.examiner	Coquand, Thierry
dc.contributor.supervisor	Ranta, Aarne
dc.contributor.supervisor	Thierry
dc.date.accessioned	2026-01-09T09:58:34Z
dc.date.issued	2025
dc.date.submitted
dc.description.abstract	Autoformalization, translating informal mathematical statements into fully formal machine-checkable code, has recently attracted interest thanks to advances in large language models (LLMs). However, progress has been hindered by the lack of highquality parallel corpora pairing natural language with formal code, especially for less-popular systems such as Agda. We address this gap by introducing SMAD (Synthetic Multilanguage Autoformalization Dataset), a 400K 4-to-3 parallel corpus covering four formal languages (Dedukti, Agda, Coq, Lean) and three natural languages (English, French, Swedish), generated via the Informath project. We finetune the open-source Qwen2.5-7B-Instruct model on SMAD and achieve substantial gains: BLEU-4 improves from 32.90 to 76.16, and Agda syntax error rate falls from 98.43 % to under 8 %. We further explore joint training across multiple formal and natural languages, demonstrating that multilingual and multi-formal regimes yield notable improvements in low-resource Agda settings. Our work establishes the first LLM-based Agda autoformalization system and provides systematic insights into model scaling, multilinguality, and dataset construction for future research.
dc.identifier.coursecode	DATX05
dc.identifier.uri	http://hdl.handle.net/20.500.12380/310852
dc.language.iso	eng
dc.setspec.uppsok	Technology
dc.subject	Autoformalization
dc.subject	Multilingual Formalization
dc.subject	Fine-tuning
dc.subject	LoRA
dc.subject	Large Language Models
dc.subject	Qwen2.5-7B-Instruct
dc.subject	Agda
dc.subject	Formal Proof Assistants
dc.subject	Synthetic Multilanguage Autoformalization Dataset (SMAD)
dc.subject	GF
dc.title	Autoformalization for Agda via Fine-tuning Large Language Models
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Data science and AI (MPDSC), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 25-89 PH.pdf
Size:: 845.19 KB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen