MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning

Liang, Zhenwen; Yu, Dian; Pan, Xiaoman; Yao, Wenlin; Zeng, Qingkai; Zhang, Xiangliang; Yu, Dong

Computer Science > Artificial Intelligence

arXiv:2307.07951v1 (cs)

[Submitted on 16 Jul 2023]

Title:MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning

Authors:Zhenwen Liang, Dian Yu, Xiaoman Pan, Wenlin Yao, Qingkai Zeng, Xiangliang Zhang, Dong Yu

View PDF

Abstract:Reasoning in mathematical domains remains a significant challenge for relatively small language models (LMs). Many current methods focus on specializing LMs in mathematical reasoning and rely heavily on knowledge distillation from powerful but inefficient large LMs (LLMs). In this work, we explore a new direction that avoids over-reliance on LLM teachers, introducing a multi-view fine-tuning method that efficiently exploits existing mathematical problem datasets with diverse annotation styles. Our approach uniquely considers the various annotation formats as different "views" and leverages them in training the model. By postpending distinct instructions to input questions, models can learn to generate solutions in diverse formats in a flexible manner. Experimental results show that our strategy enables a LLaMA-7B model to outperform prior approaches that utilize knowledge distillation, as well as carefully established baselines. Additionally, the proposed method grants the models promising generalization ability across various views and datasets, and the capability to learn from inaccurate or incomplete noisy data. We hope our multi-view training paradigm could inspire future studies in other machine reasoning domains.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2307.07951 [cs.AI]
	(or arXiv:2307.07951v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2307.07951

Submission history

From: Zhenwen Liang [view email]
[v1] Sun, 16 Jul 2023 05:41:53 UTC (148 KB)

Computer Science > Artificial Intelligence

Title:MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators