An Empirical Study on Challenging Math Problem Solving with GPT-4

Yiran Wu; Feiran Jia; Shaokun Zhang; Hangyu Li; Erkang (Eric) Zhu; Yue Wang; Yin Tat Lee; Richard Peng; Qingyun Wu; Chi Wang

An Empirical Study on Challenging Math Problem Solving with GPT-4

Yiran Wu ,
Feiran Jia ,
Shaokun Zhang ,
Hangyu Li ,
Erkang (Eric) Zhu ,
Yue Wang ,
Yin Tat Lee ,
Richard Peng ,
Qingyun Wu ,
Chi Wang

June 2023

arXiv

Download BibTex

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields. While several prior works have investigated solving elementary mathematics using LLMs, this work explores the frontier of using GPT-4 for solving more complex and challenging math problems. We evaluate various ways of using GPT-4. Some of them are adapted from existing work, and one is MathChat, a conversational problem-solving framework newly proposed in this work. We perform the evaluation on difficult high school competition problems from the MATH dataset, which shows the advantage of the proposed conversational approach.

Publication Downloads

FLAML: A Fast Library for AutoML and Tuning

December 15, 2020

FLAML is a Python library designed to automatically produce accurate machine learning models with low computational cost. It frees users from selecting learners and hyperparameters for each learner. FLAML is powered by a new, cost-effective hyperparameter optimization and learner selection method invented by Microsoft Research.

Download Data