AI as a Tool for Mathematics, Computer Science, and Machine Learning

About

Why This Workshop

Modern AI systems can accelerate real research, but using them effectively remains nontrivial. This workshop will develop a community resource of workflows to drive progress at the AI–Math/CS/ML interface — especially in machine learning, optimization, statistics, algorithms, and adjacent areas.

While some researchers focus on how AI may be able to work on research independently, others are eager to share how AI is able to help humans in their research. However, it is often difficult to extract and reproduce the specific, frequently complex, workflows and “tricks” used by these researchers. As the degree of usefulness of generative AI depends greatly on the workflow, many scientists who currently only use AI for basic tasks, can substantially benefit from access to these methods. Our workshop aims to help researchers at ICML – especially in machine learning, optimization, statistics, algorithms, and adjacent areas – to more effectively integrate AI tools in their core research workflow. The workshop will cover:

AI-Assisted Workflows

Iterative verification loops, failure modes and how to detect them, prompting patterns that improve correctness, decomposition and self-critique, multi-agent strategies, and when to switch from informal to formal reasoning.

Tool-Augmented Reasoning

Integrating LLMs with computation (code, symbolic algebra, numerics), literature navigation, and proof assistants (e.g., Lean) to reduce hallucinations.

Research Acceleration

Using AI for derivations, counterexample search, and experiment design — with an emphasis on methods that transfer across subfields.

Invited Speakers

Sergei Gukov

Caltech · American Institute of Mathematics

Sergei Gukov is the John D. MacArthur Professor of Theoretical Physics and Mathematics at Caltech, Director of the Merkin Center for Pure and Applied Mathematics, and Consulting Director of the American Institute of Mathematics. His research interests include mathematics and machine learning, quantum topology, gauge theory, and knot and 3-manifold invariants.

Website

Remy Degenne

University of Lille · Inria

Remy Degenne is a tenured researcher in the Scool team at the Inria centre at the University of Lille. He works on sequential machine learning, especially bandit theory, and is interested in online and reinforcement learning, statistics, and optimization. He is also a maintainer of Mathlib for the Lean theorem prover.

Website

Damek Davis

University of Pennsylvania

Damek Davis is an Associate Professor in Wharton's Department of Statistics and Data Science. His research interests are optimization and machine learning, and he also works on AI for mathematics. He is currently an associate editor at Mathematical Programming and Foundations of Computational Mathematics.

Website

Diyi Yang

Stanford University

Diyi Yang is an Assistant Professor in the Computer Science Department at Stanford University, affiliated with the Stanford NLP Group, Stanford HCI Group, SAIL, and Stanford HAI. Her research focuses on socially aware NLP, large language models, and human-AI interaction, with an emphasis on human-centered AI systems.

Website

Niao He

ETH Zurich

Niao He is an Associate Professor in the Department of Computer Science at ETH Zurich and leads the Optimization & Decision Intelligence Group. Her research lies at the interface of optimization and machine learning, with a focus on algorithmic and theoretical foundations for principled, scalable, and trustworthy decision intelligence.

Website

Mehtaab Sawhney

Columbia University / OpenAI

Mehtaab Sawhney is a Clay Research Fellow and a tenure-track assistant professor at Columbia University. His research interests are broadly within combinatorics, probability, analytic number theory, and theoretical computer science.

Website

Debaters

Confirmed Debaters

Niloofar Mireshghallah

CMU / humans&

Niloofar Mireshghallah is a Member of Technical Staff at humans&. Beginning Fall 2026, she will join Carnegie Mellon University's Engineering & Public Policy Department and Language Technologies Institute as an Assistant Professor. Her research spans privacy, natural language processing, AI for science, LLM reasoning, and the societal implications of machine learning.

Website

Ludwig Schmidt

Stanford University

Ludwig Schmidt is an Assistant Professor in Stanford University's Computer Science Department and Stanford Data Science. His research focuses on the foundations of machine learning, often with an emphasis on datasets, multimodality, reliable generalization, and language models. He is also a member of technical staff at Anthropic and LAION.

Website

Schedule

Full-Day Program

All times are local time in Seoul, South Korea (KST (UTC+9))

08:20–08:30

Opening remarks

08:30–09:05

Talk: Sergei Gukov

Caltech · American Institute of Mathematics

09:05–09:40

Talk: Remy Degenne

University of Lille · Inria

09:40–10:00

Coffee break

10:00–10:35

Talk: Damek Davis

University of Pennsylvania

10:35–11:10

Talk: Niao He

ETH Zurich

11:10–11:50

Get lunch

11:50–13:15

Poster session (bring your own lunch)

13:15–13:50

Talk: Diyi Yang

Stanford University

13:50–14:25

Oral presentations and Q&A

4 orals, 7 minutes each, followed by 7 minutes of Q&A

14:25–15:00

Talk: Mehtaab Sawhney

Columbia University / OpenAI

15:00–15:20

Coffee break

15:20–15:30

Short break / buffer

15:30–16:30

Structured Debate: Niloofar Mireshghallah vs. Ludwig Schmidt

16:30–17:00

Post-debate perspectives

Structured Debate

Motion and Format

The workshop will feature a structured debate on the motion:

"Even with superhuman research AI, society will continue to financially support / pay humans doing research"

The structured debate follows a format with strict timing:

Opening speeches of 4 minutes each, presenting the main arguments
Crossfire of 2 minutes with alternating questions (10 sec) and answers (20 sec)
Rebuttal and new arguments with 3 minutes each
Second crossfire round of 2 minutes
Another round of rebuttal with 3 minutes each
Closing speech summarizing the structured debate, 3 minutes each

After the structured debate, questions from the audience will follow, moderated by the organizers.

Call for Papers

Contribute Your Work

Submissions are now closed.

We welcome submissions that highlight workflows using AI for machine learning, math, and computer science research more generally. Your contribution should illustrate—in an accessible way for a non-expert—how a simple workflow has proven to be useful in solving a cognitive research task (e.g., time-saving, energy-saving, result-strengthening, etc.).

The workflow should be reproducible by ML researchers within a few hours and with academic-level financial resources. Therefore, the workflow should either involve simple prompting-based strategies, or more sophisticated strategies where the submitters provide a package/agent/repository that can be readily integrated into a chat interface or an API call. All accepted submissions will be made publicly accessible, creating a shared repository of AI-assisted research workflows.

We also welcome submissions that illustrate interesting failure modes, to improve the community's understanding of the limitations of AI assistance.

We focus on the following tasks where AI can help, and challenges associated with AI-assisted research:

AI-assisted research problem formulation
AI-assisted experiment design for ML research
Solving mathematical research problems with AI assistance
Formalization and verification workflows for research, e.g., formalizing proofs in Lean or other proof assistants, especially as it is relevant for machine learning
Automation of iterative loops
Other tasks that are integral to an AI-assisted research workflow

The focus is on how researchers can become more productive, more rigorous, and do better research with the help of AI, and less on autonomous AI research, which is the focus of another exciting workshop: AI4Math.

In order to maintain focus, our workshop will not consider tasks that are not research-centered, such as simple writing, basic literature search (aka “deep research”), slide and poster creation, and pure software engineering (e.g., installing packages, version compatibility, etc.). There are already many resources available for such tasks.

What to include in your submission:

Explain the cognitive task that appeared in your research, for which AI has either significantly saved your time, or improved the results that you could obtain otherwise. Estimate the time-savings or performance improvements.
Describe the AI workflow you used in a way that is reproducible and usable by other researchers in a few hours.
- This may include specific prompting techniques, code to call the API (once or repeatedly), a detailed description of how you set up the agentic frameworks, etc.
- If your workflow relies on web-based prompting, share the exact prompt and ideally the exact transcripts of your conversation.
- If your workflow relies on an API-based interaction and/or agentic research, provide a link to a repository with your code.
- If applicable, discuss failure modes and what you learned from them.
Ensure that submissions are as close as possible to the working workflow itself. We aim to convert as many submissions as possible into workflows to test their performance. Make sure that the workflow is reproducible, including a README explaining how to install packages, and include any code that is not publicly available.
Explain how you verified the correctness of the results.

The contributions will be evaluated according to accessibility, reproducibility and correctness.

Reciprocal reviewer. Each submission nominates one author as a reciprocal reviewer. We anticipate around 3 papers per reviewer, with the review window running between the submission and notification dates below. A track record of peer-reviewed publications at major ML/AI venues (ICML, NeurIPS, ICLR, ACL, etc.) or comparable journals is preferred, and responsible reviewing is an important consideration.

Submission Format

Main paper: up to 4 pages, excluding references; supplementary or appendix material may be included after the references in the same submitted PDF (e.g., detailed walkthroughs, screenshots, and conversation transcripts).
Must follow ICML 2026 format
Indicate preferred presentation: computer demo or poster
Submission portal closed

Policies

Non-archival: papers published at other venues are welcome
Double-blind review: submissions must be anonymized
Accepted papers will be presented as demos or posters; select papers may be invited for contributed talks
Review criteria: accessibility, reproducibility and correctness

Important Dates

Submission deadline: May 13, 2026
Notification of acceptance: May 31, 2026
Workshop date: July 10th, 2026

All deadlines are Anywhere on Earth (AoE)

Submissions closed

Venue & Attendance

Logistics

Location

COEX Convention & Exhibition Center
Seoul, South Korea

Part of ICML 2026 (July 6–11, 2026)

Workshop Dates

July 10th, 2026
(ICML 2026 workshop days)

Full-day, in-person

Registration

Workshop attendance requires ICML 2026 registration.

A workshop-only registration is sufficient. Please register through the main conference website.

Organizers

Workshop Committee

Mikhail Belkin

UC San Diego

HDSI Endowed Chair Professor in Artificial Intelligence at UC San Diego. His research spans artificial intelligence, machine learning, and high-dimensional statistics.

Website

Sébastien Bubeck

OpenAI

Member of Technical Staff at OpenAI. His work includes large language models, convex optimization, online algorithms, and adversarial robustness.

Website

Edgar Dobriban

University of Pennsylvania

Associate Professor of Statistics and Computer & Information Science at Penn. His research sits at the interface of statistics, machine learning, and AI.

Website

Dmitriy Drusvyatskiy

UC San Diego

Professor at the Halicioglu Data Science Institute at UC San Diego. His research interests are in optimization, high-dimensional statistics, machine learning, and AI.

Website

Ravi Vakil

Stanford University

Robert Grimmett Professor of Mathematics at Stanford University and President of the American Mathematical Society. He works in algebraic geometry.

Website

Fanny Yang

ETH Zürich

Assistant Professor of Computer Science at ETH Zurich. She works on high-dimensional and robust machine learning.

Website

Volunteers

Get Involved

Call for volunteers: our workshop aims to serve the needs of the community, and be community-driven. If you are interested in volunteering, please contact us. Sign up to help review contributions, build a platform to share workflows and make them easily accessible and searchable, or support the workshop in other ways.

Interested in volunteering? Sign up here.

Federico Di Gennaro

ETH Zürich

Federico Di Gennaro is a PhD student at ETH Zürich, advised by Prof. Fanny Yang. His research interests include statistical learning and trustworthy ML.

Website

Sunay Joshi

University of Pennsylvania

Sunay Joshi is a PhD student at the University of Pennsylvania, advised by Prof. Edgar Dobriban and Prof. Hamed Hassani. His research interests include conformal prediction and uncertainty quantification for AI.

Erik Wang

Stanford University

Erik Wang is a PhD student at Stanford University, working with Prof. Tengyu Ma. His research interests include language model reasoning, self-improvement, and scientific discovery.

Qingsong Wang

UC San Diego

Qingsong Wang is a postdoctoral researcher at UC San Diego, working with Prof. Mikhail Belkin and Prof. Yusu Wang. His research interests include diffusion and flow-matching generative models, data geometry, and representation learning.

Website

Tao Wang

University of Pennsylvania

Tao Wang is a PhD student in Statistics and Data Science at the University of Pennsylvania, advised by Prof. Edgar Dobriban. His research interests include uncertainty quantification, optimal transport, and LLM post-training.

Scholar