Optimizing generative AI by backpropagating language model feedback

Mert Yuksekgonul^#¹, Federico Bianchi^#², Joseph Boen^#³, Sheng Liu^#³, Pan Lu^#³, Zhi Huang^#³, Carlos Guestrin^{2

4}, James Zou^{5

6

7}

Affiliations

¹ Department of Computer Science, Stanford University, Stanford, CA, USA. merty@stanford.edu.
² Department of Computer Science, Stanford University, Stanford, CA, USA.
³ Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
⁴ Chan Zuckerberg Biohub, San Francisco, CA, USA.
⁵ Department of Computer Science, Stanford University, Stanford, CA, USA. jamesz@stanford.edu.
⁶ Department of Biomedical Data Science, Stanford University, Stanford, CA, USA. jamesz@stanford.edu.
⁷ Chan Zuckerberg Biohub, San Francisco, CA, USA. jamesz@stanford.edu.

^# Contributed equally.

PMID: 40108317
DOI: 10.1038/s41586-025-08661-4

Optimizing generative AI by backpropagating language model feedback

Mert Yuksekgonul et al. Nature. 2025 Mar.

. 2025 Mar;639(8055):609-616.

doi: 10.1038/s41586-025-08661-4. Epub 2025 Mar 19.

Authors

Mert Yuksekgonul^#¹, Federico Bianchi^#², Joseph Boen^#³, Sheng Liu^#³, Pan Lu^#³, Zhi Huang^#³, Carlos Guestrin^{2

4}, James Zou^{5

6

7}

Affiliations

¹ Department of Computer Science, Stanford University, Stanford, CA, USA. merty@stanford.edu.
² Department of Computer Science, Stanford University, Stanford, CA, USA.
³ Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
⁴ Chan Zuckerberg Biohub, San Francisco, CA, USA.
⁵ Department of Computer Science, Stanford University, Stanford, CA, USA. jamesz@stanford.edu.
⁶ Department of Biomedical Data Science, Stanford University, Stanford, CA, USA. jamesz@stanford.edu.
⁷ Chan Zuckerberg Biohub, San Francisco, CA, USA. jamesz@stanford.edu.

^# Contributed equally.

PMID: 40108317
DOI: 10.1038/s41586-025-08661-4

Abstract

Recent breakthroughs in artificial intelligence (AI) are increasingly driven by systems orchestrating multiple large language models (LLMs) and other specialized tools, such as search engines and simulators. So far, these systems are primarily handcrafted by domain experts and tweaked through heuristics rather than being automatically optimized, presenting a substantial challenge to accelerating progress. The development of artificial neural networks faced a similar challenge until backpropagation and automatic differentiation transformed the field by making optimization turnkey. Analogously, here we introduce TextGrad, a versatile framework that performs optimization by backpropagating LLM-generated feedback to improve AI systems. By leveraging natural language feedback to critique and suggest improvements to any part of a system-from prompts to outputs such as molecules or treatment plans-TextGrad enables the automatic optimization of generative AI systems across diverse tasks. We demonstrate TextGrad's generality and effectiveness through studies in solving PhD-level science problems, optimizing plans for radiotherapy treatments, designing molecules with specific properties, coding, and optimizing agentic systems. TextGrad empowers scientists and engineers to easily develop impactful generative AI systems.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

References

1. Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020).
1. Trinh, T. H., Wu, Y., Le, Q. V., He, H. & Luong, T. Solving olympiad geometry without human demonstrations. Nature 625, 476–482 (2024). - DOI - PubMed - PMC
1. Li, Y. et al. Competition-level code generation with alphacode. Science 378, 1092–1097 (2022). - DOI - PubMed
1. Yang, J. et al. SWE-agent: agent–computer interfaces enable automated software engineering. In Adv. Neural Inf. Process. Syst. 37 (NeurIPS, 2024).
1. Khattab, O. et al. DSPy: Compiling declarative language model calls into state-of-the-art pipelines. In The Twelfth International Conference on Learning Representations (2024).

LinkOut - more resources

Full Text Sources
- Nature Publishing Group

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Optimizing generative AI by backpropagating language model feedback

Affiliations

Optimizing generative AI by backpropagating language model feedback

Authors

Affiliations

Abstract

Conflict of interest statement

References

LinkOut - more resources

Full Text Sources