The Supervised Learning Phase (Critique and Revision)
Was this section helpful?
Self-Refine: Iterative Refinement with Self-Feedback, Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark, 2023arXiv preprint arXiv:2303.17651DOI: 10.48550/arXiv.2303.17651 - This paper presents a general framework for iterative self-correction in LLMs, which provides a broader conceptual understanding of the critique and revision loop used in Constitutional AI.