Training language models to follow instructions with human feedback

Authors: Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright et al.

Year: 2022

Computer scienceLanguage modelSet (abstract data type)Simple (philosophy)Reinforcement learning

4,260

Citations

2022

Published

Authors