Training language models to follow instructions with human feedback

Authors: Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright et al.

Year: 2022

Computer scienceLanguage modelSet (abstract data type)Simple (philosophy)Reinforcement learning

4,260
Citations
2022
Published
10
Authors
Read PDFDOI