Description

[2307.04964] Secrets of RLHF in Large Language Models Part I: PPO

Links and resources

Tags

community

  • @jonas.kaiser
  • @dblp
@jonas.kaiser's tags highlighted