research engineer at openai, currently on the post-training team. previously on the reinforcement learning team.
i’ve worked on the finetuning api, webgpt, chatgpt, chatgpt with browsing, gpt-4
research engineer at openai, currently on the post-training team. previously on the reinforcement learning team.
i’ve worked on the finetuning api, webgpt, chatgpt, chatgpt with browsing, gpt-4