ChatGPT is a language model trained to follow instructions in a prompt and provide a detailed response. It is available for free during the research preview at 1 It is a sibling model to InstructGPT, which is also trained to follow instructions and provide a response. 1 ChatGPT is being released to get user feedback and learn about its strengths and weaknesses. 1

Afaik they fed it tons of data and then used reinforcement learning with human feedback. They explain some of it here -
You were using GPT-3.5 Playground the entire time. ChatGPT is this
Today’s research release of ChatGPT is the latest step in OpenAI ’s iterative deployment of increasingly safe and useful AI systems. Many lessons from deployment of earlier models like GPT-3 and…
ChatGPT is fine-tuned from GPT-3.5, a language model trained to produce text. ChatGPT was optimized for dialogue by using Reinforcement Learning with Human Feedback (RLHF) – a method that uses…
GPT-3. Our GPT-3 models can understand and generate natural language. We offer four main models with different levels of power suitable for different tasks. Davinci is the most capable model,…
