DEV Community

Cover image for DeepSeek vs ChatGPT: which one is the best model?
Cloud Native Engineer
Cloud Native Engineer

Posted on • Originally published at seangoedecke.com

DeepSeek vs ChatGPT: which one is the best model?

DeepSeek-R1 is primed to disrupt the entire AI industry.

Look at what happened to the stock market!

But how does it differ from OpenAI's model?

  • OpenAI: generates chains-of-thought data using a normal model and fine-tunes it for reasoning
  • DeepSeek: uses reinforcement learning to train its model for reasoning without generating large amounts of data

Benefits:

  • DeepSeek's approach can reason better than the original model as it generates new chains of thought

Limitations:

  • DeepSeek's approach is restricted to chains-of-thought that can be verified mechanistically, mainly useful for coding and mathematics

Transfer Learning in Reasoning Models:

  • There is ongoing research to see if reasoning models trained on one domain can be effective in other domains

Read more at: https://www.seangoedecke.com/deepseek-r1/?utm_source=tldrnewsletter

Top comments (0)