Claude outperforms in coding and agentic tasks. I asked about LLM as a chat model. It’s still in benchmarks at the top, and still the most popular one, by far.
Even with Claude, the difference isn’t big and it’s the only one that managed to surpass it in benchmarks, so… still - at the bottom? You sure about that?
At the bottom? Is there a single LLM that has surpassed ChatGPT?
EDIT: I missed Gemini 3.1 pro
Still, saying “Everyone else catched up and it’s at the bottom now” is seriously out of touch thing to say
Claude
Claude outperforms in coding and agentic tasks. I asked about LLM as a chat model. It’s still in benchmarks at the top, and still the most popular one, by far.
Even with Claude, the difference isn’t big and it’s the only one that managed to surpass it in benchmarks, so… still - at the bottom? You sure about that?
The context was about coding specifically tho
I was directly questioning the “at the bottom” phrase, not the entire context. Context matters, or something
That being said:
Source