• REDACTED@infosec.pub
    link
    fedilink
    arrow-up
    2
    arrow-down
    7
    ·
    edit-2
    10 hours ago

    At the bottom? Is there a single LLM that has surpassed ChatGPT?

    EDIT: I missed Gemini 3.1 pro

    Still, saying “Everyone else catched up and it’s at the bottom now” is seriously out of touch thing to say

      • REDACTED@infosec.pub
        link
        fedilink
        arrow-up
        2
        arrow-down
        1
        ·
        12 hours ago

        Claude outperforms in coding and agentic tasks. I asked about LLM as a chat model. It’s still in benchmarks at the top, and still the most popular one, by far.

        Even with Claude, the difference isn’t big and it’s the only one that managed to surpass it in benchmarks, so… still - at the bottom? You sure about that?