grok is 17%? And that's the lowest, most models are like 80%+? While hallucinati...

Jensson · 2026-04-24T03:47:32 1777002452

> While hallucination is probably closer to 100% depending on the question.

But the benchmark didn't ask those questions, and it seems grok is very well at saying it doesn't know the answer otherwise.

elAhmo · 2026-04-23T21:16:25 1776978985

No one serious uses grok.

ajdegol · 2026-04-23T21:38:57 1776980337

@grok is this true?

for_i_in_range · 2026-04-24T13:27:09 1777037229

This comment deserves more love

NamlchakKhandro · 2026-04-24T05:15:31 1777007731

RALaBarge · 2026-04-23T22:50:15 1776984615

YMMV but Grok 4.1 Fast can usually find via static analysis a few things that other models dont seem to catch with the same prompt

d0gsg0w00f · 2026-04-24T03:06:07 1776999967

Why not? Honest question.

phillipcarter · 2026-04-24T16:27:22 1777048042

Because the Grok models offer nothing different in serious contexts from the other leading models which don't come with a heaping pile of bad baggage.

MagicMoonlight · 2026-04-24T10:40:20 1777027220

It makes sense. Grok is taught to answer the question, regardless of how explicit or extreme it is. These other models are taught to suppress any wrongthink. That's going to make it hard to answer things correctly. If you've been told to answer something incorrectly because it's wrong, then you'll have to make up an answer.