More

arcanemachiner · 2026-04-25T10:11:39 1777111899

Would love to know how GLM 5.1 stacks up in this ranking. Seems like it's on par with Kimi K2.6.

arcanemachiner · 2026-04-24T10:06:42 1777025202

Why not both? XFCE + i3 make a great pair.

arcanemachiner · 2026-04-22T15:46:51 1776872811

Sure, go get some.

This isn't the first open-weight LLM to be released. People tend to get a feel for this stuff over time.

Let me give you some more baseless speculation: Based on the quality of the 3.5 27B and the 3.6 35B models, this model is going to absolutely crush it.

arcanemachiner · 2026-04-22T15:43:31 1776872611

Divide the value before the B by 2, and there's your answer if you get a Q4_K_M quant. Plus a bit of room for KV cache.

TLDR: If you have 14GB of VRAM, you can try out this model with a 4-bit quant.

Tokens per second is an unreasonable ask since every card is different, are you using GGUF or not, CUDA or ROCm or Vulkan or MLX, what optimizations are in your version of your inference software, flags are you running, etc.

Note that it's a dense model (the Qwen models have another value at the end of the MoE model names, e.g. A3B) so it will not run very well in RAM, whereas with a MoE model, you can spill over into RAM if you don't have enough VRAM, and still have reasonable performance.

Using these models requires some technical know-how, and there's no getting around that.

arcanemachiner · 2026-04-22T15:31:22 1776871882

Damn, you're not kidding. Might be worse than r/ClaudeAI in terms of user sentiment, and that's saying something.

scottyah · 2026-04-22T20:15:15 1776888915

I mean, reddit is just a knob sama can turn for easy astroturfing. It's almost as bad as looking for grok sentiment on X.

arcanemachiner · 2026-04-21T18:39:25 1776796765

Pi is very extensible, and could possibly serve as a good foundation to build on.

giancarlostoro · 2026-04-21T18:51:31 1776797491

Is it Pi LLM you're referring to? I've heard "Pi" referenced twice now, and now I'm curious, I do have unused Pis, though not Raspberry Pi 5s...

vorticalbox · 2026-04-21T19:07:07 1776798427

https://github.com/badlogic/pi-mono/tree/main/packages/codin...

arcanemachiner · 2026-04-22T15:32:06 1776871926

Yeah, "Pi coding agent".

arcanemachiner · 2026-04-21T18:38:20 1776796700

That... might have changed?

https://news.ycombinator.com/item?id=47844269

arcanemachiner · 2026-04-21T09:34:25 1776764065

Are we before or after the part where they start throwing money out of helicopters?

bandrami · 2026-04-21T10:13:59 1776766439

That's the interesting question, right? Because if this unwinds during a period of external inflation (say, because of a big war and energy shortage) then even the Bernanke would say helicopter money won't work

arcanemachiner · 2026-04-21T09:33:20 1776764000

Considering Anthropic is constantly doing the opposite, I would just call it "balance".

embedding-shape · 2026-04-21T10:51:09 1776768669

Not that I'm some paragon when it comes to critical thinking exactly, but if there any sort of proof or evidence of Anthropic "silencing negativity"? Wouldn't surprise me, but also haven't seen anything conclusive about it either, so spreading that they are as fact, is ironically FUD itself.

Forgeties79 · 2026-04-21T12:27:22 1776774442

Name a startup that isn’t trying to downplay, scrub, or otherwise silence negative press.

root_axis · 2026-04-21T17:23:27 1776792207

When they say "doing the opposite" they are referring to Anthropic's hyperbolic marketing strategy.

Though, I don't think that justifies spreading FUD in the opposite direction. I also don't think the comment the GP was replying to contains FUD.

arcanemachiner · 2026-04-21T02:56:50 1776740210

"Seatbelts don't save the life of everyone who gets into an accident, so why bother wearing one?"