More

fancyfredbot · 2026-04-28T20:03:04 1777406584

He does have a point about fees. It's not really surprising that the fee structure designed for chatbots would not make sense when applied to long running tasks and agents. But an increase in prices can solve this problem.

Doubtless some people will reduce usage as a result. But Ed seems to find the idea that a 10 man developer team might spend 80K a year on tokens ridiculous. I don't understand this. Has he seen how much developers are paid? If you get a 20% productivity boost from coding agents, then that's two developers for 80K - effectively very good value.

Where things could go wrong is in comparison to cheaper models. If it's 5K a year for Qwen, and it's 2/3 as good will you pay 75K extra for Opus? Perhaps not.

blks · 2026-04-28T20:51:41 1777409501

I think that team is better off with a junior developer. This alleged “20% productivity boost” even if it exists, is individual. On the team level, it will be largely offset by people having to review 20% more code.

fancyfredbot · 2026-04-28T21:38:56 1777412336

Obviously in some cases a junior developer is a better investment if it's a straight up choice.

Actually I think it'll be rare for a manager to be choosing between either a junior developer or a coding assistant, since each are going to benefit the team in very different ways and it'll often be obvious which you need.

What I mean is that at the price levels in the article the coding agent still had a realistic chance of positive ROI. People will pay for things with positive ROI.

Yizahi · 2026-04-28T22:32:42 1777415562

The problem is that LLM cost is more or less the same for generating some fixed amount of code or it will converge to that soon. But developer costs vary wildly based on the seniority*geographical location. Sure some Silicon Valley architect will be always more expensive than any LLM bills he incurred. But a middle tier dev at an outsource or local cheap shop overseas using the same LLM for the same tasks and same token costs? Eeh, it can go either way really.

fancyfredbot · 2026-04-25T21:37:08 1777153028

Shocking. They'll be banning cocaine and heroin next!

fancyfredbot · 2026-04-22T23:59:11 1776902351

Lucky for America, in the case of civilizational collapse there will be a lot of spare semiconductors thanks to almost everyone being dead!

nebula8804 · 2026-04-23T15:57:11 1776959831

Well we have cool projects like CollapseOS the problem is that there is so much undocumented silicon out there that cant be used without massive efforts. I know several "gold scrappers" and its such a shame that they trash great classic chips just go get back a bit of metal. So much effort went into making those chips and its just a shame that many can't be reused. While lack of cheap electricity prevents open design from being reused, there is an even bigger world of undocumented chips that are trashed as well.

fancyfredbot · 2026-04-22T16:43:30 1776876210

He's producing semiconductors with a 1000nm (one micron) feature size. This kind of tech was cutting edge in the mid 80s. You might be able to produce a 32KB memory chip with it.

It would be difficult to break into the RAM business with that sort of product as most of the demand these days is for higher capacities.

fancyfredbot · 2026-04-20T11:28:23 1776684503

I don't see the OP implying that anyone should trust the government. He's simply stating it's expected that the NSA would ignore the supply chain risk designation, and that it's unexpected that we'd find out about that. If anything the comment seems to imply a lack of trust in government.

fancyfredbot · 2026-04-16T21:07:25 1776373645

SpaceX has baled out Grok, Twitter and Tesla, so now it's our turn to bale out SpaceX in the IPO.

ferrouswheel · 2026-04-16T22:15:50 1776377750

The American tax payer has been bailing out Tesla and SpaceX for many years. Elon is the biggest welfare queen in history.

dzhiurgis · 2026-04-17T10:09:13 1776420553

US gave him trillion dollars?

fancyfredbot · 2026-04-16T21:00:15 1776373215

Try setting up one laundry which charges by the hour and washes clothes really really slowly, and another which washes clothes at normal speed at cost plus some margin similar to your competitors.

The one which maximizes ROI will not be the one you rigged to cost more and take longer.

sebastiennight · 2026-04-16T22:50:29 1776379829

I don't think the analogy is correct here.

Directionally, tokens are not equivalent to "time spent processing your query", but rather a measure of effort/resource expended to process your query.

So a more germane analogy would be:

What if you set up a laundry which charges you based on the amount of laundry detergent used to clean your clothes?

Sounds fair.

But then, what if the top engineers at the laundry offered an "auto-dispenser" that uses extremely advanced algorithms to apply just the right optimal amount of detergent for each wash?

Sounds like value-added for the customer.

... but now you end up with a system where the laundry management team has strong incentives to influence how liberally the auto-dispenser will "spend" to give you "best results"

bombcar · 2026-04-17T02:02:57 1776391377

Shades of “repeat” in lather, rinse, repeat.

fancyfredbot · 2026-04-16T20:51:36 1776372696

Wow that is terrible. In my memory GPT 2 was more interesting than that. I remember thinking it could pass a Turing test but that output is barely better than a Markov chain.

I guess I was using the large model?

daveguy · 2026-04-16T21:23:41 1776374621

Here is the XL model. 20x the size of the medium model. Still just 2B parameters, but on the bright side it was trained pre-wordslop.

https://huggingface.co/openai-community/gpt2-xl

sillysaurusx · 2026-04-16T22:27:41 1776378461

There’s an art to GPT sampling. You have to use temperature 0.7. People never believe it makes such a massive difference, but it does.

wat10000 · 2026-04-16T21:32:23 1776375143

Probably a much better prompt, too. I just literally pasted in the top part of my comment and let fly to see what would happen.

fancyfredbot · 2026-04-15T23:33:01 1776295981

The article is about two models which have either 2B or 4B parameters. Both are dense models. The 2B version will certainly use less power than qwen3-coder-next.

The models are quite good. They aren't just a tech demo.

fancyfredbot · 2026-04-14T20:17:14 1776197834

"Epic refresh pull" is my personal pet hate right now. Although "like if you are watching this in <year>" on older videos is close behind.

thundergolfer · 2026-04-15T00:52:02 1776214322

The comments pretending Marvel actors (e.g. Benedict Cumberbatch) are their Marvel characters in other movies (e.g. Sam Mendes' 1917) kill me.