Hacker Newsnew | past | comments | ask | show | jobs | submit | ting0's commentslogin

It's not clear to me how or why this works, and how it compares to just using md files in my project. For something like this, we really need benchmarks.

Hey man, if it works it works. There's a reason everyone is creating AI tools. We're all buying them. I'm still waiting for someone to make a world-class cli harness that can replace Claude Code but solves the memory and design problem. Web design is still a nightmare with LLMs.

Could you elaborate on the web design point? I find them excellent at it personally and it’s where I most often get value out of them

thanks for saying this. one more reason people are hating on SaaS so much is that the UI is dry and for many, unusable. on the other hand, the cool AI agents have UI which is fun but again, unusable.

fun and usable can co-exist and we are trying the best to prove that. also, we have an amazing designer who never worked at big tech and has no accolades, but man got taste.


Cline. Works as a CLI and VSCode plugin.

It's terrible, don't waste your time like I did. That bench means nothing by the way.

> It’s my experience that models that perform very well on benchmarks do not necessarily perform well in real life

Well, yeah... Like Opus 4.5, 4.6, 4.7. Top of the benchmarks and yet it's a pile of crap at the moment and has been for months.


This is nonsense. The real reason is because the US companies are scamming the public, as per usual.

They don't make sense, they're a lie that these AI companies keep spamming using bots so that useful idiots perpetuate it, so that they can keep draining us of money. Straight out of the Anthropic handbook. They've always been cheap to run. I wouldn't be surprised if Anthropic is running for <$1 for 1M/tok.

Well this sucks. It's funny that Blizzard with its vast empire of wealth can't even compete with TurtleWoW.

Don't waste your time. I've been down this road and unless you've got business connections or a LOT of marketing money, it is not worth attempting.

No, but the AI labs would love to frame it this way so they can continue to nerf models and increase prices while they use the cheap, highly performant, highly powerful models internally to replace all of your businesses.

Sure is looking that way. What can't Claude do at this point?

I'm an AI engineer with a computer science and some actual AI background. I am trying to make Claude good motivation letters for applying to jobs. It currently scores a 6 out of 10. I'm much better still. And it has access to all the relevant parts of my psychology degree and data about writing good motivation letters.

All I can say is: the motivation letters don't look like they're written by AI anymore.


List is massive. Anything novel? It'll fuck up without extreme handholding. Anything for which the components arent solved public published problems? It'll fuck it up.

Basically, claude can solve issues for you where it requires the implementation of existing code or a combination of existing patterns, but novel it cannot do.


> What can't Claude do at this point?

Writing maintainable code that scales.


The Twitter example is a bad one. Elon Musk, at the time, was making hundreds-of-millions through crypto-market manipulation on Twitter. At that time he realized that having control over the entire Twitter platform would unlock many billions of dollars worth of profit opportunity. Attention is the most valuable and powerful currency in this world. Not only for manipulating markets, but also for political propaganda. The information we consume literally shapes the world. So yes, it was a 4D chess move.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: