Even before this release the tools (for me: Claude Code and Gemini for other stu...

bgirard · 2025-12-17T17:05:25 1765991125

Why wouldn't you switch? The cost to switch is near zero for me. Some tools have built in model selectors. Direct CLI/IDE plug-ins practically the same UI.

azuanrb · 2025-12-17T17:34:00 1765992840

Not OP, but I feel the same way. Cost is just one of the factor. I'm used to Claude Code UX, my CLAUDE.md works well with my workflow too. Unless there's any significant improvement, changing to new models every few months is going to hurt me more.

bgirard · 2025-12-17T18:10:18 1765995018

I used to think this way. But I moved to AGENTS.md. Now I use the different UI as a mental context separation. Codex is working on Feature A, Gemini on feature B, Claude on Feature C. It has become a feature.

rolisz · 2025-12-17T20:19:09 1766002749

You're assuming that different models need the same stuff in AGENTS.md

In my experience, to get the best performance out of different models, they need slightly different prompting.

NamlchakKhandro · 2025-12-17T23:39:33 1766014773

just switch to Opencode and stop locking yourself into a particular providers way of doing things.

There's a plugin for everything that mimics anything the others are doing

azuanrb · 2025-12-18T08:14:39 1766045679

Being open does not magically make everything better. People are willing to pay for Claude Code for many valid reasons. You are also assuming I have never used OpenCode, which is incorrect. Claude is simply my preference.

I see all of these tools as IDEs. Whether someone locks into VS Code, JetBrains, Neovim, or Sublime Text comes down to personal preference. Everyone works differently, and that is completely fine.

NamlchakKhandro · 2025-12-26T09:56:39 1766742999

I use claude on opencode.

I'm not sure you even understand what opencode is.

Gasp0de · 2025-12-18T17:07:44 1766077664

Does that mean that you also don't switch to newer Anthropic models? Because they would change similarly, wouldn't they?

nevir · 2025-12-17T19:52:31 1766001151

I think a big part of the switching cost is the cost of learning a different model's nuances. Having good intuition for what works/doesn't, how to write effective prompts, etc.

Maybe someday future models will all behave similarly given the same prompt, but we're not quite there yet

NamlchakKhandro · 2025-12-17T23:38:23 1766014703

Because some people are restricted by company policy to only use providers with which they have a legally binding agreement to not use their chats as training data.

theLiminator · 2025-12-17T16:52:56 1765990376

For me, the last wave of models finally started delivering on their agentic coding promises.

orourke · 2025-12-17T17:33:21 1765992801

This has been my experience exactly. Even over just the last few weeks I’ve noticed a dramatic drop in having to undo what the agents have done.

nprateem · 2025-12-17T17:00:30 1765990830

But for me the previous models were routinely wrong time wasters that overall added no speed increase taking the lottery of whether they'd be correct into account.

catigula · 2025-12-17T17:42:58 1765993378

Correct. Opus 4.5 'solved' software engineering. What more do I need? Businesses need uncapped intelligence, and that is a very high bar. Individuals often don't.

gaigalas · 2025-12-17T18:21:02 1765995662

If Opus is one-size-fits-all, then why Claude keeps the other series? (rethorical).

Opus and Sonnet are slower than Haiku. For lots of less sophisticated tasks, you benefit from the speed.

All vendors do this. You need smaller models that you can rapid-fire for lots of other reasons than vibe coding.

Personally, I actually use more smaller models than the sophisticated ones. Lots of small automations.

dimitri-vs · 2025-12-18T02:06:25 1766023585

Yes, all the major CLIs (Claude Code, Codex, etc) and many agentic applications use a large model main agent with task delegation to small model sub-agent. For example in CC using Opus4.5 it will delegate an Explore task to a Haiku/Sonnet subagent or multiple subagents.

gaigalas · 2025-12-18T08:34:11 1766046851

The agent interfaces are for human interaction. Some tasks can be fully unattended though. For those, I find smaller models more capable due to their speed.

Think beyond interfaces. I'm talking about rapid-firing hundreds of small agents and having zero human interaction with them. The feedback is deterministic (non agentic) and automated too.

esperent · 2025-12-18T14:57:06 1766069826

> What more do I need?

Much cheaper price and much faster token generation.

At least, that's what I need. I stopped using Anthropic because for their $20 a month offering, I get rate limited constantly, but for Gemini $20/month I've never even once hit a limit.

calflegal · 2025-12-17T17:09:01 1765991341

I asked a similar question yesterday:

https://news.ycombinator.com/item?id=46290797

alex1138 · 2025-12-17T18:52:08 1765997528

I just can't stop thinking though about the vulnerability of training data

You say good enough. Great, but what if I as a malicious person were to just make a bunch of internet pages containing things that are blatantly wrong, to trick LLMs?

calflegal · 2025-12-17T18:58:48 1765997928

The internet has already tried this, for about a few decades. The garbage is in the corpus; it gets weighted as such

floundy · 2025-12-18T02:09:24 1766023764

>a bunch of internet pages containing things that are blatantly wrong

So Reddit?

I’d imagine the AI companies have all the “pre AI internet” data they scraped very carefully catalogued.