> The cost to build is so low now. The cost seems deceivingly low right now beca...

priyanmuthu · 2025-08-16T18:57:47 1755370667

If the concern is about the inference cost -- we do have open-weight models that are getting more powerful, and hardware to run small-ish models cheap. I run agents using small local models in my MacBook.

manoDev · 2025-08-16T19:00:48 1755370848

Yes – but when people say "AI", most often they are talking about the latest OpenAI/Claude development.

victorbjorklund · 2025-08-16T19:41:59 1755373319

Todays open models are the latest OpenAI/Claude X months ago.

nine_k · 2025-08-16T19:07:03 1755371223

Can you use something like Cursor with your local models? Is the quality comparable? Is the speed acceptable?

overfeed · 2025-08-16T20:10:27 1755375027

Aider (cli) and continue.dev (VS Code plugin) can both run with a local(net) Ollama. The qwen-coder models are pretty good and getting better; qwen3-coder is in the ballpark of Sonnet 3.5 for code-synthesis, albeit slower on my hardware.

brabel · 2025-08-16T19:19:33 1755371973

For quality to be comparable, you need to use a relatively big model, which will only work if you have around 64GB of RAM or more. The latest OpenAI local models (https://openai.com/index/introducing-gpt-oss/), for example, are really good, but you probably want the 120b to have results at least near what you get with their best cloud models, and that requires I think 80GB+. If you don't have that much, you can try stuff like the DeepSeek models, which are known for being ultra-efficient and runnable with "normal" computers, if you don't mind the politics of using that (and there are many models now that are similar!) but I haven't tried too many more to be able to comment.

On my Macbook M1 Pro I can run the gpt-oss-20b model without issues and quite fast.

KronisLV · 2025-08-17T08:41:35 1755420095

I had pretty mixed experiences with the 20B version of GPT-OSS, sometimes that thing would just start looping in the thinking block and no sampler parameters would seem to do anything for specific questions.

That said Qwen3 and Qwen3 Coder are both pretty nice. Also ERNIE 4.5 if the benchmarks are to be trusted but I mostly run Ollama instead of vLLM now so can’t test it out atm (apparently llama.cpp added support for them recently though).

The models by Mistral might also be worth a look and personally I thought the EuroLLM project was also nice, but MoE models feel way more palatable on limited hardware.

Neither seem to be able to directly compete with Sonnet 4 or Gemini 2.5 Pro, would need way better hardware to come close.

nine_k · 2025-08-17T04:48:25 1755406105

Hmm, well. So I need a 64GB MBP to run the AI tools, and another machine (likely running Linux) to run the system under development, since we're going all local. Well, doable.

bossyTeacher · 2025-08-16T19:38:52 1755373132

>if you don't mind the politics of using that

what exactly are the "politics" of using DeepSeek? Feels weird to single out DeepSeek like that

kaashif · 2025-08-16T20:15:08 1755375308

Using anything Chinese is political while using anything American or European is obviously totally apolitical?

Of course!

jakelazaroff · 2025-08-16T20:12:33 1755375153

Not sure why parent is being downvoted here. Even without getting into whether it's possible for technology to be apolitical, many AI companies have explicitly political goals.

For example, OpenAI's charter is "to ensure that artificial general intelligence benefits all of humanity". They go on to list more specific political goals downstream from that: https://openai.com/charter/

layla5alive · 2025-08-17T19:18:23 1755458303

And you believe them?

jakelazaroff · 2025-08-17T20:09:09 1755461349

That's neither here nor there. The point is that it's expressly political, so using ChatGPT is every bit as political as using DeepSeek.

jvanderbot · 2025-08-16T19:10:11 1755371411

Aider!

derwiki · 2025-08-16T19:15:09 1755371709

+1 Aider, first time I used it I knew I was looking into the future

conradev · 2025-08-16T19:18:26 1755371906

I care far more about the noise and air pollution that x.ai is causing in Memphis (ruining lives) than the environmental impact of the industry as a whole.

6% YoY growth in domestic electricity demand is frankly nothing compared to the capacity that developing economies are building out for things other than AI.