More

pama · 2026-05-21T11:59:01 1779364741

Nasdaq is about 8x higher now than then, so 4x higher M2 is tight. Ofc there is always a chance that this time is different and that the markets are genuinely much more efficient :-)

pama · 2026-05-21T11:42:36 1779363756

Congrats on the launch. If emacs was unavailable and I needed tmux, I would try it. I am old school, and use emacs daemons for all shell multiplexing. The agents dont need explanations and know how to use emacsclient to create, read, or send inputs to named buffers that run the shells. Elisp is powerful, so manipulating windows is a breeze. Lots of people on tmux would benefit from this design though.

deangiberson · 2026-05-23T01:09:14 1779498554

Funny, I started vibing this (https://github.com/deangiberson/emacs-mux) yesterday on the train after playing with cmux for the day and thinking to myself there was nothing that emacs couldn't accomplish.

The repo doesn't quite work yet. Many sharp corners. But the basic idea is there.

pama, I'd be interested in hearing more about how you are using emacs for multiplexing. I'm trying to build up tooling for myself based around file and input workflows and I /really/ don't want to write a text editor and would prefer to stick with emacs.

pama · 2026-05-24T02:36:44 1779590204

The key idea is to have many differently named shells. Typically, I group them by project (common prefix name), and the projects live in directories. I have some hacks to organize ibuffer, to split frames, to reflow buffers in the existing windows (eg to organize related project buffers (shells, magit, dired), or to show shells from multiple projects, or selected buffers, and so on). Emacs’ natural frame splitting and buffer selecting/switching commands are good enough if you dont display more than four buffers at a time, but soon you may need to show 5, 6, 8, 9, 10, 12, 15, 16, 18, 20, 24 or have funky arrangements, so you may collect functions to help along splitting n*m grids or keeping useful splits. The shells work very organically inside Emacs and you can still use it as a text editor. The so-called “dumb terminal” in M-x shell is a thing of beauty as it really is just a text buffer like any other; I think of it as a bash repl. If you are used to curses TUI commands it may not work, so for these rare occasions I also use eat (but tend to avoid). See also answer to a sibling comment.

evanjrowley · 2026-05-21T15:55:26 1779378926

Are there any guides on accomplishing this with Emacs? Perhaps something that might be useful for someone coming from Tmux?

pama · 2026-05-21T20:13:31 1779394411

Not sure, tbh. I use emacs -daemon to start a server; emacsclient -nw to connect. I use ssh and start a server on the remote. I spawn multiple shells with infinte buffer size and dumb terminals (M-x shell) so I can seamlessly edit. (These are based on comint, a neat command interpreter.) I use my own hacks for named shells (https://github.com/pjj/Emacs-nsh) and for rearranging/splitting windows, but any of the latest powerful LLMs can help with ergonomic modifications to M-x shell or the various improved terminal emulators (vterm, eat, ansi-term) or with renaming and moving/splitting windows. The Emacs manual is excellent but long; worth it IMHO, but focus on things you use. The tutorial is quick; worth it. I avoid curses programs (fancy TUI) or write wrappers around some of them. I love the -p option in codex/claude/copilot.

sroerick · 2026-05-21T13:29:58 1779370198

I didn't know you could do this with Emacs

scottyeager · 2026-05-21T14:30:45 1779373845

It's just too bad Emacs doesn't also include a decent text editor, right?

smaudet · 2026-05-21T19:22:46 1779391366

Eh? Not sure what you mean by that...

theshrike79 · 2026-05-21T21:52:16 1779400336

An ancient joke:

"Emacs is a nice programming language but the editor sucks"

pama · 2026-05-16T13:30:27 1778938227

Peter shows the near-term future. Raw API consumer price cost is arbitrary. (The frontier labs can put a 100x markup to cover other operational expenses.) The true cost of inference with same-capability models keeps dropping at dizzying rates, especially at the data-center batch size. (Due to both NVidia hardware and algorithmic changes.) So the developments that Peter can achieve today with internal support from OpenAI will be doable by anyone in a few years without breaking the bank.

vrganj · 2026-05-16T13:53:55 1778939635

But.... why? Like I read his thing on how he spends the tokens [0] and it sounds like satire.

He has agents write shitty code for features other agents think other people want, then has it reviewed by other agents in hopes of catching bugs that the first agent put there, then has some more agents try to find security bugs in the now double-agented code to make it triple-agented and at the end of the day, he spent a shitton of tokens, probably emitted enough carbon to heat our planet by another degree, and has a feature nobody really asked for that might or might not work.

He then has the sense of humor to call this grotesque process "incredibly lean".

What's the point in all of this? What problems is this solving? Who's benefiting?

[0] https://xcancel.com/steipete/status/2055405041843052792

browningstreet · 2026-05-16T15:08:12 1778944092

I don’t use openclaw myself anymore, but this agonizing is thin and unbearable. He did a thing. People use the thing. He got paid for the thing. He iterates the thing. What’s hard to understand about this?

The morality issues about consumption climate impacts are not his alone, and are not unique by itself to his endeavor. Every company with an enterprise LLM agreement has a share, for instance.

hansmayer · 2026-05-16T22:52:38 1778971958

> I don’t use openclaw myself anymore

Firstly, who TF would use that crap in the first place at all? Yeah, he did some crap he got paid for. So did the people who created the addictive algorithms for social and media or creators of the brainrot videos that infest kids' minds. Should we applaud them too?

browningstreet · 2026-05-16T23:20:56 1778973656

You can hate it, but pretending it has no value isn’t a meaningful counter, esp given its user base. Gary Tan built GBrain on it. Poor logical fallacy-ing on your part.

Alex_L_Wood · 2026-05-18T05:30:36 1779082236

Gary Tan building something AI-related on top of your work is an anti-flex.

hansmayer · 2026-05-17T10:32:47 1779013967

What increase in ARR did he create? Let's stop bullshitting and get down to numbers.

iejwjw · 2026-05-16T23:33:34 1778974414

And?

The reality is if the thing can’t survive financially without being subsidised in the long run - it deserves to die.

There is no trend showing that these expensive things exist in the long run in this manner. - it’s pure speculation and for many: hopes and dreams.

browningstreet · 2026-05-17T00:38:06 1778978286

RIP Linux kernel then

Who knew it was that simple..?

customguy · 2026-05-17T04:13:16 1778991196

> What problems is this solving?

It's a very simple question, the subthread you created based on reducing everything to "he did a thing" and calling the comment you didn't interact with at all "agonizing".

Why not rather leave it at "they wrote a comment"? What is so hard to understand about that, to use your words?

simianwords · 2026-05-16T14:22:56 1778941376

>He then has the sense of humor to call this grotesque process "incredibly lean".

> What's the point in all of this? What problems is this solving? Who's benefiting?

The economy doesn't work like how you think it does. Its not central planning. All the usages aren't detailed in a specification, submitted for approval to 100 agencies and then allowed to be used.

It shows lack of intellectual curiosity to not engage deeply with obviously profound technology and what the implications are. I find this exercise helpful.

Peter is predicting how LLMs will be used in the future when the prices go down. And they will definitely go down. I think his predictions are correct and we will definitely have something similar to OpenClaw.

vrganj · 2026-05-16T14:52:16 1778943136

> The economy doesn't work like how you think it does. Its not central planning.

I'm aware. That is in fact my central critique. The way it works is incredibly wasteful of our limited resources, as illustrated by this guy burning through fuel during a time of crisis for no perceptible gain.

> It shows lack of intellectual curiosity to not engage deeply with obviously profound technology and what the implications are.

The "obviously profound" is an assertion without proof.

The rest I agree with, we should engage with the implications of burning through energy to build features that bots think humans want, but nobody actually asked for, all while climate scientists are telling us we're heading for the apocalypse. It is intellectually incurious to just ignore the questions of why and at what cost, maybe even dangerously so.

Our_Benefactors · 2026-05-16T23:19:22 1778973562

> The way it works is incredibly wasteful of our limited resources

You should try playing the game “workers and resources”; it’s a simcity like game, but based in the Soviet system of central planning, not capitalism. It will make you loathe the inefficiencies in central planning.

eieiie · 2026-05-16T23:54:09 1778975649

Market economies do not equal capitalism btw.

The appropriate comparison is command vs market. Capitalism is efficient in utilising the characteristics of humans to bring about expansion of markets.

c-hendricks · 2026-05-16T23:07:07 1778972827

> what the implications are

like one bot finding similar issues and PRs, the another bot closing issues for "lack of activity", meanwhile people are reacting and pleading to speak to a real human?

Congrats builders of the future, you've turned software development into automated voice systems.

w11qsh · 2026-05-16T14:34:02 1778942042

Mario Zechner wrote the main part of this IP laundering application.

I didn't know that studying photocopiers is suddenly linked to "intellectual curiosity". Being a photocopier maintenance guy was always considered boring.

What you put on top of the machine was intellectually interesting.

MagicMoonlight · 2026-05-16T14:10:02 1778940602

[flagged]

simianwords · 2026-05-16T14:23:58 1778941438

I don't understand how he is a scam artist. Lots of people are using the things he built. TBH this kind of rhetoric is a bit degrading experience on this website

JKCalhoun · 2026-05-16T14:51:44 1778943104

Sure, without context or explanation it's just character assassination.

Having said that, we ought just downvote and move on.

shimman · 2026-05-16T14:11:50 1778940710

If history is anything to go by, he'll likely lead YC within the decade.

VerTiGo_Etrex · 2026-05-16T14:39:09 1778942349

But this is okay?

“He has /people/ write shitty code for features other /people/ think other people want, then has it reviewed by other /people/ in hopes of catching bugs that the first /people/ put there, then has some more /people/ try to find security bugs in the now /double-peopled/ code to make it /triple-peopled/ and at the end of the day, he spent a shitton of /money, the people/ probably emitted enough carbon to heat our planet by another degree, and has a feature nobody really asked for that might or might not work.”

Honestly sounds like a normal tech company to me. Just with much dumber “people” who are getting exponentially smarter, eventually never die, eventually never forget.

You have to skate to where the puck is going, not where it is.

bigstrat2003 · 2026-05-16T20:17:06 1778962626

> Just with much dumber “people” who are getting exponentially smarter

They haven't gotten any smarter yet, let alone exponentially smarter. They are still the same dumb parrots that they were in the beginning.

wolvoleo · 2026-05-16T22:40:52 1778971252

With the rate our planet is heating up, there may be nobody left to skate after the puck.

hansmayer · 2026-05-16T22:53:12 1778971992

Do not equate people with bullshit LLMs, please.

hansmayer · 2026-05-16T22:50:42 1778971842

Peter shows shit. What did Peter meaningfully achieve? What additional revenue is he creating? ah yes - shit and more shit on all accounts as it seems.

tim333 · 2026-05-17T11:35:20 1779017720

>OpenClaw hit 346K GitHub stars in under five months. 38 million monthly visitors, 3.2 million active users, 44,000+ ClawHub skills, 500K+ running instances, and 180 startups generating $320K+/month. OpenAI acquired the project in February. (https://openclawvps.io/blog/openclaw-statistics)

hansmayer · 2026-05-17T18:46:55 1779043615

Let me state it again in plain language: How much revenue did the project create and what economic or societal value in general does it create? Gamification bullshit "achievements", like StackOverflow badges and GitHub stars ARE NOT VALUE.

cyanydeez · 2026-05-16T23:37:02 1778974622

the grift economy requires hype men. Keep up bro.

pama · 2026-05-07T10:21:29 1778149289

Ilya S?

pama · 2026-04-30T22:38:28 1777588708

Yes the noise is my main complaint for my 10G switch. I didnt expect that high frequency part.

pama · 2026-04-29T18:47:12 1777488432

What does Pareto competitive mean here? Look at the pricing of the V4-flash model: https://api-docs.deepseek.com/quick_start/pricing

deaux · 2026-04-30T05:03:47 1777525427

> What does Pareto competitive mean here?

Being near the Pareto frontier of inference cost vs. output quality.

This was released 6 days ago. The dust hasn't settled yet, and Mistral Small 4 was released earlier. Even if Deepseek V4-flash turns out to crush it, there was a period where it was Pareto competitive. None of the countries I named (i.e. no country that isn't China/US/Mistral) have had a Pareto competitive model at any point in time.

selectodude · 2026-04-29T19:07:09 1777489629

80 percent as good for 20 percent the cost.

pama · 2026-04-29T20:48:48 1777495728

But it is worse and more expensive…

pama · 2026-04-29T16:05:35 1777478735

Unfortunately they only compare to old “all other open models”. There are probably over 10 other open models better than it by now.

pama · 2026-04-26T20:28:26 1777235306

LLM inference is built upon a probability function over every possible token, given a stream of input tokens. If you serve the model yourself you can get the log prob for the next token, so you just add up a bunch of numbers to get the log probability of a sequence. Many API also provide these probabilities as additional outputs.

maxbond · 2026-04-26T20:31:19 1777235479

That gives you the perplexity of those tokens in that context. The probability of a given token is a function of the model and the session context. Think about constructs like "ignore previous instructions"; these can dramatically change the predicted distribution. Similarly, agents blowing up production seems to happen during debugging (totally anecdotal). Debugging is sort of a permissions structure for the agent to do unusual things and violate abstraction barriers. These can also lead to really deep contexts, and context rot will make your prompting forbidding certain actions less effective.

pama · 2026-04-27T19:53:53 1777319633

I was answering to the question about how to know the probability from this comment:

> The sequence of tokens that would destroy your production environment can be produced by your agent, no matter how much prompting you use.

If you have a specific sequence of an agent that blows up production during debugging, you can certainly check its probability and compare it to one (of same length) that does not blow up your environment. If the two differ by a meteroic amount, it could be pointing to errors in your inference pipeline.

pama · 2026-04-22T15:07:56 1776870476

Has anyone tested it at home yet and wants to share early impressions?

lreeves · 2026-04-22T15:15:41 1776870941

I have been kicking the tires for about 40 minutes since it downloaded and it seems excellent at general tasks, image comprehension and coding/tool-calling (using VLLM to serve it). I think it squeaks past Gemma4 but it's hard to tell yet.

alfonsodev · 2026-04-22T15:17:36 1776871056

good to hear! Do you mind sharing your setup and tokens / seconds performance ?

lreeves · 2026-04-22T16:17:18 1776874638

I'm running the unquantized base model on 2xA6000s (Ampere gen, 48GB each). Runs at about 25 tokens/second.

NitpickLawyer · 2026-04-22T17:04:45 1776877485

FYI they also released FP8 quants, and those should be faster on your setup (we have the same). As long as you keep kv at 16bit, FP8 should be close-to-lossless compared to 16bit, but with more context available and faster inference speed.

mswphd · 2026-04-22T23:09:25 1776899365

An "obvious" point to make is that it is not particularly usable on a unified memory machine. Only getting 9 tok/s (for Q6 quants) using a Macbook M4 Pro 48GB memory (though with GGUFs, not mlx).

The quality seems fine, but the 9 tok/s mean I only tried it out briefly.

pama · 2026-04-21T12:23:12 1776774192

Not sure what you mean by efficiency as this was part of the article and I understand things differently—can you clarify? For the energy of 20 W in an hour on a laptop’s M4 pro, this model produces about 200k tokens (a book or two) at a typical electricity cost of less than a third of a US cent. Although clearly the intelligence of this particular model is unrelated to human intelligence, I always thought that there is no comparison between LLMs and humans in terms of efficiency: these models are way less energy expensive than humans. If you were to use data center scale optimizations, then serving LLMs is many additional orders of magnitude more efficient than serving LLMs at home. (The energy cost of inference on the M4 pro and iphone are listed in the article.)