News
Would anybody here be interested in a "mistake postmortem" discussion group? " Less Wrong
1+ hour, 23+ min ago (232+ words) I recently made a dumb (in retrospect) mistake that set me back a lot. Feeling upset and regretful, I spoke to an older family member who reassured me, "yeah, unfortunately there's no way around it; we have to experience these…...
Thoughts on Likelihood of Existential Risks by Misaligned AIs " Less Wrong
14+ hour, 33+ min ago (304+ words) The implication of this is that it is very hard to have one concrete AI risk argument I can read and respond to. It is difficult to form opinions on AI safety when most experts are in great disagreement about…...
How I think developers of frontier AI systems and regulators ought to act in the face of existential AI risk " Less Wrong
15+ hour, 4+ min ago (1088+ words) In a recent podcast episode published July 20, 2025, Anthropic co-founder Ben Mann is asked (at 48: 43) "What are the odds that we align AI correctly and actually solve this problem?" In his answer, Ben references the following part of Anthropic's March 8, 2023 blog…...
Why should AI be moral? " Less Wrong
14+ hour, 42+ min ago (1067+ words) In outline, the moral skeptic's challenge goes: To respond, one must either refute the skeptical hypothesis or identify an extra-moral reason to accept morality. Without a response, one's acceptance of morality is unjustified. This position threatens to be reflectively destabilizing…...
World-modeling the US vs. Anthropic Standoff on Claude Fable " Less Wrong
17+ hour, 22+ min ago (872+ words) I spent the last two days doing a deep dive in forecasting outcomes of the US forcing Anthropic to take down Claude Fable. I did this for two reasons...
AI Safety Ecosystem Research notes " Less Wrong
19+ hour, 5+ min ago (463+ words) These are some personal notes taken and later dressed up a bit to make into a post. Dunno how much value is here for people already familiar with the AI Safety Ecosystem. I believe MATS will be publishing the results…...
A brief list of ways AI safety efforts could be net negative " Less Wrong
21+ hour, 14+ min ago (245+ words) I'm not aware of a good list of downside risks for AI safety broadly[1], so I decided to make one. This is not intended to be fully comprehensive, these are just the ones that I personally take seriously[2][3]: (This list…...
Online >> real life for spreading ideas " Less Wrong
21+ hour, 42+ min ago (1001+ words) I believe that internet culture influences real culture much more than the other way around. This is quite hard to prove, but I often see an idea start to come up in real-life conversations where I've seen it appear on…...
Typical Minds Aren't " Less Wrong
22+ hour, 15+ min ago (308+ words) We all know the typical mind fallacy'the bias where we assume that other people's minds are much like our own. It happens because most of our evidence for what minds are like comes from experiencing what our own mind is…...
Adversarial Proposal Design in Asset Futarchy " Less Wrong
1+ day, 1+ hour ago (652+ words) Asset futarchy is hardest to attack when conditional prices stay tightly coupled to a proposal's real causal effect on ASSET value. The proposal strategies below work by loosening that coupling. A proposer promises value-creating work, but treats delivery as the…...