Search Results

News

lesswrong. com
lesswrong. com > posts > J8 L3 Ho PYAj52x9g CA > would-anybody-here-be-interested-in-a-mistake-postmortem

Would anybody here be interested in a "mistake postmortem" discussion group? " Less Wrong

1+ hour, 23+ min ago (232+ words) I recently made a dumb (in retrospect) mistake that set me back a lot. Feeling upset and regretful, I spoke to an older family member who reassured me, "yeah, unfortunately there's no way around it; we have to experience these…...

lesswrong. com
lesswrong. com > posts > ECZjz2g5t Kpu Hn Yxu > thoughts-on-likelihood-of-existential-risks-by-misaligned

Thoughts on Likelihood of Existential Risks by Misaligned AIs " Less Wrong

14+ hour, 33+ min ago (304+ words) The implication of this is that it is very hard to have one concrete AI risk argument I can read and respond to. It is difficult to form opinions on AI safety when most experts are in great disagreement about…...

Symbols: btc-usd,cert-in

lesswrong. com
lesswrong. com > posts > q JJy DLr Ec Kk Af7bn C > how-i-think-developers-of-frontier-ai-systems-and-regulators

How I think developers of frontier AI systems and regulators ought to act in the face of existential AI risk " Less Wrong

15+ hour, 4+ min ago (1088+ words) In a recent podcast episode published July 20, 2025, Anthropic co-founder Ben Mann is asked (at 48: 43) "What are the odds that we align AI correctly and actually solve this problem?" In his answer, Ben references the following part of Anthropic's March 8, 2023 blog…...

Symbols: anthr.fg

lesswrong. com
lesswrong. com > posts > dyy Eo6n Yc Yr9 XNXLC > why-should-ai-be-moral

Why should AI be moral? " Less Wrong

14+ hour, 42+ min ago (1067+ words) In outline, the moral skeptic's challenge goes: To respond, one must either refute the skeptical hypothesis or identify an extra-moral reason to accept morality. Without a response, one's acceptance of morality is unjustified. This position threatens to be reflectively destabilizing…...

Symbols: cefe-ai

lesswrong. com
lesswrong. com > posts > zh Re3td Bps Zb GCd DK > world-modeling-the-us-vs-anthropic-standoff-on-claude-fable

World-modeling the US vs. Anthropic Standoff on Claude Fable " Less Wrong

17+ hour, 22+ min ago (872+ words) I spent the last two days doing a deep dive in forecasting outcomes of the US forcing Anthropic to take down Claude Fable. I did this for two reasons...

Symbols: anth.pvt,btc-usd,cert-in

lesswrong. com
lesswrong. com > posts > w WX9ec M5 Q7 Tycp Ky X > ai-safety-ecosystem-research-notes

AI Safety Ecosystem Research notes " Less Wrong

19+ hour, 5+ min ago (463+ words) These are some personal notes taken and later dressed up a bit to make into a post. Dunno how much value is here for people already familiar with the AI Safety Ecosystem. I believe MATS will be publishing the results…...

Symbols: btc-usd

lesswrong. com
lesswrong. com > posts > s Af MCp WLfk Hq F5 Gix > a-brief-list-of-ways-ai-safety-efforts-could-be-net-negative

A brief list of ways AI safety efforts could be net negative " Less Wrong

21+ hour, 14+ min ago (245+ words) I'm not aware of a good list of downside risks for AI safety broadly[1], so I decided to make one. This is not intended to be fully comprehensive, these are just the ones that I personally take seriously[2][3]: (This list…...

Symbols: btc-usd

lesswrong. com
lesswrong. com > posts > Digfnw7y Do8v SDx ZW > online-greater-than-greater-than-real-life-for-spreading

Online >> real life for spreading ideas " Less Wrong

21+ hour, 42+ min ago (1001+ words) I believe that internet culture influences real culture much more than the other way around. This is quite hard to prove, but I often see an idea start to come up in real-life conversations where I've seen it appear on…...

Symbols: forex:what

lesswrong. com
lesswrong. com > posts > Gv Hmv Da J2 CPo Jp Ljt > typical-minds-aren-t

Typical Minds Aren't " Less Wrong

22+ hour, 15+ min ago (308+ words) We all know the typical mind fallacy'the bias where we assume that other people's minds are much like our own. It happens because most of our evidence for what minds are like comes from experiencing what our own mind is…...

Symbols: d05.S0,u11.S0,z74.S0,594.S0,ses.si,z4d.si

lesswrong. com
lesswrong. com > posts > o AKsu X5 Xp Px FSEo HM > adversarial-proposal-design-in-asset-futarchy

Adversarial Proposal Design in Asset Futarchy " Less Wrong

1+ day, 1+ hour ago (652+ words) Asset futarchy is hardest to attack when conditional prices stay tightly coupled to a proposal's real causal effect on ASSET value. The proposal strategies below work by loosening that coupling. A proposer promises value-creating work, but treats delivery as the…...

Symbols: non-ig,nyse:cpng,asx:ire,kdrn-us,nasdaq:bcdf