Re Hanson's Grabby Aliens: Humanity is not a natural anthropic sample space — LessWrong
Published on December 9, 2024 6:07 PM GMTI, Lorec, am disoriented by neither the Fermi Paradox...
Zen and The Art of Semiconductor Manufacturing — LessWrong
Published on December 9, 2024 5:19 PM GMTI. BEGINNINGIn the beginning was the Sand.And in the...
A toy evaluation of inference code tampering — LessWrong
Published on December 9, 2024 5:43 PM GMTWork done with James Faina, Evan Hubinger and Ethan...
Childhood and Education Roundup #7 — LessWrong
Published on December 9, 2024 1:10 PM GMTSince it’s been so long, I’m splitting this roundup...
Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn — LessWrong
Published on December 9, 2024 8:24 AM GMTIn a recent essay, Euan McLean suggested that a cluster...
The first AGI may be a good engineer but bad strategist — LessWrong
Published on December 9, 2024 6:34 AM GMTAGI may have an advantage in engineering, but humans...
Keeping self-replicating nanobots in check — LessWrong
Published on December 9, 2024 5:25 AM GMTThis is a random unimportant idea to prevent a...
Cognitive Processes — LessWrong
Published on December 9, 2024 5:10 AM GMTThere is a cognitive process going on that can...
Subskills of "Listening to Wisdom" — LessWrong
Published on December 9, 2024 3:01 AM GMTA fool learns from their own mistakesThe wise learn...
Cognitive Work and AI Safety: A Thermodynamic Perspective — LessWrong
Published on December 8, 2024 9:42 PM GMTIntroduces the idea of cognitive work as a parallel...
Intricacies of Feature Geometry in Large Language Models — LessWrong
Published on December 7, 2024 6:10 PM GMTNote: This is a more fleshed-out version of this...
The Way According To Zvi — LessWrong
Published on December 7, 2024 5:35 PM GMTZvi Mowshowitz is an influential figure in the Rationalist...
Deep Learning is cheap Solomonoff induction? — LessWrong
Published on December 7, 2024 11:00 AM GMTBackground Lucius: I recently held a small talk presenting an...
minifest — LessWrong
Published on December 7, 2024 3:50 AM GMTA cozy one-day festival celebrating prediction markets, blogging, economics,...
Mask and Respirator Intelligibility Comparison — LessWrong
Published on December 7, 2024 3:20 AM GMT One of the downsides of wearing a mask...
Purging Corrupted Capabilities across Language Models — LessWrong
Published on December 6, 2024 10:56 PM GMTby Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash, Amirali Abdullah This...
Gradient Routing: Masking Gradients to Localize Computation in Neural Networks — LessWrong
Published on December 6, 2024 10:19 PM GMTWe present gradient routing, a way of controlling where...
Understanding Shapley Values with Venn Diagrams — LessWrong
Published on December 6, 2024 9:56 PM GMTDiscuss
Model Integrity — LessWrong
Published on December 6, 2024 9:28 PM GMTHi! My collaborators at the Meaning Alignment Institute put...
Can AI improve the current state of molecular simulation? — LessWrong
Published on December 6, 2024 8:22 PM GMTHey LW! I recently filmed a two-hour long scientific...
Experiments are in the territory, results are in the map — LessWrong
Published on December 6, 2024 3:44 PM GMTI recently read Thomas Kuhn's book The Structure of...
A car journey with conservative evangelicals - Understanding some British political-religious beliefs — LessWrong
Published on December 6, 2024 11:22 AM GMTI’m heading home from a family wedding this weekend....
Frontier Models are Capable of In-context Scheming — LessWrong
Published on December 5, 2024 10:11 PM GMTThis is a brief summary of what we believe...
Expevolu, a laissez-faire approach to country creation — LessWrong
Published on December 5, 2024 7:29 PM GMTI write this post to present expevolu[1], a system...
Should you be worried about H5N1? — LessWrong
Published on December 5, 2024 9:11 PM GMTEpistemic status: a few people without any particular expertise...
Are SAE features from the Base Model still meaningful to LLaVA? — LessWrong
Published on December 5, 2024 7:24 PM GMTShan Chen, Jack Gallifant, Kuleen Sasse, Danielle Bitterman[1]Please read...
Are SAE features from the Base Model still meaningful to LLaVA? — LessWrong
Published on December 5, 2024 8:21 PM GMTShan Chen, Jack Gallifant, Kuleen Sasse, Danielle Bitterman[1]Please read...
o1 tried to avoid being shut down — LessWrong
Published on December 5, 2024 7:52 PM GMTOpenAI released the o1 system card today, announcing that...
More Growth, Melancholy, and MindCraft @3QD [revised and updated] — LessWrong
Published on December 5, 2024 7:36 PM GMTThis is cross-posted from New Savanna.I’ve got a new...
OpenAI o1 + ChatGPT Pro release — LessWrong
Published on December 5, 2024 7:13 PM GMT As AI becomes more advanced, it will solve...
Announcement: AI for Math Fund — LessWrong
Published on December 5, 2024 6:33 PM GMTRenaissance Philanthropy and XTX Markets today announced the launch...
Detection of Asymptomatically Spreading Pathogens — LessWrong
Published on December 5, 2024 6:20 PM GMT Cross-posted from my NAO Notebook. This is an...
Countdown — LessWrong
Published on December 5, 2024 5:49 PM GMTTo the survivors, Earth-born and Zentradi alike, who chose...
Sam Harris’s Argument For Objective Morality — LessWrong
Published on December 5, 2024 10:19 AM GMTApparently, the following is an argument made by Sam...
Model Integrity: MAI on Value Alignment — LessWrong
Published on December 5, 2024 5:11 PM GMTEVERYONE, CALM DOWN!Meaning Alignment Institute just dropped their first...
Why muscle tension can be unsexy — LessWrong
Published on December 5, 2024 4:11 PM GMThttps://twitter.com/ChrisChipMonk/status/1864380405690061270Why do we often experience feelings as in the...
Higher and lower pleasures — LessWrong
Published on December 5, 2024 1:13 PM GMTI used to think that talk about more sophisticated...
Morality as Cooperation Part III: Failure Modes — LessWrong
Published on December 5, 2024 9:39 AM GMTThis is a Part III of a long essay....
Morality as Cooperation Part II: Theory and Experiment — LessWrong
Published on December 5, 2024 9:04 AM GMTThis is a Part II of a long essay....
Morality as Cooperation Part I: Humans — LessWrong
Published on December 5, 2024 8:16 AM GMTAbstractThe AI alignment problem is usually specified in terms...
Orca communication project - seeking feedback (and collaborators) — LessWrong
Published on December 3, 2024 5:29 PM GMTTLDRIt is currently plausible (35%) to me that average...
Book a Time to Chat about Interp Research — LessWrong
Published on December 3, 2024 5:27 PM GMTIn the spirit of the season, you can book...
Balsa Research 2024 Update — LessWrong
Published on December 3, 2024 12:30 PM GMTFor our annual update on how Balsa is doing,...
First Solo Bus Ride — LessWrong
Published on December 3, 2024 12:20 PM GMT Our kids have been riding the bus since...
How to make evals for the AISI evals bounty — LessWrong
Published on December 3, 2024 10:44 AM GMTTLDRLast weekend, I attended an AI evals hackathon organized...
Should there be just one western AGI project? — LessWrong
Published on December 3, 2024 10:11 AM GMTTom did the original thinking; Rose helped with later...
Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft — LessWrong
Published on December 3, 2024 9:29 AM GMTPrefaceSeveral friends have asked me about what psychological effects...
Chemical Turing Machines — LessWrong
Published on December 3, 2024 5:26 AM GMTEpistemic status: brief writeup of some interesting work I...
MIRI’s 2024 End-of-Year Update — LessWrong
Published on December 3, 2024 4:33 AM GMTMIRI is a nonprofit research organization with a mission...
Linkpost: Rat Traps by Sheon Han in Asterisk Mag — LessWrong
Published on December 3, 2024 3:22 AM GMTSubtitle: Does the rationalist blogosphere need to update?Discuss