Were there any ancient rationalists? — LessWrong
Published on May 3, 2024 6:26 PM GMTI've recently read some cool posts on rationality through...
Key takeaways from our EA and alignment research surveys — LessWrong
Published on May 3, 2024 6:10 PM GMTMany thanks to Spencer Greenberg, Lucius Caviola, Josh Lewis,...
"AI Safety for Fleshy Humans" an AI Safety explainer by Nicky Case — LessWrong
Published on May 3, 2024 6:10 PM GMTNicky Case, of "The Evolution of Trust" and "We...
Nicky Case AI Safety explainer [Crosspost] — LessWrong
Published on May 3, 2024 5:48 PM GMTNicky Case makes applets for explaining concepts. You may...
AI Safety for Fleshy Humans — LessWrong
Published on May 3, 2024 5:43 PM GMTThis is an accessible introduction to AI Safety, written...
LLM+Planners hybridisation for friendly AGI — LessWrong
Published on May 3, 2024 8:40 AM GMTEvery LLM in existence is a blackbox, and alignment...
AI Clarity: An Initial Research Agenda — LessWrong
Published on May 3, 2024 1:54 PM GMTThis is cross-posted from our website: https://www.convergenceanalysis.org/publications/ai-clarity-an-initial-research-agendaExecutive Summary Transformative AI...
Apply to ESPR & PAIR, Rationality and AI Camps for Ages 16-21 — LessWrong
Published on May 3, 2024 12:36 PM GMTTLDR – Apply now to ESPR and PAIR. ESPR welcomes students...
On precise out-of-context steering — LessWrong
Published on May 3, 2024 9:41 AM GMTMeta: I'm writing this in the spirit of sharing...
Mechanistic Interpretability Workshop Happening at ICML 2024! — LessWrong
Published on May 3, 2024 1:18 AM GMTAnnouncing the first academic Mechanistic Interpretability workshop, held at...
[Linkpost] Silver Bulletin: For most people, politics is about fitting in — LessWrong
Published on May 1, 2024 6:12 PM GMTNate Silver tries to answer the question: "How do...
Shane Legg's necessary properties for every AGI Safety plan — LessWrong
Published on May 1, 2024 5:15 PM GMTI've been going through the FAR AI videos from...
KAN: Kolmogorov-Arnold Networks — LessWrong
Published on May 1, 2024 4:50 PM GMTAbstract:Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold...
Manifund Q1 Retro: Learnings from impact certs — LessWrong
Published on May 1, 2024 4:48 PM GMTDiscuss
Take SCIFs, it’s dangerous to go alone — LessWrong
Published on May 1, 2024 8:02 AM GMTCoauthored by Dmitrii Volkov1.mjx-chtml {display: inline-block; line-height: 0; text-indent:...
ACX Covid Origins Post convinced readers — LessWrong
Published on May 1, 2024 1:06 PM GMTACX recently posted about the Rootclaim Covid origins debate,...
LessWrong Community Weekend 2024, open for applications — LessWrong
Published on May 1, 2024 10:18 AM GMTMain event pageFriday 13th September- Monday 16th September 2024...
AXRP Episode 30 - AI Security with Jeffrey Ladish — LessWrong
Published on May 1, 2024 2:50 AM GMTYouTube link Top labs use various forms of “safety...
Neuro/BCI/WBE for Safe AI Workshop — LessWrong
Published on May 1, 2024 12:46 AM GMTIf you're working on neurotechnology for safe AI, including...
AGI: Cryptography, Security & Multipolar Scenarios Workshop — LessWrong
Published on May 1, 2024 12:42 AM GMTIf you're working at the intersection between cryptogrpahy, secuity...
Super additivity of consciousness — LessWrong
Published on April 29, 2024 3:41 PM GMTIn “Freedom under naturalistic dualism” I have carefully argued...
Ironing Out the Squiggles — LessWrong
Published on April 29, 2024 4:13 PM GMTAdversarial Examples: A Problem The apparent successes of the...
AISC9 has ended and there will be an AISC10 — LessWrong
Published on April 29, 2024 10:53 AM GMTThe 9th AI Safety Camp (AISC9) just ended, and...
Open-Source AI: A Regulatory Review — LessWrong
Published on April 29, 2024 10:10 AM GMTCross-posted on the EA Forum. This article is part of...
Big-endian is better than little-endian — LessWrong
Published on April 29, 2024 2:30 AM GMTThis is a response to the post We Write...
San Francisco ACX Meetup “First Saturday” — LessWrong
Published on April 29, 2024 1:57 AM GMTDate: Saturday, May 4th, 2024Time: 1 pm – 3...
The Prop-room and Stage Cognitive Architecture — LessWrong
Published on April 29, 2024 12:48 AM GMTThis is a post on a novel cognitive architecture...
How are Simulators and Agents related? — LessWrong
Published on April 29, 2024 12:22 AM GMTIn this post, I will provide some speculative reasoning...
Extended Embodiment — LessWrong
Published on April 29, 2024 12:18 AM GMTI find that an especially illustrative thought experiment regarding...
Referential Containment — LessWrong
Published on April 29, 2024 12:16 AM GMTThis is an idea I am toying around with...
Release of UN's draft related to the governance of AI (a summary of the Simon Institute's response) — LessWrong
Published on April 27, 2024 6:34 PM GMTI just spent a couple of hours trying to...
Mercy to the Machine: Thoughts & Rights — LessWrong
Published on April 27, 2024 4:36 PM GMTAbstract: First [1)], a suggested general method of determining, for...
Constructability: AI safety via Pull Request — LessWrong
Published on April 27, 2024 4:04 PM GMTCharbel-Raphaël Segerie and Épiphanie Gédéon contributed equally to this...
So What's Up With PUFAs Chemically? — LessWrong
Published on April 27, 2024 1:32 PM GMTThis is exploratory investigation of a new-ish hypothesis, it...
Link: Let's Think Dot by Dot: Hidden Computation in Transformer Language Models by Jacob Pfau, William Merrill & Samuel R. Bowman — LessWrong
Published on April 27, 2024 1:22 PM GMTOne consideration that is pretty important for AI safety...
Two Vernor Vinge Book Reviews — LessWrong
Published on April 27, 2024 12:14 PM GMTVernor Vinge is a legendary and recently deceased sci-fi...
Refusal in LLMs is mediated by a single direction — LessWrong
Published on April 27, 2024 11:13 AM GMTThis work was produced as part of Neel Nanda's...
WSJ: Thinking doesn't have to feel so hard — LessWrong
Published on April 27, 2024 10:14 AM GMTNew tricks and tools can make cognitively demanding tasks...
Plausibility of Getting Early Warning Shots because AIs can't coordinate? — LessWrong
Published on April 27, 2024 8:02 AM GMTI don't know if this is a well known...
Suerposition is not "just" neuron polysemanticity — LessWrong
Published on April 26, 2024 11:22 PM GMTTL;DR: In this post, I distinguish between two related...
"Why I Write" by George Orwell (1946) — LessWrong
Published on April 25, 2024 4:02 PM GMTPeople have been posting great essays so that they're...
Cybersecurity of Frontier AI Models — LessWrong
Published on April 25, 2024 2:51 PM GMTThis article is part of a series of ~10...
The first future and the best future — LessWrong
Published on April 25, 2024 6:40 AM GMTIt seems to me worth trying to slow down...
NIH Cancer Myths Myths — LessWrong
Published on April 25, 2024 5:43 AM GMTThe NIH has a page called Cancer Myths and...
social lemon markets — LessWrong
Published on April 25, 2024 2:18 AM GMT I refuse to join any club that would...
Bayesian inference without priors — LessWrong
Published on April 24, 2024 11:50 PM GMTEpistemic status: party trick Why remove the prior One...
The Inner Ring by C. S. Lewis — LessWrong
Published on April 24, 2024 10:48 PM GMTNote: In @Nathan Young's words "It seems like great...
This is Water by David Foster Wallace — LessWrong
Published on April 24, 2024 9:21 PM GMTNote: It seems like great essays should go here...
Is being a trans woman +20 IQ? — LessWrong
Published on April 24, 2024 8:04 PM GMTWarning: This post might be depressing to read for...
Betadine oral rinses for covid and other viral infections — LessWrong
Published on April 24, 2024 5:50 PM GMTBefore we get started, this is your quarterly reminder...