Bookmarks (679)

  • screenshot

    Were there any ancient rationalists? — LessWrong

    Published on May 3, 2024 6:26 PM GMTI've recently read some cool posts on rationality through...

  • screenshot

    Key takeaways from our EA and alignment research surveys — LessWrong

    Published on May 3, 2024 6:10 PM GMTMany thanks to Spencer Greenberg, Lucius Caviola, Josh Lewis,...

  • screenshot

    "AI Safety for Fleshy Humans" an AI Safety explainer by Nicky Case — LessWrong

    Published on May 3, 2024 6:10 PM GMTNicky Case, of "The Evolution of Trust" and "We...

  • screenshot

    Nicky Case AI Safety explainer [Crosspost] — LessWrong

    Published on May 3, 2024 5:48 PM GMTNicky Case makes applets for explaining concepts. You may...

  • screenshot

    AI Safety for Fleshy Humans — LessWrong

    Published on May 3, 2024 5:43 PM GMTThis is an accessible introduction to AI Safety, written...

  • screenshot

    LLM+Planners hybridisation for friendly AGI — LessWrong

    Published on May 3, 2024 8:40 AM GMTEvery LLM in existence is a blackbox, and alignment...

  • screenshot

    AI Clarity: An Initial Research Agenda — LessWrong

    Published on May 3, 2024 1:54 PM GMTThis is cross-posted from our website: https://www.convergenceanalysis.org/publications/ai-clarity-an-initial-research-agendaExecutive Summary Transformative AI...

  • screenshot

    Apply to ESPR & PAIR, Rationality and AI Camps for Ages 16-21 — LessWrong

    Published on May 3, 2024 12:36 PM GMTTLDR – Apply now to ESPR and PAIR. ESPR welcomes students...

  • screenshot

    On precise out-of-context steering — LessWrong

    Published on May 3, 2024 9:41 AM GMTMeta: I'm writing this in the spirit of sharing...

  • screenshot

    Mechanistic Interpretability Workshop Happening at ICML 2024! — LessWrong

    Published on May 3, 2024 1:18 AM GMTAnnouncing the first academic Mechanistic Interpretability workshop, held at...

  • screenshot

    [Linkpost] Silver Bulletin: For most people, politics is about fitting in — LessWrong

    Published on May 1, 2024 6:12 PM GMTNate Silver tries to answer the question: "How do...

  • screenshot

    Shane Legg's necessary properties for every AGI Safety plan — LessWrong

    Published on May 1, 2024 5:15 PM GMTI've been going through the FAR AI videos from...

  • screenshot

    KAN: Kolmogorov-Arnold Networks — LessWrong

    Published on May 1, 2024 4:50 PM GMTAbstract:Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold...

  • screenshot

    Take SCIFs, it’s dangerous to go alone — LessWrong

    Published on May 1, 2024 8:02 AM GMTCoauthored by Dmitrii Volkov1.mjx-chtml {display: inline-block; line-height: 0; text-indent:...

  • screenshot

    ACX Covid Origins Post convinced readers — LessWrong

    Published on May 1, 2024 1:06 PM GMTACX recently posted about the Rootclaim Covid origins debate,...

  • screenshot

    LessWrong Community Weekend 2024, open for applications — LessWrong

    Published on May 1, 2024 10:18 AM GMTMain event pageFriday 13th September- Monday 16th September 2024...

  • screenshot

    AXRP Episode 30 - AI Security with Jeffrey Ladish — LessWrong

    Published on May 1, 2024 2:50 AM GMTYouTube link Top labs use various forms of “safety...

  • screenshot

    Neuro/BCI/WBE for Safe AI Workshop — LessWrong

    Published on May 1, 2024 12:46 AM GMTIf you're working on neurotechnology for safe AI, including...

  • screenshot

    AGI: Cryptography, Security & Multipolar Scenarios Workshop — LessWrong

    Published on May 1, 2024 12:42 AM GMTIf you're working at the intersection between cryptogrpahy, secuity...

  • screenshot

    Super additivity of consciousness — LessWrong

    Published on April 29, 2024 3:41 PM GMTIn “Freedom under naturalistic dualism” I have carefully argued...

  • screenshot

    Ironing Out the Squiggles — LessWrong

    Published on April 29, 2024 4:13 PM GMTAdversarial Examples: A Problem The apparent successes of the...

  • screenshot

    AISC9 has ended and there will be an AISC10 — LessWrong

    Published on April 29, 2024 10:53 AM GMTThe 9th AI Safety Camp (AISC9) just ended, and...

  • screenshot

    Open-Source AI: A Regulatory Review — LessWrong

    Published on April 29, 2024 10:10 AM GMTCross-posted on the EA Forum. This article is part of...

  • screenshot

    Big-endian is better than little-endian — LessWrong

    Published on April 29, 2024 2:30 AM GMTThis is a response to the post We Write...

  • screenshot

    San Francisco ACX Meetup “First Saturday” — LessWrong

    Published on April 29, 2024 1:57 AM GMTDate: Saturday, May 4th, 2024Time: 1 pm – 3...

  • screenshot

    The Prop-room and Stage Cognitive Architecture — LessWrong

    Published on April 29, 2024 12:48 AM GMTThis is a post on a novel cognitive architecture...

  • screenshot

    How are Simulators and Agents related? — LessWrong

    Published on April 29, 2024 12:22 AM GMTIn this post, I will provide some speculative reasoning...

  • screenshot

    Extended Embodiment — LessWrong

    Published on April 29, 2024 12:18 AM GMTI find that an especially illustrative thought experiment regarding...

  • screenshot

    Referential Containment — LessWrong

    Published on April 29, 2024 12:16 AM GMTThis is an idea I am toying around with...

  • screenshot

    Release of UN's draft related to the governance of AI (a summary of the Simon Institute's response) — LessWrong

    Published on April 27, 2024 6:34 PM GMTI just spent a couple of hours trying to...

  • screenshot

    Mercy to the Machine: Thoughts & Rights — LessWrong

    Published on April 27, 2024 4:36 PM GMTAbstract: First [1)], a suggested general method of determining, for...

  • screenshot

    Constructability: AI safety via Pull Request — LessWrong

    Published on April 27, 2024 4:04 PM GMTCharbel-Raphaël Segerie and Épiphanie Gédéon contributed equally to this...

  • screenshot

    So What's Up With PUFAs Chemically? — LessWrong

    Published on April 27, 2024 1:32 PM GMTThis is exploratory investigation of a new-ish hypothesis, it...

  • screenshot

    Link: Let's Think Dot by Dot: Hidden Computation in Transformer Language Models by Jacob Pfau, William Merrill & Samuel R. Bowman — LessWrong

    Published on April 27, 2024 1:22 PM GMTOne consideration that is pretty important for AI safety...

  • screenshot

    Two Vernor Vinge Book Reviews — LessWrong

    Published on April 27, 2024 12:14 PM GMTVernor Vinge is a legendary and recently deceased sci-fi...

  • screenshot

    Refusal in LLMs is mediated by a single direction — LessWrong

    Published on April 27, 2024 11:13 AM GMTThis work was produced as part of Neel Nanda's...

  • screenshot

    WSJ: Thinking doesn't have to feel so hard — LessWrong

    Published on April 27, 2024 10:14 AM GMTNew tricks and tools can make cognitively demanding tasks...

  • screenshot

    Plausibility of Getting Early Warning Shots because AIs can't coordinate? — LessWrong

    Published on April 27, 2024 8:02 AM GMTI don't know if this is a well known...

  • screenshot

    Suerposition is not "just" neuron polysemanticity — LessWrong

    Published on April 26, 2024 11:22 PM GMTTL;DR: In this post, I distinguish between two related...

  • screenshot

    "Why I Write" by George Orwell (1946) — LessWrong

    Published on April 25, 2024 4:02 PM GMTPeople have been posting great essays so that they're...

  • screenshot

    Cybersecurity of Frontier AI Models — LessWrong

    Published on April 25, 2024 2:51 PM GMTThis article is part of a series of ~10...

  • screenshot

    The first future and the best future — LessWrong

    Published on April 25, 2024 6:40 AM GMTIt seems to me worth trying to slow down...

  • screenshot

    NIH Cancer Myths Myths — LessWrong

    Published on April 25, 2024 5:43 AM GMTThe NIH has a page called Cancer Myths and...

  • screenshot

    social lemon markets — LessWrong

    Published on April 25, 2024 2:18 AM GMT I refuse to join any club that would...

  • screenshot

    Bayesian inference without priors — LessWrong

    Published on April 24, 2024 11:50 PM GMTEpistemic status: party trick Why remove the prior One...

  • screenshot

    The Inner Ring by C. S. Lewis — LessWrong

    Published on April 24, 2024 10:48 PM GMTNote: In @Nathan Young's words "It seems like great...

  • screenshot

    This is Water by David Foster Wallace — LessWrong

    Published on April 24, 2024 9:21 PM GMTNote: It seems like great essays should go here...

  • screenshot

    Is being a trans woman +20 IQ? — LessWrong

    Published on April 24, 2024 8:04 PM GMTWarning: This post might be depressing to read for...

  • screenshot

    Betadine oral rinses for covid and other viral infections — LessWrong

    Published on April 24, 2024 5:50 PM GMTBefore we get started, this is your quarterly reminder...