Bookmarks (682)

  • screenshot

    Re Hanson's Grabby Aliens: Humanity is not a natural anthropic sample space — LessWrong

    Published on December 9, 2024 6:07 PM GMTI, Lorec, am disoriented by neither the Fermi Paradox...

  • screenshot

    Zen and The Art of Semiconductor Manufacturing — LessWrong

    Published on December 9, 2024 5:19 PM GMTI. BEGINNINGIn the beginning was the Sand.And in the...

  • screenshot

    A toy evaluation of inference code tampering — LessWrong

    Published on December 9, 2024 5:43 PM GMTWork done with James Faina, Evan Hubinger and Ethan...

  • screenshot

    Childhood and Education Roundup #7 — LessWrong

    Published on December 9, 2024 1:10 PM GMTSince it’s been so long, I’m splitting this roundup...

  • screenshot

    Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn — LessWrong

    Published on December 9, 2024 8:24 AM GMTIn a recent essay, Euan McLean suggested that a cluster...

  • screenshot

    The first AGI may be a good engineer but bad strategist — LessWrong

    Published on December 9, 2024 6:34 AM GMTAGI may have an advantage in engineering, but humans...

  • screenshot

    Keeping self-replicating nanobots in check — LessWrong

    Published on December 9, 2024 5:25 AM GMTThis is a random unimportant idea to prevent a...

  • screenshot

    Cognitive Processes — LessWrong

    Published on December 9, 2024 5:10 AM GMTThere is a cognitive process going on that can...

  • screenshot

    Subskills of "Listening to Wisdom" — LessWrong

    Published on December 9, 2024 3:01 AM GMTA fool learns from their own mistakesThe wise learn...

  • screenshot

    Cognitive Work and AI Safety: A Thermodynamic Perspective — LessWrong

    Published on December 8, 2024 9:42 PM GMTIntroduces the idea of cognitive work as a parallel...

  • screenshot

    Intricacies of Feature Geometry in Large Language Models — LessWrong

    Published on December 7, 2024 6:10 PM GMTNote: This is a more fleshed-out version of this...

  • screenshot

    The Way According To Zvi — LessWrong

    Published on December 7, 2024 5:35 PM GMTZvi Mowshowitz is an influential figure in the Rationalist...

  • screenshot

    Deep Learning is cheap Solomonoff induction? — LessWrong

    Published on December 7, 2024 11:00 AM GMTBackground Lucius:  I recently held a small talk presenting an...

  • screenshot

    minifest — LessWrong

    Published on December 7, 2024 3:50 AM GMTA cozy one-day festival celebrating prediction markets, blogging, economics,...

  • screenshot

    Mask and Respirator Intelligibility Comparison — LessWrong

    Published on December 7, 2024 3:20 AM GMT One of the downsides of wearing a mask...

  • screenshot

    Purging Corrupted Capabilities across Language Models — LessWrong

    Published on December 6, 2024 10:56 PM GMTby Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash, Amirali Abdullah This...

  • screenshot

    Gradient Routing: Masking Gradients to Localize Computation in Neural Networks — LessWrong

    Published on December 6, 2024 10:19 PM GMTWe present gradient routing, a way of controlling where...

  • screenshot

    Understanding Shapley Values with Venn Diagrams — LessWrong

    Published on December 6, 2024 9:56 PM GMTDiscuss

  • screenshot

    Model Integrity — LessWrong

    Published on December 6, 2024 9:28 PM GMTHi! My collaborators at the Meaning Alignment Institute put...

  • screenshot

    Can AI improve the current state of molecular simulation? — LessWrong

    Published on December 6, 2024 8:22 PM GMTHey LW! I recently filmed a two-hour long scientific...

  • screenshot

    Experiments are in the territory, results are in the map — LessWrong

    Published on December 6, 2024 3:44 PM GMTI recently read Thomas Kuhn's book The Structure of...

  • screenshot

    A car journey with conservative evangelicals - Understanding some British political-religious beliefs — LessWrong

    Published on December 6, 2024 11:22 AM GMTI’m heading home from a family wedding this weekend....

  • screenshot

    Frontier Models are Capable of In-context Scheming — LessWrong

    Published on December 5, 2024 10:11 PM GMTThis is a brief summary of what we believe...

  • screenshot

    Expevolu, a laissez-faire approach to country creation — LessWrong

    Published on December 5, 2024 7:29 PM GMTI write this post to present expevolu[1], a system...

  • screenshot

    Should you be worried about H5N1? — LessWrong

    Published on December 5, 2024 9:11 PM GMTEpistemic status: a few people without any particular expertise...

  • screenshot

    Are SAE features from the Base Model still meaningful to LLaVA? — LessWrong

    Published on December 5, 2024 7:24 PM GMTShan Chen, Jack Gallifant, Kuleen Sasse, Danielle Bitterman[1]Please read...

  • screenshot

    Are SAE features from the Base Model still meaningful to LLaVA? — LessWrong

    Published on December 5, 2024 8:21 PM GMTShan Chen, Jack Gallifant, Kuleen Sasse, Danielle Bitterman[1]Please read...

  • screenshot

    o1 tried to avoid being shut down — LessWrong

    Published on December 5, 2024 7:52 PM GMTOpenAI released the o1 system card today, announcing that...

  • screenshot

    More Growth, Melancholy, and MindCraft @3QD [revised and updated] — LessWrong

    Published on December 5, 2024 7:36 PM GMTThis is cross-posted from New Savanna.I’ve got a new...

  • screenshot

    OpenAI o1 + ChatGPT Pro release — LessWrong

    Published on December 5, 2024 7:13 PM GMT As AI becomes more advanced, it will solve...

  • screenshot

    Announcement: AI for Math Fund — LessWrong

    Published on December 5, 2024 6:33 PM GMTRenaissance Philanthropy and XTX Markets today announced the launch...

  • screenshot

    Detection of Asymptomatically Spreading Pathogens — LessWrong

    Published on December 5, 2024 6:20 PM GMT Cross-posted from my NAO Notebook. This is an...

  • screenshot

    Countdown — LessWrong

    Published on December 5, 2024 5:49 PM GMTTo the survivors, Earth-born and Zentradi alike, who chose...

  • screenshot

    Sam Harris’s Argument For Objective Morality — LessWrong

    Published on December 5, 2024 10:19 AM GMTApparently, the following is an argument made by Sam...

  • screenshot

    Model Integrity: MAI on Value Alignment — LessWrong

    Published on December 5, 2024 5:11 PM GMTEVERYONE, CALM DOWN!Meaning Alignment Institute just dropped their first...

  • screenshot

    Why muscle tension can be unsexy — LessWrong

    Published on December 5, 2024 4:11 PM GMThttps://twitter.com/ChrisChipMonk/status/1864380405690061270Why do we often experience feelings as in the...

  • screenshot

    Higher and lower pleasures — LessWrong

    Published on December 5, 2024 1:13 PM GMTI used to think that talk about more sophisticated...

  • screenshot

    Morality as Cooperation Part III: Failure Modes — LessWrong

    Published on December 5, 2024 9:39 AM GMTThis is a Part III of a long essay....

  • screenshot

    Morality as Cooperation Part II: Theory and Experiment — LessWrong

    Published on December 5, 2024 9:04 AM GMTThis is a Part II of a long essay....

  • screenshot

    Morality as Cooperation Part I: Humans — LessWrong

    Published on December 5, 2024 8:16 AM GMTAbstractThe AI alignment problem is usually specified in terms...

  • screenshot

    Orca communication project - seeking feedback (and collaborators) — LessWrong

    Published on December 3, 2024 5:29 PM GMTTLDRIt is currently plausible (35%) to me that average...

  • screenshot

    Book a Time to Chat about Interp Research — LessWrong

    Published on December 3, 2024 5:27 PM GMTIn the spirit of the season, you can book...

  • screenshot

    Balsa Research 2024 Update — LessWrong

    Published on December 3, 2024 12:30 PM GMTFor our annual update on how Balsa is doing,...

  • screenshot

    First Solo Bus Ride — LessWrong

    Published on December 3, 2024 12:20 PM GMT Our kids have been riding the bus since...

  • screenshot

    How to make evals for the AISI evals bounty — LessWrong

    Published on December 3, 2024 10:44 AM GMTTLDRLast weekend, I attended an AI evals hackathon organized...

  • screenshot

    Should there be just one western AGI project? — LessWrong

    Published on December 3, 2024 10:11 AM GMTTom did the original thinking; Rose helped with later...

  • screenshot

    Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft — LessWrong

    Published on December 3, 2024 9:29 AM GMTPrefaceSeveral friends have asked me about what psychological effects...

  • screenshot

    Chemical Turing Machines — LessWrong

    Published on December 3, 2024 5:26 AM GMTEpistemic status: brief writeup of some interesting work I...

  • screenshot

    MIRI’s 2024 End-of-Year Update — LessWrong

    Published on December 3, 2024 4:33 AM GMTMIRI is a nonprofit research organization with a mission...

  • screenshot

    Linkpost: Rat Traps by Sheon Han in Asterisk Mag — LessWrong

    Published on December 3, 2024 3:22 AM GMTSubtitle: Does the rationalist blogosphere need to update?Discuss