Bookmarks (610)

  • screenshot

    Generalized Stat Mech: The Boltzmann Approach — LessWrong

    Published on April 12, 2024 5:47 PM GMTContextThere's a common intuition that the tools and frames...

  • screenshot

    AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI — LessWrong

    Published on April 12, 2024 4:10 PM GMTWelcome to the AI Safety Newsletter by the Center...

  • screenshot

    UDT1.01: Plannable and Unplanned Observations (3/10) — LessWrong

    Published on April 12, 2024 5:24 AM GMTThe Omnipresence of Unplanned Observations Time to introduce some...

  • screenshot

    Report: Evaluating an AI Chip Registration Policy — LessWrong

    Published on April 12, 2024 4:39 AM GMTAs part of our Governance Recommendations Research Program, Convergence Analysis is...

  • screenshot

    Interference Issues — LessWrong

    Published on April 12, 2024 2:30 AM GMT I've been working on building an electronic harp...

  • screenshot

    A D&D.Sci Dodecalogue — LessWrong

    Published on April 12, 2024 1:10 AM GMTBelow is some advice on making D&D.Sci scenarios. I’m...

  • screenshot

    Upcoming unambiguously good tech possibilities? (Like eg indoor plumbing) — LessWrong

    Published on April 11, 2024 11:14 PM GMTI recently asked about the glorious AI future but...

  • screenshot

    Leave No Context Behind - A Comment — LessWrong

    Published on April 11, 2024 10:50 PM GMTLeave No Context Behind: Efficient Infinite Context Transformers with...

  • screenshot

    AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt — LessWrong

    Published on April 11, 2024 9:30 PM GMT A lot of work to prevent AI existential...

  • screenshot

    ChatGPT defines 10 concrete terms: generically, for 5- and 11-year-olds, and for a scientist — LessWrong

    Published on April 11, 2024 8:27 PM GMTThis is cross-posted from New Savanna. The difference between concrete...

  • screenshot

    RTFB: On the New Proposed CAIP AI Bill — LessWrong

    Published on April 10, 2024 6:30 PM GMTA New Bill Offer Has Arrived Center for AI...

  • screenshot

    Thinking harder doesn’t work — LessWrong

    Published on April 10, 2024 6:00 PM GMTI’ve always been an extremely left-brainy guy.Few things give...

  • screenshot

    Scaling Laws and Superposition — LessWrong

    Published on April 10, 2024 3:36 PM GMTSummaryUsing results from scaling laws, this short note argues...

  • screenshot

    Responsible Advanced Artificial Intelligence Act — LessWrong

    Published on April 10, 2024 2:35 PM GMTEarlier today (4/9), the Center for AI Policy released...

  • screenshot

    Apply to the Pivotal Research Fellowship (AI Safety & Biosecurity) — LessWrong

    Published on April 10, 2024 12:08 PM GMTThe Swiss Existential Risk Initiative (CHERI) is now called Pivotal...

  • screenshot

    How I select alignment research projects — LessWrong

    Published on April 10, 2024 4:33 AM GMTYoutube VideoRecently, I was interviewed by Henry Sleight and...

  • screenshot

    How to accelerate recovery from sleep debt with biohacking? — LessWrong

    Published on April 10, 2024 1:27 AM GMTI have at least 40 hours of sleep debt...

  • screenshot

    Ophiology (or, how the Mamba architecture works) — LessWrong

    Published on April 9, 2024 7:31 PM GMTThe following post was made as part of Danielle's...

  • screenshot

    Apply to LASR Labs: a London-based technical AI safety research programme — LessWrong

    Published on April 9, 2024 5:34 PM GMTTLDR; apply by April 24th to join a 12-week programme...

  • screenshot

    "Decentralized Autonomous Education" - Call for Reviewers (Seeds of Science) — LessWrong

    Published on April 9, 2024 2:39 PM GMTAbstractWe propose a novel model for teaching and learning...

  • screenshot

    How We Picture Bayesian Agents — LessWrong

    Published on April 8, 2024 6:12 PM GMTI think that when most people picture a Bayesian...

  • screenshot

    Analyzing the moral value of unaligned AIs — LessWrong

    Published on April 8, 2024 6:04 PM GMTDiscuss

  • screenshot

    Investigating the role of agency in AI x-risk — LessWrong

    Published on April 8, 2024 3:12 PM GMTDiscuss

  • screenshot

    Measuring Learned Optimization in Small Transformer Models — LessWrong

    Published on April 8, 2024 2:41 PM GMTThis is original, independent research carried out in March...

  • screenshot

    Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition — LessWrong

    Published on April 8, 2024 11:14 AM GMTThis work represents progress on removing attention head superposition....

  • screenshot

    Math-to-English Cheat Sheet — LessWrong

    Published on April 8, 2024 9:19 AM GMTSay you've learnt math in your native language which...

  • screenshot

    Normalizing Sparse Autoencoders — LessWrong

    Published on April 8, 2024 6:17 AM GMTTL;DRSparse autoencoders (SAEs) presents us a promising direction towards...

  • screenshot

    What does it take to transfer the knowledge to action? — LessWrong

    Published on April 8, 2024 6:23 AM GMTThis is a quite personal question. Feel free to...

  • screenshot

    A Dozen Ways to Get More Dakka — LessWrong

    Published on April 8, 2024 4:45 AM GMTAs the dictum goes, “If it helps but doesn’t solve...

  • screenshot

    On hiatus — LessWrong

    Published on April 7, 2024 4:21 PM GMTSo, looks like the publication of my book didn't...

  • screenshot

    The Poker Theory of Poker Night — LessWrong

    Published on April 7, 2024 9:47 AM GMTLink to my own article. I removed the explanation...

  • screenshot

    on the dollar-yen exchange rate — LessWrong

    Published on April 7, 2024 4:49 AM GMTRecently, the yen-dollar exchange rate hit a 34-year low....

  • screenshot

    Conflict in Posthuman Literature — LessWrong

    Published on April 6, 2024 10:26 PM GMTGrant Snider created this comic (which became a meme):Richard...

  • screenshot

    "Fractal Strategy" workshop report — LessWrong

    Published on April 6, 2024 9:26 PM GMTI just ran a workshop teaching the rationality concepts...

  • screenshot

    The 2nd Demographic Transition — LessWrong

    Published on April 6, 2024 2:10 PM GMTBirth rates in the developed world are below replacement...

  • screenshot

    My intellectual journey to (dis)solve the hard problem of consciousness — LessWrong

    Published on April 6, 2024 9:32 AM GMTEpistemological status: At least a fun journey. I wanted...

  • screenshot

    Measuring Predictability of Persona Evaluations — LessWrong

    Published on April 6, 2024 8:46 AM GMTThis work was done by Thee Ho as part...

  • screenshot

    Privacy and writing — LessWrong

    Published on April 6, 2024 8:20 AM GMTEpistemic status: N=1 I've always written several thousand words a...

  • screenshot

    Koan: divining alien datastructures from RAM activations — LessWrong

    Published on April 5, 2024 6:04 PM GMT[Metadata: crossposted from https://tsvibt.blogspot.com/2024/04/koan-divining-alien-datastructures-from.html.] Exploring the ruins of an...

  • screenshot

    On the 2nd CWT with Jonathan Haidt — LessWrong

    Published on April 5, 2024 5:30 PM GMTIt was clear within the first ten minutes this...

  • screenshot

    End-to-end hacking with language models — LessWrong

    Published on April 5, 2024 3:06 PM GMTCross-posted from https://tchauvin.com/end-to-end-hacking-with-language-modelsProduced as part of the SERI ML...

  • screenshot

    Movie posters — LessWrong

    Published on April 5, 2024 6:30 AM GMTLife involves anticipations. Hopes, dreads, lookings forward. Looking forward...

  • screenshot

    New social credit formalizations — LessWrong

    Published on April 5, 2024 6:30 AM GMTHere are some classic ways humans can get some...

  • screenshot

    Partial value takeover without world takeover — LessWrong

    Published on April 5, 2024 6:20 AM GMTPeople around me are very interested in AI taking...

  • screenshot

    On Complexity Science — LessWrong

    Published on April 5, 2024 2:24 AM GMTI have a long and confused love-hate relationship with...

  • screenshot

    New report: A review of the empirical evidence for existential risk from AI via misaligned power-seeking — LessWrong

    Published on April 4, 2024 11:41 PM GMTVisiting researcher Rose Hadshar recently published a review of...

  • screenshot

    Quick evidence review of bulking & cutting — LessWrong

    Published on April 4, 2024 9:43 PM GMTEpistemic status: fairly fast non-comprehensive literature review by a...

  • screenshot

    LLMs for Alignment Research: a safety priority? — LessWrong

    Published on April 4, 2024 8:03 PM GMTA recent short story by Gabriel Mukobi illustrates a...