~www_lesswrong_com | Bookmarks (706)
-
How Close We Are to a Complete List of Imprinted Genes — LessWrong
Published on April 19, 2025 6:37 PM GMTThis post summarizes some of the research I have...
-
AI, Alignment & the Art of Relationship Design — LessWrong
Published on April 19, 2025 12:47 AM GMTWe don’t always know what we’re looking for until...
-
Novel Idea Generation in LLMs: Judgment as Bottleneck — LessWrong
Published on April 19, 2025 3:37 PM GMTIn the face of any hard problem—reversing climate change,...
-
Why Should I Assume CCP AGI is Worse Than USG AGI? — LessWrong
Published on April 19, 2025 2:47 PM GMTThough, given my doomerism, I think the natsec framing...
-
An Introduction to SAEs and their Variants for Mech Interp — LessWrong
Published on April 19, 2025 2:09 PM GMTI aim to cover a lot of ground, but...
-
AI Advances and Detection Strategy — LessWrong
Published on April 19, 2025 11:40 AM GMT Cross-posted from my NAO Notebook. This is an...
-
Emotional Theory for a Technical Manual on How Not to Freeze Completely — LessWrong
Published on April 19, 2025 9:12 AM GMTThe ambulance screeched to a halt with the flair...
-
SecureDrop review — LessWrong
Published on April 19, 2025 4:29 AM GMTThis is a living document. Crosspost below may not...
-
o3 Will Use Its Tools For You — LessWrong
Published on April 18, 2025 9:20 PM GMTOpenAI has finally introduced us to the full o3...
-
AI Control Methods Literature Review — LessWrong
Published on April 18, 2025 9:15 PM GMTAI Control is a subfield of AI Safety research...
-
Announcing Progress Conference 2025 — LessWrong
Published on April 17, 2025 5:12 PM GMTLast fall the Roots of Progress Institute hosted the...
-
Host Keys and SSHing to EC2 — LessWrong
Published on April 17, 2025 3:10 PM GMT I do a lot of work on EC2,...
-
AI #112: Release the Everything — LessWrong
Published on April 17, 2025 3:10 PM GMTOpenAI has upgraded its entire suite of models. By...
-
How worker co-ops can help restore social trust — LessWrong
Published on April 17, 2025 2:14 PM GMTThe US is experiencing a great decline in trust....
-
On AI personhood — LessWrong
Published on April 17, 2025 12:31 PM GMTIt seems to me the question of consciousness of...
-
8 PRIME SKILLS An analisis — LessWrong
Published on April 17, 2025 11:36 AM GMTWhat is this about?With some parameters we have thus...
-
8 PRIME SKILLS - A simplified construction from MaxEnt Informational Efficiency in 4 questions — LessWrong
Published on April 17, 2025 11:04 AM GMTWhat is this about?We often experience complex things (like...
-
Understanding and overcoming AGI apathy — LessWrong
Published on April 17, 2025 1:04 AM GMTCrossposting from my substack.Note for LW: This post is...
-
ALLFED emergency appeal: Help us raise $800,000 to avoid cutting half of programs — LessWrong
Published on April 16, 2025 9:47 PM GMTSUMMARY: ALLFED is making an emergency appeal here due to...
-
Prodromes and Biomarkers in Chronic Disease — LessWrong
Published on April 16, 2025 9:30 PM GMTMidjourneyThanks to Renaissance Philanthropy for their support of my...
-
To be legible, evidence of misalignment probably has to be behavioral — LessWrong
Published on April 15, 2025 6:14 PM GMTOne key hope for mitigating risk from misalignment is...
-
AISN #51: AI Frontiers — LessWrong
Published on April 15, 2025 4:01 PM GMTWelcome to the AI Safety Newsletter by the Center...
-
Surprising LLM reasoning failures make me think we still need qualitative breakthroughs for AGI — LessWrong
Published on April 15, 2025 3:56 PM GMTIntroductionWriting this post puts me in a weird epistemic...
-
OpenAI #13: Altman at TED and OpenAI Cutting Corners on Safety Testing — LessWrong
Published on April 15, 2025 3:30 PM GMTThree big OpenAI news items this week were the...