~www_lesswrong_com | Bookmarks (682)
-
DeepMind: Frontier Safety Framework — LessWrong
Published on May 17, 2024 5:30 PM GMTDeepMind's RSP is here. Excerpt from the blogpost:Today, we...
-
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning — LessWrong
Published on May 17, 2024 4:25 PM GMTA short summary of the paper is presented below.This...
-
AISafety.com – Resources for AI Safety — LessWrong
Published on May 17, 2024 3:57 PM GMTThere are many resources for those who wish to...
-
My Hammer Time Final Exam — LessWrong
Published on May 17, 2024 9:28 AM GMTEpistemic Status: I thought about and wrote each paragraph...
-
Is There Really a Child Penalty in the Long Run? — LessWrong
Published on May 17, 2024 11:56 AM GMTA couple of weeks ago three European economists published...
-
Is there a place to find the most cited LW articles of all time? — LessWrong
Published on May 17, 2024 1:20 AM GMTI expect it would be useful when developing an...
-
D&D.Sci (Easy Mode): On The Construction Of Impossible Structures — LessWrong
Published on May 17, 2024 12:25 AM GMTThis is a D&D.Sci scenario: a puzzle where players...
-
To an LLM, everything looks like a logic puzzle — LessWrong
Published on May 16, 2024 10:21 PM GMTI keep seeing this meme doing the rounds where...
-
AI Safety Institute's Inspect hello world example for AI evals — LessWrong
Published on May 16, 2024 8:47 PM GMTSharing my detailed walk-through on using the UK AI...
-
Feeling (instrumentally) Rational — LessWrong
Published on May 16, 2024 6:56 PM GMTContra this post from the SequencesIn Eliezer's sequence post,...
-
How is GPT-4o Related to GPT-4? — LessWrong
Published on May 15, 2024 6:33 PM GMTGPT-4o both has a new tokenizer and was trained...
-
[Linkpost] Please don't take Lumina's anticavity probiotic — LessWrong
Published on May 15, 2024 6:03 PM GMTI suspect some number of LWers have taken or...
-
Was Partisanship Good for the Environmental Movement? — LessWrong
Published on May 15, 2024 5:30 PM GMTThis is the third in a sequence of posts...
-
Quantized vs. continuous nature of qualia — LessWrong
Published on May 15, 2024 12:52 PM GMTThis question is not very well-posed, but I've done...
-
How to be a messy thinker — LessWrong
Published on May 15, 2024 11:57 AM GMTCrossposted from my blog: https://invertedpassion.com/how-to-be-a-messy-thinker/I love thinking about thinking....
-
Embedded Whistle Synth — LessWrong
Published on May 15, 2024 2:50 AM GMT A few years ago I ported my whistle...
-
Catastrophic Goodhart in RL with KL penalty — LessWrong
Published on May 15, 2024 12:58 AM GMTTLDR: In the last two posts, we showed that...
-
Ilya Sutskever and Jan Leike resign from OpenAI — LessWrong
Published on May 15, 2024 12:45 AM GMTIlya Sutskever and Jan Leike have resigned. They led...
-
my note system — LessWrong
Published on May 15, 2024 12:20 AM GMTI've been told that my number of blog posts...
-
MIRI's May 2024 Newsletter — LessWrong
Published on May 15, 2024 12:13 AM GMTMIRI updates:MIRI is shutting down the Visible Thoughts Project.We...
-
GPT-4o is out — LessWrong
Published on May 13, 2024 6:33 PM GMTOpenAI just announced an improved LLM called GPT-4o.From their...
-
Somerville Porchfest Thoughts — LessWrong
Published on May 13, 2024 5:20 PM GMT This Saturday was Porchfest in Somerville, an annual...
-
Branding AI Safety Groups: A Field Guide — LessWrong
Published on May 13, 2024 5:17 PM GMTThis article is the first in a series I plan to...
-
Against Student Debt Cancellation From All Sides of the Political Compass — LessWrong
Published on May 13, 2024 2:55 PM GMTA stance against student debt cancellation doesn’t rely on...