Bookmarks (696)

screenshot

Index of rationalist groups in the Bay July 2024 — LessWrong

lesswrong.com

screenshot

1

screenshot

End Single Family Zoning by Overturning Euclid V Ambler — LessWrong

lesswrong.com

screenshot

1

screenshot

Common Uses of "Acceptance" — LessWrong

lesswrong.com

screenshot

1

screenshot

Universal Basic Income and Poverty — LessWrong

lesswrong.com

screenshot

1

screenshot

A Solomonoff Inductor Walks Into a Bar: Schelling Points for Communication — LessWrong

lesswrong.com

screenshot

1

screenshot

What does a Gambler's Verity world look like? — LessWrong

lesswrong.com

screenshot

1

screenshot

Pacing Outside the Box: RNNs Learn to Plan in Sokoban — LessWrong

lesswrong.com

screenshot

1

screenshot

Does robustness improve with scale? — LessWrong

lesswrong.com

screenshot

1

screenshot

Organisation for Program Equilibrium reading group — LessWrong

lesswrong.com

screenshot

1

screenshot

Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs — LessWrong

lesswrong.com

screenshot

1

screenshot

"AI achieves silver-medal standard solving International Mathematical Olympiad problems" — LessWrong

lesswrong.com

screenshot

1

screenshot

AlphaProof: an LLM to auto-formalize + AlphaZero self-trained to prove mathematical statements in Lean — LessWrong

lesswrong.com

screenshot

1

screenshot

[Talk transcript] What “structure” is and why it matters — LessWrong

lesswrong.com

screenshot

1

screenshot

AI #74: GPT-4o Mini Me and Llama 3 — LessWrong

lesswrong.com

screenshot

1

screenshot

AI Constitutions are a tool to reduce societal scale risk — LessWrong

lesswrong.com

screenshot

1

screenshot

Determining the power of investors over Frontier AI Labs is strategically important to reduce x-risk — LessWrong

lesswrong.com

screenshot

1

screenshot

A framework for thinking about AI power-seeking — LessWrong

lesswrong.com

screenshot

1

screenshot

Llama Llama-3-405B? — LessWrong

lesswrong.com

screenshot

1

screenshot

AI Safety Memes Wiki — LessWrong

lesswrong.com

screenshot

1

screenshot

Unlearning via RMU is mostly shallow — LessWrong

lesswrong.com

screenshot

1

screenshot

Monthly Roundup #20: July 2024 — LessWrong

lesswrong.com

screenshot

1

screenshot

Confusing the metric for the meaning: Perhaps correlated attributes are "natural" — LessWrong

lesswrong.com

screenshot

1

screenshot

Ransomware Payments Should Require a Sin Tax — LessWrong

lesswrong.com

screenshot

1

screenshot

My covid-related beliefs and questions — LessWrong

lesswrong.com

screenshot

1

screenshot

Is there a Schelling point for group house room listings? — LessWrong

lesswrong.com

screenshot

1

screenshot

Room Available in Boston Group House — LessWrong

lesswrong.com

screenshot

1

screenshot

D&D.Sci Scenario Index — LessWrong

lesswrong.com

screenshot

1

screenshot

ML Safety Research Advice - GabeM — LessWrong

lesswrong.com

screenshot

1

screenshot

Trying to understand Hanson's Cultural Drift argument — LessWrong

lesswrong.com

screenshot

1

screenshot

Using an LLM perplexity filter to detect weight exfiltration — LessWrong

lesswrong.com

screenshot

1

screenshot

Would a scope-insensitive AGI be less likely to incapacitate humanity? — LessWrong

lesswrong.com

screenshot

1

screenshot

Holomorphic surjection theorem (Picard's little theorem) — LessWrong

lesswrong.com

screenshot

1

screenshot

aimless ace analyzes active amateur: a micro-aaaaalignment proposal — LessWrong

lesswrong.com

screenshot

1

screenshot

Pivotal Acts are easier than Alignment? — LessWrong

lesswrong.com

screenshot

1

screenshot

Introduction to Modern Dating: Strategic Dating Advice for beginners — LessWrong

lesswrong.com

screenshot

1

screenshot

Ball Sq Pathways — LessWrong

lesswrong.com

screenshot

1

screenshot

Freedom and Privacy of Thought Architectures — LessWrong

lesswrong.com

screenshot

1

screenshot

Why Georgism Lost Its Popularity — LessWrong

lesswrong.com

screenshot

screenshot

2

screenshot

A more systematic case for inner misalignment — LessWrong

lesswrong.com

screenshot

1

screenshot

Truth is Universal: Robust Detection of Lies in LLMs — LessWrong

lesswrong.com

screenshot

1

screenshot

Sustainability of Digital Life Form Societies — LessWrong

lesswrong.com

screenshot

1

screenshot

JumpReLU SAEs + Early Access to Gemma 2 SAEs — LessWrong

lesswrong.com

screenshot

1

screenshot

Romae Industriae — LessWrong

lesswrong.com

screenshot

1

screenshot

Have people given up on iterated distillation and amplification? — LessWrong

lesswrong.com

screenshot

1

screenshot

How do we know that "good research" is good? (aka "direct evaluation" vs "eigen-evaluation") — LessWrong

lesswrong.com

screenshot

1

screenshot

Linkpost: Surely you can be serious — LessWrong

lesswrong.com

screenshot

1

screenshot

My experience applying to MATS 6.0 — LessWrong

lesswrong.com

screenshot

1

screenshot

What are the actual arguments in favor of computationalism as a theory of identity? — LessWrong

lesswrong.com

screenshot

1

screenshot

Yet Another Critique of "Luxury Beliefs" — LessWrong

lesswrong.com

screenshot

1

screenshot

Individually incentivized safe Pareto improvements in open-source bargaining — LessWrong

lesswrong.com

screenshot

1