Search Results

WebNews

Please enter a web search for web results.

NewsWeb

lesswrong. com
lesswrong. com > posts > Ywp D58 CXkksj C4 EEe > the-ai-x-risk-lawsuit-waiting-to-happen

The AI x-risk lawsuit waiting to happen " Less Wrong

1+ hour, 46+ min ago (229+ words) There are laws against recklessly endangering people's lives, and laws against creating a public nuisance. But, at least in the USA, the bar for bringing a case on such grounds seems to be quite high. Several people I've talked to…...

lesswrong. com
lesswrong. com > posts > f5 DKLs Ts RRhbip H4r > llm-assistant-personas-seem-increasingly-incoherent-some

llm assistant personas seem increasingly incoherent (some subjective observations) " Less Wrong

1+ hour, 43+ min ago (1262+ words) (This was originally going to be a "quick take" but then it got a bit long. Just FYI.) There's this weird trend I perceive with the personas of LLM assistants over time. It feels like they're getting less "coherent" in…...

lesswrong. com
lesswrong. com > posts > Fuau Qjjb TCS5 QFLk8 > not-a-paper-frontier-lab-ceos-are-capable-of-in-context

Not a Paper: "Frontier Lab CEOs are Capable of In-Context Scheming" " Less Wrong

2+ hour, 36+ min ago (453+ words) (Fragments from a research paper that will never be written) The frontier AI developers are becoming increasingly powerful and wealthy, significantly increasing their potential for risks. One concern is that of executive misalignment: when the CEO has different incentives and…...

lesswrong. com
lesswrong. com > posts > awh Ds Bna GJdh Kz2i E > notes-on-transformer-consciousness

Notes on Transformer Consciousness " Less Wrong

5+ hour, 36+ min ago (12+ words) Assuming transformers can have conscious experience, what would that experience be like? "...

lesswrong. com
lesswrong. com > posts > 8 Lk57 Fow6o8w WKhp7 > causal-inference-diary-skiing-causes-snow

Causal inference diary: skiing causes snow " Less Wrong

7+ hour, 15+ min ago (1059+ words) I've been playing with causal inference lately, as one does. [1] I was thinking of writing a more formal sequence about how to do causal discovery and...

lesswrong. com
lesswrong. com > posts > PH5b52q Wrmps3q76p > is-ai-welfare-work-puntable

Is AI welfare work puntable? " Less Wrong

8+ hour, 19+ min ago (716+ words) Arguably, AI welfare work'is relatively non-urgent and can be left'until after the intelligence explosion, since it is hard to make progress on and not needed to avoid AI takeover or authoritarian lock-in. Here is a basic case for why we…...

lesswrong. com
lesswrong. com > posts > xdk ZSe SQN4b A8hmrb > the-problem-in-the-nerd-sniping-xkcd-comic

The Problem in the "Nerd Sniping" xkcd Comic " Less Wrong

8+ hour, 56+ min ago (857+ words) A few days ago I saw this comic reposted, and I thought: wait! Unlike every prior time I have seen this comic, I actually know how to solve this now!...

lesswrong. com
lesswrong. com > posts > k A2pf6 DFq ZCxomp7i > comment-on-forecasting-is-way-overrated-and-we-should-stop

Comment on "Forecasting is Way Overrated, and We Should Stop Funding It" " Less Wrong

9+ hour, 20+ min ago (549+ words) Originally posted as a'comment on'this post. Reposting for visibility and since it is lengthy enough to be a standalone post. I plan to post a more comprehensive update in future describing FRI's impact and theory of change in more detail....

lesswrong. com
lesswrong. com > posts > MYTy E3jfd Svr WLFM7 > strategy-matters-when-someone-implements-it-astra-is

Strategy matters when someone implements it. Astra is cultivating people to do both. " Less Wrong

9+ hour, 38+ min ago (306+ words) TL; DR. AI safety needs more people who understand the field and its gaps deeply enough to own problems end-to-end, found new projects and organizations, and shape the threat models that the rest of the field runs on. Astra is…...

lesswrong. com
lesswrong. com > posts > SJdj Lg5z Sqrb2k Mc7 > exploding-note-apply-to-mentor-secure-program-synthesis

[exploding note] Apply to Mentor Secure Program Synthesis Fellowship by May 5th " Less Wrong

10+ hour, 46+ min ago (25+ words) Apart and the secure program synthesis community are launching a fellowship! A ton of you reading this would make great mentors" but the mentor appli...