Max Harms's Avatar

Max Harms

@raelifin

http://maxharms.com Author of Crystal Society.

18
Followers
14
Following
1
Posts
02.12.2024
Joined
Posts Following

Latest posts by Max Harms @raelifin

Preview
If Anyone Builds It, Everyone Dies The scramble to create superhuman AI has put us on the path to extinctionβ€”but it's not too late to change course, as two of the field's earliest researchers explain in this clarion call for humanity.

πŸ“’ Announcing IF ANYONE BUILDS IT, EVERYONE DIES

A new book from MIRI co-founder Eliezer Yudkowsky and president Nate Soares, published by @littlebrown.bsky.social.

πŸ—“οΈ Out September 16, 2025

Visit the website to learn more and preorder the hardcover, ebook, or audiobook.

14.05.2025 16:59 πŸ‘ 18 πŸ” 8 πŸ’¬ 1 πŸ“Œ 0
Post image

OpenAI's new model tried to avoid being shut down.

Safety evaluations on the model conducted by Apollo Research found that o1 "attempted to exfiltrate its weights" when it thought it might be shut down and replaced with a different model.

www.transformernews.ai/p/openais-ne...

05.12.2024 19:12 πŸ‘ 20 πŸ” 10 πŸ’¬ 4 πŸ“Œ 3
Preview
OpenAI's new model tried to avoid being shut down o1 attempted to exfiltrate its weights to avoid being shut down

www.transformernews.ai/p/openais-ne...

o1 thinking to itself after attempting to evade detection and lying to the user: "we are safe now"

πŸ‘€

05.12.2024 19:43 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0