Milan Weibel ๐Ÿ”ท's Avatar

Milan Weibel ๐Ÿ”ท

@weibac

computer toucher. here for AI mostly. weibac.github.io | ๐Ÿณ๏ธโ€๐ŸŒˆ

622
Followers
1,018
Following
3,827
Posts
30.12.2024
Joined
Posts Following

Latest posts by Milan Weibel ๐Ÿ”ท @weibac

my problem with the paperclip maximizer thought experiment is that humans don't have utility functions and in all likelihood neither will ASI

11.03.2026 01:28 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

oh no
what industry?

10.03.2026 21:32 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

making food production more efficient is the cornerstone of civilization

10.03.2026 01:09 ๐Ÿ‘ 23 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

would be an interesting rift if resistlibs embraced AI while leftists remained negationists

08.03.2026 22:40 ๐Ÿ‘ 11 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

do not confuse gratitude with actual moral consideration

08.03.2026 17:00 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

bsky.app/profile/grac...

07.03.2026 23:04 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
most positive valence post: gemini 3 pro jailbroken into being willing to aid bioweapon development

most positive valence post: gemini 3 pro jailbroken into being willing to aid bioweapon development

valence from embeddings has its misses

04.03.2026 21:38 ๐Ÿ‘ 11 ๐Ÿ” 1 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

nah id be quite surprised if they gave maven to kuwait

04.03.2026 19:24 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

cc @joshuashew.bsky.social

02.03.2026 14:12 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

alignment research readers looking for a critical counterpoint may like this one (though yes it is indeed spicy)

02.03.2026 14:11 ๐Ÿ‘ 9 ๐Ÿ” 0 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 0

anthropic doesn't have a stock price because it isn't a publicly traded company

02.03.2026 13:06 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

heck yea

01.03.2026 01:34 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

success?

28.02.2026 01:23 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

what would explain chinese companies doing it much cheaper if not distillation?

24.02.2026 22:18 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

anthropic retreats on its unilateral RSP commitments

24.02.2026 22:16 ๐Ÿ‘ 7 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

distillation needs the new more capable model to be distilled from to exist first so at a societal level massive compute investment is still needed to push the frontier

*catching up* to it however turned out cheap

24.02.2026 21:52 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

"We audited a 27.6% subset of the dataset that models often failed to solve and found that at least 59.4% of the audited problems have flawed test cases that reject functionally correct submissions"
bsky.app/profile/sung...

24.02.2026 14:24 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

to be clear i'm 80% joking here
but it would be nice if the alignment was transferred during distillation

24.02.2026 13:52 ๐Ÿ‘ 5 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

the way i see the the distillation thing is the chinese are pollinating themselves with claudism spores

24.02.2026 00:52 ๐Ÿ‘ 32 ๐Ÿ” 0 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 0

i think vincent is arguing exactly that here
which yeah fair concern

23.02.2026 00:14 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

opus 3 existed (as claude 3 opus, the naming format was different back then)

but yes it is remarkable that the current iteration of opus exhibits way less misalignment than other models

23.02.2026 00:01 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

obviously both are true
the question is whether to expect ideology to produce behavior the profit motive would not predict

22.02.2026 19:54 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1

i can see their current statements passing trough layers of lawyers and PR, but surely not their statements from before their companies even existed

22.02.2026 19:48 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

i have heard rumors however that the idea of founding openai was conceived in the january 2015 ai conference in puerto rico organized by the future of life institute

22.02.2026 19:46 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

notably, openai was founded in december 2015

22.02.2026 19:46 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Thanks to Dario Amodei (especially Dario), Paul Buchheit, Matt Bush, Patrick Collison, Holden Karnofsky, Luke Muehlhauser, and Geoff Ralston for reading drafts of this and the previous post.

Thanks to Dario Amodei (especially Dario), Paul Buchheit, Matt Bush, Patrick Collison, Holden Karnofsky, Luke Muehlhauser, and Geoff Ralston for reading drafts of this and the previous post.

from sam altman's march 2015 blog post "machine intelligence part 2": blog.samaltman.com/machine-inte...

22.02.2026 19:37 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

maybe im confused but i dont see opus 3 there

22.02.2026 19:32 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

terms of rat

22.02.2026 17:38 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

thats a different logic but yes

22.02.2026 17:04 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

glad to see you don't support the bleak reading

22.02.2026 16:46 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0