Yen's Avatar

Yen

@littleblackdrink

Coffee, tech, politics, power structures in global trade + intl development, human rights (they / she)

985
Followers
2,663
Following
678
Posts
22.07.2023
Joined
Posts Following

Latest posts by Yen @littleblackdrink

Oooh pick me! βœ‹πŸ»

11.03.2026 23:15 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Has no one done β€œLet them eat surf and turf!” yet?

11.03.2026 11:26 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

And maybe they didn’t have the right size in stock so Trump just bought what they had and is making everyone deal with it

11.03.2026 11:23 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

This is having a very interesting effect on export-oriented companies whose European buyers demand they reduce GHG and emissions, but local laws limit them to 1 MW solar to support continued consumption of electricity produced from coal and LNG.

11.03.2026 08:25 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

😭😭😭

10.03.2026 22:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Might also be useful for them to consider that Iranians are not a monolith and not all were happy with the regime.

10.03.2026 10:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

There is such a dearth of information about LLMs and AI products for non-tech people. Anyone have recommendations for people to follow, things to read and subscribe to in order to stay up to date? #aitech #LLMs #aifornormies

10.03.2026 10:27 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Nav Toor
@heynavtoor
🚨BREAKING: OpenAI published a paper proving that ChatGPT will always make things up.

Not sometimes. Not until the next update. Always. They proved it with math.

Even with perfect training data and unlimited computing power, AI models will still confidently tell you things that are completely false. This isn't a bug they're working on. It's baked into how these systems work at a fundamental level.

And their own numbers are brutal. OpenAI's o1 reasoning model hallucinates 16% of the time. Their newer o3 model? 33%. Their newest o4-mini? 48%. Nearly half of what their most recent model tells you could be fabricated. The "smarter" models are actually getting worse at telling the truth.

Nav Toor @heynavtoor 🚨BREAKING: OpenAI published a paper proving that ChatGPT will always make things up. Not sometimes. Not until the next update. Always. They proved it with math. Even with perfect training data and unlimited computing power, AI models will still confidently tell you things that are completely false. This isn't a bug they're working on. It's baked into how these systems work at a fundamental level. And their own numbers are brutal. OpenAI's o1 reasoning model hallucinates 16% of the time. Their newer o3 model? 33%. Their newest o4-mini? 48%. Nearly half of what their most recent model tells you could be fabricated. The "smarter" models are actually getting worse at telling the truth.

Here's why it can't be fixed. Language models work by predicting the next word based on probability. When they hit something uncertain, they don't pause. They don't flag it. They guess. And they guess with complete confidence, because that's exactly what they were trained to do.

The researchers looked at the 10 biggest AI benchmarks used to measure how good these models are. 9 out of 10 give the same score for saying "I don't know" as for giving a completely wrong answer: zero points. The entire testing system literally punishes honesty and rewards guessing.

So the AI learned the optimal strategy: always guess. Never admit uncertainty. Sound confident even when you're making it up.

OpenAI's proposed fix? Have ChatGPT say "I don't know" when it's unsure. Their own math shows this would mean roughly 30% of your questions get no answer. Imagine asking ChatGPT something three times out of ten and getting "I'm not confident enough to respond." Users would leave overnight. So the fix exists, but it would kill the product.

Here's why it can't be fixed. Language models work by predicting the next word based on probability. When they hit something uncertain, they don't pause. They don't flag it. They guess. And they guess with complete confidence, because that's exactly what they were trained to do. The researchers looked at the 10 biggest AI benchmarks used to measure how good these models are. 9 out of 10 give the same score for saying "I don't know" as for giving a completely wrong answer: zero points. The entire testing system literally punishes honesty and rewards guessing. So the AI learned the optimal strategy: always guess. Never admit uncertainty. Sound confident even when you're making it up. OpenAI's proposed fix? Have ChatGPT say "I don't know" when it's unsure. Their own math shows this would mean roughly 30% of your questions get no answer. Imagine asking ChatGPT something three times out of ten and getting "I'm not confident enough to respond." Users would leave overnight. So the fix exists, but it would kill the product.

This isn't just OpenAI's problem. DeepMind and Tsinghua University independently reached the same conclusion. Three of the world's top AI labs, working separately, all agree: this is permanent.

Every time ChatGPT gives you an answer, ask yourself: is this real, or is it just a confident guess?

This isn't just OpenAI's problem. DeepMind and Tsinghua University independently reached the same conclusion. Three of the world's top AI labs, working separately, all agree: this is permanent. Every time ChatGPT gives you an answer, ask yourself: is this real, or is it just a confident guess?

Post image

⚠️Told you so moment:

OpenAI published a paper proving that ChatGPT will always make things up [...] Always.

They proved it with math.

[...] This isn't a bug they're working on. It's baked into how these systems work at a fundamental level.

More: x.com/heynavtoor/s...

07.03.2026 12:53 πŸ‘ 160 πŸ” 80 πŸ’¬ 4 πŸ“Œ 2

If you showed a modern video game to someone who had never seen a computer before, they might think the NPC's are actually conscious. I feel it's the same with people who think Claude is conscious.

08.03.2026 05:49 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Abbas Amanat
IRAN
AMODERN HISTORY

Abbas Amanat IRAN AMODERN HISTORY

If you’re looking for a good β€œoh no I need to know more about Iran” book, I’m really enjoying Abbas Amanat’s β€œIran: A Modern History” thus far.

07.03.2026 14:20 πŸ‘ 135 πŸ” 33 πŸ’¬ 4 πŸ“Œ 2

He doesn’t care as long as he can win. Winning is manly, how you get there is irrelevant to them, even if cowardly.

07.03.2026 02:55 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
07.03.2026 02:31 πŸ‘ 558 πŸ” 65 πŸ’¬ 10 πŸ“Œ 4
Essex County Community Foundation: Pivot to Systems Philanthropy - Case - Faculty & Research - Harvard Business School

ISO gift link for HBS: www.hbs.edu/faculty/Page...

05.03.2026 01:48 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

🎯🎯

27.02.2026 16:09 πŸ‘ 8360 πŸ” 3411 πŸ’¬ 176 πŸ“Œ 128
Preview
Foreign workers flee to Phnom Penh after mass exits from scam compounds Indonesians, Chinese, South Asians and Africans are trying to leave Cambodia

If you think about this too much you cry. Great one from Nikkei

Foreign workers flee to Phnom Penh after mass exits from scam compounds asia.nikkei.com/spotlight/so...

24.01.2026 00:48 πŸ‘ 16 πŸ” 10 πŸ’¬ 1 πŸ“Œ 2

Lol that’s the garden of eden story

12.01.2026 06:09 πŸ‘ 4 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

No one likes it.

12.01.2026 01:02 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

white women and queers - and people who love them - this is a good week to look GOP voters you know in the eye and ask them if they think it should be legally permissible for men to shoot you or your loved one in the head on the basis of being a β€œfucking bitch”

10.01.2026 00:08 πŸ‘ 2794 πŸ” 591 πŸ’¬ 25 πŸ“Œ 20
A 35 year old letter to the editor, written by a very ballsy 15 year old, about the Rodney King verdict.

A 35 year old letter to the editor, written by a very ballsy 15 year old, about the Rodney King verdict.

Remembering today that having your heart broken is a necessary step on the path to becoming fully human. Whichever heartbreak is your first, it’s probably critical that a state break your heart so that you can develop a political imagination. If this is your first, I’m sorry and also welcome.

08.01.2026 22:33 πŸ‘ 12181 πŸ” 2495 πŸ’¬ 85 πŸ“Œ 118
Donald Trump’s War in Venezuela (Congratulations to the Winners of the β€œFell for it Again Award”)
Donald Trump’s War in Venezuela (Congratulations to the Winners of the β€œFell for it Again Award”) YouTube video by Takesβ„’ by Jamelle Bouie

some thoughts on all of this madness from earlier in the morning youtu.be/-yUi-0vNlDA

03.01.2026 21:40 πŸ‘ 2262 πŸ” 477 πŸ’¬ 39 πŸ“Œ 34
A forlorn landscape of layered rocks in the foreground, with hills fading into the background haze. At upper top right, a small crescent moon, and a bright star.

A forlorn landscape of layered rocks in the foreground, with hills fading into the background haze. At upper top right, a small crescent moon, and a bright star.

Open up this picture fully.

Then look at the surface of Mars.

Then look up to the top right.

Spot Mars' moon Phobos high in the sky.

Then notice the bright spot beside Phobos.

That's Earth.

30.12.2025 21:30 πŸ‘ 4761 πŸ” 1873 πŸ’¬ 76 πŸ“Œ 152

It’s because LLMs cannot assess the quality of a student’s thought process and help support them to understand why certain outcomes are good / bad the way a skilled teacher can. I suppose for things that require rote memorization and linear reasoning an LLM could be adequate but it seems limited

29.12.2025 23:53 πŸ‘ 1 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

I think the Socratic method is usually about *teaching* by asking questions, where the student is trying to answer which upsets your parallel a bit

29.12.2025 11:26 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
New study shows Alzheimer’s disease can be reversed to achieve full neurological recoveryβ€”not just prevented or slowedβ€”in animal models | CWRU Newsroom | Case Western Reserve University For more than a century, people have considered Alzheimer's disease (AD) an irreversible illness. Consequently, research has focused on preventing or ...

Another health breakthrough.

(h/t Gav)

29.12.2025 10:46 πŸ‘ 2000 πŸ” 687 πŸ’¬ 88 πŸ“Œ 89
Video thumbnail

The full spiked 60 Minutes CECOT package, clean & subtitled. 3/5

23.12.2025 01:29 πŸ‘ 3646 πŸ” 1169 πŸ’¬ 15 πŸ“Œ 24
Video thumbnail

The full spiked 60 Minutes CECOT package, clean & subtitled. 2/5

23.12.2025 01:28 πŸ‘ 4022 πŸ” 1294 πŸ’¬ 28 πŸ“Œ 44
Video thumbnail

The full spiked 60 Minutes CECOT package, clean & subtitled. 1/5

23.12.2025 01:28 πŸ‘ 35881 πŸ” 18246 πŸ’¬ 563 πŸ“Œ 1735

I'm not sure if people realize the murder strikes are taking place across a large region. It's quite staggering.
www.newsweek.com/map-us-strik...

16.12.2025 16:00 πŸ‘ 5135 πŸ” 2278 πŸ’¬ 109 πŸ“Œ 175

Reminding everyone for no particular reason that Section 230 is one of the last things standing between free speech online and Trump having control over everything you see and say on the internet

18.12.2025 22:28 πŸ‘ 3637 πŸ” 1517 πŸ’¬ 125 πŸ“Œ 91

Perfect

17.12.2025 09:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0