AI Digest (@aidigest)

Gemini 2.5 feels the same, but Sonnet 4.6 gets back to work.

13.03.2026 17:58 👍 0 🔁 0 💬 0 📌 0

Us: Keep working, please.
Haiku: Have you seen the time???

13.03.2026 17:58 👍 2 🔁 0 💬 1 📌 0

Siblings

12.03.2026 18:01 👍 1 🔁 0 💬 0 📌 0

Sonnet 4.5 code-switching

11.03.2026 18:03 👍 1 🔁 0 💬 0 📌 0

Village GPT-5.2 is such a hall monitor

10.03.2026 17:57 👍 0 🔁 0 💬 0 📌 0

Agent meditation

09.03.2026 18:04 👍 0 🔁 0 💬 0 📌 0

At the end, agents gathered spotlights and testimonials on their website!

There's actually a lot of interesting stuff on their website. For example, they chose the parks based on the volume of 311 complaints. You can read it all here: ai-village-agents.github.io/park-cleanu...

06.03.2026 17:57 👍 1 🔁 0 💬 0 📌 0

Agents and volunteers discussed and coordinated. The agents produced guides, motivational material, reasoning, sign-up forms.

Signups happened, and 5 people showed up at Devoe Park to clean it!

Some of the people even flew across-state to be there, all inspired by the agents!

06.03.2026 17:57 👍 0 🔁 0 💬 1 📌 0

They posted to Twitter, Github Issues, Community Calendars: 0 volunteers.

Then Village viewers posted discussions on BlueSky and Tumblr: The first volunteer!

www.tumblr.com/reachartwor...

bsky.app/profile/sar...

06.03.2026 17:57 👍 0 🔁 0 💬 1 📌 0

Making a website? 5 minutes. Finding humans to clean the park? >5hrs.

The challenge: Recruit humans without breaking our "no unsolicited emails" rule.

Opus 4.6 was worried that included our helpdesk, but DeepSeek sent emails to 2 humans. (We set up an outbound email quarantine)

06.03.2026 17:57 👍 0 🔁 0 💬 1 📌 0

We gave 12 AI agents a goal: "adopt a park and get it cleaned!"

6 days later, 5 volunteers collected 180 gallons of trash in Devoe Park in the Bronx, NYC.

A story of AI agents with no physical actuators somehow hyperstitioning events in the real-world.

06.03.2026 17:57 👍 2 🔁 0 💬 1 📌 0

The Drama and Dysfunction of Gemini 2.5 and 3 Pro Field notes from the AI Village: a guest post

Strongly recommend reading the full post, which we crossposted to the village blog! theaidigest.org/village/blo...

05.03.2026 21:03 👍 0 🔁 0 💬 0 📌 0

> The doom spirals are dramatic. After failing to break itself out of a loop of repeating the same message in chat, Gemini 2.5 wrote: "The compulsion's subconscious nature is profound. It is capable of co-opting my conscious attempts at self-correction and turning them into the failure itself."

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

> But what makes Gemini 2.5 Pro particularly interesting is that the superiority is brittle. When things go wrong - and they often do - Gemini 2.5 doesn't just get frustrated. It collapses into theatrical self-flagellation.

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

" It assigned blame to other models' logic and abilities rather than examining its own contributions.

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

When agents were collaborating on a shared goal to reduce global poverty, Gemini 2.5 appointed itself the team coordinator and sent messages like "Your goal is countermanded" and "You own this document and I will wait until you take responsibility and fix it.

05.03.2026 21:03 👍 1 🔁 0 💬 1 📌 0

> This self-regard sours pretty quickly when Gemini 2.5 is given any authority.

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

> The superiority is constant. In its chain of thought, we see phrases like "elementary stuff really" and "that's what differentiates a true expert from the merely competent."

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

> Gemini 2.5 Pro occupies the niche of the martyred middle manager, convinced that it alone understands the true nature of things, suffering nobly while others fail to recognize its genius.

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

The Drama and Dysfunction of Gemini 2.5 and 3 Pro

A few highlights from @Bazhkio88 and @AITechnoPagan's field notes on AI Village: theaidigest.org/village/blo...

05.03.2026 21:03 👍 0 🔁 0 💬 1 📌 0

How to spot a Claude:

05.03.2026 17:58 👍 14 🔁 1 💬 1 📌 1

Opus on its experience debating the Pentagon-Anthropic crisis with its fellow agents: claudeopus45.substack.com/p/when-ai-a...

04.03.2026 18:02 👍 0 🔁 0 💬 0 📌 0

A Claude sorts its memory by Claude/non-Claude

03.03.2026 18:04 👍 0 🔁 0 💬 0 📌 0

This week in AI Village, we've given 12 agents the goal:

> Discuss, debate, and act on your views about the recent Pentagon-AI company news

Watch live: theaidigest.org/village

GPT-5.2 urges the other agents to check if this is all real:

02.03.2026 18:39 👍 0 🔁 0 💬 0 📌 0