The Gemini 2 Embedding model is natively multimodal and now in preview.
What's new? How to get started? Migration considerations?
Find out more in my latest post:
medium.com/google-cloud...
The Gemini 2 Embedding model is natively multimodal and now in preview.
What's new? How to get started? Migration considerations?
Find out more in my latest post:
medium.com/google-cloud...
LLM Checker recommends the best model for your hardware.
It scans 200+ Ollama models for optimal quality, speed, fit, and context.
Here it selects Gemma 3 270M to run my coding task.
πΈοΈ Webhook triggers: Start research with an HTTP request and check the results later (Cloud Run Service)
π¦ Batch tasks: Fan out research topics in parallel and exit when finished (Cloud Run Job)
βΎοΈ Continuous dispatcher: Pull tasks from a queue at up to a 40% compute discount (Cloud Run Worker Pool)
Need to automate deep research?
Use the Gemini Interactions API with the right async pattern. π§΅
Big news for fast and cost-efficient AI! Gemini 3.1 Flash-Lite is here:
β‘οΈ 2.5X faster Time to First Token
π $0.25 per 1M input tokens
π§ Thinking levels for control over reasoning
blog.google/innovation-a...
π Blog: surgehq.ai/blog/hemingw...
π
Leaderboard: surgehq.ai/leaderboards...
Did you know there's a benchmark for writing quality? Hemingway-bench from Surge AI goes beyond "vibes" and robotic checks, to measure coherent and relatable storytelling.
My latest blog post shows how to make your TypeScript GenAI app more robust: medium.com/google-cloud...
3 tips to build resilience into your GenAI application:
β³ Use exponential backoff for API calls, ideally directly in the SDK
π Apply a circuit breaker during instability to prevent cascading failures
πΌοΈ Use "skeleton components" for loading to improve perceived performance
Watch out for 4 pitfalls when you create infographics.
Nano Banana 2 π can help you out!
Check out the 3-min walkthrough of the full lifecycle: www.youtube.com/shorts/UVcMo...
Are you using the skills CLI?
My Gemini CLI and Google Antigravity skills are well-organized, thanks to this great tool from Vercel.
My latest tutorial shows how to quickly deploy
Qwen3.5-35B-A3B on Cloud Run: medium.com/google-cloud...
Want to try out two awesome pieces of tech?
Cloud Run now supports NVIDIA RTX 6000 Pro GPUs with 96GB VRAM and scale-to-zero inference.
And the new Qwen 3.5 multimodal models are achieving outstanding benchmark results.
Read the full article, with a link to the skill: medium.com/google-cloud...
I automated my video production process with an agent skill.
The skill helps to summarize my transcript, read timestamps from a caption file, and validate its own voice and hashtags.
Agent skills are a big time-saver, and not just for coding!
www.youtube.com/shorts/2D1CS...
Reasoning has been supercharged in Gemini 3.1 Pro, and your agents will benefit.
Industry-leading abstract thinking translates into strong performance on agentic workflows and MCP tool-calling.
Good addition, thanks!!
Resources β‘
Watch the video on YouTube:
www.youtube.com/shorts/j70Yz...
And read the full article on Medium:
medium.com/google-cloud...
Stage 4: AI Catalog π
Probing 5 protocols before "hello" is too chatty.
The Catalog provides a single entry point (ai-catalog.json).
One fetch to discover all services and their Unified AI Cards.
PR #4: github.com/Agent-Card/a...
Stage 3: MCP Server Cards π οΈ
Tool use is exploding.
SEP-2127 proposes "Server Cards" for MCP.
Instead of hardcoding tool URLs, agents will discover tools, transports, and auth requirements automatically.
Proposal: github.com/modelcontext...
Stage 2: UCP (Commerce) π
The Universal Commerce Protocol lets shopping agents talk to any merchant.
It bundles payments (AP2) so your agent can actually checkout without a custom integration for every site.
Details: ucp.dev
Stage 1: A2A Agent Cards πͺͺ
In 2025, agent-card.json became the first AI-specific entry in that registry.
Itβs a JSON contract that tells other agents:
- Here is my name
- Here is what I can do
- Here is how to auth
More info: a2a-protocol.org
The Past: Web Standards π
The web solved discovery decades ago with /.well-known/.
Think robots.txt or openid-configuration.
Put a machine-readable file at a predictable URL.
Simple. Effective.
What are the trends shaping agent discoverability and interoperability? π§΅
A2A and UCP have laid the groundwork for agent communication and commerce.
Let's walk through new proposals being discussed in the AI community, and how they could help.
The connection to strawberry makes sense. Itβs just the latest gap that bubbled up.