Middle of a bunch of tool calls, or a sub agent doing something big that itself is correct, but yeah if itβs clearly doing something wrong then of course you want to stop it.
Middle of a bunch of tool calls, or a sub agent doing something big that itself is correct, but yeah if itβs clearly doing something wrong then of course you want to stop it.
In terms of the actual output, not sure - canβt use codex at work at the moment. Feels like a nicer ux, I message now and it sees it now. With claude often Iβd message and then mid typing be interrupted by a command approval prompt. With interrupting I worry about the timing being bad?
This is on Monzo's big and complex systems though, very different approach when doing something for a personal project.
I too also went a bit over the deep end, with the point at which I first read the code being a (draft) PR, and honestly it was kinda cool at first but I'm now back to either babysitting or heavily scrutinising and iterating on a plan before starting to execute.
Have you used Codex for the 'steering' angle? It reads your message while continuing, whereas claude you have to explicitly stop it, or it waits for some kind of gap to actually read your message. The message queueing in claude code just feels bad in general now
Gives off slight continvoucly morging energy, some arrows come from the holes, some come from the cheese
Where technical debt comes from
Skill issue
@samwho.dev how long is your bread or how short is your toaster
I think this one isnβt your standard spring and latch for on systems - it looks to have a temperature dial, timer dial, and then the lever is just a βmove the bread up and downβ, rather than switching on/starting the toasting process
Apparently this is good but itβs over 200 quid www.dualit.com/products/4-s...
Considering how high tech other kitchen gadgets are these days itβs always a surprise to see how unevenly hot the heating elements on a toaster get. Wonder how much you have to pay for something that isnβt just some hot bendy wires sitting near your bread
Using @steipete.me's wonderful github.com/steipete/Cod...
Screenshot showing codexbar with codex usage, session with 81% left with 53 minutes until reset
Meanwhile codex with ~5x the tokens used, so if anything I'm getting ~25x the usage?? Something seems not right here. Claude code is usually _fine_ when I use just sonnet in a single instance without a bunch of sub agents.
Yeah this new agent team mode with Opus 4.6 and the pro plan are not made for each other, blew past my 5 hour limit in 11 minutes.
We have Claude with ~uncapped spend at work so Iβm pretty invested on that side of things and make good use of it. But codex also lets me use the plan with Opencode so another thing to look into (I used opencode with Minimax quite a bit a few weeks ago and it was pretty close to Claude with Sonnet
I tried out codex for some personal stuff (have a month free of the plus plan), and it seems like I have easily 10x the limits with 5.3 codex vs Claude code with the pro plan (same price, and Iβve been paying monthly since November).
The desktop app seems pretty nice too
AI can do this, whatever this is
Advert on the tube for βaccess trainingβ suggesting AI canβt do pipes
Canβt do what? Make an absolute mess of some pipes?
I'm seeing way more open source projects pop up because of AI - whether they'll stay maintained or are actually good is another question, but how much of existing open source is actually good and stays maintained?
In a way AI derisks the abandoned projects a bit if you can maintain it yourself now
I did this prompt and it kinda looks good?
Galaxy brain meme "same look and feel on all platforms" "stick to the platform look and feel while keeping the brand design in there" "ask claude "make the ui look good""
But what does consistent mean and what is good?
How much do you lean into to the platform design language to make your app feel native, at the risk of having two (or more - what about web?) quite distinct app designs.
Another place this happened was the switch from native desktop to electron apps
Reddit also used this rightward arrow. Maybe Bluesky trying to standardise on a cross platform UX rather than differing device native? We definitely think about this and make tradeoffs at Monzo, in terms of design language overlap
Three share icons from different platforms
Having been previously in the Android ecosystem for a while, I was used to the one on the right here being the default (supposedly comes from Wordpress but adopted by Google/Samsung on Android). Middle is Apple, and left came from Facebook?
github.com/dukky/linear tool for interacting with Linear, mainly intended for use with claude, with the aim of avoiding the ~20k token cost of Linear's MCP. Have been using this since November but just made some updates and opensourced it today
artificialanalysis.ai?models=gpt-5... o think these are against the same benchmark? Can pick any model for the charts though
For context - unlike most stations, but similar to other terminals, platforms at Euston are often announced late causing a big rush to get to the platform and get a seat. Various sites show 'booked' platforms the train is likely to come in to, this takes it a step further showing incoming delays
Another thing I vibecoded - commuter.dev.duk.im
Kinda like RealTimeTrains (see the platform before it's announced, useful at London Euston), but also matching up the incoming train, to give you an idea if things are going to be delayed before it's announced on the boards.
blog.duk.im/posts/toy-da... My second blog post! This one's on playing around with database storage engine fundamentals, from Designing Data-Intensive Applications.