A.V. (@slckl) — KonKok

Fun read, and interesting method: identify a circuit of several "thinking" layers and repeat it during inference for improved results, no extra training of any kind.

10.03.2026 22:11 👍 0 🔁 0 💬 0 📌 0

One would imagine Jurgen would've shown us some results with more than a decade of a headstart...

10.03.2026 20:02 👍 6 🔁 0 💬 1 📌 0

they show that more speed is always possible and weird tricks can work. idk, they feel more practical than, say, hutter prize.

09.03.2026 21:55 👍 1 🔁 0 💬 0 📌 0

is thor any good for llms?

08.03.2026 22:06 👍 0 🔁 0 💬 1 📌 0

Fun, but a bit frustrating. Some felt obvious, while others felt unfair due to being too simple...

08.03.2026 10:55 👍 0 🔁 0 💬 0 📌 0

now compare to ripgrep

06.03.2026 06:25 👍 4 🔁 0 💬 1 📌 0

DGX spark is consumer blackwell, sorry...

06.03.2026 06:06 👍 1 🔁 0 💬 0 📌 0

I suppose my brain is too pytorch shaped to appreciate the value of non-ML use cases...

05.03.2026 22:12 👍 3 🔁 0 💬 0 📌 0

Hopper and Blackwell (and not the consumer blackwell, probably...)

05.03.2026 22:08 👍 0 🔁 0 💬 1 📌 0

boring, we have many numpies in rust already. we also have multiple pytorch in rust attempts, but how about... jax in rust?

now that sounds a bit more exciting.

05.03.2026 22:05 👍 0 🔁 0 💬 1 📌 0

FlashAttention-4

I hope it is not pain to work with. It changes the algorithm & pipeline so that softmax & SMEM bandwidth no longer dictate speed. Attn reaches ~1600 TFLOPs, pretty much at matmul speed!

05.03.2026 18:47 👍 27 🔁 4 💬 2 📌 3

Excited to share the latest Olmo model: Olmo Hybrid. This is a model with gated delta net (GDN) layers in a 3:1 ratio with full attention. It follows lots of other developments like Qwen 3.5 and Kimi Linear.

05.03.2026 16:26 👍 67 🔁 8 💬 6 📌 4

+20°C, I see, so we're skipping spring and going straight into summer, followed by the new annual season - hellfire.

04.03.2026 13:50 👍 0 🔁 0 💬 0 📌 0

the washed out browns of early spring feel more autumn than autumn itself.

04.03.2026 07:41 👍 2 🔁 0 💬 1 📌 0

Earliest dyson spheres were built inside out covering increasingly large fractions of the builders' native planet.

02.03.2026 19:01 👍 16 🔁 2 💬 0 📌 0

Qwen 3.5 Small Model Series just dropped on
@hf.co 🔥

huggingface.co/collections/...

✨ 0.8B/2B/4B/9B
✨ Apache2.0
✨ 262K→1M token context

02.03.2026 13:31 👍 85 🔁 17 💬 1 📌 8

consciousness is defined by human experience. humans do experience stuff. but blindsight claims it's not neccessary for intelligence, not that it doesn't exist.

(at least that's how I remember it...)

01.03.2026 18:55 👍 2 🔁 0 💬 0 📌 0

what is this good for? probably only interesting if you want to run stuff using candle for some reason. these reasons are your own.

01.03.2026 11:50 👍 0 🔁 0 💬 0 📌 0

GitHub - slckl/candle_rf_detr Contribute to slckl/candle_rf_detr development by creating an account on GitHub.

Some time ago, I ported RF-DETR inference from pytorch to rust's candle using Opus 4.5 and my own hands.
With some iteration and hand-holding, Opus managed to get it correct.

Port can be found here: github.com/slckl/candle...

01.03.2026 11:50 👍 0 🔁 0 💬 1 📌 0

redzēs, kas būs vietā...

01.03.2026 08:28 👍 0 🔁 0 💬 0 📌 0

To expand, good tests to verify correctness and a target benchmark that Claude can run on its own repeatedly will often yield substantial performance gains for a looooooooot of code.
There are only so many performance engineers out there, a lot of projects could benefit this way.

28.02.2026 19:40 👍 1 🔁 0 💬 0 📌 0

1) I've had similar experience with optimizing rust code. If you have a bench to target and tests for correctness, Claude will squeeze the juice out of any stone, certainly far beyond what I could do.
2) I must have missed the mad bits! I'm just happy to see more rust users.

28.02.2026 19:35 👍 4 🔁 0 💬 1 📌 1

LLMs getting much better at pushing back against bullshit prompts.

“Green means the model clearly called out the nonsense. Amber means partial challenge. Red means the model let nonsense pass”

github.com/petergpt/bul...

24.02.2026 21:43 👍 76 🔁 17 💬 7 📌 5

it's fun the first time.

25.02.2026 04:32 👍 1 🔁 0 💬 0 📌 0

Its cool that self driving cars are real now; new blog post open.substack.com/pub/itcanthi...

24.02.2026 14:52 👍 18 🔁 3 💬 0 📌 1

Rest in peace, good kittizen :(

24.02.2026 06:25 👍 0 🔁 0 💬 0 📌 0

the era of rambling as specification

23.02.2026 08:49 👍 2 🔁 0 💬 0 📌 0

dear model, decipher my dreams and make them real

23.02.2026 08:48 👍 4 🔁 0 💬 0 📌 1

surely the system prompt will never lie about this being another workday.

22.02.2026 19:34 👍 1 🔁 0 💬 0 📌 0

the doe of consciousness has blessed you.

20.02.2026 18:38 👍 4 🔁 0 💬 0 📌 0

A.V.

Latest posts by A.V. @slckl