A.V.'s Avatar

A.V.

@slckl

Trying to make Rust x AI a reality. Python survivor, book lover and weird music enjoyer.

413
Followers
286
Following
337
Posts
08.02.2024
Joined
Posts Following

Latest posts by A.V. @slckl

Fun read, and interesting method: identify a circuit of several "thinking" layers and repeat it during inference for improved results, no extra training of any kind.

10.03.2026 22:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

One would imagine Jurgen would've shown us some results with more than a decade of a headstart...

10.03.2026 20:02 πŸ‘ 6 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

they show that more speed is always possible and weird tricks can work. idk, they feel more practical than, say, hutter prize.

09.03.2026 21:55 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

is thor any good for llms?

08.03.2026 22:06 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Fun, but a bit frustrating. Some felt obvious, while others felt unfair due to being too simple...

08.03.2026 10:55 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

now compare to ripgrep

06.03.2026 06:25 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

DGX spark is consumer blackwell, sorry...

06.03.2026 06:06 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I suppose my brain is too pytorch shaped to appreciate the value of non-ML use cases...

05.03.2026 22:12 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Hopper and Blackwell (and not the consumer blackwell, probably...)

05.03.2026 22:08 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

boring, we have many numpies in rust already. we also have multiple pytorch in rust attempts, but how about... jax in rust?

now that sounds a bit more exciting.

05.03.2026 22:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

FlashAttention-4

I hope it is not pain to work with. It changes the algorithm & pipeline so that softmax & SMEM bandwidth no longer dictate speed. Attn reaches ~1600 TFLOPs, pretty much at matmul speed!

05.03.2026 18:47 πŸ‘ 27 πŸ” 4 πŸ’¬ 2 πŸ“Œ 3
Post image

Excited to share the latest Olmo model: Olmo Hybrid. This is a model with gated delta net (GDN) layers in a 3:1 ratio with full attention. It follows lots of other developments like Qwen 3.5 and Kimi Linear.

05.03.2026 16:26 πŸ‘ 67 πŸ” 8 πŸ’¬ 6 πŸ“Œ 4

+20Β°C, I see, so we're skipping spring and going straight into summer, followed by the new annual season - hellfire.

04.03.2026 13:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

the washed out browns of early spring feel more autumn than autumn itself.

04.03.2026 07:41 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Earliest dyson spheres were built inside out covering increasingly large fractions of the builders' native planet.

02.03.2026 19:01 πŸ‘ 16 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

Qwen 3.5 Small Model Series just dropped on
@hf.co πŸ”₯

huggingface.co/collections/...

✨ 0.8B/2B/4B/9B
✨ Apache2.0
✨ 262Kβ†’1M token context

02.03.2026 13:31 πŸ‘ 85 πŸ” 17 πŸ’¬ 1 πŸ“Œ 8

consciousness is defined by human experience. humans do experience stuff. but blindsight claims it's not neccessary for intelligence, not that it doesn't exist.

(at least that's how I remember it...)

01.03.2026 18:55 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

what is this good for? probably only interesting if you want to run stuff using candle for some reason. these reasons are your own.

01.03.2026 11:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - slckl/candle_rf_detr Contribute to slckl/candle_rf_detr development by creating an account on GitHub.

Some time ago, I ported RF-DETR inference from pytorch to rust's candle using Opus 4.5 and my own hands.
With some iteration and hand-holding, Opus managed to get it correct.

Port can be found here: github.com/slckl/candle...

01.03.2026 11:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

redzΔ“s, kas bΕ«s vietā...

01.03.2026 08:28 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

To expand, good tests to verify correctness and a target benchmark that Claude can run on its own repeatedly will often yield substantial performance gains for a looooooooot of code.
There are only so many performance engineers out there, a lot of projects could benefit this way.

28.02.2026 19:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

1) I've had similar experience with optimizing rust code. If you have a bench to target and tests for correctness, Claude will squeeze the juice out of any stone, certainly far beyond what I could do.
2) I must have missed the mad bits! I'm just happy to see more rust users.

28.02.2026 19:35 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 1
Post image

LLMs getting much better at pushing back against bullshit prompts.

β€œGreen means the model clearly called out the nonsense. Amber means partial challenge. Red means the model let nonsense pass”

github.com/petergpt/bul...

24.02.2026 21:43 πŸ‘ 76 πŸ” 17 πŸ’¬ 7 πŸ“Œ 5

it's fun the first time.

25.02.2026 04:32 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Its cool that self driving cars are real now; new blog post open.substack.com/pub/itcanthi...

24.02.2026 14:52 πŸ‘ 18 πŸ” 3 πŸ’¬ 0 πŸ“Œ 1

Rest in peace, good kittizen :(

24.02.2026 06:25 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

the era of rambling as specification

23.02.2026 08:49 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

dear model, decipher my dreams and make them real

23.02.2026 08:48 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

surely the system prompt will never lie about this being another workday.

22.02.2026 19:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

the doe of consciousness has blessed you.

20.02.2026 18:38 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0