Vladimir Salnikov (@v4ldelund)

04.03.2026 06:40 👍 1 🔁 0 💬 0 📌 0

Genuinely curious how it works, like chatGPT responses are not instant (and I assume it take some time to read the response and try to understand it if the question is complex)

Do people just, sit in silence for like couple of minutes or how it works ?

04.03.2026 05:29 👍 0 🔁 0 💬 0 📌 0

For the sake of my mental health I will leave social media until Danish election season is finished

Had an experience reading news/election campaigns while being immigrant in sweden during election season and it was not fun (to put in mildly)

26.02.2026 15:22 👍 1 🔁 0 💬 0 📌 0

How Europeans view average American city :

23.02.2026 23:06 👍 0 🔁 0 💬 0 📌 0

True, I have actually missed the exact point when labs stoped providing logits because of "distillation anxiety", somewhere in early 2024 ?

23.02.2026 22:54 👍 2 🔁 0 💬 0 📌 0

Oh yeah, somehow forgot about LLM-as-judge even though it was explicitly mentioned

And I agree that you can not really call it distillation in a sense how most people image it

23.02.2026 22:50 👍 3 🔁 0 💬 1 📌 0

Also realistically

Chinese labs most likely aren't directly training on Claude's outputs (because not 150k, not even 13 million is enough for something like a frontier-level performance lol)

My prediction - most likely they're using Opus outputs as seeds for synthetic data

23.02.2026 21:49 👍 10 🔁 0 💬 3 📌 0

Ok now I want to distill Opus just out of spite

23.02.2026 21:47 👍 1 🔁 0 💬 0 📌 0

Ah, didn't know that

23.02.2026 19:00 👍 4 🔁 0 💬 0 📌 0

Could it because Z.AI have IPOd and they could take more legal action against accusation ?

23.02.2026 18:57 👍 4 🔁 0 💬 1 📌 0

Nah they should stop pretending this is some "national security" issue

They have cut access to xAI and OpenAI for their coding tools also. It is pretty obvious that Anthropic is afraid of the competition at this point

techcrunch.com/2025/08/02/a...

23.02.2026 18:54 👍 3 🔁 0 💬 0 📌 0

And like in computer vision models you often end up just randomly guessing towards the end just to get a better score

23.02.2026 17:39 👍 2 🔁 0 💬 0 📌 0

Be honest, you like it because it is happening in Malmö this year 😂😂

23.02.2026 17:29 👍 2 🔁 0 💬 1 📌 0

I think your students will definitely appreciate;) tbh I see how it becoming increasingly more important both in research but also industry

23.02.2026 14:13 👍 1 🔁 0 💬 0 📌 0

Thank you for the link !

Yeah special course might be interesting, also I have still this annoying idea of adding VLM benchmarks to EuroEval

But I am afraid that has to wait at least till Summer because I am totally busy with current studies/research projects 😞

23.02.2026 14:10 👍 1 🔁 0 💬 0 📌 0

I must admit, that at this point I would strongly benefit from the university course that focuses just on model evaluation (both CV and LLMs)

Like I am taking many courses on how to train stuff, but imho it is as important to be able to evaluate what you have trained

23.02.2026 08:03 👍 27 🔁 2 💬 5 📌 0

Ahahaha that's a good one

22.02.2026 15:10 👍 0 🔁 0 💬 0 📌 0

New Initiative Aims to Give Sweden Its Own Large-Scale AI Language Model Within the framework of the research program Wallenberg AI, Autonomous Systems and Software Program

Sweden announced new "AI strategy"

I think it is quite cool that more and more Nordic countries are interested in training open Language Models

news.cision.com/knut-och-ali...

21.02.2026 19:27 👍 11 🔁 2 💬 1 📌 0

I actually messed up and download a transparent .png but decided to leave it like that because it looks sick 🔥

20.02.2026 18:24 👍 1 🔁 0 💬 0 📌 0

I think we are slowly moving towards reinventing backpropagation from first principles 😁😁

20.02.2026 15:41 👍 1 🔁 0 💬 0 📌 0

I mean I think it won't be a problem considering usually target model in speculative decoding can just reject tokens, the issue that it might hurt the performance speedup

20.02.2026 15:32 👍 1 🔁 0 💬 1 📌 0

Wow, that actually might work pretty well in case where language/framework is fixed

20.02.2026 14:41 👍 1 🔁 0 💬 1 📌 0

Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits Large Language Models (LLMs) enable various applications on edge devices such as smartphones, wearables, and embodied robots. However, their deployment often depends on expensive cloud-based APIs, cre...

arxiv.org/abs/2505.21594

Well quick Googling says that we can

20.02.2026 12:40 👍 0 🔁 0 💬 1 📌 0

I wonder what if doing speculative decoding this way :

Take smaller/distilled/heavily quantized model and run it locally, then generate "proposal" tokens for larger target model that runs on the cloud

20.02.2026 12:39 👍 5 🔁 0 💬 3 📌 0

Thanks for the reminder

Finally joined IDA students because of this 😁

18.02.2026 20:04 👍 1 🔁 0 💬 0 📌 0

a cat is sitting in a cardboard box and looking at the camera . ALT: a cat is sitting in a cardboard box and looking at the camera .

When you have stopped being GPU poor and realized there is whole another PyTorch to learn

18.02.2026 18:56 👍 0 🔁 0 💬 0 📌 0

It would be nice to see "cost" axis, I think this might be actual moat of open weights/open source models

17.02.2026 18:45 👍 2 🔁 0 💬 0 📌 0

Also I wanted to thank @dorialexander.bsky.social for an inspiration to work on synthetic data, which will be the focus of my work

17.02.2026 10:22 👍 4 🔁 0 💬 0 📌 0

I am very grateful to @kennethenevoldsen.bsky.social for the support during the hiring process (and of course mentorship outside of it)

17.02.2026 10:22 👍 3 🔁 0 💬 1 📌 0

Life update:

In March, I’ll be starting as a Student Developer on the Danish Foundation Models project at Aarhus University

I am excited to work with amazing people and contribute to Danish open source 🇩🇰

17.02.2026 10:20 👍 14 🔁 0 💬 3 📌 0

Vladimir Salnikov

Latest posts by Vladimir Salnikov @v4ldelund