sankalp (dejavucoder) (@dejavucoder)

Alex L. Zhang | A Meticulous Guide to Advances in Deep Learning Efficiency over the Years A very long and thorough guide how deep learning algorithms, hardware, libraries, compilers, and more have become more efficient.

bookmarking here to read this soon
alexzhang13.github.io/blog/2024/ef...

09.01.2025 20:21 👍 0 🔁 0 💬 0 📌 0

The state of post-training in 2025 A re-record of my NeurIPS tutorial on language modeling (plus some added content).

The state of post-training in 2025: a tutorial on modern post-training
A re-record of my NeurIPS tutorial on language modeling (plus some added content on the high level state of things)
Blog + extra context: https://buff.ly/424VvLm
YouTube: https://buff.ly/40808l5
Slides: https://buff.ly/404jGa9

08.01.2025 15:38 👍 80 🔁 17 💬 4 📌 0

The Evolution of AI-assisted coding features and developer interaction patterns Yes, I agree that's a fancy title. There have been several developments over the last 7 years in the AI-assisted coding arena. We have gone from simple autoc...

new blog post

Evolution of AI-assited coding features and developer interaction patterns. I go through the history of progression of ai-assisted coding features, talk about how we interact with them and a Gears analogy control vs speed tradeoff

sankalp.bearblog.dev/evolution-of...

21.12.2024 19:54 👍 1 🔁 0 💬 0 📌 1

First slide deck for NeurIPS is done -- an overview of how I view post-training for applications.
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).

https://buff.ly/3ZpY5IR

09.12.2024 19:04 👍 34 🔁 4 💬 1 📌 0

agent orchestrator more like agent pimp

04.12.2024 17:09 👍 0 🔁 0 💬 0 📌 0

will check this out for synthetic data creation and evals

04.12.2024 17:08 👍 0 🔁 0 💬 0 📌 0

OpenAI's o1 using "search" was a PSYOP How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought

New post! OpenAI's o1 using "search" was a PSYOP.
How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought.

A fun one trying to communicate intuitions for what large scale RL training does to LLMs. Much more to explore here in 2025!

04.12.2024 15:33 👍 38 🔁 6 💬 3 📌 0

Wow, this is such a useful resource of industry LLM applications! And filtering via search/tags is so responsive. I was thinking of compiling something like this over the holidays (ala applied-ml) but thanks to @strickvl.bsky.social I can spend the time reading instead ♥️

zenml.io/llmops-datab...

03.12.2024 01:54 👍 50 🔁 4 💬 3 📌 0

lmao

28.11.2024 09:47 👍 2 🔁 0 💬 0 📌 0

this is kinda nice

26.11.2024 14:54 👍 1 🔁 0 💬 0 📌 0

same haha. planning to spend some time on bluesky to check ai discussions and meet mutuals who are more active here

26.11.2024 14:49 👍 2 🔁 0 💬 0 📌 0

hello sir

26.11.2024 14:45 👍 1 🔁 0 💬 1 📌 0

we are planning to read this blog blog.dottxt.co

26.11.2024 14:43 👍 2 🔁 0 💬 0 📌 0

hello world

26.11.2024 14:41 👍 2 🔁 0 💬 1 📌 0

sankalp (dejavucoder)

Latest posts by sankalp (dejavucoder) @dejavucoder