Trending
sankalp (dejavucoder)'s Avatar

sankalp (dejavucoder)

@dejavucoder

into applied ai + product engg interested in all things ai and distributed systems

109
Followers
54
Following
10
Posts
26.11.2024
Joined
Posts Following

Latest posts by sankalp (dejavucoder) @dejavucoder

Preview
Alex L. Zhang | A Meticulous Guide to Advances in Deep Learning Efficiency over the Years A very long and thorough guide how deep learning algorithms, hardware, libraries, compilers, and more have become more efficient.

bookmarking here to read this soon
alexzhang13.github.io/blog/2024/ef...

09.01.2025 20:21 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
The state of post-training in 2025 A re-record of my NeurIPS tutorial on language modeling (plus some added content).

The state of post-training in 2025: a tutorial on modern post-training
A re-record of my NeurIPS tutorial on language modeling (plus some added content on the high level state of things)
Blog + extra context: https://buff.ly/424VvLm
YouTube: https://buff.ly/40808l5
Slides: https://buff.ly/404jGa9

08.01.2025 15:38 ๐Ÿ‘ 80 ๐Ÿ” 17 ๐Ÿ’ฌ 4 ๐Ÿ“Œ 0
Preview
The Evolution of AI-assisted coding features and developer interaction patterns Yes, I agree that's a fancy title. There have been several developments over the last 7 years in the AI-assisted coding arena. We have gone from simple autoc...

new blog post

Evolution of AI-assited coding features and developer interaction patterns. I go through the history of progression of ai-assisted coding features, talk about how we interact with them and a Gears analogy control vs speed tradeoff

sankalp.bearblog.dev/evolution-of...

21.12.2024 19:54 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1
Post image

First slide deck for NeurIPS is done -- an overview of how I view post-training for applications.
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).

https://buff.ly/3ZpY5IR

09.12.2024 19:04 ๐Ÿ‘ 34 ๐Ÿ” 4 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

agent orchestrator more like agent pimp

04.12.2024 17:09 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

will check this out for synthetic data creation and evals

04.12.2024 17:08 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
OpenAI's o1 using "search" was a PSYOP How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought

New post! OpenAI's o1 using "search" was a PSYOP.
How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought.

A fun one trying to communicate intuitions for what large scale RL training does to LLMs. Much more to explore here in 2025!

04.12.2024 15:33 ๐Ÿ‘ 38 ๐Ÿ” 6 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 0

Wow, this is such a useful resource of industry LLM applications! And filtering via search/tags is so responsive. I was thinking of compiling something like this over the holidays (ala applied-ml) but thanks to @strickvl.bsky.social I can spend the time reading instead โ™ฅ๏ธ

zenml.io/llmops-datab...

03.12.2024 01:54 ๐Ÿ‘ 50 ๐Ÿ” 4 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 0

lmao

28.11.2024 09:47 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

this is kinda nice

26.11.2024 14:54 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

same haha. planning to spend some time on bluesky to check ai discussions and meet mutuals who are more active here

26.11.2024 14:49 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

hello sir

26.11.2024 14:45 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Post image

we are planning to read this blog blog.dottxt.co

26.11.2024 14:43 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

hello world

26.11.2024 14:41 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0