Oleksii Kuchaiev's Avatar

Oleksii Kuchaiev

@kuchaev

AI model alignment @ NVIDIA

83
Followers
46
Following
8
Posts
14.11.2024
Joined
Posts Following

Latest posts by Oleksii Kuchaiev @kuchaev

I’ll be in Singapore attending ICLR2025. Looking forward to chatting in person about model post-training, alignment and reasoning! ✈️🇸🇬

21.04.2025 22:45 👍 2 🔁 0 💬 0 📌 0
Preview
Nemotron-H - a nvidia Collection Mamba-Transformer hybrid models

New base models from NVIDIA - Nemotron-H: mamba-transformer hybrids are now on @hf.co hub huggingface.co/collections/...

14.04.2025 18:46 👍 1 🔁 0 💬 0 📌 0
Post image

New paper from our team. An inference-time scaling approach which can boost non-math benchmarks such as Arena-Hard of existing models. We get Arena-Hard of 92.7 for 70B model. As of 5 Mar 2025, surpassing o1-preview-2024-09- 12 (90.4) and DS-R1 (92.3). arxiv.org/pdf/2503.04378

07.03.2025 18:42 👍 2 🔁 0 💬 0 📌 0
Preview
GTC AI Conference 2025 Experience In Person and Online.

My favorite AI conference, GTC, is coming back to San Jose, California on March 17-21! Join us and thousands of other developers and innovators. This link gives you 25% off your conference pass www.nvidia.com/gtc/?ncid=GT...

04.03.2025 20:50 👍 1 🔁 0 💬 0 📌 0
Post image

Our team put together a unified mathematical framework to analyze popular model alignment algorithms. “Reward-aware Preference Optimization: A Unified Mathematical Framework
for Model Alignment” arxiv.org/pdf/2502.00203.

04.02.2025 17:25 👍 3 🔁 0 💬 0 📌 0

pretty sure Apple’s Tim Cook pledged publicly (on twitter) that they’ll donate to LA fires support and recovery efforts

15.01.2025 16:40 👍 3 🔁 0 💬 1 📌 0

“winer takes all” is also the most dangerous scenario from safety perspective. open source ecosystem is a great antidote to monopoly or duopoly scenarios.

29.11.2024 22:37 👍 6 🔁 0 💬 0 📌 0

this year timing and conference both were great. thank you!

17.11.2024 21:13 👍 2 🔁 0 💬 0 📌 0