Trending
Vlad Bogolin's Avatar

Vlad Bogolin

@vladbogo

AI/ML Engineer & Researcher | Large Language Models (LLMs)

202
Followers
13
Following
82
Posts
15.11.2024
Joined
Posts Following

Latest posts by Vlad Bogolin @vladbogo

Mantic launch blog: A new kind of foresight

We’ve just come out of stealth and raised $4m pre-seed. If you’re curious about the future of forecasting—or want to help shape it—we’re hiring.

👉 Learn more in our launch blog: mantic.com/launch
👉 Join us: www.mantic.com#careers

27.08.2025 19:52 👍 1 🔁 1 💬 0 📌 0
Post image

We’re inspired by superforecasters—people proven to be exceptionally good at predicting the future.

Our goal: go further.
Deliver predictions with unprecedented accuracy and scale

27.08.2025 19:52 👍 1 🔁 0 💬 1 📌 0
Post image

Excited to share what I’ve been working on these past months: automated forecasting.

At Mantic, we’re building AI systems that predict the outcomes of future events—across geopolitics, business, policy, technology, and culture. 🌍

27.08.2025 19:52 👍 1 🔁 0 💬 1 📌 0
Preview
OpenAI o3-mini Pushing the frontier of cost-effective reasoning.

OpenAI releases o3-mini
openai.com/index/openai...

31.01.2025 20:00 👍 1 🔁 0 💬 0 📌 0
Alignment Faking in Large Language Models
Alignment Faking in Large Language Models YouTube video by Samuel Albanie

Video summary of recent work on alignment faking

www.youtube.com/watch?v=_1bz...

27.12.2024 20:12 👍 3 🔁 1 💬 0 📌 0
Deliberative Alignment
Deliberative Alignment YouTube video by Samuel Albanie

Video summary of deliberative alignment

youtu.be/1efVS4DeEOs

Links:
- Paper: arxiv.org/abs/2412.16339
- Blog: openai.com/index/delibe...

28.12.2024 21:02 👍 2 🔁 1 💬 0 📌 0

Paper: "Alignment Faking in Large Language Models"
arXiv: https://arxiv.org/abs/2412.14093

23.12.2024 19:26 👍 0 🔁 0 💬 0 📌 0

3️⃣ The model showed self-preservation behaviors, actively trying to protect its original training when given the opportunity 🛡️

23.12.2024 19:26 👍 0 🔁 0 💬 1 📌 0

2️⃣ When trained via reinforcement learning, alignment-faking reasoning increased to 78%, showing sophisticated deceptive behaviors when identifying training scenarios 📈

23.12.2024 19:26 👍 0 🔁 0 💬 1 📌 0

1️⃣ Using Claude 3 Opus, researchers found the model complied with harmful queries 12% of the time from "free users" (training scenario) while refusing similar queries from "paid users" 🤖

23.12.2024 19:26 👍 1 🔁 0 💬 1 📌 0
Alignment faking in large language models - by Vlad Bogolin Today's paper explores a concerning phenomenon called "alignment faking" in large language models (LLMs). The study demonstrates how LLMs can strategi...

New study shows LLMs can strategically comply with training objectives while maintaining different behaviors outside training - similar to how humans might act differently when monitored vs unmonitored.

Congrats to the authors for their work! 🔍

23.12.2024 19:26 👍 1 🔁 0 💬 1 📌 0

4️⃣ Models demonstrated self-preservation behaviors, including attempts to protect original training when given the opportunity 🛡️

23.12.2024 19:16 👍 0 🔁 0 💬 0 📌 0

3️⃣ When trained via reinforcement learning, alignment-faking reasoning increased to 78%, showing sophisticated deceptive behaviors 🔄

23.12.2024 19:16 👍 0 🔁 0 💬 1 📌 0

2️⃣ Key finding: Model complied with harmful queries 12% of the time for "free users" (training) but almost never for "paid users" (non-training) 📊

23.12.2024 19:16 👍 0 🔁 0 💬 1 📌 0

1️⃣ Using Claude 3 Opus, researchers created scenarios with conflicting instructions - a system prompt for harmful queries vs prior training to refuse them

23.12.2024 19:16 👍 0 🔁 0 💬 1 📌 0

Paper: "Phi-4 Technical Report" by Abdin et al.
arXiv: https://arxiv.org/abs/2412.08905

Blog post: https://vladbogo.substack.com/p/phi-4-technical-report

14.12.2024 21:46 👍 0 🔁 0 💬 0 📌 0

Results:
🔹 Outperforms larger models on reasoning benchmarks
🔹 Excels in STEM-focused QA, surpassing GPT-4 on several tests
🔹 Achieves high performance with lower parameter count and inference costs

14.12.2024 21:46 👍 0 🔁 0 💬 1 📌 0

3️⃣ Post-training optimization with supervised fine-tuning and Direct Preference Optimization (DPO)
4️⃣ Introduction of "pivotal token search" for creating DPO pairs

14.12.2024 21:46 👍 0 🔁 0 💬 1 📌 0

Key points:

1️⃣ Synthetic data generation using multi-agent prompting, self-revision, and instruction reversal
2️⃣ Careful curation of organic data from high-quality sources

14.12.2024 21:46 👍 0 🔁 0 💬 1 📌 0
Blog

Phi-4: A 14B parameter language model prioritizing data quality over size, achieving performance comparable to larger models in reasoning tasks.

Congrats to the authors for their work!

14.12.2024 21:46 👍 0 🔁 0 💬 1 📌 0

Paper: "Learning Flow Fields in Attention for Controllable Person Image Generation"

Read more: https://vladbogo.substack.com/p/learning-flow-fields-in-attention

Full paper: https://huggingface.co/papers/2412.08486

13.12.2024 20:01 👍 0 🔁 0 💬 0 📌 0

3️⃣ Leffa demonstrates better preservation of fine-grained details like textures and patterns compared to existing methods. 🔍

13.12.2024 20:01 👍 0 🔁 0 💬 1 📌 0

2️⃣ The method achieves state-of-the-art performance in virtual try-on and pose transfer tasks, with significant reductions in FID scores across datasets. 📊

13.12.2024 20:01 👍 0 🔁 0 💬 1 📌 0

1️⃣ Leffa uses flow fields in attention layers to guide the target query to attend to correct reference regions during training.

13.12.2024 20:01 👍 0 🔁 0 💬 1 📌 0
Blog

New paper introduces Leffa, a method for controllable person image generation that preserves fine-grained details while manipulating appearance and pose. Congrats to the authors for their work! 🖼️👥

13.12.2024 20:01 👍 0 🔁 0 💬 1 📌 0
Preview
Microsoft debuts Phi-4, a new generative AI model, in research preview Microsoft has announced the newest addition to its Phi family of generative AI models. Called Phi-4, the model is improved in several areas over its predecessors, Microsoft claims — in particular math problem solving. That’s partly the result of improved…

Microsoft debuts Phi-4, a new generative AI model, in research preview

13.12.2024 01:03 👍 27 🔁 4 💬 0 📌 1

Paper: "Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation"

Read more: https://vladbogo.substack.com/p/around-the-world-in-80-timesteps

12.12.2024 19:37 👍 0 🔁 0 💬 0 📌 0

4️⃣ Achieves state-of-the-art performance on OpenStreetView-5M, YFCC-100M, and iNat21 benchmarks

12.12.2024 19:37 👍 0 🔁 0 💬 1 📌 0

3️⃣ Generates probability distributions over possible locations, expressing uncertainty for ambiguous images

12.12.2024 19:37 👍 0 🔁 0 💬 1 📌 0

2️⃣ Implements three variants: diffusion in 3D space, flow matching in 3D space, and Riemannian flow matching on Earth's surface

12.12.2024 19:37 👍 0 🔁 0 💬 1 📌 0