Jose Arjona-Medina (@arjonamedina)

(5/5) Perhaps I'm overlooking something-open to your insights 😀

01.03.2025 19:30 👍 0 🔁 0 💬 0 📌 0

(4/5) However, what we see in DeepSeek's formulation is that 1) there is no sequence of actions, and 2) the reference policy remains the same.

I don't see the credit assignment mechanism from future rewards to current actions in this formulation, which is the key factor in RL.

01.03.2025 19:30 👍 0 🔁 0 💬 1 📌 0

(3/5) Because this objective is difficult to compute (there are two distributions from two different policies involved) a constrain on how far these two policies can be is included so we can just "ignore" this mismatch.

01.03.2025 19:30 👍 0 🔁 0 💬 1 📌 0

(2/5) In TRPO and PPO, you maximize the expected sum of advantages over a sequence of actions as a way to optimize policy improvement steps.

01.03.2025 19:30 👍 0 🔁 0 💬 1 📌 0

I still uncertain about the RL aspect in DeepSeek.

To me it looks like a clever way of applaying a PPO-like clipping within a supervised framework, constrained by a fixed reference model. Althought some parts in its formulation are very similar to PPO, I wouldn't describe it as RL. (1/5)🧵

01.03.2025 19:30 👍 2 🔁 0 💬 1 📌 0

Any recommendation for a drug discovery related confernece in Europe this year? Im all ears 😄

10.01.2025 07:58 👍 2 🔁 0 💬 0 📌 0

*generalize

29.12.2024 11:09 👍 2 🔁 0 💬 0 📌 0

softmax is not enough (for sharp out-of-distribution) A key property of reasoning systems is the ability to make sharp decisions on their input data. For contemporary AI systems, a key carrier of sharp behaviour is the softmax function, with its capabili...

Why your transformer-based net does not generalized from small molecules to peptides? Well, here one of the reasons: arxiv.org/abs/2410.01104

Once you know the root of the problem, you can find nice solutions 😉
In our case, a very simple regularization term did the job.

29.12.2024 11:07 👍 16 🔁 4 💬 1 📌 0

Great book. I also enjoyed it a lot.

26.12.2024 22:17 👍 1 🔁 0 💬 0 📌 0

Thanks! We introduce inductive bias at different levels. For instance, we refined the centrality encoder to implicitly capture atom hybridization. We have other examples in the paper, and many others that we hope to publish soon 😉

02.12.2024 11:14 👍 0 🔁 0 💬 1 📌 0

I'm making a list of AI for Science researchers on bluesky — let me know if I missed you / if you'd like to join!

go.bsky.app/AcP9Lix

10.11.2024 00:11 👍 247 🔁 91 💬 160 📌 5

Same here! Thanks :)

16.11.2024 19:21 👍 1 🔁 0 💬 1 📌 0

Thanks a lot!

16.11.2024 19:13 👍 0 🔁 0 💬 0 📌 0

Analysis of Atom-level pretraining with Quantum Mechanics (QM) data for Graph Neural Networks Molecular property models Despite the rapid and significant advancements in deep learning for Quantitative Structure-Activity Relationship (QSAR) models, the challenge of learning robust molecular representations that effectiv...

I'm working on improving molecular GNN models by adding inductive bias and integrating different data modalities. We recently presented part of our work in a workshop at ICML last July: arxiv.org/abs/2405.14837

16.11.2024 19:11 👍 1 🔁 0 💬 1 📌 0

Thanks a lot!

16.11.2024 18:47 👍 1 🔁 0 💬 0 📌 0

A starter pack for anyone interested in AI & drug discovery :)

go.bsky.app/AgYHc8j

15.11.2024 00:29 👍 53 🔁 15 💬 29 📌 2

Hi, just arrived here. Nice to see this starter pack. Im also doing some research in drug design and I would love to be inluded there as well. Thanks!

16.11.2024 18:39 👍 0 🔁 0 💬 2 📌 0

Follow leading researchers, practitioners, and thought leaders exploring the intersection of #AI, machine learning, data science, and scientific research. 🦋

➕We are just getting started, please send me others who should be added

go.bsky.app/JeFdryY #ML

15.11.2024 16:11 👍 106 🔁 41 💬 44 📌 5

Great initiative! I'd love to be there as well

16.11.2024 18:32 👍 1 🔁 0 💬 1 📌 0

Jose Arjona-Medina

Latest posts by Jose Arjona-Medina @arjonamedina