Trending
Pietro Novelli's Avatar

Pietro Novelli

@pienovelli

Physicist, working on machine learning for dynamical systems | reinforcement learning | machine learning for science | transfer learning for atomistic potentials | statistical learning theory & optimization.

22
Followers
26
Following
8
Posts
25.11.2024
Joined
Posts Following

Latest posts by Pietro Novelli @pienovelli

Post image

For the past four years, I’ve been working on a topic that’s both fascinating and challenging to explain. In this post, I’ve tried to present The Operator Way — a paradigm for understanding dynamical processes — in plain, approachable terms.

pietronvll.github.io/the-operator...

09.01.2025 17:19 👍 7 🔁 0 💬 1 📌 1
Post image

By the time I finished working on this paper, I had more research questions than when I started. I take this fertility of ideas as a very good sign 😃. If you’re in Vancouver, consider checking it out. I’ll be at the West Ballroom A-D from 16:30 to 19:30, poster #6907

12.12.2024 17:19 👍 2 🔁 0 💬 0 📌 0

To add some flesh around this core idea, we developed a neat theoretical foundation that combines conditional mean embeddings and policy mirror descent. This foundation ultimately leads to sample complexity results, highlighting the interplay between exploration and exploitation.

12.12.2024 17:19 👍 2 🔁 0 💬 1 📌 0

The return is a (conditional) expected value, and we realized that there are now mature ML tools to model such expected values directly, avoiding the solution of intermediate and more difficult problems.

12.12.2024 17:19 👍 2 🔁 0 💬 1 📌 0

So, what’s all this fuss about? Reinforcement learning, in essence, is an optimization problem: we want to maximize returns.

12.12.2024 17:19 👍 2 🔁 0 💬 1 📌 0
Preview
Operator World Models for Reinforcement Learning Policy Mirror Descent (PMD) is a powerful and theoretically sound methodology for sequential decision-making. However, it is not directly applicable to Reinforcement Learning (RL) due to the inaccessi...

This quote neatly encapsulates the core of our “Operator World Models for Reinforcement Learning” which we’re presenting today at @NeurIPS. arxiv.org/abs/2406.19861

12.12.2024 17:19 👍 2 🔁 0 💬 1 📌 0

In his book “The Nature of Statistical Learning” V. Vapnik wrote:
“When solving a given problem, try to avoid a more general problem as an intermediate step”

12.12.2024 17:19 👍 8 🔁 3 💬 1 📌 0

Come check it out!!

10.12.2024 02:42 👍 2 🔁 1 💬 0 📌 0