Jan Peters's Avatar

Jan Peters

@jan-peters

#RobotLearning Professor (#MachineLearning #Robotics) at @ias-tudarmstadt.bsky.social of @tuda.bsky.social @dfki.bsky.social @hessianai.bsky.social

2,185
Followers
1,387
Following
18
Posts
14.11.2024
Joined
Posts Following

Latest posts by Jan Peters @jan-peters

Video thumbnail

Proud to share our latest work, accepted at @iclr-conf.bsky.social 2026: APPLE! ๐ŸŽ

TL;DR: APPLE is a novel reinforcement learning framework for solving active perception problems.

#ICLR2026 #Robotics #MachineLearning #ActivePerception #RL
@ias-tudarmstadt.bsky.social

12.02.2026 12:32 ๐Ÿ‘ 14 ๐Ÿ” 3 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Video thumbnail

๐Ÿงต Accepted at @iclr-conf.bsky.social!

Target networks stabilize bootstrapping in RL ๐Ÿ›ก๏ธ
But induce slow-moving targets ๐Ÿข

Online networks adapt fast โšก
But can diverge with function approximation ๐Ÿ’ฅ

๐— ๐—œ๐—ก๐—ง๐—ข ๐ŸŒฟ uses the online network ๐—ผ๐—ป๐—น๐˜† ๐—ถ๐—ณ ๐—ถ๐˜ ๐—ฐ๐—ฎ๐—ป โ€” yielding faster ๐˜ข๐˜ฏ๐˜ฅ more stable RL.

Hereโ€™s how ๐Ÿ‘‡

11.02.2026 17:02 ๐Ÿ‘ 10 ๐Ÿ” 3 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Post image

Should we use a target network in deep value-based RL?๐Ÿค”

The answer has always been YES or NO, as there are pros and cons.

@iclr-conf.bsky.social, I will present iS-QN, a method that lies in between this binary view, collecting the pros while reducing the cons๐Ÿš€

05.02.2026 16:37 ๐Ÿ‘ 21 ๐Ÿ” 4 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1
Post image

๐ŸฅณOur paper "Floating-Base Deep Lagrangian Networks (FeLaN)" has been accepted to #ICRA2026.

FeLaN: a grey-box approach for physically consistent SysID of floating-base robots (humanoids, quadrupeds).

๐Ÿ“„ arxiv.org/abs/2510.17270
๐Ÿ’ป Soon!
๐ŸŒ schulze18.github.io/felan_website/

03.02.2026 16:29 ๐Ÿ‘ 10 ๐Ÿ” 3 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
NVIDIA Awards up to $60,000 Research Fellowships to PhD Students The Graduate Fellowship Program announced the latest awards of up to $60,000 each to 10 Ph.D. students involved in research that spans all areas of computing innovation.

I'm super excited to have been named an #NVIDIA Graduate Fellowship Finalist! ๐Ÿ’š

Huge thanks to my supervisor @jan-peters.bsky.social and all my collaborators.

Can't wait to join the NVIDIA Seattle Robotics Lab for my internship next summer! ๐Ÿค–

blogs.nvidia.com/blog/graduat...

13.12.2025 16:26 ๐Ÿ‘ 5 ๐Ÿ” 3 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

๐ŸŽ‰ Really excited, our paper "XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning" has been accepted at #ICLR2026.

If you are interested in reinforcement learning, sample-efficiency, compute-efficiency go check it out. See you in Rio!

03.02.2026 10:33 ๐Ÿ‘ 10 ๐Ÿ” 3 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Just graduated PhD student #45 ... Niklas Wilhelm Funk did an outstanding job defending a doctoral dissertation on Learning Robotic Manipulation through Vision, Touch, and Spatially Grounded Representations! Major insights on many different aspects of manipulation...

09.12.2025 13:38 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Just graduated PhD student #45 ... Niklas Wilhelm Funk did an outstanding job defending a doctoral dissertation on Learning Robotic Manipulation through Vision, Touch, and Spatially Grounded Representations! Major insights on many different aspects of manipulation...

09.12.2025 12:05 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

@timschneider94.bsky.social will present "Analysing the Interplay of Vision and Touch for Dexterous Insertion Tasks" by Janis Lenz, Tim Schneider, Theo Gruner, @daniel-palenicek.bsky.social, and
@jan-peters.bsky.social

๐Ÿ—“๏ธ 13.06, 16:30 - 19:30
๐Ÿ“ Poster 100
See bsky.app/profile/tims...

13.06.2025 11:06 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Bluesky

Our work introduces a geometrically-aware approach that brings motion planning into Bayesian goal inferenceโ€”an early but promising direction.

With @anindex.bsky.social , Theo Gruner, @joemwatson.bsky.social , @georgiachal.bsky.social & @jan-peters.bsky.social

13.06.2025 11:18 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

@kay-pompetzki.bsky.social will present "Geometrically-Aware Goal Inference: Leveraging Motion Planning as Inference" by Kay Pompetzki, @anindex.bsky.social, Theo Gruner, @georgiachal.bsky.social, and @jan-peters.bsky.social

๐Ÿ—“๏ธ 13.06, 16:30 - 19:30
๐Ÿ“ Poster 86
See bsky.app/profile/kay-...

13.06.2025 11:21 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Massively Scaling Explicit Policy-conditioned Value Functions We introduce a scaling strategy for Explicit Policy-Conditioned Value Functions (EPVFs) that significantly improves performance on challenging continuous-control tasks. EPVFs learn a value function V(...

๐Ÿฆพ By combining EPVFs with massive parallelism and careful regularization, we close the gap with state-of-the-art DRL in complex environments.
๐Ÿ”— Full paper: arxiv.org/abs/2502.11949
โœจ Finally, many thanks to @jan-peters.bsky.social and
@ias-tudarmstadt.bsky.social for the support!

13.06.2025 11:50 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

We will also present "Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization" by @daniel-palenicek.bsky.social, Florian Vogt, @joemwatson.bsky.social, and @jan-peters.bsky.social.

๐Ÿ—“๏ธ 13.06, 16:30 - 19:30
๐Ÿ“ Poster 50
See bsky.app/profile/did:...

13.06.2025 14:01 ๐Ÿ‘ 5 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Or come to my talk @ International Symposium on Adaptive Motion of Animals and Machines and LokoAssist Symposium (AMAM) on Friday at TU Darmstadt

Thanks to @ias-tudarmstadt.bsky.social, @jan-peters.bsky.social

02.07.2025 18:19 ๐Ÿ‘ 5 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

๐ŸŽค Very excited to give a talk @cohereforai.bsky.social next week Friday ๐ŸŽค

I will be presenting the research I have been working on for the last 2 years with Carlo D'Eramo, @jan-peters.bsky.social, and many more collaborators!

11.07.2025 16:17 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Video thumbnail

๐ŸŽ‰Our paper "Context-Aware Deep Lagrangian Networks for MPC" was accepted at #IROS2025!

We present CaDeLaC: adaptive physics-consistent robot control via online SysID + MPC.

Paper: arxiv.org/abs/2506.15249

Big thanks to @ias-tudarmstadt.bsky.social @jan-peters.bsky.social @olegarenz.bsky.social

18.07.2025 13:28 ๐Ÿ‘ 7 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
GitHub - TimSchneider42/franky: High-Level Control Library for Franka Robots with Python and C++ Support High-Level Control Library for Franka Robots with Python and C++ Support - GitHub - TimSchneider42/franky: High-Level Control Library for Franka Robots with Python and C++ Support

Pushing for #icra but still missing real robot experiments? ๐Ÿ˜ฐ
Skip the ROS headaches โ€” get your Franka robot running in minutes with franky! ๐Ÿฆพ
Super beginner-friendly, Pythonic, and fast to set up.
๐Ÿ”— github.com/TimSchneider...
@ias-tudarmstadt.bsky.social @jan-peters.bsky.social

31.07.2025 17:09 ๐Ÿ‘ 6 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Post image

๐Ÿš€ New preprint! Introducing XQCโ€” a simple, well-conditioned actor-critic that achieves SOTA sample efficiency in #RL
โœ… ~4.5ร— fewer parameters than SimbaV2
โœ… Scales to vision-based RL
๐Ÿ‘‰ arxiv.org/pdf/2509.25174

Thanks to Florian Vogt @joemwatson.bsky.social @jan-peters.bsky.social

02.10.2025 15:48 ๐Ÿ‘ 7 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1
Preview
GitHub - Schulze18/cadelac: Context-Aware Deep Lagrangian Networks for Model Predictive Control (CaDeLaC) Context-Aware Deep Lagrangian Networks for Model Predictive Control (CaDeLaC) - Schulze18/cadelac

Warming up for #IROS2025 ๐Ÿ”ฅ

Weโ€™re releasing CaDeLaC as open source!
- Training pipeline
- Integration with acados for adaptive physics-consistent MPC
- Simulation + real Franka torque control

๐Ÿ’ป github.com/Schulze18/ca...

@ias-tudarmstadt.bsky.social @jan-peters.bsky.social @olegarenz.bsky.social

14.10.2025 12:40 ๐Ÿ‘ 7 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Post image

๐Ÿ‡ฐ๐Ÿ‡ท Conferences are about finally meeting your collaborators from all around the world!

Check out our work on Embodiment Scaling Laws @CoRL2025
We investigate cross-embodiment learning as the next axis of scaling for truly generalist policies ๐Ÿ“ˆ

๐Ÿ”— All details: embodiment-scaling-laws.github.io

30.09.2025 08:10 ๐Ÿ‘ 9 ๐Ÿ” 2 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0
Post image

As usual, @ewrl18.bsky.social was a wonderful experience.

I had the pleasure of presenting my research as a Contributed Talk ๐ŸŽ‰

Special thanks to the organizers for making it happen!

19.09.2025 16:08 ๐Ÿ‘ 8 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Today, @theovincent.bsky.social l will present "Eau De Q-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning" by Thรฉo Vincent, @jan-peters.bsky.social, and Carlo D'Eramo.
๐Ÿ—“๏ธ 12.06, 16:30 - 19:30
๐Ÿ“ Poster 28
See bsky.app/profile/theo...

12.06.2025 14:56 ๐Ÿ‘ 5 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

@timschneider94.bsky.social will present "Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach" by Tim Schneider, Cristiana de Farias, Roberto Calandra, Liming Chen, and @jan-peters.bsky.social
๐Ÿ—“๏ธ 12.06, 16:30 - 19:30
๐Ÿ“ Poster 105
See bsky.app/profile/tims...

12.06.2025 14:57 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

๐Ÿš€ Interested in Neuro-Symbolic Learning and attending #ICRA2025? ๐Ÿง ๐Ÿค–

Do not miss Leon Keller presenting โ€œNeuro-Symbolic Imitation Learning: Discovering Symbolic Abstractions for Skill Learningโ€.

Joint work of Honda Research Institute EU and @jan-peters.bsky.social (@ias-tudarmstadt.bsky.social).

19.05.2025 08:51 ๐Ÿ‘ 11 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Thanks to my co-authors Florian Vogt, @joemwatson.bsky.social @jan-peters.bsky.social

@hessianai.bsky.social @ias-tudarmstadt.bsky.social @dfki.bsky.social @cs-tudarmstadt.bsky.social
#RL #ML #AI

23.05.2025 12:50 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Work conducted @ias-tudarmstadt.bsky.social headed by @jan-peters.bsky.social @cs-tudarmstadt.bsky.social @tuda.bsky.social

Many thanks to all #LocoMuJoCo Contributors!

18.04.2025 22:47 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image Post image Post image

First German Robotics Conference starting in Messezentrum Nuremberg! 400+ delegates plus RoboCup next door! @jan-peters.bsky.social @tuda.bsky.social

13.03.2025 12:17 ๐Ÿ‘ 11 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Many thanks to my colleagues and collaborators: Daniel Palenicek, ลukasz Antczak, @jan-peters.bsky.social and most importantly Jonathan Kinzel (@ibims1jfk.bsky.social), who interned at MAB Robotics and did the experiments.
Also thanks to MAB Robotics for providing the hardware and constant support!

18.03.2025 22:24 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

We build on the efficient CrossQ DRL algorithm and combine it with two control architectures โ€” Joint Target Prediction for agile maneuvers and Central Pattern Generators for stable, natural gaits โ€” to train locomotion policies directly on the HoneyBadger quadruped robot from MAB Robotics.

18.03.2025 22:24 ๐Ÿ‘ 5 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Gait in Eight

Intrigued? Check out the paper and videos here: nico-bohlinger.github.io/gait_in_eigh...

18.03.2025 22:24 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0