Proud to share our latest work, accepted at @iclr-conf.bsky.social 2026: APPLE! ๐
TL;DR: APPLE is a novel reinforcement learning framework for solving active perception problems.
#ICLR2026 #Robotics #MachineLearning #ActivePerception #RL
@ias-tudarmstadt.bsky.social
12.02.2026 12:32
๐ 14
๐ 3
๐ฌ 1
๐ 0
๐งต Accepted at @iclr-conf.bsky.social!
Target networks stabilize bootstrapping in RL ๐ก๏ธ
But induce slow-moving targets ๐ข
Online networks adapt fast โก
But can diverge with function approximation ๐ฅ
๐ ๐๐ก๐ง๐ข ๐ฟ uses the online network ๐ผ๐ป๐น๐ ๐ถ๐ณ ๐ถ๐ ๐ฐ๐ฎ๐ป โ yielding faster ๐ข๐ฏ๐ฅ more stable RL.
Hereโs how ๐
11.02.2026 17:02
๐ 10
๐ 3
๐ฌ 1
๐ 0
Should we use a target network in deep value-based RL?๐ค
The answer has always been YES or NO, as there are pros and cons.
@iclr-conf.bsky.social, I will present iS-QN, a method that lies in between this binary view, collecting the pros while reducing the cons๐
05.02.2026 16:37
๐ 21
๐ 4
๐ฌ 1
๐ 1
๐ฅณOur paper "Floating-Base Deep Lagrangian Networks (FeLaN)" has been accepted to #ICRA2026.
FeLaN: a grey-box approach for physically consistent SysID of floating-base robots (humanoids, quadrupeds).
๐ arxiv.org/abs/2510.17270
๐ป Soon!
๐ schulze18.github.io/felan_website/
03.02.2026 16:29
๐ 10
๐ 3
๐ฌ 1
๐ 0
NVIDIA Awards up to $60,000 Research Fellowships to PhD Students
The Graduate Fellowship Program announced the latest awards of up to $60,000 each to 10 Ph.D. students involved in research that spans all areas of computing innovation.
I'm super excited to have been named an #NVIDIA Graduate Fellowship Finalist! ๐
Huge thanks to my supervisor @jan-peters.bsky.social and all my collaborators.
Can't wait to join the NVIDIA Seattle Robotics Lab for my internship next summer! ๐ค
blogs.nvidia.com/blog/graduat...
13.12.2025 16:26
๐ 5
๐ 3
๐ฌ 0
๐ 0
๐ Really excited, our paper "XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning" has been accepted at #ICLR2026.
If you are interested in reinforcement learning, sample-efficiency, compute-efficiency go check it out. See you in Rio!
03.02.2026 10:33
๐ 10
๐ 3
๐ฌ 0
๐ 0
Just graduated PhD student #45 ... Niklas Wilhelm Funk did an outstanding job defending a doctoral dissertation on Learning Robotic Manipulation through Vision, Touch, and Spatially Grounded Representations! Major insights on many different aspects of manipulation...
09.12.2025 13:38
๐ 4
๐ 0
๐ฌ 0
๐ 0
Just graduated PhD student #45 ... Niklas Wilhelm Funk did an outstanding job defending a doctoral dissertation on Learning Robotic Manipulation through Vision, Touch, and Spatially Grounded Representations! Major insights on many different aspects of manipulation...
09.12.2025 12:05
๐ 4
๐ 0
๐ฌ 0
๐ 0
@timschneider94.bsky.social will present "Analysing the Interplay of Vision and Touch for Dexterous Insertion Tasks" by Janis Lenz, Tim Schneider, Theo Gruner, @daniel-palenicek.bsky.social, and
@jan-peters.bsky.social
๐๏ธ 13.06, 16:30 - 19:30
๐ Poster 100
See bsky.app/profile/tims...
13.06.2025 11:06
๐ 3
๐ 1
๐ฌ 0
๐ 0
Bluesky
Our work introduces a geometrically-aware approach that brings motion planning into Bayesian goal inferenceโan early but promising direction.
With @anindex.bsky.social , Theo Gruner, @joemwatson.bsky.social , @georgiachal.bsky.social & @jan-peters.bsky.social
13.06.2025 11:18
๐ 2
๐ 1
๐ฌ 1
๐ 0
@kay-pompetzki.bsky.social will present "Geometrically-Aware Goal Inference: Leveraging Motion Planning as Inference" by Kay Pompetzki, @anindex.bsky.social, Theo Gruner, @georgiachal.bsky.social, and @jan-peters.bsky.social
๐๏ธ 13.06, 16:30 - 19:30
๐ Poster 86
See bsky.app/profile/kay-...
13.06.2025 11:21
๐ 4
๐ 1
๐ฌ 0
๐ 0
Massively Scaling Explicit Policy-conditioned Value Functions
We introduce a scaling strategy for Explicit Policy-Conditioned Value Functions (EPVFs) that significantly improves performance on challenging continuous-control tasks. EPVFs learn a value function V(...
๐ฆพ By combining EPVFs with massive parallelism and careful regularization, we close the gap with state-of-the-art DRL in complex environments.
๐ Full paper: arxiv.org/abs/2502.11949
โจ Finally, many thanks to @jan-peters.bsky.social and
@ias-tudarmstadt.bsky.social for the support!
13.06.2025 11:50
๐ 4
๐ 1
๐ฌ 0
๐ 0
We will also present "Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization" by @daniel-palenicek.bsky.social, Florian Vogt, @joemwatson.bsky.social, and @jan-peters.bsky.social.
๐๏ธ 13.06, 16:30 - 19:30
๐ Poster 50
See bsky.app/profile/did:...
13.06.2025 14:01
๐ 5
๐ 1
๐ฌ 0
๐ 0
Or come to my talk @ International Symposium on Adaptive Motion of Animals and Machines and LokoAssist Symposium (AMAM) on Friday at TU Darmstadt
Thanks to @ias-tudarmstadt.bsky.social, @jan-peters.bsky.social
02.07.2025 18:19
๐ 5
๐ 1
๐ฌ 0
๐ 0
๐ค Very excited to give a talk @cohereforai.bsky.social next week Friday ๐ค
I will be presenting the research I have been working on for the last 2 years with Carlo D'Eramo, @jan-peters.bsky.social, and many more collaborators!
11.07.2025 16:17
๐ 4
๐ 1
๐ฌ 1
๐ 0
๐Our paper "Context-Aware Deep Lagrangian Networks for MPC" was accepted at #IROS2025!
We present CaDeLaC: adaptive physics-consistent robot control via online SysID + MPC.
Paper: arxiv.org/abs/2506.15249
Big thanks to @ias-tudarmstadt.bsky.social @jan-peters.bsky.social @olegarenz.bsky.social
18.07.2025 13:28
๐ 7
๐ 2
๐ฌ 1
๐ 0
GitHub - TimSchneider42/franky: High-Level Control Library for Franka Robots with Python and C++ Support
High-Level Control Library for Franka Robots with Python and C++ Support - GitHub - TimSchneider42/franky: High-Level Control Library for Franka Robots with Python and C++ Support
Pushing for #icra but still missing real robot experiments? ๐ฐ
Skip the ROS headaches โ get your Franka robot running in minutes with franky! ๐ฆพ
Super beginner-friendly, Pythonic, and fast to set up.
๐ github.com/TimSchneider...
@ias-tudarmstadt.bsky.social @jan-peters.bsky.social
31.07.2025 17:09
๐ 6
๐ 2
๐ฌ 1
๐ 0
๐ New preprint! Introducing XQCโ a simple, well-conditioned actor-critic that achieves SOTA sample efficiency in #RL
โ
~4.5ร fewer parameters than SimbaV2
โ
Scales to vision-based RL
๐ arxiv.org/pdf/2509.25174
Thanks to Florian Vogt @joemwatson.bsky.social @jan-peters.bsky.social
02.10.2025 15:48
๐ 7
๐ 2
๐ฌ 1
๐ 1
GitHub - Schulze18/cadelac: Context-Aware Deep Lagrangian Networks for Model Predictive Control (CaDeLaC)
Context-Aware Deep Lagrangian Networks for Model Predictive Control (CaDeLaC) - Schulze18/cadelac
Warming up for #IROS2025 ๐ฅ
Weโre releasing CaDeLaC as open source!
- Training pipeline
- Integration with acados for adaptive physics-consistent MPC
- Simulation + real Franka torque control
๐ป github.com/Schulze18/ca...
@ias-tudarmstadt.bsky.social @jan-peters.bsky.social @olegarenz.bsky.social
14.10.2025 12:40
๐ 7
๐ 2
๐ฌ 1
๐ 0
๐ฐ๐ท Conferences are about finally meeting your collaborators from all around the world!
Check out our work on Embodiment Scaling Laws @CoRL2025
We investigate cross-embodiment learning as the next axis of scaling for truly generalist policies ๐
๐ All details: embodiment-scaling-laws.github.io
30.09.2025 08:10
๐ 9
๐ 2
๐ฌ 2
๐ 0
As usual, @ewrl18.bsky.social was a wonderful experience.
I had the pleasure of presenting my research as a Contributed Talk ๐
Special thanks to the organizers for making it happen!
19.09.2025 16:08
๐ 8
๐ 2
๐ฌ 1
๐ 0
Today, @theovincent.bsky.social l will present "Eau De Q-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning" by Thรฉo Vincent, @jan-peters.bsky.social, and Carlo D'Eramo.
๐๏ธ 12.06, 16:30 - 19:30
๐ Poster 28
See bsky.app/profile/theo...
12.06.2025 14:56
๐ 5
๐ 1
๐ฌ 0
๐ 0
@timschneider94.bsky.social will present "Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach" by Tim Schneider, Cristiana de Farias, Roberto Calandra, Liming Chen, and @jan-peters.bsky.social
๐๏ธ 12.06, 16:30 - 19:30
๐ Poster 105
See bsky.app/profile/tims...
12.06.2025 14:57
๐ 3
๐ 1
๐ฌ 0
๐ 0
๐ Interested in Neuro-Symbolic Learning and attending #ICRA2025? ๐ง ๐ค
Do not miss Leon Keller presenting โNeuro-Symbolic Imitation Learning: Discovering Symbolic Abstractions for Skill Learningโ.
Joint work of Honda Research Institute EU and @jan-peters.bsky.social (@ias-tudarmstadt.bsky.social).
19.05.2025 08:51
๐ 11
๐ 2
๐ฌ 1
๐ 0
Thanks to my co-authors Florian Vogt, @joemwatson.bsky.social @jan-peters.bsky.social
@hessianai.bsky.social @ias-tudarmstadt.bsky.social @dfki.bsky.social @cs-tudarmstadt.bsky.social
#RL #ML #AI
23.05.2025 12:50
๐ 4
๐ 1
๐ฌ 0
๐ 0
Work conducted @ias-tudarmstadt.bsky.social headed by @jan-peters.bsky.social @cs-tudarmstadt.bsky.social @tuda.bsky.social
Many thanks to all #LocoMuJoCo Contributors!
18.04.2025 22:47
๐ 3
๐ 1
๐ฌ 0
๐ 0
First German Robotics Conference starting in Messezentrum Nuremberg! 400+ delegates plus RoboCup next door! @jan-peters.bsky.social @tuda.bsky.social
13.03.2025 12:17
๐ 11
๐ 2
๐ฌ 0
๐ 0
Many thanks to my colleagues and collaborators: Daniel Palenicek, ลukasz Antczak, @jan-peters.bsky.social and most importantly Jonathan Kinzel (@ibims1jfk.bsky.social), who interned at MAB Robotics and did the experiments.
Also thanks to MAB Robotics for providing the hardware and constant support!
18.03.2025 22:24
๐ 4
๐ 1
๐ฌ 0
๐ 0
We build on the efficient CrossQ DRL algorithm and combine it with two control architectures โ Joint Target Prediction for agile maneuvers and Central Pattern Generators for stable, natural gaits โ to train locomotion policies directly on the HoneyBadger quadruped robot from MAB Robotics.
18.03.2025 22:24
๐ 5
๐ 1
๐ฌ 1
๐ 0
Gait in Eight
Intrigued? Check out the paper and videos here: nico-bohlinger.github.io/gait_in_eigh...
18.03.2025 22:24
๐ 3
๐ 1
๐ฌ 1
๐ 0