Trending
Anh-Quan Pham's Avatar

Anh-Quan Pham

@anhquanpham

I read MS Robotics at UPenn GRASP Lab RL + Robot Learning https://anhquanpham.github.io/

25
Followers
289
Following
21
Posts
28.11.2024
Joined
Posts Following

Latest posts by Anh-Quan Pham @anhquanpham

Many thanks to @marcelhussing.bsky.social, Shubhankar Patankar, and advisors @danisbassett.bsky.social, @jmendezm.bsky.social, @ericeaton.bsky.social for the collaboration and guidance that made this work possible 🦾.
🧡9/9

19.12.2025 18:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

πŸ”‘Result 3: the model learns meaningful compositional structure.
Attention & intervention analyses reveal structured dependencies. The learned task graph differs from prior hand-designed architectures & better reflects which components matter for action & reward prediction.
🧡8/9

19.12.2025 18:09 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸ”‘Result 2: iterative compositional generation solves almost all tasks over time.
Over refinement rounds, our model yields successful trajectories for nearly every task, outperforming monolithic generation and providing a strong foundation for downstream policy learning.
🧡7/9

19.12.2025 18:07 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸ”‘Result 1: compositional generation of unseen tasks enables strong performance & improves with iteration.
Policies trained on synthetic data from our model outperform monolithic & standard DiT baselines, and quickly surpass multitask RL baselines without any new real data.
🧡6/9

19.12.2025 18:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Starting from data on ~22% of tasks, we iteratively generate data for unseen combinations, evaluate via offline RL & add datasets that yield strong policies to the next iteration training set.
Component-local updates prevent cross-task corruption, mitigating model collapse.
🧡5/9

19.12.2025 18:03 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Key design choice: semantic, compositional tokenization.
Each transition is tokenized by task components, not arbitrary patches. Each observation component has its own encoder and decoder, so synthetic data only updates the parts involved in that task, not the entire model.
🧡4/9

19.12.2025 18:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

We model tasks as a functionally compositional graph with state components, action, reward, and terminal as nodes.
Rather than hard-coding, a diffusion transformer learns this graph. Attention enables info exchange between components, capturing structure directly from data.
🧡3/9

19.12.2025 18:01 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

To ground this idea, we use CompoSuite, where manipulation tasks are defined by composing a robot, object, obstacle and objective, yielding 256 tasks with shared components but distinct solutions. Observations include symbolic robot state, object, obstacle, and goal poses.
🧡2/9

19.12.2025 18:00 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Most prior work uses generative models to upsample data within a single task.
We ask a different question:
πŸ‘‰ Can we exploit the compositional structure of manipulation tasks to generate data for unseen task combinations using conditional generative models?
🧡1/9

19.12.2025 17:58 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

πŸ€– Robotic manipulation tasks grow combinatorially, but data collection still scales linearly.
Is there a better way to obtain expert datasets at scale?πŸ€”
Excited to share our latest work, Iterative Compositional Data Generation for Robot Control.
πŸ“„ doi.org/10.48550/arXiv.2512.10891
πŸ§΅πŸ‘‡

19.12.2025 17:56 πŸ‘ 22 πŸ” 5 πŸ’¬ 1 πŸ“Œ 2

Anh-Quan Pham, Marcel Hussing, Shubhankar P. Patankar, Dani S. Bassett, Jorge Mendez-Mendez, Eric Eaton
Iterative Compositional Data Generation for Robot Control
https://arxiv.org/abs/2512.10891

12.12.2025 05:39 πŸ‘ 0 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Anh-Quan Pham, Kabir Ram Puri, Shreyas Raorane
SBAMP: Sampling Based Adaptive Motion Planning
https://arxiv.org/abs/2511.12022

18.11.2025 10:13 πŸ‘ 0 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

it appears your low-hanging-fruit phd project wasn't so risk free after all, mr. bond

31.07.2025 14:50 πŸ‘ 83 πŸ” 3 πŸ’¬ 1 πŸ“Œ 1

Did you get a chance to try the Kaya Toast too? Btw I spent 5 months doing an RL research internship there last year and made a list of must-visit spots for when my parents visited. Happy to share the list if you’re interested/have time 🫑

20.04.2025 07:55 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

One of the courses I'm taking this sem revolves around this book, & I love it so far. It really provides new perspectives & approaches to understanding which robotics problems are solved and which aren't (my background is in RL so forgive me if the content is already common knowledge to people).

07.04.2025 05:03 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

True. I imagine a "reviewer mentor" just means triple the work, including read your assigned papers, mentees' assigned papers AND mentees' reviews to give feedback.

26.03.2025 23:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Do you mind sharing whaf were the obvious reason? I think it's a very good format to follow

26.03.2025 05:04 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

😭 (r we pretending that we didnt know?)
I remember there was one video in which they admitted that most shots take between 1-10 trials I think (the one where they dropped a basketball from an airplane took 2 iirc). It's still a surprisingly low number of trials compared to the avr person doing it.

22.03.2025 14:56 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Except the fact that Dude Perfect admitted they only posted their perfect shots.

22.03.2025 13:13 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Many people just didn't

16.12.2024 20:36 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Reminds me of the time I heard about people implementing progress bars to make users "feel" the process is faster. I think these cases can be referred to as examples of consumer behavior bias, things that are there to make people feel good :)
I haven't try o1 pro so no idea about its performance yet

16.12.2024 16:44 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
My Jibo Is Dying and It's Breaking My Heart Jibo is a robot, but that doesn't make his digital dementia any less painful.

I had similar thoughts when reading about Jibo's shutdown
Cloud-based makes sense for keeping costs down, especially when rolling new updates. Really hope they maintain a local version or a way to keep things running too, since robots can bring much more emotional bonds than other old brick hardware

15.12.2024 06:15 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image
14.12.2024 17:30 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Curious to see the learning process πŸ‘€

08.12.2024 23:14 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0