Trending

#deepRL

Latest posts tagged with #deepRL on Bluesky

Latest Top
Trending

Posts tagged #deepRL

Post image

✨Two single author papers accepted to ICLR 2026!✨

Truly excited to present these results at #ICLR2026 !

@iclr-conf.bsky.social #ICLR26 #DeepRL #ICLR #ReinforcementLearning

0 0 0 0
Video

If you are interested in reinforcement learning or reinforcement learning training of language models, this might spark your interest!

✨See my new paper on scaling, capacity and complexity of reinforcement learning published at #AAAI2026 ! @aaai.org

#AAAI #AAAI26 #ReinforcementLearning #DeepRL

1 0 0 0
Post image

✨Excited to present these results at #AAAI2026 !

📜It is a must read if you are interested in reinforcement learning 📜 @aaai.org

Paper: Principled Analysis of Deep Reinforcement Learning Evaluation and Design Paradigms

#ReinforcementLearning #AAAI #AAAI26 #DeepRL

3 0 0 0
Stigmergy‑Inspired Deep RL Boosts Multi‑Robot Coordination

Stigmergy‑Inspired Deep RL Boosts Multi‑Robot Coordination

A new Stigmergic Multi‑Agent Deep RL framework (S‑MADRL) uses virtual pheromone fields for coordination. Simulations with up to eight robots showed better efficiency than MADDPG and MAPPO. Read more: getnews.me/stigmergy-inspired-deep-... #stigmergy #deeprl

0 0 0 0
Deep RL Tackles Close‑Enough Traveling Salesman Problem

Deep RL Tackles Close‑Enough Traveling Salesman Problem

Researchers introduced UD3RL, a dual‑decoder reinforcement‑learning model for the close‑enough traveling salesman problem, beating classic heuristics in tour quality and speed. Read more: getnews.me/deep-rl-tackles-close-en... #closeenoughtsp #deeprl

0 0 0 0
Hybrid DRL and Bounded ES Improves Control of Variable Systems

Hybrid DRL and Bounded ES Improves Control of Variable Systems

A hybrid controller blending deep reinforcement learning with bounded extremum seeking showed faster set‑point convergence and resilience in a time‑varying particle‑accelerator component. getnews.me/hybrid-drl-and-bounded-e... #hybridcontrol #deeprl

0 0 0 0
Multi‑Actor Multi‑Critic Deep RL Outperforms State‑of‑the‑Art on MuJoCo

Multi‑Actor Multi‑Critic Deep RL Outperforms State‑of‑the‑Art on MuJoCo

MAMC adds multiple actors and critics to deep deterministic RL, surpassing prior models on MuJoCo with higher rewards and faster convergence; the code is released on GitHub. Read more: getnews.me/multi-actor-multi-critic... #deeprl #mujoco #mamac

0 0 0 0
Survey of Deep Reinforcement Learning Approaches for Bipedal Robots

Survey of Deep Reinforcement Learning Approaches for Bipedal Robots

Survey classifies reinforcement learning for bipedal robots into end-to-end and hierarchical frameworks, urging unified, efficient designs. Updated Sep 27 2025, 17 pages. Read more: getnews.me/survey-of-deep-reinforce... #deeprl #bipedalrobots

0 0 0 0
XQC Improves Sample Efficiency in Deep Reinforcement Learning

XQC Improves Sample Efficiency in Deep Reinforcement Learning

XQC adds batch-norm, weight-norm and a distributional loss to Soft Actor-Critic, cutting critic condition numbers and improving sample efficiency on benchmarks. Submitted September 2025. Read more: getnews.me/xqc-improves-sample-effi... #deeprl #xqc #rl

0 0 0 0
Deep RL Improves Dynamic After‑sales Time Slot Scheduling

Deep RL Improves Dynamic After‑sales Time Slot Scheduling

Researchers propose attention‑based deep RL (ADRL‑RE) and scenario planning for after‑sales slot scheduling; ADRL‑RE outperforms rule‑based baselines, while SBP needs less compute. Read more: getnews.me/deep-rl-improves-dynamic... #deeprl #scheduling

0 0 0 0
Gradient Eligibility Traces Boost Deep Reinforcement Learning

Gradient Eligibility Traces Boost Deep Reinforcement Learning

A new study expands the projected Bellman error framework with multistep λ‑return eligibility traces, showing gradient‑based methods outperform PPO on MuJoCo and MinAtar tasks. Read more: getnews.me/gradient-eligibility-tra... #deeprl #eligibilitytraces

0 0 0 0
Deep RL Direct Gate Control Improves Buck Converter Speed

Deep RL Direct Gate Control Improves Buck Converter Speed

A deep‑reinforcement‑learning controller that directly drives the gate of a buck converter showed faster settling and reduced overshoot in simulations versus traditional PWM control. getnews.me/deep-rl-direct-gate-cont... #buckconverter #deeprl

0 0 0 0
Deep Reinforcement Learning Boosts Lift and Cuts Drag on 3D Wing

Deep Reinforcement Learning Boosts Lift and Cuts Drag on 3D Wing

Deep reinforcement learning control raised lift by 79% and cut drag 65% on an SD7003 wing at Re 60 000, boosting aerodynamic efficiency about 408%, in a high‑Reynolds‑number test. Read more: getnews.me/deep-reinforcement-learn... #deeprl #aerodynamics

0 0 0 0
La revolución de la IA: Aprendizaje por refuerzo desde cero hasta la Inteligencia Artificial General
La revolución de la IA: Aprendizaje por refuerzo desde cero hasta la Inteligencia Artificial General YouTube video by En la mente de la máquina, Inteligencia Artificial

La Revolución de la IA ya está aquí: del #ReinforcementLearning desde cero hasta la #AGI.
Q-learning, SARSA, LLMs y agentes que aprenden a pensar 🤖✨

📺 Mira el video completo 👉 youtu.be/R6MvIB7DHLU

#InteligenciaArtificial #MachineLearning #DeepRL

1 0 0 0

Being unable to scale #DeepRL to solve diverse, complex tasks with large distribution changes has been holding back the #RL community. In this work, we demonstrate that with the right architecture and optimization adjustments, agents can maintain plasticity for large networks.

4 0 0 0

We tackled this challenge with behavioral experiments in mice, Bayesian theory, and #DeepRL. Using a novel change-detection task, we show how mice and networks adapt on the first trial from a context change by inferring both context and meaning—without trial and error.

0 0 1 0
Post image

If you are interested in large language models see my paper below on how we can uncover the biases learned by these models.

Link: neurips2023-enlsp.github.io/papers/paper...

#ReinforcementLearning #FoundationModels #DeepRL #DeepReinforcementLearning #ResponsibleAI #AIBias #LLMs #LanguageModels

2 0 0 0
Preview
a person is playing a game of go on a table ALT: a person is playing a game of go on a table

🔍 The History of Reinforcement Learning (Updated for 2025)

From Thorndike’s cat puzzle box 🐱📦 to DeepMind’s AlphaGo 🤖🏆 to DeepSeek-R1 —how did RL become a key AI breakthrough?

📖 Read the full history:
👉 researchdatapod.com/history-rein...

#AI #ReinforcementLearning #DeepSeek #DeepRL #history

1 0 0 0
Preview
GitHub - EzgiKorkmaz/adversarial-reinforcement-learning: Reading list for adversarial perspective and robustness in deep reinforcement learning. Reading list for adversarial perspective and robustness in deep reinforcement learning. - EzgiKorkmaz/adversarial-reinforcement-learning

If you are interested in deep reinforcement learning, I will share this repo here:

Link: github.com/EzgiKorkmaz/...

#ReinforcementLearning #SafeAI #Adversarial #Robust #DeepRL #robustRL #LanguageModels #AdversarialRL #AISafety #ExplainableAI #TrustworthyAI #ResponsibleAI #DeepReinforcementLearning

5 0 0 0
Post image

I am teaching a class on #FoundationalModels for #robotics and Scaling #DeepRL algorithms. This class expands on last year's class and my generalist robotics policies tutorial and code. I plan to share the lectures and code assignments. Starting with the first lectures below.

21 6 1 0
Post image

A recent paper I wrote introduces foundational analysis on deep reinforcement learning decision making and representations learnt by it.

Link: proceedings.mlr.press/v235/korkmaz...

#ReinforcementLearning #ICLR2025 #ACL2025 #NAACL2025 #NeurIPS2024 #ICML2025 #DeepRL #DeepReinforcementLearning

14 2 0 0
Post image

I wrote a recent survey about deep reinforcement learning. The paper is a compact guide to understand some of the key concepts in reinforcement learning.

Link: arxiv.org/pdf/2401.023...

#ReinforcementLearning #ICLR2025 #ACL2025 #NAACL2025 #NeurIPS2024 #ICML2025 #DeepRL #DeepReinforcementLearning

39 9 1 0
Preview
NeurIPS2024 Related RL papers | Notion Deep RL papers

#NeurIPS2024 wrapped up last week. I put together a curated reading list for #DeepRL and #reinforcementlearning work. (represents my interests).

Talks and workshops:
third-crowd-c77.notion.site/NeurIPS2024-...

Curated reading list
fracturedplane.notion.site/NeurIPS2024-...

#Holidayreading

70 14 0 0
Post image Post image

We show that reducing churn by regularizing out-of-batch data reduces these chain effects and results in improved sample efficiency and scaling. #deepRL #reinforcementlearning

1 0 1 0
Post image

Training #deepRL agents has always been a tricky and unstable process. What is the cause of these instabilities? We study the coupling effects of policy training and value estimation and find a chain effect of the value and policy churn in popular DRL agents.

16 2 2 0

If you are curious about deep reinforcement learning find the compact highlights of my recent papers in this new short piece:

#NeurIPS2024 @neuripsconf.bsky.social #NeurIPS24
#reinforcementlearning #AIsafety #AISecurity #ResponsibleAI #TrustworthyAI #RobustAI #DeepRL

bsky.app/profile/ezgi...

2 1 0 0
Preview
GitHub - mohmdelsayed/streaming-drl: Deep reinforcement learning without experience replay, target networks, or batch updates. Deep reinforcement learning without experience replay, target networks, or batch updates. - mohmdelsayed/streaming-drl

Cool #DeepRL paper from University of Alberta:

"Deep reinforcement learning without experience replay, target networks, or batch updates"

Deep RL networks in the streaming setting without replay buffers thanks to signal normalization & step-size bounding 🤯

📄Paper: openreview.net/pdf?id=yqQJG...

2 0 0 0
Post image Post image

We show that reducing churn by regularizing out-of-batch data reduces these chain effects and results in improved sample efficiency and scaling. #deepRL #reinforcementlearning

1 0 1 0
Preview
Огляд досягнень у галузі глибокого навчання з підкріпленням | TheTransmitted Глибоке навчання з підкріпленням (Deep RL) поєднує в собі навчання з підкріпленням і глибоке навчання, демонструючи безпрецедентний успіх у вирішенні складних завдань, які колись вважалися недосяжними...

Глибоке навчання з підкріпленням (Deep RL) поєднує в собі навчання з підкріпленням і глибоке навчання, демонструючи безпрецедентний успіх у вирішенні складних завдань, які колись вважалися недосяжними для машин.

#AI #DeepRL #ML #ШІ

1 0 0 0

Now that #CreativeProblemSolving is in the limelight, our AIGenC model (🖋️@corinacatarau1 @EAlonso20) may interest you.
Compatible with heat.


#creativity #generalisation #deeprl #ai #reinforcementlearning #HierarchicalRepresentations, #graphs #transfer

arxiv.org/abs/2205.09738

0 0 0 0