#deepRL

@ezgikorkmaz.bsky.social

2 weeks ago

✨Two single author papers accepted to ICLR 2026!✨

Truly excited to present these results at #ICLR2026 !

@iclr-conf.bsky.social #ICLR26 #DeepRL #ICLR #ReinforcementLearning

0 0 0 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 month ago

If you are interested in reinforcement learning or reinforcement learning training of language models, this might spark your interest!

✨See my new paper on scaling, capacity and complexity of reinforcement learning published at #AAAI2026 ! @aaai.org

#AAAI #AAAI26 #ReinforcementLearning #DeepRL

1 0 0 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 month ago

✨Excited to present these results at #AAAI2026 !

📜It is a must read if you are interested in reinforcement learning 📜 @aaai.org

Paper: Principled Analysis of Deep Reinforcement Learning Evaluation and Design Paradigms

#ReinforcementLearning #AAAI #AAAI26 #DeepRL

3 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Stigmergy‑Inspired Deep RL Boosts Multi‑Robot Coordination

A new Stigmergic Multi‑Agent Deep RL framework (S‑MADRL) uses virtual pheromone fields for coordination. Simulations with up to eight robots showed better efficiency than MADDPG and MAPPO. Read more: getnews.me/stigmergy-inspired-deep-... #stigmergy #deeprl

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Deep RL Tackles Close‑Enough Traveling Salesman Problem

Researchers introduced UD3RL, a dual‑decoder reinforcement‑learning model for the close‑enough traveling salesman problem, beating classic heuristics in tour quality and speed. Read more: getnews.me/deep-rl-tackles-close-en... #closeenoughtsp #deeprl

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Hybrid DRL and Bounded ES Improves Control of Variable Systems

A hybrid controller blending deep reinforcement learning with bounded extremum seeking showed faster set‑point convergence and resilience in a time‑varying particle‑accelerator component. getnews.me/hybrid-drl-and-bounded-e... #hybridcontrol #deeprl

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Multi‑Actor Multi‑Critic Deep RL Outperforms State‑of‑the‑Art on MuJoCo

MAMC adds multiple actors and critics to deep deterministic RL, surpassing prior models on MuJoCo with higher rewards and faster convergence; the code is released on GitHub. Read more: getnews.me/multi-actor-multi-critic... #deeprl #mujoco #mamac

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Survey of Deep Reinforcement Learning Approaches for Bipedal Robots

Survey classifies reinforcement learning for bipedal robots into end-to-end and hierarchical frameworks, urging unified, efficient designs. Updated Sep 27 2025, 17 pages. Read more: getnews.me/survey-of-deep-reinforce... #deeprl #bipedalrobots

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

XQC Improves Sample Efficiency in Deep Reinforcement Learning

XQC adds batch-norm, weight-norm and a distributional loss to Soft Actor-Critic, cutting critic condition numbers and improving sample efficiency on benchmarks. Submitted September 2025. Read more: getnews.me/xqc-improves-sample-effi... #deeprl #xqc #rl

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Deep RL Improves Dynamic After‑sales Time Slot Scheduling

Researchers propose attention‑based deep RL (ADRL‑RE) and scenario planning for after‑sales slot scheduling; ADRL‑RE outperforms rule‑based baselines, while SBP needs less compute. Read more: getnews.me/deep-rl-improves-dynamic... #deeprl #scheduling

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Gradient Eligibility Traces Boost Deep Reinforcement Learning

A new study expands the projected Bellman error framework with multistep λ‑return eligibility traces, showing gradient‑based methods outperform PPO on MuJoCo and MinAtar tasks. Read more: getnews.me/gradient-eligibility-tra... #deeprl #eligibilitytraces

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Deep RL Direct Gate Control Improves Buck Converter Speed

A deep‑reinforcement‑learning controller that directly drives the gate of a buck converter showed faster settling and reduced overshoot in simulations versus traditional PWM control. getnews.me/deep-rl-direct-gate-cont... #buckconverter #deeprl

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Deep Reinforcement Learning Boosts Lift and Cuts Drag on 3D Wing

Deep reinforcement learning control raised lift by 79% and cut drag 65% on an SD7003 wing at Re 60 000, boosting aerodynamic efficiency about 408%, in a high‑Reynolds‑number test. Read more: getnews.me/deep-reinforcement-learn... #deeprl #aerodynamics

0 0 0 0

En la mente de la máquina

@elmdlm.bsky.social

6 months ago

La revolución de la IA: Aprendizaje por refuerzo desde cero hasta la Inteligencia Artificial General YouTube video by En la mente de la máquina, Inteligencia Artificial

La Revolución de la IA ya está aquí: del #ReinforcementLearning desde cero hasta la #AGI.
Q-learning, SARSA, LLMs y agentes que aprenden a pensar 🤖✨

📺 Mira el video completo 👉 youtu.be/R6MvIB7DHLU

#InteligenciaArtificial #MachineLearning #DeepRL

1 0 0 0

Glen Berseth

@glenberseth.bsky.social

8 months ago

Being unable to scale #DeepRL to solve diverse, complex tasks with large distribution changes has been holding back the #RL community. In this work, we demonstrate that with the right architecture and optimization adjustments, agents can maintain plasticity for large networks.

4 0 0 0

Jonathan Kadmon

@kadmonj.bsky.social

1 year ago

We tackled this challenge with behavioral experiments in mice, Bayesian theory, and #DeepRL. Using a novel change-detection task, we show how mice and networks adapt on the first trial from a context change by inferring both context and meaning—without trial and error.

0 0 1 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 year ago

If you are interested in large language models see my paper below on how we can uncover the biases learned by these models.

Link: neurips2023-enlsp.github.io/papers/paper...

#ReinforcementLearning #FoundationModels #DeepRL #DeepReinforcementLearning #ResponsibleAI #AIBias #LLMs #LanguageModels

2 0 0 0

The Research Scientist Pod

@scientistpod.bsky.social

1 year ago

a person is playing a game of go on a table ALT: a person is playing a game of go on a table

🔍 The History of Reinforcement Learning (Updated for 2025)

From Thorndike’s cat puzzle box 🐱📦 to DeepMind’s AlphaGo 🤖🏆 to DeepSeek-R1 —how did RL become a key AI breakthrough?

📖 Read the full history:
👉 researchdatapod.com/history-rein...

#AI #ReinforcementLearning #DeepSeek #DeepRL #history

1 0 0 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 year ago

GitHub - EzgiKorkmaz/adversarial-reinforcement-learning: Reading list for adversarial perspective and robustness in deep reinforcement learning. Reading list for adversarial perspective and robustness in deep reinforcement learning. - EzgiKorkmaz/adversarial-reinforcement-learning

If you are interested in deep reinforcement learning, I will share this repo here:

Link: github.com/EzgiKorkmaz/...

#ReinforcementLearning #SafeAI #Adversarial #Robust #DeepRL #robustRL #LanguageModels #AdversarialRL #AISafety #ExplainableAI #TrustworthyAI #ResponsibleAI #DeepReinforcementLearning

5 0 0 0

Glen Berseth

@glenberseth.bsky.social

1 year ago

I am teaching a class on #FoundationalModels for #robotics and Scaling #DeepRL algorithms. This class expands on last year's class and my generalist robotics policies tutorial and code. I plan to share the lectures and code assignments. Starting with the first lectures below.

21 6 1 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 year ago

A recent paper I wrote introduces foundational analysis on deep reinforcement learning decision making and representations learnt by it.

Link: proceedings.mlr.press/v235/korkmaz...

#ReinforcementLearning #ICLR2025 #ACL2025 #NAACL2025 #NeurIPS2024 #ICML2025 #DeepRL #DeepReinforcementLearning

14 2 0 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 year ago

I wrote a recent survey about deep reinforcement learning. The paper is a compact guide to understand some of the key concepts in reinforcement learning.

Link: arxiv.org/pdf/2401.023...

#ReinforcementLearning #ICLR2025 #ACL2025 #NAACL2025 #NeurIPS2024 #ICML2025 #DeepRL #DeepReinforcementLearning

39 9 1 0

Glen Berseth

@glenberseth.bsky.social

1 year ago

#NeurIPS2024 wrapped up last week. I put together a curated reading list for #DeepRL and #reinforcementlearning work. (represents my interests).

Talks and workshops:
third-crowd-c77.notion.site/NeurIPS2024-...

Curated reading list
fracturedplane.notion.site/NeurIPS2024-...

#Holidayreading

70 14 0 0

Glen Berseth

@glenberseth.bsky.social

1 year ago

We show that reducing churn by regularizing out-of-batch data reduces these chain effects and results in improved sample efficiency and scaling. #deepRL #reinforcementlearning

1 0 1 0

Glen Berseth

@glenberseth.bsky.social

1 year ago

Training #deepRL agents has always been a tricky and unstable process. What is the cause of these instabilities? We study the coupling effects of policy training and value estimation and find a chain effect of the value and policy churn in popular DRL agents.

16 2 2 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 year ago

If you are curious about deep reinforcement learning find the compact highlights of my recent papers in this new short piece:

#NeurIPS2024 @neuripsconf.bsky.social #NeurIPS24
#reinforcementlearning #AIsafety #AISecurity #ResponsibleAI #TrustworthyAI #RobustAI #DeepRL

bsky.app/profile/ezgi...

2 1 0 0

Mattia Rigotti

@matrig.net

1 year ago

GitHub - mohmdelsayed/streaming-drl: Deep reinforcement learning without experience replay, target networks, or batch updates. Deep reinforcement learning without experience replay, target networks, or batch updates. - mohmdelsayed/streaming-drl

Cool #DeepRL paper from University of Alberta:

"Deep reinforcement learning without experience replay, target networks, or batch updates"

Deep RL networks in the streaming setting without replay buffers thanks to signal normalization & step-size bounding 🤯

📄Paper: openreview.net/pdf?id=yqQJG...

2 0 0 0

Glen Berseth

@glenberseth.bsky.social

1 year ago

We show that reducing churn by regularizing out-of-batch data reduces these chain effects and results in improved sample efficiency and scaling. #deepRL #reinforcementlearning

1 0 1 0

TheTransmitted

@thetransmitted.com

1 year ago

Огляд досягнень у галузі глибокого навчання з підкріпленням | TheTransmitted Глибоке навчання з підкріпленням (Deep RL) поєднує в собі навчання з підкріпленням і глибоке навчання, демонструючи безпрецедентний успіх у вирішенні складних завдань, які колись вважалися недосяжними...

Глибоке навчання з підкріпленням (Deep RL) поєднує в собі навчання з підкріпленням і глибоке навчання, демонструючи безпрецедентний успіх у вирішенні складних завдань, які колись вважалися недосяжними для машин.

#AI #DeepRL #ML #ШІ

1 0 0 0

Esther Mondragón

@e-mondragon.bsky.social

3 years ago

Now that #CreativeProblemSolving is in the limelight, our AIGenC model (🖋️@corinacatarau1 @EAlonso20) may interest you.
Compatible with heat.

#creativity #generalisation #deeprl #ai #reinforcementlearning #HierarchicalRepresentations, #graphs #transfer

arxiv.org/abs/2205.09738

0 0 0 0

Posts tagged #deepRL