TY - GEN
T1 - Deep Reinforcement Learning based Swarm Motion for Collision Avoidance via Self-configurable Potential Formation
AU - Soza, Marlon
AU - Herrera, Marco
AU - Camacho, Oscar
AU - Vásconez, Juan Pablo
AU - Andrade, Roberto
AU - Prado-Romo, Alvaro Javier
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - One of the main challenges in swarm robotics is modeling and controlling the collective dynamics arising from local interactions among many agents. Classical control methods struggle with the nonlinear, decentralized, and adaptive behaviors required for swarm coordination in dynamic environments. This work presents a Deep Reinforcement Learning (DRL)-based flocking control framework that enables emergent dynamics within an adjustable formation while ensuring collision avoidance. A Deep Deterministic Policy Gradient (DDPG) agent learns coordinated strategies directly from experience, avoiding explicit modeling of inter-agent dynamics. To represent a formation-adaptable model, a Potential Linked Nodes (PLN) structure modifies swarm kinematics to reach goals and avoid obstacles. The DRL-based model regulates swarm positioning via Artificial Potential Fields (APF), while the agent controls collective motion through linear and angular velocities. Experiments with different swarm sizes and environments assessed performance using flocking m etrics such as distance error and collisions, confirming s table transitions. Results validated that the DDPG agent achieves reliable flocking toward targets while avoiding obstacles.
AB - One of the main challenges in swarm robotics is modeling and controlling the collective dynamics arising from local interactions among many agents. Classical control methods struggle with the nonlinear, decentralized, and adaptive behaviors required for swarm coordination in dynamic environments. This work presents a Deep Reinforcement Learning (DRL)-based flocking control framework that enables emergent dynamics within an adjustable formation while ensuring collision avoidance. A Deep Deterministic Policy Gradient (DDPG) agent learns coordinated strategies directly from experience, avoiding explicit modeling of inter-agent dynamics. To represent a formation-adaptable model, a Potential Linked Nodes (PLN) structure modifies swarm kinematics to reach goals and avoid obstacles. The DRL-based model regulates swarm positioning via Artificial Potential Fields (APF), while the agent controls collective motion through linear and angular velocities. Experiments with different swarm sizes and environments assessed performance using flocking m etrics such as distance error and collisions, confirming s table transitions. Results validated that the DDPG agent achieves reliable flocking toward targets while avoiding obstacles.
KW - Deep reinforcement learning
KW - collision avoidance
KW - coordinate motion
KW - deterministic policy gradient
KW - potential fields
UR - https://www.scopus.com/pages/publications/105037992386
U2 - 10.1109/CHILECON66915.2025.11476358
DO - 10.1109/CHILECON66915.2025.11476358
M3 - Contribución a la conferencia
AN - SCOPUS:105037992386
T3 - Proceedings - IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, ChileCon
BT - 2025 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, CHILECON 2025
A2 - Lefranc, Gaston
A2 - Cubillos, Claudio
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, CHILECON 2025
Y2 - 28 October 2025 through 30 October 2025
ER -