Safe-Multi-Agent-Deep-Policy-Gradient-Optimization-Algorithm
PublicThis repository contains the code for a new Safe Multi-Agent Reinforcement Learning (MARL) algorithm. It integrates deep policy gradients with a Lagrangian multiplier framework to enable autonomous agents to learn cooperative strategies while rigorously adhering to safety constraints and maximizing rewards.