Scalable multi-agent reinforcement learning for aggregation systems