Reinforcement learning for collective multi-agent decision making