Text this: A Multi-Agent Centralized Strategy Gradient Reinforcement Learning Algorithm Based on State Transition