Please follow this to implement DQN under the OpenAI gym environment.
(7 points) Define a RL_brain for DQN.
(2 points) Set the main loop
(1 point) Plot the reward along the timesteps.
(10 points) Please complete this PyG exercise