Learning Flappy Bird Game Using DQN
2017-07-03
Learning Flappy Bird Game Using DQN
The algorithm uses a CNN to produce the Q values under current states(image), the longer the time of the game the higher the reward is. For the future reward, I use a gamma, which is 0.9.