Loading...

How to Win Slot Machines - Intro to Deep Learning #13 - Siraj Raval - 深度學習 Deep Learning 公開課 - Cupoy

We'll learn how to solve the multi-armed bandit problem (maximizing success for a given slot machine...

AI共學社群

We'll learn how to solve the multi-armed bandit problem (maximizing success for a given slot machine) using a reinforcement learning technique called policy gradients. Code for this video: https://github.com/llSourcell/how_to_... Mike's winning code: https://github.com/xkortex/Siraj_Chat... Vishal's runner up code: https://github.com/erilyth/DeepLearni... this coding challenge was really close, so i'm also going to put code for 3rd place just this time (Eibriel): https://github.com/Eibriel/ice-cream-...