Win-Stay, Lose-Shift algorithm: choose an arm at random, and keep pulling it as long as it keeps paying off. If the arm doesn't pay off after a particular pull, then switch to the other one [...] Robbins proved in 1952 that it performs reliably better than chance. #BrianChristian

Comments

Popular posts from this blog

We humans are unhappy in large part because we are insatiable [...] Rather than feeling satisfied, we feel a bit bored, and in response to this boredom, we go on to form new, even grander desires. #WilliamBIrvine