Win-Stay, Lose-Shift algorithm: choose an arm at random, and keep pulling it as long as it keeps paying off. If the arm doesn't pay off after a particular pull, then switch to the other one [...] Robbins proved in 1952 that it performs reliably better than chance. #BrianChristian

Comments

Popular posts from this blog

We humans are unhappy in large part because we are insatiable [...] Rather than feeling satisfied, we feel a bit bored, and in response to this boredom, we go on to form new, even grander desires. #WilliamBIrvine

If a parent or carer is warm, consistent, attuned, steady and kind, the child will thrive. It will have confidence in itself and in the world. It will know how to love and will have the courage to start relationships, secure [and calmly complain when neglected] #TheSchoolOfLife