Win-Stay, Lose-Shift algorithm: choose an arm at random, and keep pulling it as long as it keeps paying off. If the arm doesn't pay off after a particular pull, then switch to the other one [...] Robbins proved in 1952 that it performs reliably better than chance. #BrianChristian
Win-Stay, Lose-Shift algorithm: choose an arm at random, and keep pulling it as long as it keeps paying off. If the arm doesn't pay off after a particular pull, then switch to the other one [...] Robbins proved in 1952 that it performs reliably better than chance. #BrianChristian
— English Quotes (@english_quotes) Sep 19, 2024
Comments
Post a Comment