Utilizing Offline Reinforcement Studying to Trial On-line Platform Interventions | by Daniel Miller | Nov, 2024

Offline reinforcement studying and simulation to strategize on-line engagement.

Picture by Fernando Freitas on Unsplash

Synopsis

This text extends on our analysis into predicting engagement in on-line platforms, utilizing deep studying. We discovered predicting person behaviour was depending on having ample historic information, which was according to current analysis into the world.

Non-paid platforms typically search to encourage and reward participation by means of badges, medals and on-line incentives. Though they are often efficient, they typically yield unintended behaviours and combined outcomes. Each Coursera and StackOverflow for instance have witnessed “steering”. Customers typically work to realize a badge then disengage from the platform. Though this boosts engagement within the brief time period, it’s a restricted technique for changing minimal customers to long run individuals.

Instance of the steering impact of on-line incentives. Referenced from Yanovsky, Hoernle and Gal’s research into incentive searching for on StackOverfow.

Trialling incentives as nicely can include threat. A Zooniverse undertaking, known as Previous Climate, trialled a aggressive rating technique to detrimental outcomes.