Utilizing Offline Reinforcement Studying to Trial On-line Platform Interventions | by Daniel Miller

Offline reinforcement studying and simulation to strategize on-line engagement.

10 min learn

14 hours in the past

Synopsis

This text extends on our analysis into predicting engagement in on-line platforms, utilizing deep studying. We discovered predicting person behaviour was depending on having ample historic information, which was according to current analysis into the world.

Non-paid platforms typically search to encourage and reward participation by means of badges, medals and on-line incentives. Though they are often efficient, they typically yield unintended behaviours and combined outcomes. Each Coursera and StackOverflow for instance have witnessed “steering”. Customers typically work to realize a badge then disengage from the platform. Though this boosts engagement within the brief time period, it’s a restricted technique for changing minimal customers to long run individuals.

Instance of the steering impact of on-line incentives. Referenced from Yanovsky, Hoernle and Gal’s research into incentive searching for on StackOverfow.

Trialling incentives as nicely can include threat. A Zooniverse undertaking, known as Previous Climate, trialled a aggressive rating technique to detrimental outcomes.

Utilizing Offline Reinforcement Studying to Trial On-line Platform Interventions | by Daniel Miller | Nov, 2024

Offline reinforcement studying and simulation to strategize on-line engagement.

Synopsis

Load-Testing LLMs Utilizing LLMPerf | In the direction of Knowledge Science

Extra huge updates to Foundry at this time: o3 and o4-mini from OpenAI are each… | Satya Nadella

Making AI-generated code extra correct in any language | MIT Information

When Physics Meets Finance: Utilizing AI to Resolve Black-Scholes

How AI is Redrawing the World’s Electrical energy Maps: Insights from the IEA Report

Load-Testing LLMs Utilizing LLMPerf | In the direction of Knowledge Science

Extra huge updates to Foundry at this time: o3 and o4-mini from OpenAI are each… | Satya Nadella

Making AI-generated code extra correct in any language | MIT Information

When Physics Meets Finance: Utilizing AI to Resolve Black-Scholes