Navigation auf uzh.ch

Suche

Department of Informatics Computation and Economics Research Group

Details for Talk on: 19.11.2019

  • Speaker: Craig Boutilier (Google AI)
  • Title: Reinforcement Learning in Recommender Systems: Some Challenges
  • Abstract: I'll present a brief overview of some recent work on reinforcement learning motivated by practical issues that arise in the application of RL to online, user-facing applications like recommender systems. These include (a) stochastic action sets; (b) long-term cumulative effects; and (c) combinatorial action spaces. With respect to (c) I will discuss SlateQ, a novel decomposition technique that allows value-based RL (e.g., Q-learning) in slate-based recommender to scale to commercial production systems, and briefly describe both small-scale simulation and a large-scale experiment with YouTube.  With respect to (b), I will briefly discuss advantage amplification, a temporal aggregation technique that allows for more effective RL in partially observable domains with low SNR, as often arise in recommender systems.

    Joint work with various collaborators.

Weiterführende Informationen

Title

Teaser text