Online learning approaches to optimize database join operations in PostgreSQL.
-
Updated
Sep 17, 2024 - C
Online learning approaches to optimize database join operations in PostgreSQL.
Network-Oriented Repurposing of Drugs Python Package
Another A/B test library
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
Privacy-Preserving Bandits (MLSys'20)
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.
A small collection of Bandit Algorithms (ETC, E-Greedy, Elimination, UCB, Exp3, LinearUCB, and Thompson Sampling)
Reinforcement learning
Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto
Adversarial multi-armed bandit algorithms
Thompson Sampling Tutorial
Research project on automated A/B testing of software by evolutionary bandits.
Movie Recommendation using Cascading Bandits namely CascadeLinTS and CascadeLinUCB
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
Add a description, image, and links to the bandit-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the bandit-algorithm topic, visit your repo's landing page and select "manage topics."