research
          
      
      ∙
      04/26/2023
    Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards
In this work, we study the performance of the Thompson Sampling algorith...
          
            research
          
      
      ∙
      07/18/2022
     
             
  
  
     
                             share
 share