Combinatorial Multi-armed Bandits: Arm Selection via Group TestingArpan MukherjeeShashanka Ubaruet al.2025TMLR
Mean-based Best Arm Identification in Stochastic Bandits under Reward ContaminationArpan MukherjeeAli Tajeret al.2021NeurIPS 2021