Publications by Tuomas Sandholm

Conference

A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation

2024 • Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS • 2024-May:2489-2491
Taguelmimt R, Aknine S, Boukredera D, Changder N, Sandholm T

Conference

CONFRONTING REWARD MODEL OVEROPTIMIZATION WITH CONSTRAINED RLHF

2024 • 12th International Conference on Learning Representations, ICLR 2024
Moskovitz T, Singh AK, Strouse DJ, Sandholm T, Salakhutdinov R, Dragan AD, McAleer S

Conference

Convergence of log(1/ϵ) for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis

2024 • Advances in Neural Information Processing Systems • 37:
Anagnostides I, Sandholm T

Conference

Efficient Size-based Hybrid Algorithm for Optimal Coalition Structure Generation

2024 • Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS • 2024-May:2492-2494
Taguelmimt R, Aknine S, Boukredera D, Changder N, Sandholm T

Conference

Efficient Φ-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

2024 • Advances in Neural Information Processing Systems • 37:
Zhang BH, Anagnostides I, Farina G, Sandholm T

Conference

GAME-THEORETIC ROBUST RL HANDLES TEMPORALLY-COUPLED PERTURBATIONS

2024 • 12th International Conference on Learning Representations, ICLR 2024
Liang Y, Sun Y, Zheng R, Liu X, Eysenbach B, Sandholm T, Huang F, McAleer S
Displaying 1 - 25 of 586