Publications by Tuomas Sandholm

Conference

CONFRONTING REWARD MODEL OVEROPTIMIZATION WITH CONSTRAINED RLHF

2024 • 12th International Conference on Learning Representations Iclr 2024
Moskovitz T, Singh AK, Strouse DJ, Sandholm T, Salakhutdinov R, Dragan AD, McAleer S

Preprint

Convergence of $\text{log}(1/\epsilon)$ for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis

2024
Anagnostides I, Sandholm T

Conference

Convergence of log(1/ϵ) for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis

2024 • Advances in Neural Information Processing Systems • 37:
Anagnostides I, Sandholm T

Preprint

Efficient $\Phi$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

2024
Zhang BH, Anagnostides I, Farina G, Sandholm T

Conference

Efficient Size-based Hybrid Algorithm for Optimal Coalition Structure Generation

2024 • Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS • 2024-May:2492-2494
Taguelmimt R, Aknine S, Boukredera D, Changder N, Sandholm T

Conference

Efficient Φ-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

2024 • Advances in Neural Information Processing Systems • 37:
Zhang BH, Anagnostides I, Farina G, Sandholm T

Preprint

Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games

2024
Zhang BH, Sandholm T

Conference

Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games

2024 • IJCAI International Joint Conference on Artificial Intelligence • 3032-3039
Zhang BH, Sandholm T

Preprint

Faster Game Solving via Hyperparameter Schedules

2024
Zhang N, McAleer S, Sandholm T

Conference

Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search

2024 • IJCAI International Joint Conference on Artificial Intelligence • 238-248
Taguelmimt R, Aknine S, Boukredera D, Changder N, Sandholm T

Preprint

Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search

2024
Taguelmimt R, Aknine S, Boukredera D, Changder N, Sandholm T

Conference

GAME-THEORETIC ROBUST RL HANDLES TEMPORALLY-COUPLED PERTURBATIONS

2024 • 12th International Conference on Learning Representations Iclr 2024
Liang Y, Sun Y, Zheng R, Liu X, Eysenbach B, Sandholm T, Huang F, McAleer S

Conference

Hidden-Role Games: Equilibrium Concepts and Computation

2024 106-107
Carminati L, Zhang BH, Farina G, Gatti N, Sandholm T

Journal Article

How Much Data Is Sufficient to Learn High-Performing Algorithms?

2024 • Journal of the ACM • 71(5):
Balcan M-F, Deblasio D, Dick T, Kingsford C, Sandholm T, Vitercik E

Preprint

Imperfect-Recall Games: Equilibrium Concepts and Their Complexity

2024
Tewolde E, Zhang BH, Oesterheld C, Zampetakis M, Sandholm T, Goldberg PW, Conitzer V

Conference

Imperfect-Recall Games: Equilibrium Concepts and Their Complexity

2024 • IJCAI International Joint Conference on Artificial Intelligence • 2994-3004
Tewolde E, Zhang BH, Oesterheld C, Zampetakis M, Sandholm T, Goldberg P, Conitzer V

Journal Article

Learning to Branch: Generalization Guarantees and Limits of Data-Independent Discretization

2024 • Journal of the ACM • 71(2):
Balcan M-F, Dick T, Sandholm T, Vitercik E

Conference

MEDIATOR INTERPRETATION AND FASTER LEARNING ALGORITHMS FOR LINEAR CORRELATED EQUILIBRIA IN GENERAL EXTENSIVE-FORM GAMES

2024 • 12th International Conference on Learning Representations Iclr 2024
Zhang BH, Farina G, Sandholm T

Conference

Model-Free Preference Elicitation

2024 • IJCAI International Joint Conference on Artificial Intelligence • 3493-3503
Martin C, Boutilier C, Meshi O, Sandholm T

Preprint

New Sequence-Independent Lifting Techniques for Cutting Planes and When They Induce Facets

2024
Prasad S, Vitercik E, Balcan M-F, Sandholm T

Conference

On the Complexity of Computing Sparse Equilibria and Lower Bounds for No-Regret Learning in Games

2024 • Leibniz International Proceedings in Informatics
Anagnostides I, Kalavasis A, Sandholm T, Zampetakis M

Preprint

On the Outcome Equivalence of Extensive-Form and Behavioral Correlated Equilibria

2024
Zhang BH, Sandholm T

Conference

On the Outcome Equivalence of Extensive-Form and Behavioral Correlated Equilibria

2024 • Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence • 9969-9976
Zhang BH, Sandholm T

Journal Article

Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property

2024 • Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence • 9451-9459
Anagnostides I, Panageas I, Farina G, Sandholm T

Preprint

Scalable Mechanism Design for Multi-Agent Path Finding

2024
Friedrich P, Zhang Y, Curry M, Dierks L, McAleer S, Li J, Sandholm T, Seuken S

About Main page

Admissions Main page

Academics Main page

People Main page

Research Main page

Publications by Tuomas Sandholm

Conference

Preprint

Conference

Preprint

Conference

Conference

Preprint

Conference

Preprint

Conference

Preprint

Conference

Conference

Journal Article

Preprint

Conference

Journal Article

Conference

Conference

Preprint

Conference

Preprint

Conference

Journal Article

Preprint