| Characterizing Consensuses in Belief Flow Networks | Nicolas Schwind, Gauvain Bourgne, Katsumi Inoue |
| Comparing the Fairness of Recursively Balanced Picking Sequences | Karen Frilya Celine, Warut Suksompong, Sheung Man Yuen |
| Constrained Assumption-Based Argumentation Frameworks | Emanuele De Angelis, Fabio Fioravanti, Maria Chiara Meo, Alberto Pettorossi, Maurizio Proietti, Francesca Toni |
| Identifying Essential Rule Sets in Agent-Based Models Through Systematic Ablation: A Tumor Evolution Case Study | Nhung Duong, Äoà n Minh Lượng, Anh Do, Anh Truong, Ngoc Do, Nguyen Tran Nam Tien, Tuan Do |
| Multi-Objective Coverage via Constraint Active Search | Zakaria Shams Siam, Xuefeng Liu, Chong Liu |
| An Algebraic Structuring of Epistemic States for BDI Agents in Uncertain Environments | Charles A. N. Costa, Marlo Souza, Célia Ghedini Ralha |
| Interactive Bayesian Deception under Strategic Timing | Amitalok J. Budkuley, Arya Choudhuri |
| SOM: Structured Opponent Modeling for LLM-based Agents via Structural Causal Model | Shiyue Cao, Pei Xu, Likun Yang, Lei Cui, Xiaotang Chen, Kaiqi Huang |
| Cross-Domain Alignment with Fine Geometric Perception for Detail-Preserving Point Cloud Completion | Chen Huang, Haobo Ma, Yan Zhang, Chao Yang, Jianhua Song |
| Decisions.jl: Representing and Transforming Decision Problem Classes in Julia | Mel Krusniak, Ofer Dagan, Himanshu Gupta, Benjamin Kraske, Kyle Hollins Wray, Zachary Sunberg |
| Defection at First Sight: Learning Partner Selection in Optional Social Dilemmas without Prior Information | Benedict Russell, Chin-wing Leung, Paolo Turrini |
| Metric Hedonic Games on the Line | Merlin de la Haye, Pascal Lenzner, Farehe Soheil, Marcus Wunderlich |
| The Impossibility of Strategyproof Rank Aggregation | Manuel Eberl, Patrick Lederer |
| Robust Counterfactual Inference in Markov Decision Processes | Jessica Lally, Milad Kazemi, Nicola Paoletti |
| Surrogate-Augmented Deception in Reinforcement Learning (SAD-RL) | Joe Shymanski, Scott Nivison, Sandip Sen |
| Decentralized Value Systems Agreements | Arturo Hernandez-Sanchez, Natalia Criado, Stella Heras, Miguel Rebollo, Jose Such |
| Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards | Rupal Nigam, Niket Parikh, Hamid Osooli, Mikihisa Yuasa, Jacob Heglund, Huy T Tran |
| OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories | Returaj Burnwal, Nirav Pravinbhai Bhatt, Balaraman Ravindran |
| Reasoning about Bias in Multi-Agent Systems Verification | Vadim Malvone, Chunyan Mu |
| Maximizing Index Diversity in Committee Elections | Paula Böhm, Robert Bredereck, Till Fluschnik |
| Output-Feedback Security Tracking for Robotic Systems against DoS Attacks Using Finite-Time Observers | jingwen Fan, Xiaozheng Jin, Jia Fu |
| Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning | Yohai Trabelsi, GUOJUN XIONG, Fentabil Getnet, Stéphane Verguet, Milind Tambe |
| Balancing for Agent Decision Making through Argumentation | Liuwen Yu, Chenyang Cai, Leon Van der Torre, Réka Markovich |
| Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts | Hongbo Bo, Jingyu Hu, Weiru Liu |
| Catch Me If You Can: Finding the Source of Infections in Temporal Networks | Ben Bals, Michelle Döring, Nicolas Klodt, George Skretas |
| Utility-based soft masking for continual multi-objective reinforcement learning | Timon Deschamps, Rémy Chaput, Mathieu Guillermin, Laetitia Matignon |
| Proportional and Pareto-Optimal Allocation of Chores with Subsidy | Jugal Garg, Eklavya Sharma, Xiaowei Wu |
| Existence and Computation of Fair Allocations under Constraints | Siddharth Barman, Ioannis Caragiannis, Sudarshan Shyam |
| Average Unfairness in Routing Games | Pan-Yang Su, Arwa Alanqary, Bryce Ferguson, Manxi Wu, Alexandre M Bayen, S. Shankar Sastry |
| Practical approach to $2$-Euclidean Preferences | Michal Dvořák, Jan Pokorný, Dušan Knop, Martin Slávik |
| Teaching LLMs Naturally: Pedagogical Strategies for Interactive Knowledge Acquisition | Sabrina Patania, Luca Annese, Cansu Koyuturk, Dimitri Ognibene |
| Fairness Dynamics in Digital Economy Platforms with Biased Ratings | J. Martin Smit, Fernando P. Santos |
| DRAGON: LLM-Driven Decomposition and Reconstruction Agents for Large-Scale Combinatorial Optimization | Shengkai Chen, Zhiguang Cao, Jianan Zhou, Yaoxin Wu, Senthilnath Jayavelu, Zhuoyi Lin, Xiaoli Li, Shili Xiang |
| Computing Perfect Bayesian Equilibria, with Application to Empirical Game-Theoretic Analysis | Christine Konicki, Mithun Chakraborty, Michael P. Wellman |
| Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation | Patrick Benjamin, Alessandro Abate |
| Learning the Value Systems of Societies with Preference-based Multi-objective Reinforcement Learning | Andrés Holgado-Sánchez, Peter Vamplew, Richard Dazeley, Sascha Ossowski, Holger Billhardt |
| Why and Whom to Communicate? A Dual-Objective, Cost–Benefit Framework for Multi-Agent Communication | Yi-Yu Lin, Jiahao Zhang, Xiao-Jun Zeng |
| TAAM:Inductive Graph-Class Incremental Learning with Task-Aware Adaptive Modulation | Jingtao Liu, Xinming Zhang |
| Rethinking Priority Scheduling for Sequential Multi-Agent Decision Making in Stackelberg Games | Xiangyu Liu, Liang Zhang, Bo Jin, Ziqi Wei |
| $\mu$ACP: A Formal Calculus for Expressive, Resource-Constrained Agent Communication | Arnab, Indraveni |
| The Power of Information for Intermediate States in Contract Design | YiRui Zhang, Zhixuan Fang |
| IntentCUA: Learning Intent-level Representations for Skill Abstraction and Multi-Agent Planning in Computer-Use Agents | Seoyoung Lee, Seobin Yoon, Seongbeen Lee, Yoojung Chun, Dayoung Park, Doyeon Kim, Joo Yong Sim |
| Real-time Cohorting of Nursing Care into Bubbles | Jeffrey Keithley, Tinh Tran, Lucas Zach-Ryan, D M Hasibul Hasan, Brodie McCuen, Sriram Pemmaraju, Bijaya Adhikari |
| Towards Generalisable Imitation Learning Through Conditioned Transition Estimation and Online Behaviour Alignment | Nathan Gavenski, Matteo Leonetti, Odinaldo Rodrigues |
| CONSENT: A Negotiation Framework for Leveraging User Flexibility in Vehicle-to-Building Charging under Uncertainty | Rishav Sen, Fangqi Liu, Jose Paolo Talusan, Ava Pettet, Yoshinori Suzue, Mark Bailey, Ayan Mukhopadhyay, Abhishek Dubey |
| Single-Winner Voting on Matchings | Niclas Boehmer, Jessica Dierking |
| Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes | Min Zhang, Hongyao Tang, Jianye HAO, YAN ZHENG |
| Adjusted Winner: from Splitting to Selling | Robert Bredereck, Eyal Briman, Bin Sun, Nimrod Talmon |
| Beyond Scalar Welfare: Enforcing Identity-Aware Equity in Multi-Agent Reinforcement Learning | Nhat-Hoang P. Nguyen, Thinh Pham, Trung-Hoang Le, sameer alam, Vu N. Duong |
| A Verification Framework for Obstruction, Probability, and Time | Wissal DAHANI, Jean Leneutre, Vadim Malvone, James Ortiz, Axel Oscar |
| General Flexible $f$-divergence for Challenging Offline RL Datasets with Low Stochasticity and Diverse Behavior Policies | Jianxun Wang, Grant Collier Forbes, Leonardo Villalobos-Arias, David L. Roberts |
| Boosting Offline MARL under Imbalanced Datasets via Compositional Diffusion Models | Lihe Li, Shenghe Hu, Bingxuan Lan, Yuqi Bian, Huan ZHang, ZhaoMing, Chongjie Zhang, Lei Yuan, Yang Yu |
| BeWater: Effective Protesters Navigate Watersheds in Street Networks | Guillaume Moinard, Matthieu Latapy |
| Everyone Contributes! Incentivizing Strategic Cooperation in Multi-LLM Systems via Sequential Public Goods Games | Yunhao Liang, Yuan Qu, Jingyuan Yang, Shaochong Lin, Zuo-Jun Shen |
| The Complexity of Strategic Behavior in Primary Elections | Colin Cleveland, Bart De Keijzer, Maria Polukarov |
| Lifted Forward Planning in Relational Factored Markov Decision Processes with Concurrent Actions | Florian Andreas Marwitz, Tanya Braun, Ralf Möller, Marcel Gehrke |
| Flow-Based Task Assignment for Large-Scale Online Multi-Agent Pickup and Delivery | Yue Zhang, Zhe Chen, Daniel Harabor, Pierre Le Bodic, Peter J. Stuckey |
| Rejecting Arguments Based on Doubt in Structured Bipolar Argumentation | Michael A. Müller, Srdjan Vesic, Bruno Yun |
| Extremely Large Collective Coalition Formation: Scalability | Neha G. Pusalkar, Julie A. Adams |
| Metric Distortion in Peer Selection | Javier Cembrano, Golnoosh Shahkarami |
| PortfoliQA: An Agentic RAG Framework for Knowledge Graph Question Answering via Structured Evidence Portfolios | Weina Zhang, Junsheng Huang, Zhongqin Bi, Dan Dai |
| Efficiently Computing Equilibria in Budget-Aggregation Games | Patrick Becker, Alexander Fries, Matthias Greger, Erel Segal-Halevi |
| Temporal Panel Selection in Ongoing Citizens’ Assemblies | Yusuf Hakan Kalayci, Evi Micha |
| Imperfect-Information Games on Quantum Computers: A Case Study in Skat | Ulrich Armbrüster, Stefan Edelkamp, Gabriel Maresch, Erik Schulze |
| Teaching an Old Dynamics New Tricks: Regularization-free Last-iterate Convergence in Zero-sum Games via BNN Dynamics | Tuo Zhang, Leonardo Stella |
| HyperAgent: Leveraging Hypergraphs for Topology Optimization in Multi-Agent Communication | Heng Zhang, Yuling Shi, Xiaodong Gu, Zijian Zhang, Haochen You, Lubin Gan, Yilei Yuan, Jin Huang |
| MA-SafeDiffuser: Safe Multi-Agent Planning with Diffusion Probabilistic Models | Kiran Ravish, Ankita Kushwaha, Preeti, Pawan Kumar |
| Disobedience in normative multi-agent systems | Marija Slavkovik, Liuwen Yu, Leon Van der Torre, Réka Markovich, Beishui Liao |
| Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning | David Hudák, Maris F. L. Galesloot, Martin Tappler, Martin KureÄka, Nils Jansen, Milan Ceska |
| VLM-ReG : Vision-Language Models Enhanced via Reward-Refined GRPO in Remote Sensing Reasoning | Yanzhong Zhou, Biaoxin Li, Zhangling Wang, Chao Wang, Wanpeng Zhang |
| Learning Truthful Mechanisms without Discretization | Yunxuan Ma, Steven Wang, Zhijian Duan, Yukun Cheng, Xiaotie Deng |
| Conformal Reachability for Safe Control in Unknown Environments | Xinhang Ma, Junlin Wu, Yiannis Kantaros, Yevgeniy Vorobeychik |
| Perception-Based Beliefs for POMDPs with Visual Observations | Miriam Schäfers, Merlijn Krale, Thiago D. Simão, Nils Jansen, Maximilian Weininger |
| A Multi-Robot Architecture for Continuous Planning and Execution using BDI Agents | Carlos Joel Tavares da Silva, Rafael Melo Santos, Rafael C. Cardoso, Célia Ghedini Ralha |
| A Semi-Decentralized Approach to Multiagent Control | Mahdi Al-Husseini, Mykel Kochenderfer, Kyle Hollins Wray |
| VEsNA-Pro: Exploiting BDI Agents with Propensities for Emergent Narrative | Andrea Gatti, Fabio Casale, Viviana Mascardi, Andrea Stucchi, Angelo Ferrando |
| Solving Repeated Games with Large Language Model | Naming Liu, Youzhi Zhang, Ying Wen |
| Information Contagion in Climate-Stressed SME Networks: An Agent-Based Simulation Study | Äoà n Minh Lượng, Nguyen Tu Uyen, Duong Thi Phuong Thao, Duong Thu Ngan, Phong Ho, Nhung Duong, Tuan Do |
| VGC-Bench: Towards Mastering Diverse Team Strategies in Competitive Pokémon | Cameron L. Angliss, Jiaxun Cui, Jiaheng Hu, Arrasy Rahman, Peter Stone |
| Active Evaluation of General Agents: Problem Definition and Comparison of Baseline Algorithms | Marc Lanctot, Kate Larson, Ian Gemp, Michael Kaisers |
| Optimizing Pool Testing for Epidemic Surveillance | Jack Heavey, Abhijin Adiga, Anil Vullikanti |
| Transparent and Accessible ABMs with FODD: Automatic Code from Formal ODD | Themis Dimitra Xanthopoulou, Andreas Prinz, Haakon Bøthun Lunde, Ivan Puga-Gonzalez, F. LeRon Shults |
| First-Order and Second-Order Model Counting Meet Stable Marriages, Stable Roommates, and Stable Diners | Václav Kůla, Jan Tóth, Yuanhong Wang, Yuyi Wang, Ondrej Kuzelka |
| Stable Matching: Dealing with Changes in Preferences | Rohith Reddy Gangam, Tung Mai, Nitya Raju, Vijay Vazirani |
| RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation | Andrei Kozyrev, Nikita Khramov, Gleb Solovev, Anton Podkopaev |
| MARLIN: LLM-Guided Multi-Agent Reinforcement Learning with Murmuration Intelligence for Reservoir Management | Heming Fu, Shan Lin, GUOJUN XIONG |
| Control in Hedonic Games | Jiehua Chen, Jakob Guttmann, Merisa Mustajbašić, Sofia Simola |
| Advancing Multi-Agent RAG system with Minimalist Reinforcement Learning | Yihong Wu, Liheng Ma, Muzhi Li, Jiaming Zhou, Lei Ding, Jianye HAO, Ho-fung Leung, Irwin King, Yingxue Zhang, Jian-Yun Nie |
| CoopReflect: Towards Natural Language Communication for Cooperative Autonomous Driving via Multi-Agent Learning | Jiaxun Cui, Chen Tang, Jarrett Holtz, Janice Nguyen, Alessandro G Allievi, Hang Qiu, Peter Stone |
| The Role of Social Learning and Collective Norm Formation in Fostering Cooperation in LLM Multi-Agent Systems | Prateek Gupta, Qiankun Zhong, Hiromu Yakura, Thomas F. Eisenmann, Iyad Rahwan |
| Population synthesis with geographic coordinates | Jacopo Lenti, Lorenzo Costantini, Ariadna Fosch, Anna Monticelli, David Scala, Marco Pangallo |
| Interbank Lending Games | Jinyun Tong, Bart De Keijzer, Haoxiang Wang, Carmine Ventre |
| A Novel Framework for Uncertainty-Driven Adaptive Exploration | Leonidas Bakopoulos, Georgios Chalkiadakis |
| Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design | Hampus Gummesson Svensson, Ola Engkvist, Jon Paul Janet, Christian Tyrchan, Morteza Haghir Chehreghani |
| QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision–Language–Action Models | Yixuan Li, YUHUI CHEN, Mingcai.Zhou, Haoran Li |
| MeCo: Enhancing LLM-Empowered Multi-Robot Collaboration via Similar Task Memoization | Baiqing Wang, Helei Cui, Bo Zhang, Xiaolong Zheng, Bin Guo, Zhiwen Yu |
| CRAwDAD: Causal Reasoning Augmentation with Dual-Agent Debate | Finn G. Vamosi, Nils Forkert |
| Mechanism Design for Efficient Task Allocation | Zifan Gong, Minming Li, Houyu Zhou |
| Beyond Self-Interest: Modeling Social-Oriented Motivation for Human-like Multi-Agent Interactions | Jingzhe Lin, Ceyao Zhang, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Fangwei Zhong |
| Learning Robust Markov Models for Safe Runtime Monitoring | Antonina Skurka, Luko van der Maas, Sebastian Junges, Hazem Torfah |
| Multi-Agent Cooperative Transportation: Optimal and Efficient Task Allocation and Path Finding | Ning Zhou, Nikolai W.F. Bode, Edmund R Hunt |
| ME-IGM: Individual-Global-Max in Maximum Entropy Multi-Agent Reinforcement Learning | Wentse Chen, Yuxuan Li, Shiyu Huang, Jiayu Chen, Jeff Schneider |
| EmoDebt: Bayesian-Optimized Emotional Intelligence for Strategic Agent-to-Agent Debt Recovery | Yunbo Long, Yuhan Liu, Liming Xu, Alexandra Brintrup |
| B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning | Woojun Kim, Katia P. Sycara |
| Probing Dec-POMDP Reasoning in Cooperative MARL | Kale-ab Tessera, Leonard Hinckeldey, Riccardo Zamboni, David Abel, Amos Storkey |
| Peach: Program Each Agent and Communicate Howsoever | Amit K. Chopra, Samuel H. Christie V, Munindar P. Singh |
| The Triad of Identity, Trust and Responsibility in Multi-Agent Systems | Jayati Deshmukh, Vahid Yazdanpanah, Sebastian Stein, Sarvapali D Ramchurn |
| Offline Multi-Agent Reinforcement Learning with Global Moderate Generalization | Yuanrui Duan, Wengang Zhou, Yufeng Shi, Xiancheng Gao, Lin Liu, Houqiang Li |
| Reasoning About Responsibility for Taking Risks | Maksim Gladyshev, Natasha Alechina, Mehdi Dastani, Dragan Doder |
| ATL*AS: An Automata-Theoretic Approach and Tool for the Verification of Strategic Abilities in Multi-Agent Systems | Sofia Garcia de Blas Garcia-Alcaide, Francesco Belardinelli |
| Complexity of (Non-)Convergence in Iterative Voting | Paul W. Goldberg, Marios Mavronicolas, Tomasz Wąs |
| Interpretable Failure Analysis in Multi-Agent Reinforcement Learning Systems | Risal Shahriar Shefin, Debashis Gupta, Thai Le, Sarra Alqahtani |
| Learning Bayesian Game Families, with Application to Mechanism Design | Madelyn Gatchel, Michael P. Wellman |
| Bandwidth-constrained Variational Message Encoding for Cooperative Multi-agent Reinforcement Learning | Wei Duan, Jie Lu, En Yu, Junyu Xuan |
| My Body, My Perceptions: A Shift from Computationalism to Embodied Cognition in BDI-agent-based Embedded Systems | Nilson Lazarin, Carlos Eduardo Pantoja, Jose Viterbo |
| AC-MASAC: An Attentive Curriculum Learning Framework for Heterogeneous UAV Swarm Coordination | Wanhao Liu, Junhong Dai, Yixuan Zhang, Shengyun Yin, Panshuo Li |
| Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning | Panayiotis Danassis, Naman Goel |
| Learning from Delay Distributions: A New Representation for Delay-Aware Reinforcement Learning | Zhuoru Yu, Chenchen Fu, Hengkai Zhong, Wanyuan Wang, Weiwei Wu, Chun Jason Xue |
| Epistemic Modal Logic Meets Algebraic Model Counting | Daxin Liu, Vaishak Belle |
| Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics | Jashaswimalya Acharjee, Balaraman Ravindran |
| Heterogeneous RBCs via deep multi-agent reinforcement learning | Federico Gabriele, Aldo Glielmo, Marco Taboga |
| Beyond Neighbor Influence: A Behavior-Driven Agent-Based Model of Silence | Ziqian Shi, Rosa M Benito |
| Exploring Relations among Fairness Notions in Discrete Fair Division | Jugal Garg, Eklavya Sharma |
| When is Offline Policy Selection Sample Efficient for Reinforcement Learning? | Vincent Liu, Prabhat Nagarajan, Andrew Patterson, Martha White |
| TEACH: Temporal Variance-Driven Curriculum for Reinforcement Learning | Gaurav Chaudhary, Laxmidhar Behera |
| Identifying the Source of Information Spread in Networks via Markov Chains | Yael Sabato, Amos Azaria, Noam Hazon |
| Heartbeat Synchronization in Large Multi-Agent Systems Using One-Way Communication | Federico Bergenti, Simone Dallospedale, Stefania Monica |
| A Unified Framework for Analyzing Meta-algorithms in Online Convex Optimization | Mohammad Pedramfar, Vaneet Aggarwal |
| Generating Fair Consensus Statements with Social Choice on Token-Level MDPs | Carter Blair, Kate Larson |
| Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas | Alper Demir, Hüseyin Aydın, Kale-ab Tessera, David Abel, Stefano V. Albrecht |
| LLMEvalRec: An Agentic Framework for Simulating Users to Evaluate News Recommendation Systems | Yao Ma, Abhishek Tripathi, Samuel Louvan, Wei Liu, Murat Sensoy |
| HLSMAC: A New StarCraft Multi-Agent Challenge for High-Level Strategic Decision-Making | Xingxing Hong, Yungong Wang, Dexin Jin, Ye Yuan, Ximing Huang, Zijian Wu, Yirui Rao, Wenxin Li |
| A Three-Layer Reinforcement Learning-based Approach for Dynamic Task Allocation Under Multiple Task Resource Constraints | Shuo Wang, Yuzhen Zhang, Hao Su, Yazhou Hu, Pei Lv |
| Project Submission Games in Participatory Budgeting | Piotr Faliszewski, Åukasz Janeczko, Andrzej Kaczmarczyk, Grzegorz Lisowski, Grzegorz PierczyÅ„ski |
| Eliminating Inconsistencies among CP-Theory Qualitative Preferences | Erik Rauer, Samik Basu, Vasant G. Honavar, Jia Tao |
| EG-RAG: Retrieval-Augmented Generation with Evidence Graph for Reliable Multi-Document Reasoning | Seunggwan Hong, Junhyung Moon, Eunkyeong Lee, Jaehyoung Park, Hyunseung Choo |
| Quality-Diversity for Multi-Agent Reinforcement Learning | Hao Chen, Pengyi Li, Bin Zhang, Hu Fu, Zhiwei Xu, Ce Zhang, Xinyue Lu, Guoliang Fan |
| UNCAP: Uncertainty-Guided Neurosymbolic Planning Using Natural Language Communication for Cooperative Autonomous Vehicles | Neel P. Bhatt, Po-han Li, Kushagra Gupta, Rohan Siva, Daniel Milan, Alexander Todd Hogue, Sandeep P. Chinchali, David Fridovich-Keil, Zhangyang Wang, ufuk topcu |
| Modeling Human Behavior in a Strategic Network Game With Complex Group Dynamics | Jonathan Berry Skaggs, Jacob Crandall |
| Strength Change Explanations in Quantitative Argumentation | Timotheus Kampik, Xiang Yin, Nico Potyka, Francesca Toni |
| Modelling Customer Trajectories with Reinforcement Learning for Practical Retail Insights | Ken Ming Lee, Paul Barde, Maxime C. Cohen, Derek Nowrouzezahrai |
| Online Sensor Grouping via Multi-Agent Learning Automata: An Ising Model Perspective | Anis Yazidi, Marco Antonio Pinto-Orellana, Youcef Djenouri |
| Multi UAVs Preflight Planning in a Shared and Dynamic Airspace | Amath SOW, Mauricio Rodriguez Cesen, FabÃola Martins Campos de Oliveira, Mariusz Wzorek, Daniel de Leng, Mattias Tiger, Fredrik Heintz, Christian Esteve Rothenberg |
| Feasible Constraint Policy Optimization for Safe Reinforcement Learning | Luoyang Sun, Jiwen Jiang, Ning Yang, Rasul Tutunov, Haifeng Zhang, Jun Wang |
| Inference of Altruism and Intrinsic Rewards in Multi-Agent Systems | Victor Villin, Christos Dimitrakakis |
| The Multi-Agent Off-Switch Game | Akash Agrawal, Soroush Ebadian, Lewis Hammond |
| Fluid-Agent Reinforcement Learning | Shishir Sharma, Doina Precup, Theodore Perkins |
| Confounding Robust Continuous Control via Automatic Reward Shaping | Mateo Juliani, Mingxuan Li, Elias Bareinboim |
| Functional Multi-armed Bandit and the Best Function Identification Problems | Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Anastasiia Soboleva |
| Cleaner Adversarial CAPTCHAs: Intelligent Targets and Precise Noise for Usable Security | Meir Litman, Chen Hajaj |
| Translating Latent State World Model Representations into Natural Language | Matthew Barker, Matteo Leonetti |
| Online Fair Division With Subsidy: When Do Envy-Free Allocations Exist, and at What Cost? | Pooja Ravi Kulkarni, Ruta Mehta, Vishnu V. Narayan, Tomasz Ponitka |
| Towards Foresighted AI Cooperators with LLM-driven Decision-Time Planning | Yuheng Jing, Kai Li, Bingyun Liu, Ziwen Zhang, Zhe Wu, Yifan Zhang, Junliang Xing, Jian Cheng |
| Greedy Routing Reachability Games | Pascal Lenzner, Paraskevi Machaira |
| Robust Information Design for Multi-Agent Systems with Complementarities: Smallest-Equilibrium Threshold Policies | Farzaneh Farhadi, Maria Chli |
| A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation | Ashwin Kumar, William Yeoh |
| Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization | Nikita Kachaev, Mikhail Kolosov, Daniil Zelezetsky, Alexey Kovalev, Aleksandr Panov |
| Scalable Knothe--Rosenblatt-like Heuristic Transportation Plans for Imaging Problems | Gennaro Auricchio, Min Lin, Lingxuan Zhou, Zhaori Guo, zhongqi cai |
| Adaptive Agents in Spatial Double-Auction Markets: Modeling the Emergence of Industrial Symbiosis | Matthieu Mastio, Benoit Gaudou, Paul Saves, Nicolas Verstaevel |
| Multi-Agent Pickup and Delivery with Heterogeneous Agents | Benedetta Flammini, Francesco Amigoni, Bruno Lacerda |
| Calibrated LRT Guidance for Offline Diffusion Policies | Ximan Sun, Xiang Cheng |
| A Generic Framework for Fair Consensus Clustering in Streams | Diptarka Chakraborty, Kushagra Chatterjee, Debarati Das, Tien-Long Nguyen |
| Fairness in Cooperative Multi-objective Multi-agent Reinforcement Learning using Expected Utility | Fares Chouaki, Aurélie Beynier, Nicolas Maudet, Paolo Viappiani |
| Kidney Exchange: Faster Parameterized Algorithms and Tighter Lower Bounds | Palash Dey, Aritra Banik, Abhishek Sahu, Sujoy Bhore |
| CentaurMD: Confidence-Aware Human–AI Decision Fusion for Multi-Label Disease Diagnosis via Label-Specific MoE | Youcheng Zhang, Hui Wang, Jiaqi Liu, Yao Zhang, Zhiwen Yu, Bin Guo |
| MEASE: Multi-agent Episodic Action Sequence Explanation | Khaing Phyo Wai, Minghong Geng, Shubham Pateria, Budhitama Subagdja, Ah-Hwee Tan |
| Neuro-symbolic Action Masking for Deep Reinforcement Learning | Shuai Han, Mehdi Dastani, Shihan Wang |
| LLM-augmented empirical game theoretic simulation for socio-ecological systems | Jennifer Shi, Christopher K. Frantz, Christian Kimmich, Saba Siddiki, Atrisha Sarkar |
| Solving Qualitative Multi-Objective Stochastic Games | Moritz Graf, Anthony Widjaja Lin, R Majumdar |
| MENSA: Leveraging Mental Simulation for In-Context Policy Improvement in LLM Agents | Chung-Che Chang, Erick Chandra, Jane Yung-jen Hsu, Yen-Ling Kuo |
| Truthful Reporting of Competence with Minimal Verification | Reshef Meir, Jonathan Wagner, Omer Ben-Porat |
| Robust Sequential Learning in Random Order Networks | William Guo, Edward Xiong, Jie Gao |
| Delayed Assignments in Online Non-Centroid Clustering with Stochastic Arrivals | Saar Cohen |
| TriBand-BEV: Real-Time LiDAR-Only 3D Pedestrian Detection via Height-Aware BEV and High-Resolution Feature Fusion | Mohammad Khoshkdahan, Alexey Vinel |
| A Radius-Sensitive Approximation Algorithm for Connected Submodular Maximization | Philip Cervenjak, Junhao Gan, Naonori Kakimura, Seeun William Umboh, Anthony Wirth |
| Dynamic Action Space Reinforcement Learning for Optimal Trading Execution | Pangjing Wu, Xiaodong Li |
| Multi-Objective Categorical Deep Q-Networks | Fares Chouaki, Aurélie Beynier, Nicolas Maudet, Paolo Viappiani |
| Peer-Aware Cost Estimation in Nonlinear General-Sum Dynamic Games for Mutual Learning and Intent Inference | Seyed Yousef Soltanian, Wenlong Zhang |
| Risk-aware Flow Tuning for Collective Emotion in Social Media via Multi-agent RL | Ruyi Wang, R L Hill, J. Michael Herrmann |
| Wallet ATL: Towards Reliable Smart Contract Verification | Angelo Ferrando, Blondelle Kana Zanlefack, Vadim Malvone |
| Planning Ahead with RSA: Efficient Signalling in Dynamic Environments by Projecting User Awareness across Future Timesteps | Anwesha Das, John Duff, Jörg Hoffmann, Vera Demberg |
| STO-RL: OFFLINE RL UNDER SPARSE REWARDS VIA LLM-GUIDED SUBGOAL TEMPORAL ORDER | Chengyang GU, Yuxin Pan, Hui Xiong, Yize Chen |
| Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning | Sijia Li, Xinran Li, Shibo Chen, Jun Zhang |
| The Agency Circuit: A Neuro-Symbolic Solution for Mitigating Policy Collapse in Reinforcement Learning | Mahnoor Shahid |
| Utility Aware Adaptive Privacy Budget Allocation for Streaming Multi-Agent Systems | Puspanjali Ghoshal, Ashok Singh Sairam |
| Universal Solvability for Robot Motion Planning on Graphs | Anubhav Dhar, Pranav Nyati, Tanishq Prasad, Ashlesha Hota, Sudeshna Kolay |
| Modifying Preferences and Capacities for Stability in Flow Networks: Algorithms and Complexity | Gergely Csáji, Kitti Varga |
| Human-Inspired Context-Selective Multimodal Memory for Social Robots | Hangyeol Kang, Slava Voloshynovskiy, Nadia Thalmann |
| A Hierarchical Approach with Crisis Mitigation for Multi-Robot Spatio-Temporal Restoration | Amel Nestor Docena, Alberto Quattrini Li |
| From User Preferences to Base Score Extraction Functions in Gradual Argumentation | Aniol Civit, Antonio Rago, Antonio Andriella, Guillem AlenyÃ, Francesca Toni |
| Modeling and Optimizing the Provisioning of Exhaustible Capabilities for Simultaneous Task Allocation and Scheduling | Jinwoo Park, Harish Ravichandar, Seth Hutchinson |
| End-to-End Decision-Focused Prediction in Dynamic Bike-Sharing Rebalancing | Pengbo Fu, Chongyang Wan, Yuan Luo |
| EFX Allocations Exist on Triangle-Free Multi-Graphs | Mahyar Afshinmehr, Arash Ashuri, Pouria Mahmoudkhan, Kurt Mehlhorn |
| Wisdom of the Machines: Exploring Collective Intelligence in LLM Crowds | Yashar Talebirad, Ali Parsaee, Vishwajeet Ohal, Amirhossein Nadiri, Csongor Szepesvari, Yash Mouje, Eden Redman |
| Majoritarian Assignment Rules | Felix Brandt, Haoyuan Chen, Chris Dong, Patrick Lederer, Alexander Schlenga |
| Modeling Dynamics under Random Delays in Reinforcement Learning | Bokai Ji, Guangxia Li, Yulong Shen |
| Synthesis and Evaluation of Long-term History-aware Medical Dialogue | Hebin Hu, Renke Dai, Ah-Hwee Tan, Yilin Kang |
| Stigmergic Swarming Agents for Fast Subgraph Isomorphism | H. Van Dyke Parunak |
| Ratio-Based Signaling for Source-Victim Separation in Swarm Fault Detection | Longyin Cui |
| From Actions to Words: Towards Abstractive-Textual Policy Summarization in RL | Sahar Admoni, Assaf Hallak, Yftah Ziser, Omer Ben-Porat, Ofra Amir |
| Contrastive explanations of BDI agents | Michael Winikoff |
| IMAS$^2$: Joint Agent Selection and Information-Theoretic Coordinated Perception In Dec-POMDPs | Chongyang Shi, Wesley A. Suttle, Michael Dorothy, Jie Fu |
| Scaling Multi-Agent Epistemic Planning through GNN-Derived Heuristics | Giovanni Briglia, Francesco Fabiano, Stefano Mariani |
| ReGMS: Retrieval‑Grounded Multi‑Agent Scenario Analysis for Climate Risk | Yun Wing Kiang, King Hang LAM |
| Coherent belief and opinion propagation produces more echo chambers | Hiro Kataoka, Jérôme Euzenat, Koji Hasebe |
| Algorithmic Collusion at Test Time: A Meta-game Design and Evaluation | Yuhong Luo, Daniel Schoepflin, Xintong Wang |
| RoboGPT-R1: Enhancing Robot Task Planning with Reinforcement Learning | Jinrui Liu, Bingyan Nie, Boyu Li, Yaran Chen, Yuze Wang, Shunsen He, Haoran Li |
| SAT: Sequential Agent Tuning for Coordinator‑Free Plug‑and‑Play Multi‑LLM Training with Monotonic Improvement Guarantees | Yi Xie, Yangyang Xu, Yi Fan, Bo Liu |
| Flexibility-Based Traffic Flow Optimisation in Lifelong Multi-Agent Path Finding | Peiqian Lin, Zhe Chen, David L. Dowe, Daniel Harabor |
| Axiomatic Foundations of Counterfactual Explanations | Leila Amgoud, Martin C. Cooper |
| The Reachability Objective in Multi-Agent Path Finding | Noy Gabay, Jonathan Morag, Ariel Felner, Roni Stern |
| Approximating Nash Equilibria in General-Sum Games via Meta-Learning | David Sychrovský, Christopher Solinas, Revan MacQueen, Kevin A. Wang, James R. Wright, Nathan R. Sturtevant, Michael Bowling |
| Heterogeneity in Multi-Agent Reinforcement Learning | Tianyi Hu, Zhiqiang Pu, Yuan Wang, Tenghai Qiu, Min Chen, Xin Yu |
| Towards Failure-Resilient Lifelong Learning Agents through Scene Graph-Guided Proactive Replanning | Che Rin Yu, Daewon Chae, Dabin Seo, Sangwon Lee, Hyeongwoo IM, Jinkyu Kim |
| PREFINE: Preference-based Implicit Reward and Cost Fine-tuning for Safety Alignment | Richa Verma, Bavish Kulur, Sanjay Chawla, Balaraman Ravindran |
| Equitable Core Imputations for Max-Flow, MST and $b$-Matching Games | Rohith Reddy Gangam, Naveen Garg, Parnian Shahkar, Vijay Vazirani |
| Enhancing Goal Inference via Correction Timing | Anjiabei Wang, Shuangge Wang, Tesca Fitzgerald |
| Robust Value Maximization in Challenge the Champ Tournaments with Probabilistic Outcomes | Umang Bhaskar, Juhi Chaudhary, Sushmita Gupta, Pallavi Jain, Sanjay Seetharaman |
| Maximin Shares with Lower Quotas | Hirota Kinoshita, Ayumi Igarashi |
| Multiagent System for Dynamic Multicriteria Traffic Routing in Urban Environments | Temirlan Kurbanov, Adéla KubÃková, Jiri Vokrinek, Franziska Klügl |
| Byzantine Fault Tolerance in Distributed Constraint Optimization Problems | Koji Noshiro, Koji Hasebe |
| Smooth Routing in Decaying Trees | Till Fluschnik, Amela Pucic, Malte Renken |
| Online Learning of Numeric Action Models for Planning | Argaman Mordoch, Yarin Benyamin, Shahaf S. Shperberg, Brendan Juba, Roni Stern |
| A Survey Of Reinforcement Learning For Autonomous Air Combat: Current Progresses And Limitations | Alex Pierron, Thibault Lahire |
| Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting | Deniz Gorur, Antonio Rago, Francesca Toni |
| Fair Coordination in Strategic Scheduling | Wei-Chen Lee, Martin Bullinger, Alessandro Abate, Michael J. Wooldridge |
| Automatically Benchmarking LLM Code Agents through Agent-driven Annotation and Evaluation | Lingyue Fu, Bolun Zhang, Hao Guan, Yaoming Zhu, Lin Qiu, Weiwen Liu, Xuezhi Cao, Xunliang Cai, Weinan Zhang, Yong Yu |
| GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion | Tongxuan Liu, Xingyu Wang, Weizhe Huang, Wenjiang Xu, Yuting Zeng, Lei Jiang, Hailong Yang, Jing Li |
| PIQL: Projective Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Xinchen Han, Hossam Afifi, Michel Marot |
| Assessing VLM-Driven Semantic-Affordance Inference for Non-Humanoid Robot Morphologies | Rob Jones, Sabine Hauert, Raul Santos-Rodriguez |
| Multi-Agent Decision S4: Leveraging State Space Models for Offline Multi-Agent Reinforcement Learning | Ashmita Bhattacharya, Malyaban Bal |
| Learning Robust Policy for Multi-UAV Collision Avoidance via Compact Causal Feature | Zhun Fan, Gaofei Han, Che Lin, Wenji Li, jie xu, Jiafan Zhuang |
| Follow the STARs: Dynamic $\omega$-Regular Shielding of Learned Probabilistic Policies | Ashwani Anand, Satya Prakash Nayak, Ritam Raha, Anne-Kathrin Schmuck |
| KAN-Enhanced Graph Learning for Active Voltage Control in Dynamic Power Systems | Liqian Sun, Hang Xiao, Shuhan Qi, Huale Li, Jiajia Zhang, Xuan Wang |
| SESiL: Social, Evolutionary Supported Learning | Tianshu Zhao, Zinovi Rabinovich |
| BusEnv: A Multi-agent Reinforcement Learning Environment and Benchmark for Urban Public Transportation | WESLEY DA SILVA E SILVA, RICARDO RIOS, RAFAEL DA COSTA FONSECA, Sabarikirishwaran Ponnambalam, Léa Cassé, Marcos VinÃcius dos Santos Ferreira, Albert Bifet, Tatiane Nogueira Rios |
| IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning | Yihao Qin, Yuanfei Wang, Hang Zhou, Peiran Liu, Hao Dong, Yiding Ji |
| Dynamically Increasing Agents Set-Size in Bayesian Multi-agent Multi-armed Bandits Framework | Mohammad ESSA Alsomali, Leandro Soriano Marcolino, Barry Porter, Roberto Rodrigues-Filho |
| GAPS: Global-Aware Prediction-driven Scheduling for Large-Scale LLM Inference | Zhengyu Liu, Fan Zhang, Fengzhe Zhang, Shuaikang Hou |
| Fair Allocation of Improvements: When Old Endowments Shape New Assignments | Noga Klein Elmalem, Rica Gonen, Erel Segal-Halevi |
| MORL4Water: A Modular Multi-Objective Reinforcement Learning Toolkit for Water Resource Management | Zuzanna Osika, Roxana Rădulescu, Jazmin Zatarain Salazar, Frans A Oliehoek, Pradeep K. Murukannaiah |
| Minimax and Preferential Almost-Stable Matchings | Frederik Glitzner, David Manlove |
| Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition | Yutong Chao, Jalal Etesami |
| QD-MAPPER: A Quality Diversity Framework to Automatically Evaluate Multi-Agent Path Finding Algorithms in Diverse Maps | Cheng Qian, Yulun Zhang, Varun Bhatt, Matthew Christopher Fontaine, Stefanos Nikolaidis, Jiaoyang Li |
| Token-level Advantage Policy Optimization from Negative Feedback in Multi-Turn Agents | Xufeng Zhou, Linjing Li, Daniel Dajun Zeng |
| Outer Diversity of Structured Domains | Piotr Faliszewski, Krzysztof Sornat, StanisÅ‚aw Szufa, Tomasz Wąs |
| Population size effects on strategic classification dynamics | Marta C. Couto, Flavia Barsotti, Fernando P. Santos |
| On the Fair Allocation to Asymmetric Agents with Binary XOS Valuations | Ziheng Chen, Bo Li, Zihan Luo, Jialin Zhang |
| Stable Marriage on Networks | Miao Li, Xinwei Song, Dengji Zhao |
| Stability in Online Assignment Games | Emile Martinez, Felipe Garrido-Lucero, Umberto Grandi |
| Synthesis of Safety Specifications for Probabilistic Systems | Gaspard Ohlmann, Edwin Hamel-de le Court, Francesco Belardinelli |
| Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings | Zhizun Wang, David Meger |
| Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success | George Bredis, Stanislav Dereka, Viacheslav Sinii, Ruslan Rakhimov, Daniil Gavrilov |
| D³MAS: Decompose, Deduce, and Distribute for Enhanced Knowledge Sharing in Multi-Agent Systems | Heng Zhang, Yuling Shi, Xiaodong Gu, Haochen You, Zijian Zhang, Lubin Gan, Yilei Yuan, Jin Huang |
| On Angels and Demons: Strategic (De)Construction of Dynamic Models | davide catta, Rustam Galimullin, Munyque Mittelmann |
| Obnoxious Facility Location Problems: Strategyproof Mechanisms Optimizing $L_p$-Aggregated Utilities and Costs | Hau Chan, Jianan Lin, Chenhao Wang |
| General Dynamic Goal Recognition using Goal-Conditioned and Meta Reinforcement Learning | Osher Elhadad, Owen Morrissey, Reuth Mirsky |
| Feature-based Uncertainty Model for School Choice | Yao Zhang, Makoto Yokoo |
| Enabling User Agency in Scalable Content Recommendations with Large Language Models | Yucheng Li, Gerrit J.J. Van den Burg, Wei Liu, Zhunxuan Wang, Abhishek Tripathi, Murat Sensoy |
| Reconstructing Network Outbreaks under Group Surveillance | Ritwick Mishra, Abhijin Adiga, Anil Vullikanti |
| IntRec: Intent-based Retrieval with Contrastive Refinement | Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, Yue Lu |
| Parallelized Planning-Acting for Multi-Agent LLM Systems in Minecraft | Yaoru Li, Shunyu Liu, Tongya Zheng, Li Sun, Mingli Song |
| Computational Aspects of Plan-Dependent Model Equivalence: The Case of Knowing-How Bisimulations | Carlos Areces, Raul Fervari, Antonio Mondejar |
| ReAcTree: Hierarchical LLM Agent Trees with Control Flow for Long-Horizon Task Planning | Jae-Woo Choi, Hyungmin Kim, Hyobin Ong, Youngwoo Yoon, Minsu Jang, DohyungKim, Jaehong Kim |
| PyVRP$^{+}$: LLM-Driven Metacognitive Heuristic Evolution for Hybrid Genetic Search in Vehicle Routing Problems | Manuj Malik, Jianan Zhou, Shashank Reddy Chirra, Zhiguang Cao |
| Maximizing the Egalitarian Welfare in Friends and Enemies Games | Edith Elkind, Michele Flammini, Giovanna Varricchio |
| Relationships and Connections between Definitions of Metric Proportional Representation | Yusuf Hakan Kalayci, David Kempe |
| The Landscape of Almost Equitable Allocations | Hadi Hosseini, Vishwa Prakash HV, Aditi Sethia, Jatin Yadav |
| Fair Division under Laminar Matroid Constraints for Three Agents | Sarfaraz Equbal |
| Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour | Balint Gyevnar, Christopher G. Lucas, Stefano V. Albrecht, Shay B Cohen |
| AltNet: Addressing the Plasticity-Stability Dilemma in Reinforcement Learning | Mansi Maheshwari, John C. Raisbeck, Bruno Castro da Silva |
| Necessary President in Elections with Parties | KatarÃna Cechlárová, Ildikó Schlotter |
| Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias in Benchmark Aggregation | Damien Berriaud, Roger Wattenhofer |
| Developing Guidelines for Human-LLM Agent Teams: A Multi-Stakeholder Lens | Mireia Yurrita, Davide Dell'Anna, Pradeep K. Murukannaiah, Catholijn M Jonker, Pinar Yolum |
| DEpiABS: Differentiable Epidemic Agent-Based Simulator | Zhijian Gao, Shuxin Li, Bo An |
| On-line Learning in Tree MDPs by Treating Policies as Bandit Arms | Anvay Shah, Ramsundar Anandanarayanan, Sharayu Moharir, Shivaram Kalyanakrishnan |
| Dual-Enhanced Model-Based Policy Optimization: Dynamic Bias-Shift Tradeoff and Adaptive Bidirectional Rollout | Yuetian Wang, Dianxi Shi, Huanhuan Yang, Yuanze Wang, Shiming Song, Chunping Qiu |
| MAStitch: Unifying Local and Global Perspectives for Anomaly Detection in Multi-Agent Systems | Lior Waknin, Yarin Yerushalmi Levi, Ron Solomon, Jaidip Kotak, Amit Giloni, Chiara Picardi, Roman Vainshtein, Yuval Elovici, Asaf Shabtai |
| Reducing Overestimation by Measuring Critic Disagreement in Multi-Critics Architectures | Nitsan Soffair, Gilad Katz |
| Large Language Models for Designing Participatory Budgeting Rules | Nguyen T Thach, Xingchen Sha, Hau Chan |
| Distributed Quantum Gaussian Processes for Multi-Agent Systems | Meet Gandhi, George P. Kontoudis |
| Constrained Multi-Agent Reinforcement Learning with MAF-Net for Safe Trajectory Planning | Bizhao Pang, Mingcheng Zhang, Xinting Hu, Thinh Pham, sameer alam, Guglielmo Lulli |
| Timing the Message: Language-Based Notifications for Time-Critical Assistive Settings | Ya-Chuan Hsu, Jonathan DeCastro, Andrew Silva, Guy Rosman |
| Beyond Outcome-Based Imperfect-Recall: Higher-Resolution Abstractions for Imperfect-Information Games | Yanchang Fu, Qiyue Yin, Shengda Liu, Pei Xu, Kaiqi Huang |
| Reputation as a Solution to Cooperation Collapse in LLM-based MASs | Siyue Ren, Wanli Fu, Xinkun Zou, Chen Shen, Yi Cai, Chu chen, Zhen Wang, Shuyue Hu |
| Sim2Sea: Sim-to-Real Policy Transfer for Maritime Vessel Navigation in Congested Waters | Xinyu Cui, Xuanfa Jin, Xue Yan, Yongcheng Zeng, Luoyang Sun, Wei Siying, Ruizhi Zhang, Jian Zhao, Haifeng Zhang, Jun Wang |
| Procedural Knowledge Improves Agentic LLM Workflows | Vincent Hsiao, Mak Roberts, Leslie N. Smith |
| Think Fast! Learning to Control Online Reasoning in Stochastic Environments | Matthew Budd, Bruno Lacerda, Nick Hawes |
| GLEAR: A Graph Logic-Enhanced RAG Framework for Legal QA | Jingyun Sun, Jiaming Tian, Jie Shi, Yixin Zhang, Wenxi Sheng, Yang Li |
| Robust autobidding for noisy conversion prediction models | Andrey Pudovikov, Khirianova Alexandra, Ekaterina Solodneva, Gleb Molodtsov, Aleksandr Katrutsa, Yuriy Dorn, Egor Samosvat |
| Hierarchical Reward Design from Language: Enhancing Alignment of Agent Behavior with Human Specifications | Zhiqin Qian, Ryan Diaz, Sangwon Seo, Vaibhav Unhelkar |
| Generalized Per-Agent Advantage Estimation for Multi-Agent Policy Optimization | Seongmin Kim, Giseung Park, Woojun Kim, Jiwon Jeon, Seungyul Han, Youngchul Sung |
| AutoMETA: A Multi-Agent LLM System for Autonomous Meta-Analysis | Kunhee Ryu, Keeheon Lee |
| Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective. | George Papadopoulos, George Vouros |
| On Minimal Achievable Quotas in Multiwinner Voting | Patrick Becker, Fabian Frank |
| Truthful Reverse Auctions for Adaptive Selection via Contextual Multi-Armed Bandits | Pronoy Patra, Sankarshan Damle, Manisha Padala, Sujit Gujar |
| Participation Incentives in Online Cooperative Games | Haris Aziz, Yuhang Guo, Zhaohong Sun |
| UAM-MARL: Uncertainty-Aware Modality-Enhanced Multi-Agent Reinforcement Learning with LLM-Guided Graph Policies | Zichen Song, Weijia Li |
| Building Large-Scale Drone Defenses from Small-Team Strategies | Grant Douglas, Stephen Franklin, Claudia Szabo, Mingyu Guo |
| Theory of Mind Guided Strategy Adaptation for Zero-Shot Coordination | Andrew Ni, Simon Stepputtis, Stefanos Nikolaidis, Michael Lewis, Katia P. Sycara, Woojun Kim |
| Guiding Neuro-Symbolic Scenario Generation with Spatio-Temporal Logic | Lorenzo Bonin, Francesco Giacomarra, Luca Bortolussi, Jyotirmoy V. Deshmukh, Francesca Cairoli |
| MoralityGym: A Benchmark for Evaluating Hierarchical Moral Alignment in Sequential Decision-Making Agents | Simon Rosen, Siddarth Singh, Ebenezer Gelo, Helen Robertson, Ibrahim Suder, Victoria Williams, Benjamin Rosman, Geraud Nangue Tasse, Steven James |
| Beyond Vibe Decision Theory: Asymmetric Manipulation Vulnerabilities in LLM Multi-Agent Coordination | Sukanya Krishna, Tobin South |
| Bi-Level Policy Optimization with Nyström Hypergradients | Arjun Prakash, Naicheng He, Denizalp Goktas, Amy Greenwald |
| Placing Green Bridges Optimally, with Close-Range Habitats in Sparse Graphs | Christian Wallisch, Till Fluschnik, Leon Kellerhals |
| SIGMAS: Second-Order Interaction-based Grouping for Overlapping Multi-Agent Swarms | Minah Lee, Saibal Mukhopadhyay |
| Efficiently Computing Approximate Nash Equilibria in Multi-Adversarial Team Games | Prasanna Maddila, Régis Sabbadin, Meritxell Vinyals |
| The Facility Location Problem with Aleatory Agents | Gennaro Auricchio, Jie Zhang |
| Deception and Communication in Autonomous Multi-Agent Systems: An Experimental Study with Among Us | Maria Milkowski, Tim Weninger |
| Team of Rivals: Hierarchical Deep Reinforcement Learning and Behavior Cloning for Multiplayer Poker | Avishag Shapira, Ido Rom, Asaf Shabtai, Gilad Katz |
| DELL: Dual-Knowledge Enhanced LLMs for Precise Decision Making in Healthcare | Danying Mo, Chao Yu, Xuan Lin, Zhongqi Wu, Yuheng Luo, Chen Bai, Chaojin Chen |
| Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement | Saman Forouzandeh, Wei Peng, Parham Moradi, Xinghuo Yu, Mahdi Jalili |
| SocraticAgent: An Autonomous Agent for Unlocking Latent Knowledge in LLMs | Yang Yan, Yu Lu, Renjun Xu, Zhenzhong Lan |
| Bons-AI: An Agent-Based Model to evaluate the behavior of bonsai grower according to different levels of communication and experience | Sara Saori Satake, Guilherme Henrique de Souza Nakahata, Claus Aranha |
| DR2: Revisiting Visual Reinforcement Learning from the Dimensional Analysis Perspective | Chuxiong Sun, Jinli Chen, Zehua Zang, Jiangmeng Li, Rui Wang, Changwen Zheng |
| Conflict-Based Search for Multi Agent Path Finding with Asynchronous Actions | Xuemian Wu, Shizhe Zhao, Zhongqiang Ren |
| Sample-Efficient Policy Space Response Oracles with Joint Experience Best Response | Ariyan Bighashdel, Thiago D. Simão, Frans A Oliehoek |
| Social Welfare Maximization in Approval-Based Committee Voting under Uncertainty | Haris Aziz, Yuhang Guo, venkateswara Rao Kagita, Baharak Rastegari, Mashbat Suzuki |
| Robustness of Stable Matchings When Attributes and Salience Determine Preferences | Amit Ronen, S. S. Ravi, Sarit Kraus |
| Individual Rationality in Constrained Hedonic Games: Additively Separable and Fractional Preferences | Foivos Fioravantes, Harmender Gahlawat, Nikolaos Melissinos, Å imon Schierreich |
| Node-Level Federated Learning with Adaptive Personalized Aggregation for Spatio-Temporal Traffic Prediction | Xiaoying Tu, Ying Lin, Xingjian Lu, Yibing Wang, Bo Hu |
| LLM Performance Predictors: Learning When to Escalate in Hybrid Human-AI Moderation Systems | Or Bachar, Or Levi, Sardhendu Mishra, Adi Levi, Manpreet Singh Minhas, Justin Miller, Omer Ben-Porat, Eilon Sheetrit, Jonathan Morra |
| Agents of Diffusion: Enhancing Diffusion Language Models with Multi-Agent Reinforcement Learning for Structured Data Generation | Aja Khanal, Kaushik Tushar Ranade, Rishabh Agrawal, Kalyan Shankar Basu, Apurva Narayan |
| Alternating-Time Temporal Logic with Dependent Strategies | Jessica L. Newman, Enrico H. Gerding, Enrico Marchioni, Baharak Rastegari |
| Issues with measuring task complexity via random policies in robotic tasks | Reabetswe M. Nkhumise, Mohamed S. Talamali, Aditya Gilra |
| Pareto-guided Pipeline for Distilling Featherweight AI Agents in Mobile MOBA Games | Xionghui Yang, Bozhou Chen, Yunlong Lu, Yongyi Wang, Lingfeng Li, Lanxiao Huang, Lin Liu, Wenjun Wang, Meng Meng, Xia Lin, Wenxin Li |
| Graph-Conditioned Diffusion for Offline Multi-Agent Reinforcement Learning | Luis Manuel Pimentel, Minwoo Cho, Sean Charles Ye, James E. G. Pagan, Matthew Craig Gombolay |
| Altruism and Fair Objective in Mixed-Motive Markov games | Franck XU, Tayeb LEMLOUMA, Arnaud Braud, Jean-Marie Bonnin |
| Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models | Alex Goodall, Francesco Belardinelli |
| Learning to Price: Interpretable Attribute-Level Models for Dynamic Markets | Srividhya Sethuraman, Chandra Shekar Lakshminarayanan |
| R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory | Maoyuan Li, Zhongsheng Wang, Haoyuan Li, Jiamou Liu |
| OWLViz: An Open-World Benchmark for Visual Question Answering | Thuy Nguyen, Dang Nguyen, Hoà ng Nguyễn, Thuan Duc Luong, Franck Dernoncourt, Long Hoang Dang, Viet Dac Lai |
| Exploring Cognitive Bias Impact, Detection and Mitigation in Large Language Models | Ana Gutiérrez-Mandingorra, Stella Heras, Javier Palanca, Vicent Botti |
| Federated Gaussian Process Learning via Pseudo-Representations for Large-Scale Multi-Robot Systems | Sanket A Salunkhe, George P. Kontoudis |
| Multigranular Alignment via Linguistic Decomposition and Reward Optimization for Text to Image Diffusion | Hengrui Liu, Luming Jin, Meng-Fen Chiang |
| Grassroots Federation: Fair Democratic Governance at Scale | Nimrod Talmon, Ehud Shapiro |
| DiffVAS: Diffusion-Guided Visual Active Search in Partially Observable Environments | Anindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Nathan Jacobs, Yevgeniy Vorobeychik |
| Scalable and Safe Multi-Agent Coordination with Reconstructed Level-k Monte Carlo Tree Search | Zhihao Lin, Lin Wu, Zhen Tian, Alessio Lomuscio, Jianglin Lan |