Contributors Forks Stargazers Issues

Updated on 2025.06.28

Usage instructions: here

Agent

Publish Date Title Authors PDF Code
2025-06-26 Whole-Body Conditioned Egocentric Video Prediction Yutong Bai et.al. 2506.21552 null
2025-06-26 PsyLite Technical Report Fangjun Ding et.al. 2506.21536 null
2025-06-26 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Boyu Gou et.al. 2506.21506 null
2025-06-26 From multi-allocations to allocations, with subadditive valuations Uriel Feige et.al. 2506.21493 null
2025-06-26 Ad-Hoc Human-AI Coordination Challenge Tin Dizdarević et.al. 2506.21490 null
2025-06-26 Reinforcement Learning for Optimal Control of Spin Magnetometers Logan W. Cooke et.al. 2506.21475 null
2025-06-26 Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents Tianyi Men et.al. 2506.21252 null
2025-06-26 Dynamic Risk-Aware MPPI for Mobile Robots in Crowds via Efficient Monte Carlo Approximations Elia Trevisan et.al. 2506.21205 null
2025-06-26 Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout Apurva Shah et.al. 2506.21186 null
2025-06-26 Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 Jongyeon Park et.al. 2506.21174 null
2025-06-25 MMSearch-R1: Incentivizing LMMs to Search Jinming Wu et.al. 2506.20670 null
2025-06-25 The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind Andrei Lupu et.al. 2506.20664 null
2025-06-25 Memento: Note-Taking for Your Future Self Chao Wan et.al. 2506.20642 null
2025-06-25 Towards Community-Driven Agents for Machine Learning Engineering Sijie Li et.al. 2506.20640 null
2025-06-25 Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm Baixiang Huang et.al. 2506.20606 null
2025-06-25 Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges Alexander D. Kalian et.al. 2506.20598 null
2025-06-25 An Explicit Solution for the Problem of Optimal Investment with Random Endowment Michael Donisch et.al. 2506.20506 null
2025-06-25 Engineering Sentience Konstantin Demin et.al. 2506.20504 null
2025-06-25 Opinion Dynamics with Highly Oscillating Opinions Víctor A. Vargas-Pérez et.al. 2506.20472 null
2025-06-25 An Agentic System for Rare Disease Diagnosis with Traceable Reasoning Weike Zhao et.al. 2506.20430 null
2025-06-24 JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning Ai Han et.al. 2506.19846 null
2025-06-24 MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration Yucheng Zhou et.al. 2506.19835 null
2025-06-24 Curating art exhibitions using machine learning Eurico Covas et.al. 2506.19813 null
2025-06-24 LLM-Based Social Simulations Require a Boundary Zengqing Wu et.al. 2506.19806 null
2025-06-24 Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning Menglong Zhang et.al. 2506.19785 null
2025-06-24 SAGE: Strategy-Adaptive Generation Engine for Query Rewriting Teng Wang et.al. 2506.19783 null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 null
2025-06-24 From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking Gyeongwon James Kim et.al. 2506.19724 null
2025-06-24 A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures Dezhang Kong et.al. 2506.19676 null
2025-06-24 How trust networks shape students’ opinions about the proficiency of artificially intelligent assistants Yutong Bu et.al. 2506.19655 null
2025-06-23 Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models Kiymet Akdemir et.al. 2506.18900 null
2025-06-23 Steering Conceptual Bias via Transformer Latent-Subspace Activation Vansh Sharma et.al. 2506.18887 null
2025-06-23 GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM Annika Thomas et.al. 2506.18885 null
2025-06-23 Broad Validity of the First-Order Approach in Moral Hazard Eduardo Azevedo et.al. 2506.18873 null
2025-06-23 Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning Anthony Kobanda et.al. 2506.18847 null
2025-06-23 Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories Islem Bouzenia et.al. 2506.18824 null
2025-06-23 Multi-Agent Online Control with Adversarial Disturbances Anas Barakat et.al. 2506.18814 null
2025-06-23 Fair Allocation with Money: What is Your Objective? Noga Klein Elmalem et.al. 2506.18794 null
2025-06-23 TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation Kamil Szczepanik et.al. 2506.18783 null
2025-06-23 Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI Daniel M. Lang et.al. 2506.18720 null
2025-06-20 VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning Zhangyang Qi et.al. 2506.17221 null
2025-06-20 Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation Xiuyu Yang et.al. 2506.17213 null
2025-06-20 Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems Matias Martinez et.al. 2506.17208 null
2025-06-20 Towards AI Search Paradigm Yuchen Li et.al. 2506.17188 null
2025-06-20 Capturing Misalignment Pierfrancesco Guarino et.al. 2506.17176 null
2025-06-20 A Note on Proper Relational Structures Adam Bjorndahl et.al. 2506.17142 null
2025-06-20 When Can Model-Free Reinforcement Learning be Enough for Thinking? Josiah P. Hanna et.al. 2506.17124 null
2025-06-20 A general multi-stratum model for a nanofunctionalized releasing capsule: a computational study Elia Onofri et.al. 2506.17078 null
2025-06-20 Behavior Driven Development for 3D Games Fernando Pastor Ricós et.al. 2506.17057 null
2025-06-20 Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment Leizhen Wang et.al. 2506.17029 null
2025-06-20 Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence Yining Hong et.al. 2506.15677 null
2025-06-18 Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers Tommaso Green et.al. 2506.15674 link
2025-06-18 SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence Yao Zhang et.al. 2506.15672 null
2025-06-18 PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection Wenhao Li et.al. 2506.15656 null
2025-06-18 FindingDory: A Benchmark to Evaluate Memory in Embodied Agents Karmesh Yadav et.al. 2506.15635 null
2025-06-18 The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games Lyle Goodyear et.al. 2506.15624 null
2025-06-18 Multi-Agent, Multi-Scale Systems with the Koopman Operator Craig Bakker et.al. 2506.15589 null
2025-06-18 Learning to flock in open space by avoiding collisions and staying together Martino Brambati et.al. 2506.15587 null
2025-06-18 Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents Aline Dobrovsky et.al. 2506.15567 null
2025-06-18 Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning Roger Creus Castanyer et.al. 2506.15544 link
2025-06-17 RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills Chunru Lin et.al. 2506.14763 null
2025-06-17 Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems Shiyu Cheng et.al. 2506.14749 null
2025-06-17 AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes Jiahao Qiu et.al. 2506.14728 null
2025-06-17 Linear Planar 3-SAT and Its Applications in Planning Victorien Desbois et.al. 2506.14713 null
2025-06-17 AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions Aishan Liu et.al. 2506.14697 null
2025-06-17 Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference Kalliyan Velasco et.al. 2506.14690 null
2025-06-17 Unified Software Engineering agent as AI Software Engineer Leonhard Applis et.al. 2506.14683 null
2025-06-17 StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery Jina Kim et.al. 2506.14670 null
2025-06-17 SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning Hexian Ni et.al. 2506.14648 null
2025-06-17 GenerationPrograms: Fine-grained Attribution with Executable Programs David Wan et.al. 2506.14580 null
2025-06-16 MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering Arya Fayyazi et.al. 2506.13755 null
2025-06-16 PB $^2$ : Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning Brahim Driss et.al. 2506.13741 null
2025-06-16 The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning Jiashun Liu et.al. 2506.13672 null
2025-06-16 We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems Junfeng Fang et.al. 2506.13666 link
2025-06-16 Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning Shulin Tian et.al. 2506.13654 null
2025-06-16 xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations Kaiyuan Chen et.al. 2506.13651 null
2025-06-16 Deceptive Path Planning: A Bayesian Game Approach Violetta Rostobaya et.al. 2506.13650 null
2025-06-16 CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation Yuwei Du et.al. 2506.13599 null
2025-06-16 Agent Capability Negotiation and Binding Protocol (ACNBP) Ken Huang et.al. 2506.13590 link
2025-06-16 Non-exchangeable mean-field theory for adaptive weights: propagation of chaos and graphon sampling lemma Datong Zhou et.al. 2506.13587 null
2025-06-13 Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale Junha Lee et.al. 2506.12009 null
2025-06-13 Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents? Ramesh Raskar et.al. 2506.12003 null
2025-06-13 Self-Regulating Cars: Automating Traffic Control in Free Flow Road Networks Ankit Bhardwaj et.al. 2506.11973 null
2025-06-13 Visual Pre-Training on Unlabeled Images using Reinforcement Learning Dibya Ghosh et.al. 2506.11967 null
2025-06-13 Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning Mohammadamin Moradi et.al. 2506.11957 null
2025-06-13 Secure API-Driven Research Automation to Accelerate Scientific Discovery Tyler J. Skluzacek et.al. 2506.11950 null
2025-06-13 Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations Miguel Suau et.al. 2506.11912 null
2025-06-13 Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients Chapa Sirithunge et.al. 2506.11906 null
2025-06-13 An Explainable AI Framework for Dynamic Resource Management in Vehicular Network Slicing Haochen Sun et.al. 2506.11882 null
2025-06-13 Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems Zhipeng Bao et.al. 2506.11842 null
2025-06-12 AutoMind: Adaptive Knowledgeable Agent for Automated Data Science Yixin Ou et.al. 2506.10974 link
2025-06-12 Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop Justin Kerr et.al. 2506.10968 null
2025-06-12 SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks Lianghong Guo et.al. 2506.10954 link
2025-06-12 Build the web for agents, not agents for the web Xing Han Lù et.al. 2506.10953 null
2025-06-12 Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors Chen Yueh-Han et.al. 2506.10949 link
2025-06-12 Execution Guided Line-by-Line Code Generation Boaz Lavon et.al. 2506.10948 link
2025-06-12 Dynamic Epistemic Friction in Dialogue Timothy Obiso et.al. 2506.10934 null
2025-06-12 Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence Eduardo Baena et.al. 2506.10925 null
2025-06-12 Prediction and control of geometry-induced nematic order in growing multicellular systems Lukas Hupe et.al. 2506.10867 null
2025-06-12 CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training Alireza Salemi et.al. 2506.10844 link
2025-06-11 Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling Tim Z. Xiao et.al. 2506.09998 null
2025-06-11 SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance Wentao Ge et.al. 2506.09968 null
2025-06-11 The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability Jiachen Hu et.al. 2506.09940 null
2025-06-11 On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing Junlin Chen et.al. 2506.09924 null
2025-06-11 PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants Zheng Zhao et.al. 2506.09902 link
2025-06-11 “What are my options?”: Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended) Noel Brindise et.al. 2506.09901 null
2025-06-11 OctoNav: Towards Generalist Embodied Navigation Chen Gao et.al. 2506.09839 null
2025-06-11 Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy Tonghe Wang et.al. 2506.09805 null
2025-06-11 Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy Davide Grossi et.al. 2506.09789 null
2025-06-11 Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era Shuo Jiang et.al. 2506.09755 null
2025-06-10 ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering Yuki Imajuku et.al. 2506.09050 link
2025-06-10 VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning Li Kang et.al. 2506.09049 null
2025-06-10 Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation Xiaowen Ma et.al. 2506.09046 null
2025-06-10 The Decoupled Risk Landscape in Performative Prediction Javier Sanguino et.al. 2506.09044 null
2025-06-10 Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System Yuan Guo et.al. 2506.08972 null
2025-06-10 Towards Robust Deep Reinforcement Learning against Environmental State Perturbation Chenxu Wang et.al. 2506.08961 null
2025-06-10 What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities Wendong Bu et.al. 2506.08933 null
2025-06-10 Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL) Maria-Veronica Ciocanel et.al. 2506.08916 link
2025-06-10 Intention-Conditioned Flow Occupancy Models Chongyi Zheng et.al. 2506.08902 link
2025-06-10 Pairwise similarity method for majority domination problem N. I. Shushko et.al. 2506.08886 null
2025-06-09 GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior Penghao Wu et.al. 2506.08012 null
2025-06-09 Dreamland: Controllable World Creation with Simulator and Generative Models Sicheng Mo et.al. 2506.08006 null
2025-06-09 Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System Fan Yang et.al. 2506.07997 null
2025-06-09 $τ^2$ -Bench: Evaluating Conversational Agents in a Dual-Control Environment Victor Barres et.al. 2506.07982 link
2025-06-09 Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator Alberto Bazán-Guillén et.al. 2506.07980 null
2025-06-10 Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction Junhong Shen et.al. 2506.07976 link
2025-06-09 HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization Hongzheng Chen et.al. 2506.07972 link
2025-06-09 Diffusion of Responsibility in Collective Decision Making Pavel Naumov et.al. 2506.07935 null
2025-06-09 LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement Dimitris Panagopoulos et.al. 2506.07915 null
2025-06-09 A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit Andrea Tiranti et.al. 2506.07877 null
2025-06-06 PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time Weizhi Zhang et.al. 2506.06254 null
2025-06-06 Longer Lists Yield Better Matchings Yuri Faenza et.al. 2506.06217 null
2025-06-06 Can Theoretical Physics Research Benefit from Language Agents? Sirui Lu et.al. 2506.06214 null
2025-06-06 A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization Muhammed Ustaomeroglu et.al. 2506.06179 null
2025-06-06 Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach James Ford et.al. 2506.06175 null
2025-06-06 The Lock-in Hypothesis: Stagnation by Algorithm Tianyi Alex Qiu et.al. 2506.06166 null
2025-06-06 (AI peers) are people learning from the same standpoint: Perception of AI characters in a Collaborative Science Investigation Eunhye Grace Ko et.al. 2506.06165 null
2025-06-06 Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks Adiba Mahbub Proma et.al. 2506.06153 null
2025-06-06 CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting Peter Lengyel et.al. 2506.06128 null
2025-06-06 Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library Weixun Wang et.al. 2506.06122 null
2025-06-05 Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Niv Eckhaus et.al. 2506.05309 link
2025-06-05 ProRefine: Inference-time Prompt Refinement with Textual Feedback Deepak Pandita et.al. 2506.05305 null
2025-06-05 Control Tax: The Price of Keeping AI in Check Mikhail Terekhov et.al. 2506.05296 null
2025-06-05 A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$ : Robust Imitation via Learning to Search Arnav Kumar Jain et.al. 2506.05294 link
2025-06-05 Tight analyses of first-order methods with error feedback Daniel Berg Thomsen et.al. 2506.05271 link
2025-06-06 Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams Mohammed Almutairi et.al. 2506.05265 null
2025-06-05 Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning Dravyansh Sharma et.al. 2506.05252 null
2025-06-05 Towards Language-Augmented Multi-Agent Deep Reinforcement Learning Maxime Toquebiau et.al. 2506.05236 null
2025-06-05 A Framework for Ethical Judgment of Smart City Applications Weichen Shi et.al. 2506.05172 null
2025-06-05 An emergence-oriented approach to cyclic pursuit Zhaozhan Yao et.al. 2506.05157 null
2025-06-04 OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis Junting Chen et.al. 2506.04217 link
2025-06-04 Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs Alex DeWeese et.al. 2506.04215 null
2025-06-04 TracLLM: A Generic Framework for Attributing Long Context LLMs Yanting Wang et.al. 2506.04202 link
2025-06-04 MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures Elena Zamaraeva et.al. 2506.04195 null
2025-06-04 SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Yuhao Wu et.al. 2506.04180 null
2025-06-04 A primal-dual price-optimization method for computing equilibrium prices in mean-field games models Xu Wang et.al. 2506.04169 link
2025-06-04 Image Editing As Programs with Diffusion Models Yujia Hu et.al. 2506.04158 null
2025-06-05 macOSWorld: A Multilingual Interactive Benchmark for GUI Agents Pei Yang et.al. 2506.04135 link
2025-06-04 TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems Shaina Raza et.al. 2506.04133 null
2025-06-04 CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues Disha Sheshanarayana et.al. 2506.04131 null
2025-06-03 GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Qianhui Wu et.al. 2506.03143 null
2025-06-03 Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Yinjie Wang et.al. 2506.03136 link
2025-06-03 Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff Sophie Greenwood et.al. 2506.03102 null
2025-06-03 EgoVLM: Policy Optimization for Egocentric Video Understanding Ashwin Vinod et.al. 2506.03097 link
2025-06-03 DPO Learning with LLMs-Judge Signal for Computer Use Agents Man Luo et.al. 2506.03095 null
2025-06-03 Provable Reinforcement Learning from Human Feedback with an Unknown Link Function Qining Zhang et.al. 2506.03066 null
2025-06-03 MAEBE: Multi-Agent Emergent Behavior Framework Sinem Erisken et.al. 2506.03053 null
2025-06-03 EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment Mikolaj Walczak et.al. 2506.03046 null
2025-06-03 Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective Jintian Shao et.al. 2506.03038 null
2025-06-03 TestAgent: An Adaptive and Intelligent Expert for Human Assessment Junhao Yu et.al. 2506.03032 null
2025-05-30 Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents Yaxin Luo et.al. 2505.24878 null
2025-05-30 Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks Tajamul Ashraf et.al. 2505.24876 link
2025-05-30 VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software Brandon Man et.al. 2505.24838 link
2025-05-30 Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation Yucheng Zhou et.al. 2505.24787 link
2025-06-02 EXP-Bench: Can AI Conduct AI Research Experiments? Patrick Tser Jern Kon et.al. 2505.24785 link
2025-05-30 Emergent Dynamics of Active Systems on Curved Environments Euan D. Mackay et.al. 2505.24730 null
2025-05-30 CoRet: Improved Retriever for Code Editing Fabio Fehr et.al. 2505.24715 null
2025-05-30 Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting Wei Chen et.al. 2505.24710 link
2025-05-30 Towards a unified user modeling language for engineering human centered AI systems Aaron Conrardy et.al. 2505.24697 null
2025-05-30 Multiple LLM Agents Debate for Equitable Cultural Alignment Dayeon Ki et.al. 2505.24671 link
2025-05-29 From Chat Logs to Collective Insights: Aggregative Question Answering Wentao Zhang et.al. 2505.23765 null
2025-05-29 ZeroGUI: Automating Online GUI Learning at Zero Human Cost Chenyu Yang et.al. 2505.23762 link
2025-05-29 ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks Akashah Shabbir et.al. 2505.23752 link
2025-05-29 ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering Zexi Liu et.al. 2505.23723 link
2025-05-29 COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents Arun Verma et.al. 2505.23720 null
2025-05-29 From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems Zeinab Nezami et.al. 2505.23710 null
2025-05-29 Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics Ran Zhang et.al. 2505.23695 link
2025-05-29 ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork Caroline Wang et.al. 2505.23686 link
2025-05-29 GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents Manish Shetty et.al. 2505.23671 link
2025-05-29 Initial Luminally Deposited FGF4 Critically Influences Blastocyst Patterning Michael A. Ramirez-Sierra et.al. 2505.23650 null
2025-05-28 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model Wenbo Hu et.al. 2505.22657 null
2025-05-28 Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents Michael Kirchhof et.al. 2505.22655 null
2025-05-28 WebDancer: Towards Autonomous Information Seeking Agency Jialong Wu et.al. 2505.22648 link
2025-05-29 FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control Younggyo Seo et.al. 2505.22642 null
2025-05-28 LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents Rui Li et.al. 2505.22634 null
2025-05-28 HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym Ngoc La et.al. 2505.22597 link
2025-05-28 GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git Tobias Lindenbauer et.al. 2505.22583 link
2025-05-29 Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems Hoang Pham et.al. 2505.22571 null
2025-05-28 Universal Visuo-Tactile Video Understanding for Embodied Interaction Yifan Xie et.al. 2505.22566 null
2025-05-28 Training RL Agents for Multi-Objective Network Defense Tasks Andres Molina-Markham et.al. 2505.22531 null
2025-05-27 Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making Yihan Wang et.al. 2505.21503 null
2025-05-27 AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery Haowei Wang et.al. 2505.21499 link
2025-05-27 Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Wei Pang et.al. 2505.21497 link
2025-05-27 UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents Han Xiao et.al. 2505.21496 link
2025-05-27 Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming Yang Yang et.al. 2505.21486 null
2025-05-27 Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration Zijun Liu et.al. 2505.21471 link
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks Francesco Cozzi et.al. 2505.21426 link
2025-05-27 GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation Naizhu Jin et.al. 2505.21425 null
2025-05-27 Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery Lina Zhao et.al. 2505.21418 null
2025-05-27 MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents Ziming Wei et.al. 2505.20148 link
2025-05-26 Agentic 3D Scene Generation with Spatially Contextualized VLMs Xinhang Liu et.al. 2505.20129 null
2025-05-26 Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers Zhengliang Shi et.al. 2505.20128 link
2025-05-26 Agentic AI Process Observability: Discovering Behavioral Variability Fabiana Fournier et.al. 2505.20127 null
2025-05-26 Agents Require Metacognitive and Strategic Reasoning to Succeed in the Coming Labor Markets Simpson Zhang et.al. 2505.20120 null
2025-05-27 TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent Dominik Meier et.al. 2505.20118 link
2025-05-26 MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning Thang Nguyen et.al. 2505.20096 null
2025-05-26 SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale Qi Li et.al. 2505.20094 null
2025-05-26 REARANK: Reasoning Re-ranking Agent via Reinforcement Learning Le Zhang et.al. 2505.20046 link
2025-05-26 Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking Yihan Chen et.al. 2505.20023 null
2025-05-23 Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find Owen Bianchi et.al. 2505.18148 null
2025-05-23 Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading Mohamed Swailem et.al. 2505.18145 null
2025-05-23 Gaming Tool Preferences in Agentic LLMs Kazem Faghih et.al. 2505.18135 link
2025-05-23 ProgRM: Build Better GUI Agents with Progress Rewards Danyang Zhang et.al. 2505.18121 null
2025-05-23 Facility Location with Public Locations and Private Doubly-Peaked Costs Richard Cole et.al. 2505.18114 null
2025-05-23 ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework Lisheng Huang et.al. 2505.18105 link
2025-05-23 Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL Joey Hong et.al. 2505.18098 null
2025-05-23 Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding Xiaoyi Zhang et.al. 2505.18079 null
2025-05-23 Linear Mixture Distributionally Robust Markov Decision Processes Zhishuai Liu et.al. 2505.18044 null
2025-05-23 Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective Jintian Shao et.al. 2505.17997 null
2025-05-22 SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding Haoning Wu et.al. 2505.17012 link
2025-05-22 X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs Rui Ye et.al. 2505.16997 link
2025-05-22 MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems Rui Ye et.al. 2505.16988 link
2025-05-22 T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Amartya Chakraborty et.al. 2505.16986 null
2025-05-22 Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine Adib Bazgir et.al. 2505.16982 null
2025-05-22 Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design Zhenkun Li et.al. 2505.16979 null
2025-05-22 SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development Yaxin Du et.al. 2505.16975 link
2025-05-22 Modeling Inequality in Complex Networks of Strategic Agents using Iterative Game-Theoretic Transactions Mayank Kejriwal et.al. 2505.16966 null
2025-05-22 Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection Jiaying Fu et.al. 2505.16954 null
2025-05-22 A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization Shengyu Feng et.al. 2505.16952 null
2025-05-22 GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents Yuqi Zhou et.al. 2505.15810 link
2025-05-21 The Agentic Economy David M. Rothschild et.al. 2505.15799 null
2025-05-22 HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving Zhiwen Chen et.al. 2505.15793 null
2025-05-21 Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning Pedro P. Santos et.al. 2505.15782 null
2025-05-21 Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses Xiaoxue Yang et.al. 2505.15738 link
2025-05-21 DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning Gaurav Srivastava et.al. 2505.15734 null
2025-05-21 Quantum Dots as Functional Nanosystems for Enhanced Biomedical Applications Pronama Biswas et.al. 2505.15705 null
2025-05-21 HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning Xiaodong Mei et.al. 2505.15703 null
2025-05-21 Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives Milad Kazemi et.al. 2505.15693 null
2025-05-21 From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems Xiuchao Sui et.al. 2505.15685 link
2025-05-20 NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search Sunhao Dai et.al. 2505.14680 null
2025-05-20 ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions Bufang Yang et.al. 2505.14668 null
2025-05-20 AI Agents in the Electricity Market Game with Cryptocurrency Transactions: A Post-Terminator Analysis Microsoft Copilot et.al. 2505.14612 null
2025-05-20 Agent Context Protocols Enhance Collective Inference Devansh Bhardwaj et.al. 2505.14569 null
2025-05-20 Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study Saahil Mahato et.al. 2505.14544 link
2025-05-20 A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version) Gaia Belardinelli et.al. 2505.14539 null
2025-05-20 Energy-Efficient Deep Reinforcement Learning with Spiking Transformers Mohammad Irfan Uddin et.al. 2505.14533 null
2025-05-20 BACON: A fully explainable AI model with graded logic for decision making problems Haishi Bai et.al. 2505.14510 null
2025-05-20 Design and Evaluation of a Microservices Cloud Framework for Online Travel Platforms Biman Barua et.al. 2505.14508 null
2025-05-20 Security of Distributed Gradient Descent Against Byzantine Agents Sribalaji C. Anand et.al. 2505.14473 null
2025-05-19 G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning Liang Chen et.al. 2505.13426 link
2025-05-20 A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut Gabriel Malikal et.al. 2505.13405 null
2025-05-19 Robin: A multi-agent system for automating scientific discovery Ali Essam Ghareeb et.al. 2505.13400 null
2025-05-19 Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges Hongru Wang et.al. 2505.13328 null
2025-05-19 Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions Saleh Soudijani et.al. 2505.13311 null
2025-05-19 TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents Yifu Cai et.al. 2505.13291 link
2025-05-19 Hybrid Voting-Based Task Assignment in Modular Construction Scenarios Daniel Weiner et.al. 2505.13278 null
2025-05-19 From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery Tianshi Zheng et.al. 2505.13259 link
2025-05-19 Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability Jingyi Ren et.al. 2505.13258 link
2025-05-19 Composing Dextrous Grasping and In-hand Manipulation via Scoring with a Reinforcement Learning Critic Lennart Röstel et.al. 2505.13253 null
2025-05-16 Automatic Reward Shaping from Confounded Offline Data Mingxuan Li et.al. 2505.11478 null
2025-05-16 Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks Wesley A Suttle et.al. 2505.11461 null
2025-05-16 Robust Equilibria in Shared Resource Allocation via Strengthening Border’s Theorem David X. Lin et.al. 2505.11431 null
2025-05-16 Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis Jing Liu et.al. 2505.11401 null
2025-05-16 Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation Zihan Wang et.al. 2505.11383 link
2025-05-16 GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents Lingxiao Diao et.al. 2505.11368 null
2025-05-16 Long-Term Average Impulse Control with Mean Field Interactions K. L. Helmes et.al. 2505.11345 null
2025-05-16 Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics Ardian Selmonaj et.al. 2505.11311 null
2025-05-16 Diffusion Learning with Partial Agent Participation and Local Updates Elsa Rizk et.al. 2505.11307 null
2025-05-16 Meta-World+: An Improved, Standardized, RL Benchmark Reginald McLean et.al. 2505.11289 link
2025-05-15 Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models Annie Wong et.al. 2505.10543 link
2025-05-15 Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation Xinrui Wang et.al. 2505.10522 null
2025-05-15 Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning Andrea Baisero et.al. 2505.10484 null
2025-05-15 Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps Ningyuan Yang et.al. 2505.10482 null
2025-05-15 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Ranjan Sapkota et.al. 2505.10468 null
2025-05-15 Bridging Theory and Perception in Fair Division: A Study on Comparative and Fair Share Notions Hadi Hosseini et.al. 2505.10433 null
2025-05-15 Aggregating Information and Preferences with Bounded-Size Deviations Qishen Han et.al. 2505.10388 null
2025-05-15 Multi-Agent Path Finding For Large Agents Is Intractable Artem Agafonov et.al. 2505.10387 null
2025-05-15 Plasticity as the Mirror of Empowerment David Abel et.al. 2505.10361 null
2025-05-15 Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change Jonathan Clifford Balloch et.al. 2505.10330 null
2025-05-14 Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? Anthony GX-Chen et.al. 2505.09614 null
2025-05-14 WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models Abdullah Mushtaq et.al. 2505.09595 null
2025-05-14 Preserving Plasticity in Continual Learning with Adaptive Linearity Injection Seyed Roozbeh Razavi Rohani et.al. 2505.09486 null
2025-05-14 Streaming Multi-agent Pathfinding Mingkai Tang et.al. 2505.09472 link
2025-05-14 CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios Raghav Garg et.al. 2505.09436 link
2025-05-15 Decentralized Nonlinear Model Predictive Control-Based Flock Navigation with Real-Time Obstacle Avoidance in Unknown Obstructed Environments Nuthasith Gerdpratoom et.al. 2505.09434 null
2025-05-14 Using Dopants as Agents to Probe Key Electronic Properties of Organic Semiconductors Artem Fediai et.al. 2505.09431 null
2025-05-14 Linear Search with Probabilistic Detection and Variable Speeds Jared Coleman et.al. 2505.09429 link
2025-05-15 SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation Achref Doula et.al. 2505.09427 null
2025-05-14 The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners Vince Trencsenyi et.al. 2505.09396 null
2025-05-14 Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology Yatai Ji et.al. 2505.08765 null
2025-05-13 Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots Glaucia Melo et.al. 2505.08648 null
2025-05-13 TRAIL: Trace Reasoning and Agentic Issue Localization Darshan Deshpande et.al. 2505.08638 null
2025-05-13 Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning Shuai Han et.al. 2505.08630 null
2025-05-13 OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning Zhaochen Su et.al. 2505.08617 link
2025-05-13 MC-Swarm: Minimal-Communication Multi-Agent Trajectory Planning and Deadlock Resolution for Quadrotor Swarm Yunwoo Lee et.al. 2505.08593 null
2025-05-14 Communication-Efficient Distributed Online Nonconvex Optimization with Time-Varying Constraints Kunpeng Zhang et.al. 2505.08592 null
2025-05-13 The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News Yuhan Liu et.al. 2505.08532 null
2025-05-13 Strategy-Augmented Planning for Large Language Models via Opponent Exploitation Shuai Xu et.al. 2505.08459 link
2025-05-13 Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting Emlyn Williams et.al. 2505.08458 null
2025-05-12 Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models Seungjae Lee et.al. 2505.07815 null
2025-05-12 A Theoretical Framework for Explaining Reinforcement Learning with Shapley Values Daniel Beechey et.al. 2505.07797 link
2025-05-12 MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering Rushi Qiang et.al. 2505.07782 link
2025-05-12 Multi-Agent Path Finding via Finite-Horizon Hierarchical Factorization Jiarui Li et.al. 2505.07779 null
2025-05-12 Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving Xinji Mai et.al. 2505.07773 link
2025-05-12 Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture Rintaro Ando et.al. 2505.07757 null
2025-05-13 VTutor for High-Impact Tutoring at Scale: Managing Engagement and Real-Time Multi-Screen Monitoring with P2P Connections Eason Chen et.al. 2505.07736 null
2025-05-13 Codifying Character Logic in Role-Playing Letian Peng et.al. 2505.07705 link
2025-05-12 Belief Injection for Epistemic Control in Linguistic State Space Sebastian Dumbrava et.al. 2505.07693 null
2025-05-12 Chronocept: Instilling a Sense of Time in Machines Krish Goel et.al. 2505.07637 link
2025-05-09 Robust Multi-Agent Decision-Making in Finite-Population Games Shinkyu Park et.al. 2505.06200 null
2025-05-09 Neuro-Symbolic Concepts Jiayuan Mao et.al. 2505.06191 null
2025-05-09 The Power of Matching for Online Fractional Hedonic Games Martin Bullinger et.al. 2505.06163 null
2025-05-09 Realistic Adversarial Attacks for Robustness Evaluation of Trajectory Prediction Models via Future State Perturbation Julian F. Schumann et.al. 2505.06134 link
2025-05-09 ELA-ZSON: Efficient Layout-Aware Zero-Shot Object Navigation Agent with Hierarchical Planning Jiawei Hou et.al. 2505.06131 null
2025-05-09 Oncolytic mechanisms and immunotherapeutic potential of Newcastle disease virus in cancer therapy Umar Ahmad et.al. 2505.06067 null
2025-05-09 Offline Multi-agent Reinforcement Learning via Score Decomposition Dan Qiao et.al. 2505.05968 null
2025-05-09 Learning Power Control Protocol for In-Factory 6G Subnetworks Uyoata E. Uyoata et.al. 2505.05967 null
2025-05-09 Cost-Effective, Low Latency Vector Search with Azure Cosmos DB Nitish Upreti et.al. 2505.05885 link
2025-05-09 Evolutionary ecology of words Reiji Suzuki et.al. 2505.05863 null
2025-05-08 RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles Pouria Behnoudfar et.al. 2505.05452 null
2025-05-08 clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations Chalamalasetti Kranti et.al. 2505.05445 null
2025-05-09 EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation Biao Yi et.al. 2505.05440 null
2025-05-08 Empowering Scientific Workflows with Federated Agents J. Gregory Pauloski et.al. 2505.05428 link
2025-05-08 Robustly optimal dynamics for active matter reservoir computing Mario U. Gaimann et.al. 2505.05420 null
2025-05-08 Weighted Envy-Freeness Revisited: Indivisible Resource and House Allocations Yuxi Liu et.al. 2505.05353 null
2025-05-08 Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects Agnese Chiatti et.al. 2505.05318 null
2025-05-08 HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow You Peng et.al. 2505.05286 link
2025-05-09 Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents Kaixin Wang et.al. 2505.05283 null
2025-05-08 Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration Andreas Kontogiannis et.al. 2505.05262 link
2025-05-07 Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions Stéphane Aroca-Ouellette et.al. 2505.04579 link
2025-05-07 Optimal Deterministic Rendezvous in Labeled Lines Yann Bourreau et.al. 2505.04564 null
2025-05-07 Qualitative Analysis of $ω$ -Regular Objectives on Robust MDPs Ali Asadi et.al. 2505.04539 null
2025-05-07 Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving Qi Liu et.al. 2505.04528 null
2025-05-07 RLMiniStyler: Light-weight RL Style Agent for Arbitrary Sequential Neural Style Generation Jing Hu et.al. 2505.04424 link
2025-05-07 Consensus-Aware AV Behavior: Trade-offs Between Safety, Interaction, and Performance in Mixed Urban Traffic Mohammad Elayan et.al. 2505.04379 link
2025-05-07 Extending a Quantum Reinforcement Learning Exploration Policy with Flags to Connect Four Filipe Santos et.al. 2505.04371 null
2025-05-07 Benchmarking LLMs’ Swarm intelligence Kai Ruan et.al. 2505.04364 link
2025-05-07 Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows Wenhao Li et.al. 2505.04354 null
2025-05-07 Resist Platform-Controlled AI Agents and Champion User-Centric Agent Advocates Sayash Kapoor et.al. 2505.04345 null
2025-05-06 Multi-Agent System for Comprehensive Soccer Understanding Jiayuan Rao et.al. 2505.03735 null
2025-05-06 WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch Zimu Lu et.al. 2505.03733 link
2025-05-06 Critical habitat size of organisms diffusing with stochastic resetting Luiz Menon et.al. 2505.03727 null
2025-05-06 Meta-Optimization and Program Search using Language Models for Task and Motion Planning Denis Shcherba et.al. 2505.03725 null
2025-05-06 Accelerated Decentralized Constraint-Coupled Optimization: A Dual $^2$ Approach Jingwang Li et.al. 2505.03719 null
2025-05-06 Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid Parv Kapoor et.al. 2505.03694 null
2025-05-06 Location-Restricted Stable Matching Garret Castro et.al. 2505.03680 null
2025-05-06 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting Huawei Sun et.al. 2505.03679 null
2025-05-06 Gap the (Theory of) Mind: Sharing Beliefs About Teammates’ Goals Boosts Collaboration Perception, Not Performance Yotam Amitai et.al. 2505.03674 null
2025-05-06 RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration Huajie Tan et.al. 2505.03673 link
2025-05-05 Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation Lu Ling et.al. 2505.02836 null
2025-05-05 AutoLibra: Agent Metric Induction from Open-Ended Feedback Hao Zhu et.al. 2505.02820 link
2025-05-05 Generating HomeAssistant Automations Using an LLM-based Chatbot Mathyas Giudici et.al. 2505.02802 null
2025-05-05 Recolorable Graph Exploration by an Oblivious Agent with Fewer Colors Shota Takahashi et.al. 2505.02789 null
2025-05-05 Brief Announcement: Minimizing Energy Solves Relative Majority with a Cubic Number of States in Population Protocols Tom-Lukas Breitkopf et.al. 2505.02785 null
2025-05-05 Merging plasmoids and nanojet-like ejections in a coronal current sheet Samrat Sen et.al. 2505.02733 null
2025-05-05 Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework Andrzej Mizera et.al. 2505.02712 link
2025-05-05 Technical Report: Evaluating Goal Drift in Language Model Agents Rauno Arike et.al. 2505.02709 null
2025-05-05 Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Yemin Shi et.al. 2505.02707 link
2025-05-05 Exploring LLM-Powered Role and Action-Switching Pedagogical Agents for History Education in Virtual Reality Zihao Zhu et.al. 2505.02699 null
2025-05-02 Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story Vincenzo De Paola et.al. 2505.01336 null
2025-05-02 Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning Mohammed Sumayli et.al. 2505.01332 null
2025-05-02 The Dance of the Sheared Eigenfunctions J. Oliveira-Cony et.al. 2505.01303 null
2025-05-02 Pattern formation using an intrinsic optimal control approach Tianhao Li et.al. 2505.01302 null
2025-05-02 Essential Workers at Risk: An Agent-Based Model (SAFE-ABM) with Bayesian Uncertainty Quantification Elizabeth B. Amona et.al. 2505.01243 null
2025-05-02 Bilateral Cognitive Security Games in Networked Control Systems under Stealthy Injection Attacks Anh Tung Nguyen et.al. 2505.01232 null
2025-05-02 Non-universal Impact of Cholesterol on Ionic Liquid-Membrane Interactions J. Gupta et.al. 2505.01230 null
2025-05-02 A Space-Time Trade-off for Fast Self-Stabilizing Leader Election in Population Protocols Henry Austin et.al. 2505.01210 null
2025-05-02 Explainable AI Based Diagnosis of Poisoning Attacks in Evolutionary Swarms Mehrdad Asadi et.al. 2505.01181 null
2025-05-02 Simulating Tertiary Educational Decision Dynamics: An Agent-Based Model for the Netherlands Jean-Paul Daemen et.al. 2505.01142 null
2025-05-01 Towards Autonomous Micromobility through Scalable Urban Simulation Wayne Wu et.al. 2505.00690 null
2025-05-01 Visual Test-time Scaling for GUI Agent Grounding Tiange Luo et.al. 2505.00684 link
2025-05-01 Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions Yiming Du et.al. 2505.00675 link
2025-05-01 A Finite-State Controller Based Offline Solver for Deterministic POMDPs Alex Schutz et.al. 2505.00596 link
2025-05-01 ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models Jiarong Wei et.al. 2505.00586 null
2025-05-01 A continuum thermodynamic model of the influence of non-ionic surfactant on mass transfer from gas bubbles Dieter Bothe et.al. 2505.00581 null
2025-05-01 Directly Forecasting Belief for Reinforcement Learning with Delays Qingyuan Wu et.al. 2505.00546 link
2025-05-01 Emergence of Roles in Robotic Teams with Model Sharing and Limited Communication Ian O’Flynn et.al. 2505.00540 null
2025-05-01 Safety-Critical Traffic Simulation with Guided Latent Diffusion Model Mingxing Peng et.al. 2505.00515 null
2025-05-01 Variational OOD State Correction for Offline Reinforcement Learning Ke Jiang et.al. 2505.00503 null
2025-04-30 A Survey of Interactive Generative Video Jiwen Yu et.al. 2504.21853 null
2025-04-30 TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments Sichang Tu et.al. 2504.21851 null
2025-04-30 Characterizing AI Agents for Alignment and Governance Atoosa Kasirzadeh et.al. 2504.21848 null
2025-04-30 SWE-smith: Scaling Data for Software Engineering Agents John Yang et.al. 2504.21798 null
2025-04-30 WebThinker: Empowering Large Reasoning Models with Deep Research Capability Xiaoxi Li et.al. 2504.21776 link
2025-04-30 Is Intermediate Fusion All You Need for UAV-based Collaborative Perception? Jiuwu Hao et.al. 2504.21774 link
2025-04-30 LLM-based Interactive Imitation Learning for Robotic Manipulation Jonas Werner et.al. 2504.21769 link
2025-04-30 Asymptotic Analysis of Weighted Fair Division Pasin Manurangsi et.al. 2504.21728 null
2025-04-30 LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Marc Glocker et.al. 2504.21716 link
2025-04-30 Economic Inequality between Groups in an a priori Stratified Society Thiago Dias et.al. 2504.21703 null
2025-04-29 Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam et.al. 2504.20997 null
2025-04-29 TesserAct: Learning 4D Embodied World Models Haoyu Zhen et.al. 2504.20995 null
2025-04-29 XPG-RL: Reinforcement Learning with Explainable Priority Guidance for Efficiency-Boosted Mechanical Search Yiting Zhang et.al. 2504.20969 null
2025-04-29 AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security Zikui Cai et.al. 2504.20965 link
2025-04-29 Opinion-Driven Decision-Making for Multi-Robot Navigation through Narrow Corridors Norah K. Alghamdi et.al. 2504.20947 null
2025-04-29 Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity Taisuke Kobayashi et.al. 2504.20932 null
2025-04-29 Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR Shahbaz P Qadri Syed et.al. 2504.20927 null
2025-04-29 Modeling AI-Human Collaboration as a Multi-Agent Adaptation Prothit Sen et.al. 2504.20903 link
2025-04-29 CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models Hasan Md Tusfiqur Alam et.al. 2504.20898 link
2025-04-29 Does Feedback Help in Bandits with Arm Erasures? Merve Karakas et.al. 2504.20894 null
2025-04-28 Towards Automated Scoping of AI for Social Good Projects Jacob Emmerson et.al. 2504.20010 null
2025-04-28 Simplified and Secure MCP Gateways for Enterprise AI Integration Ivo Brett et.al. 2504.19997 link
2025-04-28 TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Emre Can Acikgoz et.al. 2504.19982 null
2025-04-28 On one generalization of stable allocations in a two-sided market Alexander V. Karzanov et.al. 2504.19978 null
2025-04-28 Securing Agentic AI: A Comprehensive Threat Model and Mitigation Framework for Generative AI Agents Vineeth Sai Narajala et.al. 2504.19956 null
2025-04-28 Securing GenAI Multi-Agent Systems Against Tool Squatting: A Zero Trust Registry-Based Approach Vineeth Sai Narajala et.al. 2504.19951 null
2025-04-28 Automated decision-making for dynamic task assignment at scale Riccardo Lo Bianco et.al. 2504.19933 link
2025-04-28 Can AI Agents Design and Implement Drug Discovery Pipelines? Khachik Smbatyan et.al. 2504.19912 null
2025-04-28 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Guangyi Liu et.al. 2504.19838 link
2025-04-28 PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping Feng Chen et.al. 2504.19818 link
2025-04-25 Instrumentation for Better Demonstrations: A Case Study Remko Proesmans et.al. 2504.18481 null
2025-04-25 Improved Dwell-times for Switched Nonlinear Systems using Memory Regression Extension Muzaffar Qureshi et.al. 2504.18457 null
2025-04-25 Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior Milad Sefidgaran et.al. 2504.18455 null
2025-04-25 On monotone completion of risk markets: Limit results for incomplete risk markets Iman Khajepour et.al. 2504.18436 null
2025-04-25 LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection Rajesh Yarra et.al. 2504.18423 null
2025-04-25 Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant Lei Shen et.al. 2504.18373 link
2025-04-25 Interpretable Affordance Detection on 3D Point Clouds with Probabilistic Prototypes Maximilian Xiling Li et.al. 2504.18355 null
2025-04-25 Revisiting Data Auditing in Large Vision-Language Models Hongyu Zhu et.al. 2504.18349 null
2025-04-25 Optimal Control of Sensor-Induced Illusions on Robotic Agents Lorenzo Medici et.al. 2504.18339 null
2025-04-25 Towards Adaptive Software Agents for Debugging Yacine Majdoub et.al. 2504.18316 null
2025-04-24 Robotic Task Ambiguity Resolution via Natural Language Interaction Eugenio Chisari et.al. 2504.17748 null
2025-04-24 Applied Sheaf Theory For Multi-agent Artificial Intelligence (Reinforcement Learning) Systems: A Prospectus Eric Schmid et.al. 2504.17700 null
2025-04-24 ‘The Boring and the Tedious’: Invisible Labour in India’s Gig-Economy Pratyay Suvarnapathaki et.al. 2504.17697 null
2025-04-24 Towards a HIPAA Compliant Agentic AI System in Healthcare Subash Neupane et.al. 2504.17669 null
2025-04-24 A Constraint Opinion Model Fabio Gadducci et.al. 2504.17605 null
2025-04-24 Mitigating xApp conflicts for efficient network slicing in 6G O-RAN: a graph convolutional-based attention network approach Sihem Bakri et.al. 2504.17590 null
2025-04-24 A Multi-Agent, Laxity-Based Aggregation Strategy for Cost-Effective Electric Vehicle Charging and Local Transformer Overload Prevention Kristoffer Christensen et.al. 2504.17575 null
2025-04-24 Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks Yuelin Liu et.al. 2504.17526 null
2025-04-24 Communication-Efficient Personalized Distributed Learning with Data and Node Heterogeneity Zhuojun Tian et.al. 2504.17520 null
2025-04-24 Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning Mingqi Yuan et.al. 2504.17490 null
2025-04-23 OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents Raghav Thind et.al. 2504.16918 null
2025-04-23 Building A Secure Agentic AI Application Leveraging A2A Protocol Idan Habler et.al. 2504.16902 null
2025-04-23 Do Large Language Models know who did what to whom? Joseph M. Denning et.al. 2504.16884 null
2025-04-23 Hybrid Reinforcement Learning and Model Predictive Control for Adaptive Control of Hydrogen-Diesel Dual-Fuel Combustion Julian Bedei et.al. 2504.16875 null
2025-04-23 Monte Carlo Planning with Large Language Model for Text-Based Game Agents Zijing Shi et.al. 2504.16855 null
2025-04-23 Fair division of the replacement-units without an appraiser in urban renewal processes Noga Klein Elmalem et.al. 2504.16852 null
2025-04-23 MLOps Monitoring at Scale for Digital Platforms Yu Jeffrey Hu et.al. 2504.16789 null
2025-04-23 A Survey of AI Agent Protocols Yingxuan Yang et.al. 2504.16736 null
2025-04-24 DYNUS: Uncertainty-aware Trajectory Planner in Dynamic Unknown Environments Kota Kondo et.al. 2504.16734 null
2025-04-23 IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery Aniketh Garikaparthi et.al. 2504.16728 link
2025-04-22 MR. Video: “MapReduce” is the Principle for Long Video Understanding Ziqi Pang et.al. 2504.16082 null
2025-04-22 LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Thomas Schmied et.al. 2504.16078 null
2025-04-22 Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation Zhiyuan Hu et.al. 2504.16073 null
2025-04-22 ForesightNav: Learning Scene Imagination for Efficient Exploration Hardik Shah et.al. 2504.16062 link
2025-04-22 Reinforcement Learning and Metaheuristics for Feynman Integral Reduction Mao Zeng et.al. 2504.16045 null
2025-04-22 A Lagrangian Approach to Optimal Lotteries in Non-Convex Economies Chengfeng Shen et.al. 2504.15997 null
2025-04-22 Neuroadaptive Haptics: Comparing Reinforcement Learning from Explicit Ratings and Neural Signals for Adaptive XR Systems Lukas Gehrke et.al. 2504.15984 null
2025-04-22 Towards Test Generation from Task Description for Mobile Testing with Multi-modal Reasoning Hieu Huynh et.al. 2504.15917 link
2025-04-22 Learning the Spoofability of Limit Order Books With Interpretable Probabilistic Neural Networks Timothée Fabre et.al. 2504.15908 null
2025-04-22 A closer look at how large language models trust humans: patterns and biases Valeria Lerman et.al. 2504.15801 null
2025-04-21 Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Chun-Hsiao Yeh et.al. 2504.15280 link
2025-04-21 Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning Ehsan Ahmadi et.al. 2504.15263 null
2025-04-21 FlowReasoner: Reinforcing Query-Level Meta-Agents Hongcheng Gao et.al. 2504.15257 link
2025-04-21 A Self-Improving Coding Agent Maxime Robeyns et.al. 2504.15228 null
2025-04-21 An experimental study of the influence of anonymous information on social media users Boleslaw K. Szymanski et.al. 2504.15215 null
2025-04-21 Fully Adaptive Stepsizes: Which System Benefit More – Centralized or Decentralized? Diyako Ghaderyan et.al. 2504.15196 null
2025-04-21 Behavioral Universe Network (BUN): A Behavioral Information-Based Framework for Complex Systems Wei Zhou et.al. 2504.15146 null
2025-04-21 Neural ATTF: A Scalable Solution to Lifelong Multi-Agent Path Planning Kushal Shah et.al. 2504.15130 null
2025-04-21 Contemplative Wisdom for Superalignment Ruben Laukkonen et.al. 2504.15125 null
2025-04-21 Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN Lin Wang et.al. 2504.15099 null
2025-04-18 Science Hierarchography: Hierarchical Organization of Science Literature Muhan Gao et.al. 2504.13834 link
2025-04-18 LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark Guangyi Liu et.al. 2504.13805 null
2025-04-18 ChatNekoHacker: Real-Time Fan Engagement with Conversational Agents Takuya Sera et.al. 2504.13793 null
2025-04-21 BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models Zhengxian Wu et.al. 2504.13775 null
2025-04-18 $O(p \log d)$ Subgraph Isomorphism using Stigmergic Swarming Agents H. Van Dyke Parunak et.al. 2504.13722 null
2025-04-18 Stability of flocking in the reciprocal two-species Vicsek model: Effects of relative population, motility, and noise Aditya Kumar Dutta et.al. 2504.13709 null
2025-04-18 OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation Yichen Wu et.al. 2504.13707 null
2025-04-18 Modelling Immunity in Agent-based Models Gray Manicom et.al. 2504.13706 null
2025-04-18 EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model Sijing Li et.al. 2504.13650 link
2025-04-18 Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning Tao He et.al. 2504.13643 null
2025-04-17 Sleep-time Compute: Beyond Inference Scaling at Test-time Kevin Lin et.al. 2504.13171 link
2025-04-17 Exploring Expert Failures Improves LLM Agent Tuning Li-Cheng Lan et.al. 2504.13145 null
2025-04-17 Object-Driven Narrative in AR: A Scenario-Metaphor Framework with VLM Integration Yusi Sun et.al. 2504.13119 null
2025-04-17 Retrieval-Augmented Generation with Conflicting Evidence Han Wang et.al. 2504.13079 link
2025-04-17 InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning Zheng Wang et.al. 2504.13032 null
2025-04-17 Why Ask One When You Can Ask $k$ ? Two-Stage Learning-to-Defer to a Set of Experts Yannis Montreuil et.al. 2504.12988 null
2025-04-17 QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning? Zhouyang Jiang et.al. 2504.12961 null
2025-04-17 Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback Nearchos Potamitis et.al. 2504.12951 null
2025-04-17 RL-PINNs: Reinforcement Learning-Driven Adaptive Sampling for Efficient Training of PINNs Zhenao Song et.al. 2504.12949 null
2025-04-18 Customizing Emotional Support: How Do Individuals Construct and Interact With LLM-Powered Chatbots Xi Zheng et.al. 2504.12943 null
2025-04-16 Adapting a World Model for Trajectory Following in a 3D Game Marko Tot et.al. 2504.12299 null
2025-04-16 Optimal flock formation induced by agent heterogeneity Arthur N. Montanari et.al. 2504.12297 link
2025-04-16 Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning Mahmoud Salhab et.al. 2504.12254 null
2025-04-16 Data Assimilation for Robust UQ Within Agent-Based Simulation on HPC Systems Adam Spannaus et.al. 2504.12228 null
2025-04-16 Communication Optimization for Decentralized Learning atop Bandwidth-limited Edge Networks Tingyang Sun et.al. 2504.12210 null
2025-04-16 ARCeR: an Agentic RAG for the Automated Definition of Cyber Ranges Matteo Lupinacci et.al. 2504.12143 null
2025-04-16 Multilingual Contextualization of Large Language Models for Document-Level Machine Translation Miguel Moura Ramos et.al. 2504.12140 null
2025-04-16 The Social Learning Barrier Florian Brandl et.al. 2504.12136 null
2025-04-16 EmoACT: a Framework to Embed Emotions into Artificial Agents Based on Affect Control Theory Francesca Corrao et.al. 2504.12125 null
2025-04-16 Towards LLM Agents for Earth Observation Chia Hsiang Kao et.al. 2504.12110 null
2025-04-15 TextArena Leon Guertler et.al. 2504.11442 link
2025-04-15 Embodied World Models Emerge from Navigational Task in Open-Ended Environments Li Jin et.al. 2504.11419 null
2025-04-15 Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions Wang Bill Zhu et.al. 2504.11373 link
2025-04-15 DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks Yupei Liu et.al. 2504.11358 link
2025-04-15 Learning to Be A Doctor: Searching for Effective Medical Agent Architectures Yangyang Zhuang et.al. 2504.11301 null
2025-04-15 Policy heterogeneity improves collective olfactory search in 3-D turbulence Lorenzo Piro et.al. 2504.11291 null
2025-04-15 The Obvious Invisible Threat: LLM-Powered GUI Agents’ Vulnerability to Fine-Print Injections Chaoran Chen et.al. 2504.11281 null
2025-04-15 Multi-Agent Reinforcement Learning for Greenhouse Gas Offset Credit Markets Liam Welsh et.al. 2504.11258 null
2025-04-16 UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis Xinyi Liu et.al. 2504.11257 null
2025-04-15 A Rollout-Based Algorithm and Reward Function for Efficient Resource Allocation in Business Processes Jeroen Middelhuis et.al. 2504.11250 null
2025-04-14 The Price of Competitive Information Disclosure Siddhartha Banerjee et.al. 2504.10459 null
2025-04-15 GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents Xiaobo Xia et.al. 2504.10458 null
2025-04-14 RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users Suyu Ye et.al. 2504.10445 link
2025-04-14 Position Uncertainty in a Prisoner’s Dilemma Game : An Experiment Chowdhury Mohammad Sakib Anwar et.al. 2504.10441 null
2025-04-14 Silent Self-Stabilizing Ranking: Time Optimal and Space Efficient Petra Berenbrink et.al. 2504.10417 null
2025-04-14 Ctrl-Z: Controlling AI Agents via Resampling Aryan Bhatt et.al. 2504.10374 null
2025-04-14 Proteinoid spikes: from protocognitive to universal approximating agents Saksham Sharma et.al. 2504.10362 null
2025-04-14 Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving Xiaoshan Zhou et.al. 2504.10296 null
2025-04-14 Characterizing LLM-driven Social Network: The Chirper.ai Case Yiming Zhu et.al. 2504.10286 null
2025-04-14 RealHarm: A Collection of Real-World Language Model Application Failures Pierre Le Jeune et.al. 2504.10277 link
2025-04-11 DocAgent: A Multi-Agent System for Automated Code Documentation Generation Dayu Yang et.al. 2504.08725 link
2025-04-11 SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents Muhammad Shihab Rashid et.al. 2504.08703 link
2025-04-11 SeaView: Software Engineering Agent Visual Interface for Enhanced Workflow Timothy Bula et.al. 2504.08696 null
2025-04-11 TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning Hang Ni et.al. 2504.08694 null
2025-04-11 Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing Jiho Kim et.al. 2504.08687 null
2025-04-11 Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents Alessio Buscemi et.al. 2504.08640 null
2025-04-11 Optimal selection of the most informative nodes for a noisy DeGroot model with stubborn agents Roberta Raineri et.al. 2504.08622 null
2025-04-11 MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation Tao Zhang et.al. 2504.08621 link
2025-04-11 Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage Constraints Mohamed S. Talamali et.al. 2504.08585 null
2025-04-11 FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents Xin Tan et.al. 2504.08581 null
2025-04-10 Fast Adaptation with Behavioral Foundation Models Harshit Sikchi et.al. 2504.07896 null
2025-04-10 Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini et.al. 2504.07887 link
2025-04-11 An LLM-Driven Multi-Agent Debate System for Mendelian Diseases Xinyang Zhou et.al. 2504.07881 null
2025-04-10 Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis Fei-Hsuan Yu et.al. 2504.07872 null
2025-04-10 In itinere infections covertly undermine localized epidemic control in metapopulations Francesca Dilisante et.al. 2504.07849 null
2025-04-10 Anytime Single-Step MAPF Planning with Anytime PIBT Nayesha Gandotra et.al. 2504.07841 null
2025-04-10 Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems Simon Lermen et.al. 2504.07831 null
2025-04-10 MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations Genglin Liu et.al. 2504.07830 link
2025-04-10 Active Matter Flocking via Predictive Alignment Julian Giraldo-Barreto et.al. 2504.07778 null
2025-04-10 Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents Manh Hung Nguyen et.al. 2504.07655 null
2025-04-09 SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills Boyuan Zheng et.al. 2504.07079 null
2025-04-09 A Unified Agentic Framework for Evaluating Conditional Image Generation Jifang Wang et.al. 2504.07046 link
2025-04-09 Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration Kostas Hatalis et.al. 2504.06943 null
2025-04-09 AI-Driven Consensus: Modeling Multi-Agent Networks with Long-Range Interactions through path-Laplacian Matrices Yusef Ahsini et.al. 2504.06894 link
2025-04-09 More connection, less community: network formation and local public goods provision Alastair Langtry et.al. 2504.06872 null
2025-04-09 Persona Dynamics: Unveiling the Impact of Personality Traits on Agents in Text-Based Games Seungwon Lim et.al. 2504.06868 link
2025-04-09 IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments Can Zhang et.al. 2504.06827 null
2025-04-09 Inducing Programmatic Skills for Agentic Tasks Zora Zhiruo Wang et.al. 2504.06821 link
2025-04-09 FamilyTool: A Multi-hop Personalized Tool Use Benchmark Yuxin Wang et.al. 2504.06766 link
2025-04-09 Adaptive Human-Robot Collaborative Missions using Hybrid Task Planning Gricel Vázquez et.al. 2504.06746 null
2025-04-08 FEABench: Evaluating Language Models on Multiphysics Reasoning Ability Nayantara Mudur et.al. 2504.06260 link
2025-04-08 The Work Capacity of Channels with Memory: Maximum Extractable Work in Percept-Action Loops Lukas J. Fiderer et.al. 2504.06209 null
2025-04-08 TxGemma: Efficient and Agentic LLMs for Therapeutics Eric Wang et.al. 2504.06196 null
2025-04-08 SkillFlow: Efficient Skill and Code Transfer Through Communication in Adapting AI Agents Pagkratios Tagkopoulos et.al. 2504.06188 null
2025-04-08 Linear Regulator-Based Synchronization of Positive Multi-Agent Systems Alba Gurpegui et.al. 2504.06169 null
2025-04-08 V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models Xiangxi Zheng et.al. 2504.06148 link
2025-04-08 Deploying Chatbots in Customer Service: Adoption Hurdles and Simple Remedies Evgeny Kagan et.al. 2504.06145 null
2025-04-08 A Multimedia Analytics Model for the Foundation Model Era Marcel Worring et.al. 2504.06138 null
2025-04-08 Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning Tooraj Helmi et.al. 2504.06135 null
2025-04-08 Accelerating Vehicle Routing via AI-Initialized Genetic Algorithms Ido Greenberg et.al. 2504.06126 null
2025-04-07 CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models Kavana Venkatesh et.al. 2504.05306 null
2025-04-07 How to evaluate control measures for LLM agents? A trajectory from today to superintelligence Tomek Korbak et.al. 2504.05259 null
2025-04-07 Rationalizing dynamic choices Henrique de Oliveira et.al. 2504.05251 null
2025-04-07 Reducing the Communication of Distributed Model Predictive Control: Autoencoders and Formation Control Torben Schiz et.al. 2504.05223 null
2025-04-07 DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation Xinglin Lyu et.al. 2504.05122 link
2025-04-07 AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments Saeid Ario Vaghefi et.al. 2504.05104 null
2025-04-07 AI-Driven Tactical Communications and Networking for Defense: A Survey and Emerging Trends Victor Monzon Baeza et.al. 2504.05071 null
2025-04-07 Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning Sugyeong Eo et.al. 2504.05047 null
2025-04-08 Attention-Augmented Inverse Reinforcement Learning with Graph Convolutions for Multi-Agent Task Allocation Huilin Yin et.al. 2504.05045 null
2025-04-07 Mixture-of-Personas Language Models for Population Simulation Ngoc Bui et.al. 2504.05019 null
2025-04-04 Bonsai: Interpretable Tree-Adaptive Grounded Reasoning Kate Sanders et.al. 2504.03640 null
2025-04-04 Epicast 2.0: A large-scale, demographically detailed, agent-based model for simulating respiratory pathogen spread in the United States Prescott C. Alexander et.al. 2504.03604 null
2025-04-04 APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay Akshara Prabhakar et.al. 2504.03601 null
2025-04-04 A Lower Bound on Conservative Elementary Object Systems Coverability Francesco Di Cosmo et.al. 2504.03591 null
2025-04-04 SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement Runnan Fang et.al. 2504.03561 link
2025-04-04 Agentic Knowledgeable Self-awareness Shuofei Qiao et.al. 2504.03553 link
2025-04-04 The Limits of “Fairness” of the Variational Generalized Nash Equilibrium Sophie Hall et.al. 2504.03540 null
2025-04-04 RANa: Retrieval-Augmented Navigation Gianluca Monaci et.al. 2504.03524 null
2025-04-04 Target Prediction Under Deceptive Switching Strategies via Outlier-Robust Filtering of Partially Observed Incomplete Trajectories Yiming Meng et.al. 2504.03502 null
2025-04-04 A stochastic volatility approximation for a tick-by-tick price model with mean-field interaction Paolo Dai Pra et.al. 2504.03445 null
2025-04-03 Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets Chuning Zhu et.al. 2504.02792 null
2025-04-03 Sequential Binary Hypothesis Testing with Competing Agents under Information Asymmetry Aneesh Raghavan et.al. 2504.02743 null
2025-04-03 Responsible Development of Offensive AI Ryan Marinelli et.al. 2504.02701 link
2025-04-03 The Tension between Trust and Oversight in Long-term Relationships Peter Achim et.al. 2504.02696 null
2025-04-03 Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL Achilles Kiwanuka Machumilane et.al. 2504.02688 null
2025-04-03 A Set-Theoretic Robust Control Approach for Linear Quadratic Games with Unknown Counterparts Francesco Bianchin et.al. 2504.02679 null
2025-04-03 Affordable AI Assistants with Knowledge Graph of Thoughts Maciej Besta et.al. 2504.02670 null
2025-04-03 SymDQN: Symbolic Knowledge and Reasoning in Neural Network-based Reinforcement Learning Ivo Amador et.al. 2504.02654 null
2025-04-04 Controlled Social Learning: Altruism vs. Bias Raghu Arghal et.al. 2504.02648 null
2025-04-03 Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions PeiJie Yu et.al. 2504.02623 link
2025-04-02 Graphon games and an idealized limit of large network games Motoki Otsuka et.al. 2504.01944 null
2025-04-02 Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection Souradip Chakraborty et.al. 2504.01931 null
2025-04-02 Gen-C: Populating Virtual Worlds with Generative Crowds Andreas Panayiotou et.al. 2504.01924 null
2025-04-02 Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning Yinggan Xu et.al. 2504.01911 null
2025-04-02 Interpreting Emergent Planning in Model-Free Reinforcement Learning Thomas Bush et.al. 2504.01871 null
2025-04-02 PaperBench: Evaluating AI’s Ability to Replicate AI Research Giulio Starace et.al. 2504.01848 link
2025-04-02 A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning Yuyang Qiu et.al. 2504.01839 null
2025-04-02 Budget-Feasible Contracts Michal Feldman et.al. 2504.01773 null
2025-04-03 Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning Ke Jiang et.al. 2504.01719 null
2025-04-02 Reasoning LLMs for User-Aware Multimodal Conversational Agents Hamed Rahimi et.al. 2504.01700 null
2025-03-31 RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Zhonghan Zhao et.al. 2503.24388 null
2025-03-31 Coordinating Distributed Energy Resources with Nodal Pricing in Distribution Networks: a Game-Theoretic Approach Eli Brock et.al. 2503.24342 null
2025-03-31 Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning Yubo Zhang et.al. 2503.24296 null
2025-03-31 Value of Information-based Deceptive Path Planning Under Adversarial Interventions Wesley A. Suttle et.al. 2503.24284 null
2025-03-31 MaintainCoder: Maintainable Code Generation Under Dynamic Requirements Zhengren Wang et.al. 2503.24260 link
2025-03-31 PAARS: Persona Aligned Agentic Retail Shoppers Saab Mansour et.al. 2503.24228 null
2025-03-31 Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany Abdul Sittar et.al. 2503.24199 null
2025-03-31 Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms Shuoming Zhang et.al. 2503.24191 null
2025-03-31 Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up Ziming Cheng et.al. 2503.24180 null
2025-03-31 Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy Harry Robertshaw et.al. 2503.24140 null
2025-03-28 Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions Mohammad Almansoori et.al. 2503.22678 null
2025-03-28 ActionStudio: A Lightweight Framework for Data and Training of Action Models Jianguo Zhang et.al. 2503.22673 link
2025-03-28 On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations Rajdeep Singh Hundal et.al. 2503.22575 null
2025-03-28 SafeCast: Risk-Responsive Motion Forecasting for Autonomous Vehicles Haicheng Liao et.al. 2503.22541 null
2025-03-28 Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement Wenqiang Luo et.al. 2503.22512 null
2025-03-28 Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments Luke Rowe et.al. 2503.22496 null
2025-03-28 WorkTeam: Constructing Workflows from Natural Language with Multi-Agents Hanchao Liu et.al. 2503.22473 null
2025-03-28 Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey Shengyue Guan et.al. 2503.22458 null
2025-03-28 Scaling Laws of Scientific Discovery with AI and Robot Scientists Pengsong Zhang et.al. 2503.22444 null
2025-03-28 CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching Zhonghao Jiang et.al. 2503.22424 link
2025-03-27 MemInsight: Autonomous Memory Augmentation for LLM Agents Rana Salama et.al. 2503.21760 null
2025-03-27 GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics Arsham Gholamzadeh Khoee et.al. 2503.21735 null
2025-03-27 Collab: Controlled Decoding using Mixture of Agents for LLM Alignment Souradip Chakraborty et.al. 2503.21720 null
2025-03-27 A tale of two goals: leveraging sequentiality in multi-goal scenarios Olivier Serris et.al. 2503.21677 null
2025-03-27 Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI Danaja Rutar et.al. 2503.21668 null
2025-03-27 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Zhengxi Lu et.al. 2503.21620 link
2025-03-27 A Measure Based Generalizable Approach to Understandability Vikas Kushwaha et.al. 2503.21615 null
2025-03-27 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Xiaoye Qu et.al. 2503.21614 link
2025-03-27 A Deep Reinforcement Learning-based Approach for Adaptive Handover Protocols Johannes Voigt et.al. 2503.21601 null
2025-03-27 debug-gym: A Text-Based Environment for Interactive Debugging Xingdi Yuan et.al. 2503.21557 null
2025-03-26 Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Shijie Zhou et.al. 2503.20776 null
2025-03-26 Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why? Ilia Shilov et.al. 2503.20772 null
2025-03-27 Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs Yuxuan Lu et.al. 2503.20749 null
2025-03-26 Prospect for measuring work statistics in quantum coherent systems Cheolhee Han et.al. 2503.20729 null
2025-03-26 Convergence Theory of Flexible ALADIN for Distributed Optimization Xu Du et.al. 2503.20716 null
2025-03-26 Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control Eloy Anguiano Batanero et.al. 2503.20688 null
2025-03-27 Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound Yuhao Huang et.al. 2503.20685 null
2025-03-26 TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews Huimin Xu et.al. 2503.20666 null
2025-03-26 Agent-Based Analysis of the Impact of Near Real-Time Data and Smart Balancing on the Frequency Stability of Power Systems Johannes Lips et.al. 2503.20665 null
2025-03-26 State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning Zongyuan Zhang et.al. 2503.20613 null
2025-03-25 Energetic advantages for quantum agents in online execution of complex strategies Jayne Thompson et.al. 2503.19896 null
2025-03-25 A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design Jie Tian et.al. 2503.19889 null
2025-03-25 Collaborative Satisfaction of Long-Term Spatial Constraints in Multi-Agent Systems: A Distributed Optimization Approach (extended version) Farhad Mehdifar et.al. 2503.19879 null
2025-03-25 Towards Online Multi-Modal Social Interaction Understanding Xinpeng Li et.al. 2503.19851 link
2025-03-25 FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Carlos Plou et.al. 2503.19850 null
2025-03-25 Thinking agents for zero-shot generalization to qualitatively novel tasks Thomas Miconi et.al. 2503.19815 null
2025-03-25 Simulating Tracking Data to Advance Sports Analytics Research David Radke et.al. 2503.19809 link
2025-03-25 Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation Lewis Newsham et.al. 2503.19752 null
2025-03-25 Writing as a testbed for open ended agents Sian Gooding et.al. 2503.19711 null
2025-03-25 Optimal Path Planning and Cost Minimization for a Drone Delivery System Via Model Predictive Control Muhammad Al-Zafar Khan et.al. 2503.19699 null
2025-03-24 AdaWorld: Learning Adaptable World Models with Latent Actions Shenyuan Gao et.al. 2503.18938 link
2025-03-24 AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration Zhexuan Wang et.al. 2503.18891 link
2025-03-24 Dynamics of Insect Paraintelligence: How a Mindless Colony of Ants Meaningfully Moves a Beetle Eldar Knar et.al. 2503.18858 null
2025-03-24 Self-Organizing Graph Reasoning Evolves into a Critical State for Continuous Discovery Through Structural-Semantic Dynamics Markus J. Buehler et.al. 2503.18852 null
2025-03-24 EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments Sara Fish et.al. 2503.18825 null
2025-03-24 Faster Heat Transfer Clarifies the Unexpected Twist in the Simultaneous Freezing of Hot versus Cold Water James D. Brownridge et.al. 2503.18820 null
2025-03-24 Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm Chak Lam Shek et.al. 2503.18816 null
2025-03-24 Defeating Prompt Injections by Design Edoardo Debenedetti et.al. 2503.18813 null
2025-03-24 Simulation-Driven Balancing of Competitive Game Levels with Reinforcement Learning Florian Rupp et.al. 2503.18748 link
2025-03-24 Unsupervised Acquisition of Discrete Grammatical Categories David Ph. Shakouri et.al. 2503.18702 null
2025-03-21 HCAST: Human-Calibrated Autonomy Software Tasks David Rein et.al. 2503.17354 link
2025-03-21 CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities Yuxuan Zhu et.al. 2503.17332 link
2025-03-21 LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language Kun Chu et.al. 2503.17309 link
2025-03-21 Exploring the Temporal Dynamics of Facial Mimicry in Emotion Processing Using Action Units Meisam Jamshidi Seikavandi et.al. 2503.17306 null
2025-03-21 Coarsening in the Persistent Voter Model: analytical results R. G. de Almeida et.al. 2503.17295 null
2025-03-21 Curriculum RL meets Monte Carlo Planning: Optimization of a Real World Container Management Problem Abhijeet Pendyala et.al. 2503.17194 link
2025-03-21 Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection Duanrui Yu et.al. 2503.17175 null
2025-03-21 Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning Chan Kim et.al. 2503.17125 null
2025-03-21 Deterministic AI Agent Personality Expression through Standard Psychological Diagnostics J. M. Diederik Kruijssen et.al. 2503.17085 null
2025-03-21 Replay4NCL: An Efficient Memory Replay-based Methodology for Neuromorphic Continual Learning in Embedded AI Systems Mishal Fatima Minhas et.al. 2503.17061 null
2025-03-20 Survey on Evaluation of LLM-based Agents Asaf Yehudai et.al. 2503.16416 null
2025-03-20 Computing Lindahl Equilibrium for Public Goods with and without Funding Caps Christian Kroer et.al. 2503.16414 null
2025-03-20 RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints Yiran Qin et.al. 2503.16408 null
2025-03-20 Do Visual Imaginations Improve Vision-and-Language Navigation Agents? Akhil Perincherry et.al. 2503.16394 null
2025-03-20 JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Muyao Li et.al. 2503.16365 null
2025-03-20 Issue2Test: Generating Reproducing Test Cases from Issue Reports Noor Nashid et.al. 2503.16320 null
2025-03-20 Characterizing the Convergence of Game Dynamics via Potentialness Martin Bichler et.al. 2503.16285 link
2025-03-20 Binary-Report Peer Prediction for Real-Valued Signal Spaces Rafael Frongillo et.al. 2503.16280 null
2025-03-20 AI Agents in Cryptoland: Practical Attacks and No Silver Bullet Atharv Singh Patlan et.al. 2503.16248 null
2025-03-20 Dispersion is (Almost) Optimal under (A)synchrony Ajay D. Kshemkalyani et.al. 2503.16216 null
2025-03-19 More Information is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns Kushagra Gupta et.al. 2503.15486 null
2025-03-19 SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Yifei Zhou et.al. 2503.15478 link
2025-03-19 Energy-efficient Merging of Connected and Automated Vehicles using Control Barrier Functions Shreshta Rajakumar Deshpande et.al. 2503.15379 null
2025-03-19 Lyapunov-Based Graph Neural Networks for Adaptive Control of Multi-Agent Systems Brandon C. Fallin et.al. 2503.15360 null
2025-03-19 MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration David Wan et.al. 2503.15272 null
2025-03-19 Exploring Large Language Models for Word Games:Who is the Spy? Chentian Wei et.al. 2503.15235 link
2025-03-19 A Personalized Data-Driven Generative Model of Human Motion Angelo Di Porzio et.al. 2503.15225 null
2025-03-19 When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection Tittaya Mairittha et.al. 2503.15204 null
2025-03-19 Learning Topology Actions for Power Grid Control: A Graph-Based Soft-Label Imitation Learning Approach Mohamed Hassouna et.al. 2503.15190 null
2025-03-19 Role-Selection Game in Block Production under Proposer-Builder Separation Yanzhen Li et.al. 2503.15184 null
2025-03-18 Gricean Norms as a Basis for Effective Collaboration Fardin Saad et.al. 2503.14484 link
2025-03-18 Don’t lie to your friends: Learning what you know from collaborative self-play Jacob Eisenstein et.al. 2503.14481 null
2025-03-18 EnvBench: A Benchmark for Automated Environment Setup Aleksandra Eliseeva et.al. 2503.14443 link
2025-03-18 PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play Wei Fang et.al. 2503.14432 null
2025-03-18 Decentralized RISE-based Control for Exponential Heterogeneous Multi-Agent Target Tracking of Second-Order Nonlinear Systems Cristian F. Nino et.al. 2503.14418 null
2025-03-18 Large Language Models for Virtual Human Gesture Selection Parisa Ghanad Torshizi et.al. 2503.14408 null
2025-03-18 Unified Analysis of Decentralized Gradient Descent: a Contraction Mapping Framework Erik G. Larsson et.al. 2503.14353 null
2025-03-18 MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration Yisen Xu et.al. 2503.14340 null
2025-03-18 DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal Vaibhav Aggarwal et.al. 2503.14269 link
2025-03-18 Conversational Agents as Catalysts for Critical Thinking: Challenging Social Influence in Group Decision-making Soohwan Lee et.al. 2503.14263 null
2025-03-17 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Ye Liu et.al. 2503.13444 link
2025-03-17 A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives Weiqiang Jin et.al. 2503.13415 null
2025-03-17 Reward Adaptation Via Q-Manipulation Kevin Vora et.al. 2503.13414 null
2025-03-17 Toward Generative 6G Simulation: An Experimental Multi-Agent LLM and ns-3 Integration Farhad Rezazadeh et.al. 2503.13402 null
2025-03-17 MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research James Burgess et.al. 2503.13399 link
2025-03-17 Mixtures of ensembles: System separation and identification via optimal transport Filip Elvander et.al. 2503.13362 null
2025-03-17 Optimal intrinsic formation using exogenous systems Yueyue Xu et.al. 2503.13359 null
2025-03-17 Agents Play Thousands of 3D Video Games Zhongwen Xu et.al. 2503.13356 null
2025-03-17 Goal2Story: A Multi-Agent Fleet based on Privately Enabled sLLMs for Impacting Mapping on Requirements Elicitation Xinkai Zou et.al. 2503.13279 null
2025-03-17 Knowledge-Aware Iterative Retrieval for Multi-Agent Systems Seyoung Song et.al. 2503.13275 null
2025-03-14 Scaling the Automated Discovery of Quantum Circuits via Reinforcement Learning with Gadgets Jan Olle et.al. 2503.11638 null
2025-03-14 Essentials of the kinetic theory of multi-agent systems Nadia Loy et.al. 2503.11554 null
2025-03-14 Multi-robot coordination for connectivity recovery after unpredictable environment changes Yaroslav Marchukov et.al. 2503.11520 null
2025-03-14 Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks Diego Gosmar et.al. 2503.11517 link
2025-03-14 Multi-agent coordination for on-demand data gathering with periodic information upload Yaroslav Marchukov et.al. 2503.11504 null
2025-03-14 Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control Yifeng Zhang et.al. 2503.11488 null
2025-03-14 Research Vision: Multi-Agent Path Planning for Cops And Robbers Via Reactive Synthesis William Fishell et.al. 2503.11475 null
2025-03-14 Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning Jose-Luis Holgado-Alvarez et.al. 2503.11467 null
2025-03-14 Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves Aryaman Reddi et.al. 2503.11452 link
2025-03-14 Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery Balaji Rama et.al. 2503.11444 link
2025-03-13 UniGoal: Towards Universal Zero-shot Goal-oriented Navigation Hang Yin et.al. 2503.10630 null
2025-03-13 Uncertainty in Action: Confidence Elicitation in Embodied Agents Tianjiao Yu et.al. 2503.10628 null
2025-03-13 CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing Advait Gupta et.al. 2503.10613 link
2025-03-13 GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding Rui Hu et.al. 2503.10596 link
2025-03-13 The Lagrangian Method for Solving Constrained Markov Games Soham Das et.al. 2503.10561 null
2025-03-13 A large multi-agent system with noise both in position and control Giuseppe D’Onofrio et.al. 2503.10543 null
2025-03-13 Fair allocations with subadditive and XOS valuations Uriel Feige et.al. 2503.10513 null
2025-03-13 SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models Sahar Admoni et.al. 2503.10509 null
2025-03-13 SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process Tom Maus et.al. 2503.10466 null
2025-03-13 Compliant Control of Quadruped Robots for Assistive Load Carrying Nimesh Khandelwal et.al. 2503.10401 null
2025-03-12 Auspex: Building Threat Modeling Tradecraft into an Artificial Intelligence-based Copilot Andrew Crossman et.al. 2503.09586 null
2025-03-12 Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks Lutfi Eren Erdogan et.al. 2503.09572 null
2025-03-12 The turnpike control in stochastic multi-agent dynamics: a discrete-time approach with exponential integrators Fabio Cassini et.al. 2503.09549 null
2025-03-13 Large Language Models for Multi-Facility Location Mechanism Design Nguyen Thach et.al. 2503.09533 null
2025-03-12 PairVDN - Pair-wise Decomposed Value Functions Zak Buzzard et.al. 2503.09521 link
2025-03-12 RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment Md Morshed Alam et.al. 2503.09513 null
2025-03-12 TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues Hannah VanderHoeven et.al. 2503.09511 null
2025-03-12 ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning Ziyu Wan et.al. 2503.09501 link
2025-03-12 SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery Jiayuan Huang et.al. 2503.09474 null
2025-03-12 Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation Máté Tóth et.al. 2503.09464 null
2025-03-11 CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving Changxing Liu et.al. 2503.08683 link
2025-03-11 AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence Zekun Li et.al. 2503.08669 null
2025-03-11 EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments Dongping Li et.al. 2503.08604 link
2025-03-11 GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Tong Wei et.al. 2503.08525 null
2025-03-11 ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews Xian Gao et.al. 2503.08506 null
2025-03-11 Existence of Optimal Contracts for Principal-Agent Problem with Drift Control and Quadratic Effort Cost Xinfu Chen et.al. 2503.08503 null
2025-03-11 Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RAN F. Giarrè et.al. 2503.08493 null
2025-03-11 Hybrid Deep Reinforcement Learning for Radio Tracer Localisation in Robotic-assisted Radioguided Surgery Hanyi Zhang et.al. 2503.08492 null
2025-03-11 Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding Tim Steinke et.al. 2503.08474 null
2025-03-12 An Autonomous RL Agent Methodology for Dynamic Web UI Testing in a BDD Framework Ali Hassaan Mughal et.al. 2503.08464 null
2025-03-10 MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Xiangru Tang et.al. 2503.07459 link
2025-03-10 LLMs syntactically adapt their language use to their conversational partner Florian Kandra et.al. 2503.07457 null
2025-03-10 Towards Safe Robot Foundation Models Maximilian Tölle et.al. 2503.07404 null
2025-03-10 Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning Kha Vo et.al. 2503.07397 null
2025-03-10 AttentionSwarm: Reinforcement Learning with Attention Control Barier Function for Crazyflie Drones in Dynamic Environments Grik Tadevosyan et.al. 2503.07376 null
2025-03-10 Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future Yannick Oswald et.al. 2503.07364 null
2025-03-10 Temporal Triplane Transformers as Occupancy World Models Haoran Xu et.al. 2503.07338 null
2025-03-10 Dynamic Path Navigation for Motion Agents with LLM Reasoning Yubo Zhao et.al. 2503.07323 null
2025-03-10 Experimental Exploration: Investigating Cooperative Interaction Behavior Between Humans and Large Language Model Agents Guanxuan Jiang et.al. 2503.07320 null
2025-03-10 Automated Movie Generation via Multi-Agent CoT Planning Weijia Wu et.al. 2503.07314 link
2025-03-07 On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations Vittorio Bilò et.al. 2503.05695 null
2025-03-07 A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval Yu Zhang et.al. 2503.05659 link
2025-03-07 Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning Justin Chih-Yao Chen et.al. 2503.05641 null
2025-03-07 InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model Feeza Khan Khanzada et.al. 2503.05573 null
2025-03-07 Tractable Representations for Convergent Approximation of Distributional HJB Equations Julie Alhosh et.al. 2503.05563 null
2025-03-07 ALMAGAL I. The ALMA evolutionary study of high-mass protocluster formation in the Galaxy. Presentation of the survey and early results S. Molinari et.al. 2503.05555 null
2025-03-07 Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning Raphael Trumpp et.al. 2503.05546 null
2025-03-07 The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence Noah Mamie et.al. 2503.05473 null
2025-03-07 Game Theory in Formula 1: Multi-agent Physical and Strategical Interactions Giona Fienia et.al. 2503.05421 null
2025-03-07 First-passage-time statistics of active Brownian particles: A perturbative approach Yanis Baouche et.al. 2503.05401 null
2025-03-06 The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making Stephen Pilli et.al. 2503.04692 null
2025-03-06 Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases Pengcheng Qiu et.al. 2503.04691 null
2025-03-06 Multi-Agent Inverse Q-Learning from Demonstrations Nathaniel Haynam et.al. 2503.04679 null
2025-03-06 Data-Driven Distributed Optimization via Aggregative Tracking and Deep-Learning Riccardo Brumali et.al. 2503.04668 null
2025-03-06 Assessing the performance of compartmental and renewal models for learning $R_{t}$ using spatially heterogeneous epidemic simulations on real geographies Matthew Ghosh et.al. 2503.04648 null
2025-03-06 SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing Xiangchao Yan et.al. 2503.04629 link
2025-03-06 The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy Xinyi Hou et.al. 2503.04596 null
2025-03-06 Advancing Solutions for the Three-Body Problem Through Physics-Informed Neural Networks Manuel Santos Pereira et.al. 2503.04585 null
2025-03-06 ToolFuzz – Automated Agent Tool Testing Ivan Milev et.al. 2503.04479 null
2025-03-06 From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design Felix Ocker et.al. 2503.04417 null
2025-03-05 The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems Richard Ren et.al. 2503.03750 null
2025-03-05 CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning Yuqi Zhou et.al. 2503.03743 link
2025-03-05 A Practical Memory Injection Attack against LLM Agents Shen Dong et.al. 2503.03704 null
2025-03-05 MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems Rui Ye et.al. 2503.03686 null
2025-03-05 Optimally Installing Strict Equilibria Jeremy McMahan et.al. 2503.03676 null
2025-03-05 Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models Bar Karov et.al. 2503.03669 link
2025-03-05 A Generative Approach to High Fidelity 3D Reconstruction from Text Data Venkat Kumar R et.al. 2503.03664 null
2025-03-05 Motion Planning and Control with Unknown Nonlinear Dynamics through Predicted Reachability Zhiquan Zhang et.al. 2503.03633 null
2025-03-05 TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation Haowei Sun et.al. 2503.03629 link
2025-03-05 Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories Alperen Yildiz et.al. 2503.03586 null
2025-03-04 MuBlE: MuJoCo and Blender simulation Environment and Benchmark for Task Planning in Robot Manipulation Michal Nazarczuk et.al. 2503.02834 link
2025-03-04 Meta-Learning to Explore via Memory Density Feedback Kevin L. McKee et.al. 2503.02831 null
2025-03-04 Do Not Trust Licenses You See – Dataset Compliance Requires Massive-Scale AI-Powered Lifecycle Tracing Jaekyeom Kim et.al. 2503.02784 null
2025-03-04 Quantitative Resilience Modeling for Autonomous Cyber Defense Xavier Cadet et.al. 2503.02780 null
2025-03-04 From Metaphor to Mechanism: How LLMs Decode Traditional Chinese Medicine Symbolic Language for Modern Clinical Relevance Jiacheng Tang et.al. 2503.02760 null
2025-03-04 Consumption-portfolio choice with preferences for liquid assets Guohui Guan et.al. 2503.02697 null
2025-03-04 Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems Jakob Weber et.al. 2503.02693 link
2025-03-04 FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting Congluo Xu et.al. 2503.02692 null
2025-03-04 MPO: Boosting LLM Agents with Meta Plan Optimization Weimin Xiong et.al. 2503.02682 link
2025-03-04 Unique existence of solution and Hyers-Ulam stability for a new fractional differential quasi-variational inequality with Mittag-Leffler kernel and its applications Zeng-bao Wu et.al. 2503.02669 null
2025-02-28 Hybrid Team Tetris: A New Platform For Hybrid Multi-Agent, Multi-Human Teaming Kaleb Mcdowell et.al. 2502.21300 null
2025-02-28 Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind Dingyi Zhang et.al. 2502.21297 null
2025-02-28 ReaLJam: Real-Time Human-AI Music Jamming with Reinforcement Learning-Tuned Transformers Alexander Scarlatos et.al. 2502.21267 null
2025-02-28 Towards Developing Ethical Reasoners: Integrating Probabilistic Reasoning and Decision-Making for Complex AI Systems Nijesh Upreti et.al. 2502.21250 null
2025-02-28 A Method of Selective Attention for Reservoir Based Agents Kevin McKee et.al. 2502.21229 null
2025-02-28 ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments Pedro Gimenes et.al. 2502.21208 null
2025-03-03 Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction Baiting Luo et.al. 2502.21186 link
2025-02-28 Reducing Reward Dependence in RL Through Adaptive Confidence Discounting Muhammed Yusuf Satici et.al. 2502.21181 null
2025-02-28 Autonomous Curriculum Design via Relative Entropy Based Task Modifications Muhammed Yusuf Satici et.al. 2502.21166 null
2025-02-28 Cryptis: Cryptographic Reasoning in Separation Logic Arthur Azevedo de Amorim et.al. 2502.21156 null
2025-02-27 Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation Siddhant Haldar et.al. 2502.20391 link
2025-02-27 Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis Jeffrey Yang Fan Chiang et.al. 2502.20383 null
2025-02-27 Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers Shalev Lifshitz et.al. 2502.20379 null
2025-02-27 Multi-Agent Path Planning in Complex Environments using Gaussian Belief Propagation with Global Path Finding Jens Høigaard Jensen et.al. 2502.20369 link
2025-02-27 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 link
2025-02-27 Trajectory-to-Action Pipeline (TAP): Automated Scenario Description Extraction for Autonomous Vehicle Behavior Comparison Aron Harder et.al. 2502.20353 null
2025-02-27 Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning Thomas Budiarjo et.al. 2502.20348 null
2025-02-27 Safety Representations for Safer Policy Learning Kaustubh Mani et.al. 2502.20341 null
2025-02-27 Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application Thomas Hickling et.al. 2502.20326 null
2025-02-27 M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging Jinghao Feng et.al. 2502.20301 null
2025-02-26 Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation Shiven Sinha et.al. 2502.19414 link
2025-02-26 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Max Ku et.al. 2502.19400 null
2025-02-26 Hybrid Robot Learning for Automatic Robot Motion Planning in Manufacturing Siddharth Singh et.al. 2502.19340 null
2025-02-26 Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Hao Peng et.al. 2502.19328 link
2025-02-26 CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query Zhe Wang et.al. 2502.19313 null
2025-02-26 WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies William Solow et.al. 2502.19308 link
2025-02-26 Agent-centric Information Access Evangelos Kanoulas et.al. 2502.19298 null
2025-02-26 CritiQ: Mining Data Quality Criteria from Human Preferences Honglin Guo et.al. 2502.19279 null
2025-02-26 EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region Nadya Abdel Madjid et.al. 2502.19260 link
2025-02-26 ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding Qihang Peng et.al. 2502.19247 null
2025-02-25 FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response Mollie Shichman et.al. 2502.18452 null
2025-02-25 MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning Chanwoo Park et.al. 2502.18439 null
2025-02-25 ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies Pedro Sequeira et.al. 2502.18438 null
2025-02-25 CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing Yafei Ou et.al. 2502.18437 null
2025-02-25 AgentRM: Enhancing Agent Generalization with Reward Modeling Yu Xia et.al. 2502.18407 null
2025-02-25 Responsible AI Agents Deven R. Desai et.al. 2502.18359 null
2025-02-25 WebGames: Challenging General-Purpose Web-Browsing AI Agents George Thomas et.al. 2502.18356 link
2025-02-25 RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction Jianhao Yan et.al. 2502.18308 null
2025-02-25 Smart and Efficient IoT-Based Irrigation System Design: Utilizing a Hybrid Agent-Based and System Dynamics Approach Taha Ahmadi Pargo et.al. 2502.18298 null
2025-02-25 A Competitive Posted-Price Mechanism for Online Budget-Feasible Auctions Andreas Charalampopoulos et.al. 2502.18265 null
2025-02-24 Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making Luca Lalor et.al. 2502.17417 null
2025-02-24 Distributed Coordination for Heterogeneous Non-Terrestrial Networks Jikang Deng et.al. 2502.17366 null
2025-02-24 Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents Prafulla Kumar Choubey et.al. 2502.17321 null
2025-02-24 Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach Jichen Li et.al. 2502.17307 null
2025-02-24 IGDA: Interactive Graph Discovery through Large Language Model Agents Alex Havrilla et.al. 2502.17189 null
2025-02-24 Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being Bin Yin et.al. 2502.17172 null
2025-02-24 A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning Hamidreza Mazandarani et.al. 2502.17167 null
2025-02-24 Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case Hamidreza Mazandarani et.al. 2502.17120 null
2025-02-24 Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration Junyang Wang et.al. 2502.17110 null
2025-02-24 Generative Models in Decision Making: A Survey Yinchuan Li et.al. 2502.17100 null
2025-02-21 AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Zhining Zhang et.al. 2502.15676 link
2025-02-21 Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities Natasha Astudillo et.al. 2502.15663 null
2025-02-21 Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network Vincent Hsiao et.al. 2502.15662 null
2025-02-21 Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? Yoshua Bengio et.al. 2502.15657 null
2025-02-21 A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications Jefferson Silveira et.al. 2502.15649 null
2025-02-21 WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents Xinhang Liu et.al. 2502.15601 null
2025-02-21 SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instrucion Following Evaluation for Social Agents Wenyuan Zhang et.al. 2502.15538 link
2025-02-21 Contract DesignUnderApproximate Best Responses Francesco Bacchiocchi et.al. 2502.15523 null
2025-02-21 SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning Xuyang Li et.al. 2502.15512 null
2025-02-21 Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing Masaya Kobayashi et.al. 2502.15506 null
2025-02-20 GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks Jianwen Luo et.al. 2502.14848 link
2025-02-20 Red-Teaming LLM Multi-Agent Systems via Communication Attacks Pengfei He et.al. 2502.14847 null
2025-02-20 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Yue Yang et.al. 2502.14846 null
2025-02-20 Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Vlad Sobal et.al. 2502.14819 null
2025-02-20 Optimizing Model Selection for Compound AI Systems Lingjiao Chen et.al. 2502.14815 link
2025-02-20 Byzantine Game Theory: Sun Tzus Boxes Andrei Constantinescu et.al. 2502.14812 null
2025-02-20 Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission Gregg Rabideau et.al. 2502.14803 null
2025-02-20 A Multi-Agent Perspective on Modern Information Retrieval Haya Nachimovsky et.al. 2502.14796 null
2025-02-20 Making Universal Policies Universal Niklas Höpner et.al. 2502.14777 link
2025-02-20 Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis Priyanka Kargupta et.al. 2502.14767 link
2025-02-19 Autellix: An Efficient Serving Engine for LLM Agents as General Programs Michael Luo et.al. 2502.13965 null
2025-02-19 LIDDIA: Language-based Intelligent Drug Discovery Agent Reza Averly et.al. 2502.13959 null
2025-02-19 RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision Guangzhi Xiong et.al. 2502.13957 null
2025-02-19 Qwen2.5-VL Technical Report Shuai Bai et.al. 2502.13923 null
2025-02-19 Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health Xingbo Wang et.al. 2502.13920 link
2025-02-19 DataSciBench: An LLM Agent Benchmark for Data Science Dan Zhang et.al. 2502.13897 link
2025-02-19 NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants Yiran Qin et.al. 2502.13894 null
2025-02-19 Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents Jiahao Liu et.al. 2502.13843 link
2025-02-19 ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities Chanjin Zheng et.al. 2502.13832 link
2025-02-19 Learning to explore when mistakes are not allowed Charly Pecqueux-Guézénec et.al. 2502.13801 null
2025-02-18 AIDE: AI-Driven Exploration in the Space of Code Zhengyao Jiang et.al. 2502.13138 link
2025-02-18 Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions Taedong Yun et.al. 2502.13135 null
2025-02-18 Magma: A Foundation Model for Multimodal AI Agents Jianwei Yang et.al. 2502.13130 link
2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 null
2025-02-18 Approximately Efficient Bilateral Trade with Samples Yuan Deng et.al. 2502.13122 null
2025-02-18 Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Mengkang Hu et.al. 2502.13092 null
2025-02-18 Interactive Agents to Overcome Ambiguity in Software Engineering Sanidhya Vijayvargiya et.al. 2502.13069 link
2025-02-18 Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection Jingbiao Mei et.al. 2502.13061 link
2025-02-18 AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks Yurun Chen et.al. 2502.13053 null
2025-02-18 Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks Markus J. Buehler et.al. 2502.13025 link
2025-02-17 HARBOR: Exploring Persona Dynamics in Multi-Agent Competition Kenan Jiang et.al. 2502.12149 null
2025-02-17 Scaling Autonomous Agents via Automatic Reward Modeling And Planning Zhenfang Chen et.al. 2502.12130 null
2025-02-17 A-MEM: Agentic Memory for LLM Agents Wujiang Xu et.al. 2502.12110 link
2025-02-17 Relational Norms for Human-AI Cooperation Brian D. Earp et.al. 2502.12102 null
2025-02-17 A Study on Leveraging Search and Self-Feedback for Agent Reasoning Karthikeyan K et.al. 2502.12094 null
2025-02-17 Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation Zhongyi Qiu et.al. 2502.12073 null
2025-02-17 A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice Carole Adam et.al. 2502.12058 null
2025-02-17 Multi-agent coordination via communication partitions Wei-Chen Lee et.al. 2502.12042 null
2025-02-17 Machine Learning Should Maximize Welfare, Not (Only) Accuracy Nir Rosenfeld et.al. 2502.11981 null
2025-02-17 FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control Yutong Ye et.al. 2502.11937 null
2025-02-14 Representation and Interpretation in Artificial and Natural Computing Luis A. Pineda et.al. 2502.10383 null
2025-02-14 Agentic Verification for Ambiguous Query Disambiguation Youngwon Lee et.al. 2502.10352 null
2025-02-14 Process Reward Models for LLM Agents: Practical Framework and Directions Sanjiban Choudhury et.al. 2502.10325 link
2025-02-14 Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations Abdelrhman Shaheen et.al. 2502.10303 null
2025-02-14 Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers Aivin V. Solatorio et.al. 2502.10263 link
2025-02-14 Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding Laurin Luttmann et.al. 2502.10233 link
2025-02-14 A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation Redha Taguelmimt et.al. 2502.10226 null
2025-02-14 Do Large Language Models Reason Causally Like Us? Even Better? Hanna M. Dettki et.al. 2502.10215 null
2025-02-14 Dynamic Reinforcement Learning for Actors Katsunari Shibata et.al. 2502.10200 null
2025-02-14 Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design Jingjie Ni et.al. 2502.10187 null
2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 link
2025-02-13 KIMAs: A Configurable Knowledge Integrated Multi-Agent System Zitao Li et.al. 2502.09596 null
2025-02-13 Rolling Ahead Diffusion for Traffic Scene Simulation Yunpeng Liu et.al. 2502.09587 null
2025-02-13 Learning to Coordinate with Experts Mohamad H. Danesh et.al. 2502.09583 link
2025-02-13 Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks Qian Wan et.al. 2502.09577 null
2025-02-13 MDCrow: Automating Molecular Dynamics Workflows with Large Language Models Quintina Campbell et.al. 2502.09565 link
2025-02-13 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Rui Yang et.al. 2502.09560 null
2025-02-13 Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages Shreyan Biswas et.al. 2502.09532 null
2025-02-13 Exact Leader Estimation: A New Approach for Distributed Differentiation Rodrigo Aldana-Lopez et.al. 2502.09529 null
2025-02-13 Forward-backward Contention Resolution Schemes for Fair Rationing Will Ma et.al. 2502.09521 null
2025-02-12 Poly-Autoregressive Prediction for Modeling Interactions Neerja Thakkar et.al. 2502.08646 null
2025-02-12 Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs Mantas Mazeika et.al. 2502.08640 null
2025-02-12 SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent Keyeun Lee et.al. 2502.08599 link
2025-02-12 Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners David Easley et.al. 2502.08597 null
2025-02-12 Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks Ang Li et.al. 2502.08586 null
2025-02-12 Statistically validated projection of bipartite signed networks Anna Gallo et.al. 2502.08567 null
2025-02-12 Human-Centric Foundation Models: Perception, Generation and Agentic Modeling Shixiang Tang et.al. 2502.08556 link
2025-02-12 Extreme vulnerability to intruder attacks destabilizes network dynamics Amirhossein Nazerian et.al. 2502.08552 null
2025-02-12 Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation Mahnaz Koupaee et.al. 2502.08514 link
2025-02-12 Resilient Quantized Consensus in Multi-Hop Relay Networks Liwei Yuan et.al. 2502.08455 null
2025-02-11 MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces Loris Gaven et.al. 2502.07709 link
2025-02-11 Human Decision-making is Susceptible to AI-driven Manipulation Sahand Sabour et.al. 2502.07663 link
2025-02-11 Robust-Sorting and Applications to Ulam-Median Ragesh Jaiswal et.al. 2502.07653 null
2025-02-11 Distributed Value Decomposition Networks with Networked Agents Guilherme S. Varela et.al. 2502.07635 null
2025-02-11 Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy Kristijan Atanasov et.al. 2502.07593 null
2025-02-11 DMWM: Dual-Mind World Model with Long-Term Imagination Lingyi Wang et.al. 2502.07591 null
2025-02-11 Pure $ε$ -equilibrium in random games Bary S. R. Pradelski et.al. 2502.07585 null
2025-02-11 Genetic evolution of a multi-generational population in the context of interstellar space travels – Part II: Phenotypic effects of gene expression Frédéric Marin et.al. 2502.07559 null
2025-02-11 Unsupervised Translation of Emergent Communication Ido Levy et.al. 2502.07552 null
2025-02-11 A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond Zicheng Hu et.al. 2502.07514 null
2025-02-10 Visual Agentic AI for Spatial Reasoning with a Dynamic API Damiano Marsili et.al. 2502.06787 null
2025-02-10 Towards Internet-Scale Training For Agents Brandon Trabucco et.al. 2502.06776 null
2025-02-10 Distributed Constraint-Coupled Optimization: Harnessing ADMM-consensus for robustness Mohamed Abdelmouamin Messilem et.al. 2502.06763 null
2025-02-10 Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty Valia Efthymiou et.al. 2502.06749 null
2025-02-10 Institutional Preferences in the Laboratory Qiankun Zhong et.al. 2502.06748 null
2025-02-10 Wandering around: A bioinspired approach to visual attention through object motion sensitivity Giulia D Angelo et.al. 2502.06747 link
2025-02-10 AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection Roohan Ahmed Khan et.al. 2502.06725 null
2025-02-10 Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene Tai-Yu Pan et.al. 2502.06682 null
2025-02-10 Quantile Multi-Armed Bandits with 1-bit Feedback Ivan Lau et.al. 2502.06678 null
2025-02-10 Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen et.al. 2502.06655 null
2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link
2025-02-07 MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison Kaijie Zhu et.al. 2502.05174 link
2025-02-07 From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance Jiamin Xu et.al. 2502.05145 link
2025-02-07 Maximin Share Guarantees for Few Agents with Subadditive Valuations George Christodoulou et.al. 2502.05141 null
2025-02-07 Joint TITE-CRM for Dual Agent Dose Finding Studies Helen Barnett et.al. 2502.05072 null
2025-02-07 Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation Wenqi Bai et.al. 2502.05069 null
2025-02-07 nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow Geliang Ouyang et.al. 2502.05036 link
2025-02-07 Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency Qixin Zhang et.al. 2502.05028 null
2025-02-07 Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning Tristan K. Schuler et.al. 2502.05014 null
2025-02-07 The Rising Threat to Emerging AI-Powered Search Engines Zeren Luo et.al. 2502.04951 null
2025-02-06 ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Yinjie Wang et.al. 2502.04306 link
2025-02-06 Mutual Multilinearity of Nonequilibrium Network Currents Sara Dal Cengio et.al. 2502.04298 null
2025-02-06 DECAF: Learning to be Fair in Multi-agent Resource Allocation Ashwin Kumar et.al. 2502.04281 null
2025-02-06 Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study Michael Walters et.al. 2502.04249 null
2025-02-06 Multi-agent Architecture Search via Agentic Supernet Guibin Zhang et.al. 2502.04180 link
2025-02-06 Dense Fixed-Wing Swarming using Receding-Horizon NMPC Varun Madabushi et.al. 2502.04174 null
2025-02-06 Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning Wesley A. Suttle et.al. 2502.04141 null
2025-02-06 Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation Jiahao Lu et.al. 2502.04139 null
2025-02-06 VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output Eason Chen et.al. 2502.04103 null
2025-02-06 Strategic Learning with Local Explanations as Feedback Kiet Q. H. Vo et.al. 2502.04058 null
2025-02-05 A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) Yiye Chen et.al. 2502.03450 null
2025-02-05 Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators Yuan Xinjie et.al. 2502.03424 null
2025-02-05 Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach Abdullahi Isa Ahmed et.al. 2502.03377 null
2025-02-05 Learning from Active Human Involvement through Proxy Value Propagation Zhenghao Peng et.al. 2502.03369 null
2025-02-05 PalimpChat: Declarative and Interactive AI analytics Chunwei Liu et.al. 2502.03368 null
2025-02-05 Inverse Mixed Strategy Games with Generative Trajectory Models Max Muchen Sun et.al. 2502.03356 null
2025-02-05 Implicit Communication in Human-Robot Collaborative Transport Elvin Yang et.al. 2502.03346 link
2025-02-05 Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes Haotian Wu et.al. 2502.03335 null
2025-02-05 SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs Ben Liu et.al. 2502.03283 null
2025-02-05 Modeling and Optimization of Insulin Injection for Type-1 Diabetes Mellitus Management Rinrada Jadsadaphongphaibool et.al. 2502.03269 null
2025-02-04 QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search Zongyu Lin et.al. 2502.02584 link
2025-02-04 Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents Shayan Kiyani et.al. 2502.02561 null
2025-02-04 AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis Divya Bharti et.al. 2502.02555 link
2025-02-04 Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks Huiqun Huang et.al. 2502.02537 null
2025-02-04 Adaptive Self-improvement LLM Agentic System for ML Library Development Genghan Zhang et.al. 2502.02534 link
2025-02-04 Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Han Zhou et.al. 2502.02533 null
2025-02-04 Why human-AI relationships need socioaffective alignment Hannah Rose Kirk et.al. 2502.02528 null
2025-02-04 The Cost Perspective of Liquid Democracy: Feasibility and Control Shiri Alouf-Heffetz et.al. 2502.02380 null
2025-02-04 Mirai: A Wearable Proactive AI “Inner-Voice” for Contextual Nudging Cathy Mengying Fang et.al. 2502.02370 null
2025-02-04 MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning Lavanya Ratnabala et.al. 2502.02311 null
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 link
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 link
2025-01-31 Learning Contracts in Hierarchical Multi-Agent Systems Antoine Scheid et.al. 2501.19388 null
2025-01-31 The Physics and Metaphysics of Social Powers: Bridging Cognitive Processing and Social Dynamics, a New Perspective on Power through Active Inference Mahault Albarracin et.al. 2501.19368 null
2025-01-31 PixelWorld: Towards Perceiving Everything as Pixels Zhiheng Lyu et.al. 2501.19339 null
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning Balint Gyevnar et.al. 2501.19256 null
2025-02-03 SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments Hüseyin Aydın et.al. 2501.19245 link
2025-01-31 Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics Xingyu Wang et.al. 2501.19239 null
2025-01-31 A parallelizable variant of HCA* Sreenivasan Ganti et.al. 2501.19218 null
2025-01-30 Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method Peter Baile Chen et.al. 2501.18539 null
2025-01-30 Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems Parth Ganeriwala et.al. 2501.18506 null
2025-01-30 Graph Exploration with Edge Weight Estimates Matthias Gehnen et.al. 2501.18496 null
2025-01-30 Conversation Games and a Strategic View of the Turing Test Kaveh Aryan et.al. 2501.18455 null
2025-01-30 Stable Marriage: Loyalty vs. Competition Amit Ronen et.al. 2501.18442 null
2025-01-30 Gravity-Bench-v1: A Benchmark on Gravitational Physics Discovery for Agents Nolan Koblischke et.al. 2501.18411 null
2025-01-30 Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach Tianpeng Pan et.al. 2501.18320 null
2025-01-30 Model-Free RL Agents Demonstrate System 1-Like Intentionality Hal Ashton et.al. 2501.18299 null
2025-01-30 CueTip: An Interactive and Explainable Physics-aware Pool Assistant Sean Memery et.al. 2501.18291 null
2025-01-30 Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents ShuiDe Wen et.al. 2501.18190 null
2025-01-29 From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning Junseok Park et.al. 2501.17842 null
2025-01-29 A note on the Cucker-Smale model with time delay and communication failures Elisa Continelli et.al. 2501.17743 null
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 link
2025-01-29 Inferring Implicit Goals Across Differing Task Models Silvia Tulli et.al. 2501.17704 null
2025-01-29 CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization Derui Wang et.al. 2501.17667 link
2025-01-29 Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps Scott Fredriksson et.al. 2501.17661 null
2025-01-29 Coalitional control: a bottom-up approach Filiberto Fele et.al. 2501.17614 null
2025-01-29 Coalitional model predictive control of an irrigation canal Filiberto Fele et.al. 2501.17561 null
2025-01-29 Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant Gaole He et.al. 2501.17546 link
2025-01-29 Sequential Learning of the Pareto Front for Multi-objective Bandits Elise Crépon et.al. 2501.17513 link
2025-01-28 Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning Rémy Hosseinkhan Boucher et.al. 2501.17115 null
2025-01-28 CRSet: Non-Interactive Verifiable Credential Revocation with Metadata Privacy for Issuers and Everyone Else Felix Hoops et.al. 2501.17089 null
2025-01-28 Learning Mean Field Control on Sparse Graphs Christian Fabian et.al. 2501.17079 null
2025-01-28 Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning Anna Soligo et.al. 2501.17077 null
2025-01-28 Context is Key in Agent Security Lillian Tsai et.al. 2501.17070 null
2025-01-28 Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework Longzhong Lin et.al. 2501.17015 null
2025-01-28 Towards Open-Source and Modular Space Systems with ATMOS Pedro Roque et.al. 2501.16973 link
2025-01-28 Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning Xi Chen et.al. 2501.16966 null
2025-01-28 ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations Xinyi Ni et.al. 2501.16945 null
2025-01-28 Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies Suzie Grondin et.al. 2501.16935 null
2025-01-27 LUCY: Linguistic Understanding and Control Yielding Early Stage of Her Heting Gao et.al. 2501.16327 link
2025-01-27 Privacy-aware Nash Equilibrium Synthesis with Partially Ordered LTL $_f$ Objectives Caleb Probine et.al. 2501.16307 null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 null
2025-01-27 Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma Richard Willis et.al. 2501.16173 link
2025-01-27 AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants Pascal J. Sager et.al. 2501.16150 null
2025-01-27 Quantifying the Self-Interest Level of Markov Social Dilemmas Richard Willis et.al. 2501.16138 null
2025-01-27 Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection Eslam Eldeeb et.al. 2501.16098 null
2025-01-27 Galaxy Era: Agent-based Simulation of Execution Tickets Pascal Stichler et.al. 2501.16090 link
2025-01-27 Value-oriented forecast reconciliation for renewables in electricity markets Honglin Wen et.al. 2501.16086 null
2025-01-27 Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki Vanja Falck et.al. 2501.16080 null
2025-01-24 An Attentive Graph Agent for Topology-Adaptive Cyber Defence Ilya Orson Sandoval et.al. 2501.14700 link
2025-01-24 The Division of Surplus and the Burden of Proof Deniz Kattwinkel et.al. 2501.14686 null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 link
2025-01-24 Whisper D-SGD: Correlated Noise Across Agents for Differentially Private Decentralized Learning Angelo Rodio et.al. 2501.14644 link
2025-01-24 Fair Division Beyond Monotone Valuations Siddharth Barman et.al. 2501.14609 null
2025-01-24 Hybrid Quantum-Classical Multi-Agent Pathfinding Thore Gerlach et.al. 2501.14568 null
2025-01-24 Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation Wenzhang Liu et.al. 2501.14543 link
2025-01-24 Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning Yuhan Hu et.al. 2501.14488 null
2025-01-24 Avoiding Overfitting in Variable-Order Markov Models: a Cross-Validation Approach Valeria Secchini et.al. 2501.14476 null
2025-01-24 The Pseudo-Dimension of Contracts Paul Duetting et.al. 2501.14474 null
2025-01-23 GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration Yue Fan et.al. 2501.13896 null
2025-01-23 Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning Matyáš Lorenc et.al. 2501.13883 link
2025-01-23 Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems Ethan Wilson et.al. 2501.13878 null
2025-01-23 EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents Yuhui Yun et.al. 2501.13746 null
2025-01-23 Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System Haikuo Du et.al. 2501.13727 link
2025-01-23 A Non-Parametric Approach to Heterogeneity Analysis Avner Seror et.al. 2501.13721 null
2025-01-23 Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel–Young Loss Perspective and Gap-Dependent Regret Analysis Shinsaku Sakaue et.al. 2501.13648 null
2025-01-23 WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control Claire Bizon Monroc et.al. 2501.13592 link
2025-01-23 Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation Nasir Khan et.al. 2501.13552 null
2025-01-23 Towards a Theory of AI Personhood Francis Rhys Ward et.al. 2501.13533 null
2025-01-22 Boosting MCTS with Free Energy Minimization Mawaba Pascal Dao et.al. 2501.13083 null
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null
2025-01-22 Evolution and The Knightian Blindspot of Machine Learning Joel Lehman et.al. 2501.13075 null
2025-01-22 Optimizing Return Distributions with Distributional Dynamic Programming Bernardo Ávila Pires et.al. 2501.13028 null
2025-01-22 The regret lower bound for communicating Markov Decision Processes Victor Boone et.al. 2501.13013 null
2025-01-22 MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking Sebastian Farquhar et.al. 2501.13011 null
2025-01-22 Constructive characterisations of the must-preorder for asynchrony Giovanni Bernardi et.al. 2501.13002 link
2025-01-22 An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management Eslam Eldeeb et.al. 2501.12991 null
2025-01-22 Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization Hossein Nejatbakhsh Esfahani et.al. 2501.12989 null
2025-01-22 Quantification of Ultrafast Nonlinear Photothermal and Photoacoustic Effects in Molecular Thin Films via Time-Domain Brillouin Scattering Valentin Cherruault et.al. 2501.12912 null
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 link
2025-01-21 UI-TARS: Pioneering Automated GUI Interaction with Native Agents Yujia Qin et.al. 2501.12326 link
2025-01-21 Transitions to synchronization in adaptive multilayer networks with higher-order interactions Richita Ghosh et.al. 2501.12301 null
2025-01-21 mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework Bingyi Liu et.al. 2501.12263 null
2025-01-21 Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control Mark Gonzales et.al. 2501.12234 null
2025-01-21 Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access Antonio López Martínez et.al. 2501.12229 null
2025-01-21 Convergence of time-delayed opinion dynamics with complex interaction types Lingling Yao et.al. 2501.12219 null
2025-01-21 RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Uri Gadot et.al. 2501.12216 null
2025-01-21 Experience-replay Innovative Dynamics Tuo Zhang et.al. 2501.12199 null
2025-01-21 Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window A. Bautista et.al. 2501.12198 null
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 link
2025-01-17 Towards Human-Guided, Data-Centric LLM Co-Pilots Evgeny Saveliev et.al. 2501.10321 null
2025-01-17 Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling Suvodip Dey et.al. 2501.10316 link
2025-01-17 Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture Suvidha Mhatre et.al. 2501.10292 null
2025-01-17 Evidence for the gravity-driven and magnetically-regularized gas flows feeding the massive protostellar cluster in Cep A Panigrahy Sandhyarani et.al. 2501.10280 null
2025-01-17 Grey-Box Fuzzing in Constrained Ultra-Large Systems: Lessons for SE Community Jiazhao Yu et.al. 2501.10269 null
2025-01-17 Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments Niklas Dahlquist et.al. 2501.10262 null
2025-01-17 Logarithmic Regret for Nonlinear Control James Wang et.al. 2501.10261 null
2025-01-17 Secure Semantic Communication With Homomorphic Encryption Rui Meng et.al. 2501.10182 null
2025-01-17 PaSa: An LLM Agent for Comprehensive Academic Paper Search Yichen He et.al. 2501.10120 link
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 link
2025-01-16 The Goofus & Gallant Story Corpus for Practical Value Alignment Md Sultan Al Nahian et.al. 2501.09707 null
2025-01-16 Authenticated Delegation and Authorized AI Agents Tobin South et.al. 2501.09674 null
2025-01-16 NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes Nathaniel S. Keplinger et.al. 2501.09646 link
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631 null
2025-01-16 A Multi-agent System for Hybrid Optimization Eric S. Fraga et.al. 2501.09563 null
2025-01-16 Solving the unsolvable: Translating case law in Hong Kong King-kui Sin et.al. 2501.09444 null
2025-01-16 ADAGE: A generic two-layer framework for adaptive agent based modelling Benjamin Patrick Evans et.al. 2501.09429 null
2025-01-16 AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling Ancheng Xu et.al. 2501.09426 null
2025-01-16 Agent-Based Simulation of a Perpetual Futures Market Ramshreyas Rao et.al. 2501.09404 null
2025-01-15 Personality Modeling for Persuasion of Misinformation using AI Agent Qianmin Lou et.al. 2501.08985 null
2025-01-15 Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action Fouad Bousetouane et.al. 2501.08944 null
2025-01-15 A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management Surya Murthy et.al. 2501.08941 null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 null
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 link
2025-01-15 Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts Antonio Castellanos et.al. 2501.08869 null
2025-01-15 The geometry of moral decision making Roland M. Friedrich et.al. 2501.08865 null
2025-01-15 On the Dominance of Truth-Telling in Gradual Mechanisms Wenqian Wang et.al. 2501.08802 null
2025-01-15 Networked Agents in the Dark: Team Value Learning under Partial Observability Guilherme S. Varela et.al. 2501.08778 null
2025-01-15 Leveraging LLM Agents for Translating Network Configurations Yunze Wei et.al. 2501.08760 null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 null
2025-01-14 Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation Aleix Nicolás Olivé et.al. 2501.08280 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 link
2025-01-14 Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps Kannan Parthasarathy et.al. 2501.08243 null
2025-01-14 Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning Enrique Adrian Villarrubia-Martin et.al. 2501.08234 null
2025-01-14 ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems Mohita Chowdhury et.al. 2501.08208 null
2025-01-14 An Elementary Microscopic Model of Sympatric Speciation Franco Bagnoli et.al. 2501.08130 null
2025-01-14 Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving Guizhe Jin et.al. 2501.08096 null
2025-01-14 AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation Feng Zhang et.al. 2501.08088 null
2025-01-14 CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning Guoliang He et.al. 2501.08071 link
2025-01-13 WebWalker: Benchmarking LLMs in Web Traversal Jialong Wu et.al. 2501.07572 link
2025-01-13 SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds Grik Tadevosyan et.al. 2501.07566 null
2025-01-13 SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing Varun Biyyala et.al. 2501.07554 link
2025-01-13 Evaluating Agent-based Program Repair at Google Pat Rondon et.al. 2501.07531 null
2025-01-13 Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning Haonan Xu et.al. 2501.07508 null
2025-01-13 How low-cost AI universal approximators reshape market efficiency Paolo Barucca et.al. 2501.07489 null
2025-01-13 SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM) Xiang Cheng et.al. 2501.07459 link
2025-01-13 Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI Rolf Pfister et.al. 2501.07458 null
2025-01-13 Online inductive learning from answer sets for efficient reinforcement learning exploration Celeste Veronese et.al. 2501.07445 null
2025-01-13 Attention when you need Lokesh Boominathan et.al. 2501.07440 null
2025-01-10 PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Yangyu Huang et.al. 2501.06184 null
2025-01-10 A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem Allen George Philip et.al. 2501.06130 null
2025-01-10 Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation Guojun Xiong et.al. 2501.06103 null
2025-01-10 Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks Kevin Fu et.al. 2501.06058 link
2025-01-10 Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems Nathaniel Hamilton et.al. 2501.06016 null
2025-01-10 Enhanced Acoustic Beamforming with Sub-Aperture Angular Multiply and Sum – in vivo and in Human Demonstration Matthieu Toulemonde et.al. 2501.05837 null
2025-01-10 CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech Madhurananda Pahar et.al. 2501.05755 null
2025-01-10 Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions Sonia Raychaudhuri et.al. 2501.05750 null
2025-01-10 How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond Chen Huang et.al. 2501.05714 null
2025-01-10 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Vighnesh Subramaniam et.al. 2501.05707 null
2025-01-09 Search-o1: Agentic Search-Enhanced Large Reasoning Models Xiaoxi Li et.al. 2501.05366 link
2025-01-09 Control of Overpopulated Tails in Kinetic Epidemic Models Mattia Zanella et.al. 2501.05365 null
2025-01-09 A Path Variant of the Explorer Director Game on Graphs Abigail Raz et.al. 2501.05364 null
2025-01-09 On Corrigibility and Alignment in Multi Agent Games Edmund Dable-Heath et.al. 2501.05360 null
2025-01-09 A learning agent-based approach to the characterization of open quantum systems Lorenzo Fioroni et.al. 2501.05350 null
2025-01-09 The Bakers and Millers Game with Restricted Locations Simon Krogmann et.al. 2501.05334 null
2025-01-09 Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning Dmytro Kuzmenko et.al. 2501.05329 null
2025-01-09 Contrast-Free Myocardial Scar Segmentation in Cine MRI using Motion and Texture Fusion Guang Yang et.al. 2501.05241 null
2025-01-09 CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness Shoucheng Song et.al. 2501.05207 null
2025-01-09 Emergence of human-like polarization among large language model agents Jinghua Piao et.al. 2501.05171 null
2025-01-08 RadGPT: Constructing 3D Image-Text Tumor Datasets Pedro R. A. S. Bassi et.al. 2501.04678 link
2025-01-08 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Yuhang Liu et.al. 2501.04575 link
2025-01-08 The importance of being discrete – An agent-based model for active nematics and more Mathieu Dedenon et.al. 2501.04559 null
2025-01-08 Approximately EFX and PO Allocations for Bivalued Chores Zehan Lin et.al. 2501.04550 null
2025-01-08 Cyber-Physical Steganography in Robotic Motion Control Ching-Chun Chang et.al. 2501.04541 null
2025-01-08 Safe Reinforcement Learning with Minimal Supervision Alexander Quessy et.al. 2501.04481 null
2025-01-08 Hybrid Artificial Intelligence Strategies for Drone Navigation Rubén San-Segundo et.al. 2501.04472 null
2025-01-08 A Digital Shadow for Modeling, Studying and Preventing Urban Crime Juan Palma-Borda et.al. 2501.04435 null
2025-01-08 User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation Krisztian Balog et.al. 2501.04410 null
2025-01-08 Agent Laboratory: Using LLM Agents as Research Assistants Samuel Schmidgall et.al. 2501.04227 null
2025-01-07 Kinetic theory of decentralized learning for smart active matter Gerhard Jung et.al. 2501.03948 null
2025-01-07 Implicit Coordination using Active Epistemic Inference Lauren Bramblett et.al. 2501.03907 null
2025-01-07 Truthful mechanisms for linear bandit games with private contexts Yiting Hu et.al. 2501.03865 null
2025-01-07 Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants Philip Weber et.al. 2501.03862 null
2025-01-07 Run-and-tumble chemotaxis using reinforcement learning Ramesh Pramanik et.al. 2501.03687 null
2025-01-07 The Textbook of Tomorrow: Rethinking Course Material Interfacing in the Era of GPT Audrey Olson et.al. 2501.03618 null
2025-01-07 Distributed Observer for Descriptor Linear System: The Luenberger Observer Method Shuai Liu et.al. 2501.03564 null
2025-01-07 Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective Tianyang Duan et.al. 2501.03562 null
2025-01-07 FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis Xiaojiao Xiao et.al. 2501.03526 link
2025-01-07 A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages Jinming Gao et.al. 2501.03496 null
2025-01-06 Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation Yuhui Zhang et.al. 2501.03225 link
2025-01-06 Turn-based Multi-Agent Reinforcement Learning Model Checking Dennis Gross et.al. 2501.03187 null
2025-01-06 Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning Muyun Li et.al. 2501.03162 null
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty Andreas Athanasopoulos et.al. 2501.03018 link
2025-01-06 Approximating N-Player Nash Equilibrium through Gradient Descent Dongge Wang et.al. 2501.03001 null
2025-01-06 CALM: Curiosity-Driven Auditing for Large Language Models Xiang Zheng et.al. 2501.02997 link
2025-01-06 CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems Chuanbo Hua et.al. 2501.02977 link
2025-01-06 Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective Chuxiong Sun et.al. 2501.02888 null
2025-01-06 A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation Toomas Tahves et.al. 2501.02858 null
2025-01-03 QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture Shvetank Prakash et.al. 2501.01892 null
2025-01-03 Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification Xiangxiang Dai et.al. 2501.01849 link
2025-01-03 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 null
2025-01-03 SDPO: Segment-Level Direct Preference Optimization for Social Agents Aobo Kong et.al. 2501.01821 link
2025-01-03 Distributed Framework Construction for Affine Formation Control Huiming Li et.al. 2501.01817 null
2025-01-03 Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery Baoru Huang et.al. 2501.01752 null
2025-01-03 Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning Gavin B. Rens et.al. 2501.01727 null
2025-01-03 AgentRefine: Enhancing Agent Generalization through Refinement Tuning Dayuan Fu et.al. 2501.01702 null
2025-01-03 The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective Alexander Lam et.al. 2501.01660 null
2025-01-03 PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents Jingoo Lee et.al. 2501.01594 null
2025-01-02 Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective Julian Barreiro-Gomez et.al. 2501.01389 null
2025-01-02 PIMAEX: Multi-Agent Exploration through Peer Incentivization Michael Kölle et.al. 2501.01266 null
2025-01-02 Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants Lixiong Qin et.al. 2501.01243 null
2025-01-02 From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma Tianqi Song et.al. 2501.01220 null
2025-01-02 D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma Ludovico Musenich et.al. 2501.01211 null
2025-01-02 Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects Abdullah Mushtaq et.al. 2501.01205 null
2025-01-02 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer Jiajun Deng et.al. 2501.01163 null
2025-01-02 A3: Android Agent Arena for Mobile GUI Agents Yuxiang Chai et.al. 2501.01149 null
2025-01-02 Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method Ruichen Zhang et.al. 2501.01141 null
2025-01-02 Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning Min Whoo Lee et.al. 2501.01140 null
2024-12-30 Distributed Mixture-of-Agents for Edge Inference with Large Language Models Purbesh Mitra et.al. 2412.21200 link
2024-12-30 Aviary: training language agents on challenging scientific tasks Siddharth Narayanan et.al. 2412.21154 link
2024-12-30 Training Software Engineering Agents and Verifiers with SWE-Gym Jiayi Pan et.al. 2412.21139 link
2024-12-30 Positional information trade-offs in boundary-driven reaction-diffusion systems Jonas Berx et.al. 2412.21113 null
2024-12-30 Exploring and Controlling Diversity in LLM-Agent Conversation KuanChao Chu et.al. 2412.21102 null
2024-12-30 Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024 Reza Azadeh et.al. 2412.21088 null
2024-12-30 Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding Wenhao Zhuang et.al. 2412.21069 null
2024-12-30 Plancraft: an evaluation dataset for planning with LLM agents Gautier Dagan et.al. 2412.21033 link
2024-12-30 UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Fangwei Zhong et.al. 2412.20977 null
2024-12-31 SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity Pengfei Jing et.al. 2412.20787 null
2024-12-27 Bottom-up robust modeling for the foraging behavior of Physarum polycephalum Damiano Reginato et.al. 2412.19790 null
2024-12-27 Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration Le Chen et.al. 2412.19770 link
2024-12-27 Can Large Language Models Adapt to Other Agents In-Context? Matthew Riemer et.al. 2412.19726 null
2024-12-27 OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Qiushi Sun et.al. 2412.19723 null
2024-12-27 The Value of Recall in Extensive-Form Games Ratip Emin Berker et.al. 2412.19659 null
2024-12-27 Xmodel-2 Technical Report Wang Qun et.al. 2412.19638 link
2024-12-27 Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives Guy Avni et.al. 2412.19609 null
2024-12-27 Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following Yuxiao Yang et.al. 2412.19562 null
2024-12-27 Quantiles under ambiguity and risk sharing Peng Liu et.al. 2412.19546 null
2024-12-27 TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data Xiang Huang et.al. 2412.19544 link
2024-12-24 Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems Fernando Jia et.al. 2412.18601 link
2024-12-24 Automated Code Review In Practice Umut Cihan et.al. 2412.18531 null
2024-12-24 Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving Hao Pang et.al. 2412.18511 null
2024-12-24 Calibrating the Subjective Mark Whitmeyer et.al. 2412.18486 null
2024-12-24 Multi-Agent Norm Perception and Induction in Distributed Healthcare Chao Li et.al. 2412.18454 null
2024-12-24 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Tatiana Zemskova et.al. 2412.18450 link
2024-12-24 GeAR: Graph-enhanced Agent for Retrieval-augmented Generation Zhili Shen et.al. 2412.18431 null
2024-12-24 Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent Farhad Nooralahzadeh et.al. 2412.18428 link
2024-12-24 GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent Kangjia Zhao et.al. 2412.18426 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-23 Observation Interference in Partially Observable Assistance Games Scott Emmons et.al. 2412.17797 null
2024-12-23 ResearchTown: Simulator of Human Research Community Haofei Yu et.al. 2412.17767 link
2024-12-23 Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning Christian A. Schroth et.al. 2412.17740 null
2024-12-23 Robin Hood Reachability Bidding Games Shaull Almagor et.al. 2412.17718 null
2024-12-23 SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC Yue Deng et.al. 2412.17707 link
2024-12-23 Large Language Model Safety: A Holistic Survey Dan Shi et.al. 2412.17686 link
2024-12-23 Shape and Performance of Fastest Paths over Networks with Interacting Selfish Agents Marco Cogoni et.al. 2412.17665 null
2024-12-23 CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction Yuanyuan Gao et.al. 2412.17612 null
2024-12-23 Fluid-Derived Lattices for Unbiased Modeling of Bacterial Colony Growth Bryan Verhoef et.al. 2412.17604 null
2024-12-23 PC Agent: While You Sleep, AI Works – A Cognitive Journey into Digital World Yanheng He et.al. 2412.17589 link
2024-12-20 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 link
2024-12-20 Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information Dirk Bergemann et.al. 2412.16132 null
2024-12-20 Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG Hasan Md Tusfiqur Alam et.al. 2412.16086 link
2024-12-20 Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning Jingbo Chen et.al. 2412.15975 null
2024-12-20 The multilayer garbage disposal game Hsin-Lun Li et.al. 2412.15942 null
2024-12-20 Speedup Techniques for Switchable Temporal Plan Graph Optimization He Jiang et.al. 2412.15908 null
2024-12-20 Exploring the Effects of AI Nonverbal Emotional Cues on Human Decision Certainty in Moral Dilemmas Chenyi Zhang et.al. 2412.15834 null
2024-12-20 WebLLM: A High-Performance In-Browser LLM Inference Engine Charlie F. Ruan et.al. 2412.15803 link
2024-12-20 FTISS Adaptive Bearing-Only Formation Tracking Control with Unknown Disturbance Rejection Hong Liang Cheah et.al. 2412.15757 null
2024-12-20 Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion Martin Bichler et.al. 2412.15707 null
2024-12-19 AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Shuo Xing et.al. 2412.15206 link
2024-12-19 Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration Junjia Liu et.al. 2412.15166 link
2024-12-19 Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents Jessica Woodgate et.al. 2412.15163 null
2024-12-19 Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents Serafina Kamp et.al. 2412.15162 null
2024-12-19 Probabilistic Strategy Logic with Degrees of Observability Chunyan Mu et.al. 2412.15135 null
2024-12-19 From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model Jerome Garnier-Brun et.al. 2412.14996 null
2024-12-19 Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination Leonardo Barcellona et.al. 2412.14957 null
2024-12-19 Long Time Behavior and Stabilization for Displacement Monotone Mean Field Games Marco Cirant et.al. 2412.14903 null
2024-12-19 Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning Anthony Kobanda et.al. 2412.14865 null
2024-12-19 Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning Mohammadreza nakhaei et.al. 2412.14834 link
2024-12-18 TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Frank F. Xu et.al. 2412.14161 link
2024-12-18 Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report Markus Dablander et.al. 2412.14085 null
2024-12-18 A Computationally Grounded Framework for Cognitive Attitudes (extended version) Tiago de Lima et.al. 2412.14073 null
2024-12-18 Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning Adi Shuchami et.al. 2412.14039 link
2024-12-18 Decentralized Convergence to Equilibrium Prices in Trading Networks Edwin Lock et.al. 2412.13972 null
2024-12-18 Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves Martin Kurečka et.al. 2412.13962 null
2024-12-18 Harvesting energy from turbulent winds with Reinforcement Learning Lorenzo Basile et.al. 2412.13961 null
2024-12-18 Towards privacy-preserving cooperative control via encrypted distributed optimization Philipp Binfet et.al. 2412.13953 null
2024-12-18 Strategyproof Matching of Roommates and Rooms Hadi Hosseini et.al. 2412.13887 null
2024-12-18 Who Saves us From Risk? Altruists Promote Cooperation in a Public Investment Game Shen Zhang et.al. 2412.13816 null
2024-12-17 Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents Yifei Zhou et.al. 2412.13194 null
2024-12-17 GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Haoyi Jiang et.al. 2412.13193 link
2024-12-17 SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents Sheng Yin et.al. 2412.13178 link
2024-12-17 Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs – A Graph Sequential Embedding Method Jiate Li et.al. 2412.13134 link
2024-12-17 Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements Rafael Dewes et.al. 2412.13114 null
2024-12-17 Active Reinforcement Learning Strategies for Offline Policy Improvement Ambedkar Dukkipati et.al. 2412.13106 null
2024-12-17 AI PERSONA: Towards Life-long Personalization of LLMs Tiannan Wang et.al. 2412.13103 null
2024-12-17 Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks Kevin McKee et.al. 2412.13093 null
2024-12-17 Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks Kun Huang et.al. 2412.13054 null
2024-12-18 NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation Karan Wanchoo et.al. 2412.13026 null
2024-12-16 Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives Marius Belly et.al. 2412.12063 link
2024-12-16 Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers Farnaz Nouraei et.al. 2412.12061 null
2024-12-16 Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps Linfeng Zhao et.al. 2412.12024 null
2024-12-16 Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm Rajat Khanda et.al. 2412.12006 null
2024-12-16 CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception Senkang Hu et.al. 2412.12000 null
2024-12-16 AlphaZero Neural Scaling and Zipf’s Law: a Tale of Board Games and Power Laws Oren Neumann et.al. 2412.11979 link
2024-12-16 Learning Human-Aware Robot Policies for Adaptive Assistance Jason Qin et.al. 2412.11913 null
2024-12-16 Reentrant phase behavior in binary topological flocks with nonreciprocal alignment Tian Tang et.al. 2412.11871 null
2024-12-16 The Black Ninjas and the Sniper: On Robustness of Population Protocols Benno Lossin et.al. 2412.11783 null
2024-12-16 Prediction of social dilemmas in networked populations via graph neural networks Huaiyu Tan et.al. 2412.11775 null
2024-12-13 Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining Zhiqi Ge et.al. 2412.10342 null
2024-12-13 Reciprocity in Interbank Markets Lutz Honvehlmann et.al. 2412.10329 null
2024-12-13 MeshA*: Efficient Path Planing With Motion Primitives Marat Agranovskiy et.al. 2412.10320 null
2024-12-13 BrushEdit: All-In-One Image Inpainting and Editing Yaowei Li et.al. 2412.10316 null
2024-12-13 Cultural Evolution of Cooperation among LLM Agents Aron Vallinder et.al. 2412.10270 null
2024-12-13 ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL Yang Qin et.al. 2412.10138 link
2024-12-13 You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects Islem Bouzenia et.al. 2412.10133 link
2024-12-13 Reward Machine Inference for Robotic Manipulation Mattijs Baert et.al. 2412.10096 null
2024-12-13 Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints Dolev Mutzari et.al. 2412.10083 null
2024-12-13 Large Action Models: From Inception to Implementation Lu Wang et.al. 2412.10047 link
2024-12-12 GenEx: Generating an Explorable World Taiming Lu et.al. 2412.09624 null
2024-12-12 AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Yiheng Xu et.al. 2412.09605 null
2024-12-12 DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction Yu Feng et.al. 2412.09572 null
2024-12-12 Can Modern LLMs Act as Agent Cores in Radiology~Environments? Qiaoyu Zheng et.al. 2412.09529 link
2024-12-12 Agent-based Video Trimming Lingfeng Yang et.al. 2412.09513 null
2024-12-12 Solving Multiagent Path Finding on Highly Centralized Networks Foivos Fioravantes et.al. 2412.09433 null
2024-12-12 From Intention To Implementation: Automating Biomedical Research via LLMs Yi Luo et.al. 2412.09429 null
2024-12-12 Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer Adam Labiosa et.al. 2412.09417 null
2024-12-12 Uncommon Belief in Rationality Qi Shi et.al. 2412.09407 null
2024-12-12 Falcon-UI: Understanding GUI Before Following User Instructions Huawen Shen et.al. 2412.09362 null
2024-12-11 GPD-1: Generative Pre-training for Driving Zixun Xie et.al. 2412.08643 link
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-11 RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation Mingfei Han et.al. 2412.08591 null
2024-12-11 Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead Yanqi Su et.al. 2412.08581 null
2024-12-11 GenPlan: Generative sequence models as adaptive planners Akash Karthikeyan et.al. 2412.08565 link
2024-12-11 An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios Leandro Parada et.al. 2412.08562 null
2024-12-11 Exact Algorithms for Multiagent Path Finding with Communication Constraints on Tree-Like Structures Foivos Fioravantes et.al. 2412.08556 null
2024-12-11 Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending against Poisoning Attacks Ao Liu et.al. 2412.08555 null
2024-12-11 MaestroMotif: Skill Design from Artificial Intelligence Feedback Martin Klissarov et.al. 2412.08542 null
2024-12-11 Spatial segregation across travelling fronts in individual-based and continuum models for the growth of heterogeneous cell populations José A. Carrillo et.al. 2412.08535 null
2024-12-10 Balancing Mobility Behaviors to avoid Global epidemics from Local Outbreaks Pablo Valgañón et.al. 2412.07656 null
2024-12-10 Searching for Structure: Investigating Emergent Communication with Large Language Models Tom Kouwenhoven et.al. 2412.07646 null
2024-12-10 Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization Zongkai Liu et.al. 2412.07639 link
2024-12-10 Swarm Behavior Cloning Jonas Nüßlein et.al. 2412.07617 null
2024-12-10 Modeling Speculative Trading Patterns in Token Markets: An Agent-Based Analysis with TokenLab Mengjue Wang et.al. 2412.07512 null
2024-12-10 ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning Hongshu Guo et.al. 2412.07507 null
2024-12-10 SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World Jiaqi Zhang et.al. 2412.07472 link
2024-12-10 Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems Sen Kong et.al. 2412.07471 null
2024-12-10 Dynamic Ensemble Reasoning for LLM Experts Jinwu Hu et.al. 2412.07448 null
2024-12-10 ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving Rongqing Li et.al. 2412.07369 null
2024-12-09 Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty Meera Hahn et.al. 2412.06771 link
2024-12-09 AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark Lan Li et.al. 2412.06724 link
2024-12-09 Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies Dilian Gurov et.al. 2412.06706 null
2024-12-09 Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework Tianming Liu et.al. 2412.06681 null
2024-12-09 Self-Interested Agents in Collaborative Learning: An Incentivized Adaptive Data-Centric Framework Nithia Vijayan et.al. 2412.06597 null
2024-12-09 Argentine ants regulate traffic flow with stopped individuals Ulrich Dobramysl et.al. 2412.06587 null
2024-12-09 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Egor Cherepanov et.al. 2412.06531 null
2024-12-09 EFX Allocations on Some Multi-graph Classes Umang Bhaskar et.al. 2412.06513 null
2024-12-09 The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap Yedi Zhang et.al. 2412.06512 null
2024-12-09 Reasoning about Strategic Abilities in Stochastic Multi-agent Systems Yedi Zhang et.al. 2412.06509 null
2024-12-06 TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft Qian Long et.al. 2412.05255 link
2024-12-06 AI’s assigned gender affects human-AI cooperation Sepideh Bazazi et.al. 2412.05214 null
2024-12-06 SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot Jinlin Wu et.al. 2412.05187 link
2024-12-06 Sense and Sensitivity: Evaluating the simulation of social dynamics via Large Language Models Da Ju et.al. 2412.05093 null
2024-12-06 Synchronization and desynchronization in ensembles of mobile agents E. M. Varvarin et.al. 2412.05040 null
2024-12-06 Frontier Models are Capable of In-context Scheming Alexander Meinke et.al. 2412.04984 null
2024-12-06 Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task Raphael C. Engelhardt et.al. 2412.04974 null
2024-12-06 Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games Ryota Nonomura et.al. 2412.04937 link
2024-12-06 Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase Zak Hussain et.al. 2412.04936 link
2024-12-06 PERCY: A Multimodal Dataset and Conversational System for Personalized and Emotionally Aware Human-Robot Interaction Mohammed Althubyani et.al. 2412.04908 null
2024-12-05 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Yiheng Xu et.al. 2412.04454 null
2024-12-05 GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Kaiyi Huang et.al. 2412.04440 null
2024-12-05 Sub-diffraction Imaging of Carrier Dynamics in Halide Perovskite Semiconductors: Effects of Passivation, Morphology, and Ion Motion Madeleine D. Breshears et.al. 2412.04423 null
2024-12-05 Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation Xuying Li et.al. 2412.04415 null
2024-12-05 EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding Yuqi Wu et.al. 2412.04380 link
2024-12-05 Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach Haoran Su et.al. 2412.04369 null
2024-12-05 Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting Edoardo Cetin et.al. 2412.04368 null
2024-12-05 Machine Theory of Mind for Autonomous Cyber-Defence Luke Swaby et.al. 2412.04367 null
2024-12-05 Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles Ke Sun et.al. 2412.04341 null
2024-12-05 Action Mapping for Reinforcement Learning in Continuous Environments with Constraints Mirco Theile et.al. 2412.04327 null
2024-12-04 Navigation World Models Amir Bar et.al. 2412.03572 null
2024-12-04 From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents Xinyi Mou et.al. 2412.03563 link
2024-12-04 Categorize and randomize: a model of sequential stochastic choice Ester Sudano et.al. 2412.03554 null
2024-12-04 SPICE: Smart Projection Interface for Cooking Enhancement Vera Prohaska et.al. 2412.03551 link
2024-12-04 Risk-aware Classification via Uncertainty Quantification Murat Sensoy et.al. 2412.03391 null
2024-12-04 WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis Chengwei Hu et.al. 2412.03359 null
2024-12-04 AI-Driven Day-to-Day Route Choice Leizhen Wang et.al. 2412.03338 link
2024-12-04 Mean-field Concentration of Opinion Dynamics in Random Graphs Javiera Gutiérrez-Ramírez et.al. 2412.03207 null
2024-12-04 AffordDP: Generalizable Diffusion Policy with Transferable Affordance Shijie Wu et.al. 2412.03142 null
2024-12-04 ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning Zhe Xie et.al. 2412.03104 link
2024-12-03 Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation Gabriele Giudici et.al. 2412.02644 null
2024-12-03 Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework Ziheng Liu et.al. 2412.02581 null
2024-12-03 Generating Critical Scenarios for Testing Automated Driving Systems Trung-Hieu Nguyen et.al. 2412.02574 link
2024-12-03 TAB-Fields: A Maximum Entropy Framework for Mission-Aware Adversarial Planning Gokul Puthumanaillam et.al. 2412.02570 link
2024-12-03 Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization Nicolás García Trillos et.al. 2412.02535 link
2024-12-03 General Resetting Theory for Group Avoidance Juhee Lee et.al. 2412.02524 null
2024-12-03 Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations Conghao Wong et.al. 2412.02447 null
2024-12-03 A Multi-Agent Framework for Extensible Structured Text Generation in PLCs Donghao Yang et.al. 2412.02410 null
2024-12-03 Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction Ziqian Zou et.al. 2412.02395 null
2024-12-03 Bio-inspired visual relative localization for large swarms of UAVs Martin Křížek et.al. 2412.02393 null
2024-11-29 EF1 Allocations for Identical Trilean and Separable Single-Peaked Valuations Umang Bhaskar et.al. 2411.19881 null
2024-11-29 Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models Claudio Agnorelli et.al. 2411.19840 null
2024-11-29 Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation Robin D. Pesl et.al. 2411.19804 null
2024-11-29 CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives Armin Saghafian et.al. 2411.19787 link
2024-11-29 The 2024 Motile Active Matter Roadmap Gerhard Gompper et.al. 2411.19783 null
2024-11-29 HVAC-DPT: A Decision Pretrained Transformer for HVAC Control Anaïs Berkes et.al. 2411.19746 null
2024-11-29 Relative Representations of Latent Spaces enable Efficient Semantic Channel Equalization Tomás Hüttebräucker et.al. 2411.19719 null
2024-11-29 RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents Shi Zifeng et.al. 2411.19639 null
2024-11-29 Build An Influential Bot In Social Media Simulations With Large Language Models Bailu Jin et.al. 2411.19635 null
2024-11-29 Solving Rubik’s Cube Without Tricky Sampling Yicheng Lin et.al. 2411.19583 null
2024-11-27 Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective Zhi Zhang et.al. 2411.18615 null
2024-11-27 Robust Offline Reinforcement Learning with Linearly Structured $f$ -Divergence Regularization Cheng Tang et.al. 2411.18612 null
2024-11-27 AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans Dillon Loh et.al. 2411.18539 link
2024-11-27 Biswas-Chatterjee-Sen kinetic exchange opinion model for two connected groups Krzysztof Suchecki et.al. 2411.18527 null
2024-11-27 NeuroAI for AI Safety Patrick Mineault et.al. 2411.18526 null
2024-11-27 Collective decision making by embodied neural agents Nicolas Coucke et.al. 2411.18498 link
2024-11-27 Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator Frederic Kirstein et.al. 2411.18444 null
2024-11-28 A Multi-Agent Dual Dialogue System to Support Mental Health Care Providers Onno P. Kampman et.al. 2411.18429 null
2024-11-27 Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration Esmaeel Mohammadi et.al. 2411.18305 null
2024-11-27 InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving Xiyan Jiang et.al. 2411.18302 link
2024-11-26 SketchAgent: Language-Driven Sequential Sketch Generation Yael Vinker et.al. 2411.17673 null
2024-11-26 MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation Harsh Singh et.al. 2411.17636 null
2024-11-26 Making History Readable Bipasha Banerjee et.al. 2411.17600 null
2024-11-26 Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals William A. Ingram et.al. 2411.17598 null
2024-11-26 Decision making in stochastic extensive form II: Stochastic extensive forms and games E. Emanuel Rapsch et.al. 2411.17587 null
2024-11-26 Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence Ross O’Driscoll et.al. 2411.17585 null
2024-11-26 Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach Yaosheng Deng et.al. 2411.17552 null
2024-11-26 ShowUI: One Vision-Language-Action Model for GUI Visual Agent Kevin Qinghong Lin et.al. 2411.17465 link
2024-11-26 Object-centric proto-symbolic behavioural reasoning from pixels Ruben van Bergen et.al. 2411.17438 link
2024-11-26 Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning Mahdi Salahshour et.al. 2411.17353 null
2024-11-25 Winning opinion: Following Your Friends’ Advice or That of Their Friends? Francisco J. Muñoz et.al. 2411.16671 null
2024-11-25 Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination Viswa Narayanan Sankaranarayanan et.al. 2411.16608 null
2024-11-25 Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete? Connor Douglas et.al. 2411.16574 null
2024-11-25 Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation Muhammad Burhan Hafez et.al. 2411.16532 link
2024-11-25 Reinforcement Learning for Bidding Strategy Optimization in Day-Ahead Energy Market Luca Di Persio et.al. 2411.16519 null
2024-11-25 Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding Hongzhi Zang et.al. 2411.16506 link
2024-11-25 Distributed Online Optimization with Stochastic Agent Availability Juliette Achddou et.al. 2411.16477 null
2024-11-25 Generating social networks with static and dynamic utility-maximization approaches Aldric Labarthe et.al. 2411.16464 link
2024-11-25 Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction Haoming Li et.al. 2411.16457 null
2024-11-25 TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation Linqing Zhong et.al. 2411.16425 null
2024-11-22 RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts Hjalmar Wijk et.al. 2411.15114 link
2024-11-22 XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models Yixin Dong et.al. 2411.15100 null
2024-11-22 On Multi-Agent Inverse Reinforcement Learning Till Freihaut et.al. 2411.15046 null
2024-11-22 Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium Zeyang Li et.al. 2411.15036 null
2024-11-22 On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations Guojun Xiong et.al. 2411.15014 null
2024-11-22 ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data Junhong Shen et.al. 2411.15004 link
2024-11-22 Free Energy Projective Simulation (FEPS): Active inference with interpretability Joséphine Pazem et.al. 2411.14991 null
2024-11-22 BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence Xuewu Lin et.al. 2411.14869 link
2024-11-22 Universal and Context-Independent Triggers for Precise Control of LLM Outputs Jiashuo Liang et.al. 2411.14738 null
2024-11-22 Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents Hanwen Shi et.al. 2411.14637 null
2024-11-21 Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Yuhao Dong et.al. 2411.14432 link
2024-11-21 Multi-Agent Environments for Vehicle Routing Problems Ricardo Gama et.al. 2411.14411 link
2024-11-21 Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs Ofer Dagan et.al. 2411.14404 null
2024-11-21 SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching Arjun P S et.al. 2411.14322 link
2024-11-21 Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks Kubra Duran et.al. 2411.14281 null
2024-11-21 Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation Pedro Enrique Iturria-Rivera et.al. 2411.14264 null
2024-11-21 Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems Junhua Liu et.al. 2411.14214 null
2024-11-21 SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization Shuchen Zhu et.al. 2411.14166 null
2024-11-21 Multi-terminal Strong Coordination subject to Secrecy Constraints Viswanathan Ramachandran et.al. 2411.14123 null
2024-11-21 Umbrella Reinforcement Learning – computationally efficient tool for hard non-linear problems Egor E. Nuzhin et.al. 2411.14117 link
2024-11-20 BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Davide Paglieri et.al. 2411.13543 null
2024-11-20 Metacognition for Unknown Situations and Environments (MUSE) Rodolfo Valiente et.al. 2411.13537 null
2024-11-20 AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations Gaurav Verma et.al. 2411.13451 null
2024-11-20 Robust Monocular Visual Odometry using Curriculum Learning Assaf Lahiany et.al. 2411.13438 null
2024-11-20 A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback Alireza Rashidi Laleh et.al. 2411.13410 null
2024-11-20 Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership Lars Fluri et.al. 2411.13381 null
2024-11-20 WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving Siwei Chen et.al. 2411.13340 link
2024-11-20 Revealed Information Laura Doval et.al. 2411.13293 null
2024-11-20 Transforming the Hybrid Cloud for Emerging AI Workloads Deming Chen et.al. 2411.13239 null
2024-11-20 Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications Tiago Roux Oliveira et.al. 2411.13234 null
2024-11-19 Reinforcement Learning, Collusion, and the Folk Theorem Galit Askenazi-Golan et.al. 2411.12725 null
2024-11-19 UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments Chunru Lin et.al. 2411.12711 null
2024-11-19 Weighted Envy Freeness With Limited Subsidies Noga Klein Elmalem et.al. 2411.12696 null
2024-11-19 Quasi-stability notions in two-sided matching models Nadia Guiñazú et.al. 2411.12533 null
2024-11-19 Coevolution of relationship-driven cooperation under recommendation protocol on multiplex networks Hongyu Yue et.al. 2411.12436 null
2024-11-19 Instrumentation of Software Systems with OpenTelemetry for Software Visualization Malte Hansen et.al. 2411.12380 null
2024-11-19 C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention Xiaohe Li et.al. 2411.12313 null
2024-11-19 SNN-Based Online Learning of Concepts and Action Laws in an Open World Christel Grimaud et.al. 2411.12308 null
2024-11-19 Emergence of Implicit World Models from Mortal Agents Kazuya Horibe et.al. 2411.12304 null
2024-11-19 Could Humans Outshine AI in Visual Data Analysis? Ratanond Koonchanok et.al. 2411.12299 null
2024-11-18 Generative World Explorer Taiming Lu et.al. 2411.11844 null
2024-11-18 Reinterpreting Delay and Procrastination Conrad Kosowsky et.al. 2411.11828 null
2024-11-18 Competing Bandits in Decentralized Large Contextual Matching Markets Satush Parikh et.al. 2411.11794 null
2024-11-18 LLM-IE: A Python Package for Generative Information Extraction with Large Language Models Enshuo Hsu et.al. 2411.11779 null
2024-11-18 Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework Yannick Metz et.al. 2411.11761 null
2024-11-18 The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning Longju Bai et.al. 2411.11758 link
2024-11-18 Distributed Asynchronous Time-Varying Quadratic Programming with Asynchronous Objective Sampling Gabriel Behrendt et.al. 2411.11732 null
2024-11-18 Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment Allison Huang et.al. 2411.11731 link
2024-11-18 TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World Xianlong Wang et.al. 2411.11683 null
2024-11-18 Artificial Scientific Discovery Antonio Norelli et.al. 2411.11672 null
2024-11-15 Fair Division via the Cake-Cutting Share Yannan Bai et.al. 2411.10434 null
2024-11-15 Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash Parsa Hejabi et.al. 2411.10422 link
2024-11-15 The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Siyuan Hu et.al. 2411.10323 link
2024-11-15 Static network structure cannot stabilize cooperation among Large Language Model agents Jin Han et.al. 2411.10294 null
2024-11-15 Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review Hossein Hassani et.al. 2411.10268 null
2024-11-15 Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning Jingru Yang et.al. 2411.10252 null
2024-11-15 An Empirical Study on LLM-based Agents for Automated Bug Fixing Xiangxin Meng et.al. 2411.10213 null
2024-11-15 Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking Valeria Jannelli et.al. 2411.10184 null
2024-11-15 Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks Marco Matarese et.al. 2411.10176 null
2024-11-15 The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning Moritz Schneider et.al. 2411.10175 null
2024-11-14 Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games Georgios Pantazis et.al. 2411.09636 null
2024-11-14 Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents Yuyou Gan et.al. 2411.09523 null
2024-11-14 Randomized Truthful Auctions with Learning Agents Gagan Aggarwal et.al. 2411.09517 null
2024-11-14 Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity Sneha Ramshanker et.al. 2411.09493 null
2024-11-14 Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches Carlos J. Costa et.al. 2411.09313 null
2024-11-14 Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning Dunwei Tu et.al. 2411.09250 null
2024-11-14 Risk-aware MPPI for Stochastic Hybrid Systems Hardik Parwana et.al. 2411.09198 link
2024-11-14 Enhancing reinforcement learning for population setpoint tracking in co-cultures Sebastián Espinel-Ríos et.al. 2411.09177 null
2024-11-14 Artificial Theory of Mind and Self-Guided Social Organisation Michael S. Harré et.al. 2411.09169 null
2024-11-14 Theory of Mind Enhances Collective Intelligence Michael S. Harré et.al. 2411.09168 null
2024-11-13 The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games Dan Calderone et.al. 2411.08809 null
2024-11-13 FinRobot: AI Agent for Equity Research and Valuation with Large Language Models Tianyu Zhou et.al. 2411.08804 link
2024-11-13 Evaluating World Models with LLM for Decision Making Chang Yang et.al. 2411.08794 null
2024-11-13 Towards Fair and Efficient Public Transportation: A Bus Stop Model Martin Bullinger et.al. 2411.08784 link
2024-11-13 Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces Arabinda Ghosh et.al. 2411.08754 null
2024-11-13 Statistical Operating Characteristics of Current Early Phase Dose Finding Designs with Toxicity and Efficacy in Oncology Hao Sun et.al. 2411.08698 null
2024-11-13 Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models Jan Albrecht et.al. 2411.08692 null
2024-11-13 Robot See, Robot Do: Imitation Reward for Noisy Financial Environments Sven Goluža et.al. 2411.08637 null
2024-11-13 On the Application of Model Predictive Control to a Weighted Coverage Path Planning Problem Kilian Schweppe et.al. 2411.08634 null
2024-11-13 NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation Youzhi Liu et.al. 2411.08579 null
2024-11-12 LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models Anoop Cherian et.al. 2411.08027 null
2024-11-12 Incentive Design with Spillovers Krishna Dasaratha et.al. 2411.08026 null
2024-11-12 From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents Chuyi Kong et.al. 2411.07965 null
2024-11-12 Learning Memory Mechanisms for Decision Making through Demonstrations William Yue et.al. 2411.07954 link
2024-11-12 RedCode: Risky Code Execution and Generation Benchmark for Code Agents Chengquan Guo et.al. 2411.07781 link
2024-11-12 Efficiency of energy-consuming random walkers: Variability in energy helps Mohsen Ghasemi Nezhadhaghighi et.al. 2411.07771 null
2024-11-12 Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows Fangyu Lei et.al. 2411.07763 null
2024-11-12 Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning Stefan Pranger et.al. 2411.07700 null
2024-11-12 World Models: The Safety Perspective Zifan Zeng et.al. 2411.07690 null
2024-11-12 Safe Exploitative Play with Untrusted Type Beliefs Tongxin Li et.al. 2411.07679 null
2024-11-11 Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving Botao Yu et.al. 2411.07228 null
2024-11-11 Grounding Video Models to Actions through Goal Conditioned Exploration Yunhao Luo et.al. 2411.07223 null
2024-11-11 ‘Explaining RL Decisions with Trajectories’: A Reproducibility Study Karim Abdel Sadek et.al. 2411.07200 link
2024-11-11 Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation Yao Ma et.al. 2411.07185 null
2024-11-11 RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration Young-Min Cho et.al. 2411.07161 null
2024-11-11 Azurin-Based Peptide p28 Arrests the p53-HDM2 Interactions: A Novel Anti-Cancer Pathway Albin Joy et.al. 2411.07124 null
2024-11-11 Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing Chuye Hong et.al. 2411.07104 null
2024-11-11 Bounded Rationality Equilibrium Learning in Mean Field Games Yannick Eich et.al. 2411.07099 link
2024-11-11 A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs Myeongsoo Kim et.al. 2411.07098 null
2024-11-11 Differentially-Private Collaborative Online Personalized Mean Estimation Yauhen Yakimenka et.al. 2411.07094 null
2024-11-08 Topology-aware Reinforcement Feature Space Reconstruction for Graph Data Wangyang Ying et.al. 2411.05742 null
2024-11-08 A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics Puze Liu et.al. 2411.05718 null
2024-11-08 Settling the Complexity of Popularity in Additively Separable and Fractional Hedonic Games Martin Bullinger et.al. 2411.05713 null
2024-11-08 Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning Indranil Sur et.al. 2411.05683 null
2024-11-08 The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent Leon O. H. Kroczek et.al. 2411.05653 null
2024-11-08 LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution Yuheng Zhao et.al. 2411.05651 null
2024-11-08 Expectation vs. Reality: Towards Verification of Psychological Games Marta Kwiatkowska et.al. 2411.05599 null
2024-11-08 Smart navigation through a rotating barrier: Deep reinforcement learning with application to size-based separation of active microagents Mohammad Hossein Masoudi et.al. 2411.05587 null
2024-11-08 Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs Hubert Szolc et.al. 2411.05586 link
2024-11-08 Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs Ryoto Ando et.al. 2411.05574 null
2024-11-07 Few-Shot Task Learning through Inverse Generative Modeling Aviv Netanyahu et.al. 2411.04987 null
2024-11-07 Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games Usman Anwar et.al. 2411.04976 link
2024-11-07 StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration Panwen Hu et.al. 2411.04925 null
2024-11-07 OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Siming Huang et.al. 2411.04905 null
2024-11-07 Achieving superconductivity in infinite-layer nickelate thin films by aluminum sputtering deposition Dongxin Zhang et.al. 2411.04896 null
2024-11-07 GUI Agents with Foundation Models: A Comprehensive Survey Shuai Wang et.al. 2411.04890 null
2024-11-07 Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning Satchit Chatterji et.al. 2411.04867 link
2024-11-07 Robust Regulation of Labour Contracts Théo Durandard et.al. 2411.04841 null
2024-11-07 Plasticity Loss in Deep Reinforcement Learning: A Survey Timo Klein et.al. 2411.04832 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796 null
2024-11-06 Predicting and Publishing Accurate Imbalance Prices Using Monte Carlo Tree Search Fabio Pavirani et.al. 2411.04011 null
2024-11-06 Temporal Network Creation Games: The Impact of Non-Locality and Terminals Davide Bilò et.al. 2411.03973 null
2024-11-06 Almost Time-Optimal Loosely-Stabilizing Leader Election on Arbitrary Graphs Without Identifiers in Population Protocols Haruki Kanaya et.al. 2411.03902 null
2024-11-06 AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making Yizhe Huang et.al. 2411.03865 link
2024-11-06 Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC Tyler Clark et.al. 2411.03820 link
2024-11-06 From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning Zhirui Deng et.al. 2411.03817 null
2024-11-06 MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue Fengxiang Wang et.al. 2411.03814 null
2024-11-06 Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data Chengrui Qu et.al. 2411.03810 link
2024-11-06 Multi-Modal Intelligent Channel Modeling: A New Modeling Paradigm via Synesthesia of Machines Lu Bai et.al. 2411.03711 null
2024-11-06 Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services Amr Abo-eleneen et.al. 2411.03686 null
2024-11-05 SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents Dawei Li et.al. 2411.03284 link
2024-11-05 Causal Responsibility Attribution for Human-AI Collaboration Yahang Qi et.al. 2411.03275 link
2024-11-05 Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities Ryosuke Takata et.al. 2411.03252 null
2024-11-05 Troll Farms Philipp Denter et.al. 2411.03241 null
2024-11-05 A resolved Lyman-Alpha profile with doubly peaked emission at z~7 C. Moya-Sierralta et.al. 2411.03222 null
2024-11-05 GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis Temitope Akinboyewa et.al. 2411.03205 link
2024-11-05 Online Data Collection for Efficient Semiparametric Inference Shantanu Gupta et.al. 2411.03195 link
2024-11-05 Hierarchical Orchestra of Policies Thomas P Cannon et.al. 2411.03008 null
2024-11-05 Accelerating Task Generalisation with Multi-Level Hierarchical Options Thomas P Cannon et.al. 2411.02998 null
2024-11-05 Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation Francisco Giral et.al. 2411.02975 null
2024-11-04 Attacking Vision-Language Computer Agents via Pop-ups Yanzhe Zhang et.al. 2411.02391 link
2024-11-04 Two-Sided Learning in Decentralized Matching Markets Vade Shah et.al. 2411.02377 null
2024-11-04 Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences Ruotong Wang et.al. 2411.02353 null
2024-11-04 WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Zehan Qi et.al. 2411.02337 link
2024-11-04 CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments Kung-Hsiang Huang et.al. 2411.02305 link
2024-11-04 Kinetic exchange opinion dynamics for the battleground-states in the 2024 US presidential elections Soumyajyoti Biswas et.al. 2411.02240 null
2024-11-04 Positive Experience Reflection for Agents in Interactive Text Environments Philip Lippmann et.al. 2411.02223 null
2024-11-04 CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education Pranathi Rayavaram et.al. 2411.02143 null
2024-11-04 Foundations and Recent Trends in Multimodal Mobile Agents: A Survey Biao Wu et.al. 2411.02006 link
2024-11-04 Taking AI Welfare Seriously Robert Long et.al. 2411.00986 null
2024-10-31 Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Jiajun Xi et.al. 2410.24218 link
2024-10-31 DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning Zhenyu Jiang et.al. 2410.24185 null
2024-10-31 Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning Jiaqi Liu et.al. 2410.24152 null
2024-10-31 Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis Jia Lin Hau et.al. 2410.24128 link
2024-10-31 Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning Nabil Omi et.al. 2410.24096 null
2024-10-31 Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks Yingzhe Peng et.al. 2410.24032 null
2024-10-31 AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Yifan Xu et.al. 2410.24024 link
2024-10-31 Optimal control problems driven by nonlinear degenerate Fokker-Planck equations Francesca Anceschi et.al. 2410.24000 null
2024-10-31 Persuading a Credible Agent Jiarui Gan et.al. 2410.23989 null
2024-10-31 Fair Division of Chores with Budget Constraints Edith Elkind et.al. 2410.23979 null
2024-10-30 Proportional Fairness in Non-Centroid Clustering Ioannis Caragiannis et.al. 2410.23273 null
2024-10-30 Evaluating Cultural and Social Awareness of LLM Web Agents Haoyi Qiu et.al. 2410.23252 null
2024-10-30 Carrot and Stick: Eliciting Comparison Data and Beyond Yiling Chen et.al. 2410.23243 null
2024-10-30 A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment Matteo G. Mecattaf et.al. 2410.23242 link
2024-10-31 Aligning Audio-Visual Joint Representations with an Agentic Workflow Shentong Mo et.al. 2410.23230 null
2024-10-30 OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Zhiyong Wu et.al. 2410.23218 link
2024-10-30 Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Michael Matthews et.al. 2410.23208 link
2024-10-30 VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning Yichao Liang et.al. 2410.23156 null
2024-10-30 Fair Division with Market Values Siddharth Barman et.al. 2410.23137 null
2024-10-30 First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 Tengfei Zhang et.al. 2410.23077 null
2024-10-29 Environment as Policy: Learning to Race in Unseen Tracks Hongze Wang et.al. 2410.22308 null
2024-10-29 Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Yihe Deng et.al. 2410.22304 null
2024-10-29 Fourier Head: Helping Large Language Models Learn Complex Probability Distributions Nate Gillman et.al. 2410.22269 null
2024-10-29 RingSim- An Agent-based Approach for Modelling Mesoscopic Magnetic Nanowire Networks Ian T Vidamour et.al. 2410.22204 null
2024-10-29 Democratizing Reward Design for Personal and Representative Value-Alignment Carter Blair et.al. 2410.22203 null
2024-10-29 ADAM: An Embodied Causal Agent in Open-World Environments Shu Yu et.al. 2410.22194 null
2024-10-29 EconoJax: A Fast & Scalable Economic Simulation in Jax Koen Ponse et.al. 2410.22165 link
2024-10-29 Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration Cory Hymel et.al. 2410.22129 null
2024-10-29 Inverse Design Method with Enhanced Sampling for Complex Open Crystals: Application to Novel Zeolite Self-Assembly in a Coarse-Grained Model Chaohong Wang et.al. 2410.22111 null
2024-10-29 An LLM-based Simulation Framework for Embodied Conversational Agents in Psychological Counseling Lixiu Wu et.al. 2410.22041 link
2024-10-28 Capacity-Aware Planning and Scheduling in Budget-Constrained Monotonic MDPs: A Meta-RL Approach Manav Vora et.al. 2410.21249 null
2024-10-28 Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines Zhixin Zhang et.al. 2410.21220 link
2024-10-28 Magnetic Milli-spinner for Robotic Endovascular Surgery Shuai Wu et.al. 2410.21112 null
2024-10-28 Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment Yi Zheng et.al. 2410.21109 null
2024-10-28 LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity Recognition Naga Venkata Sai Raviteja Chappa et.al. 2410.21108 null
2024-10-28 Topological Identification of Agent Status in Information Contagions: Application to Financial Markets Anubha Goel et.al. 2410.21104 link
2024-10-28 Automatic Generation of Benchmarks and Reliable LLM Judgment for Code Tasks Eitan Farchi et.al. 2410.21071 null
2024-10-28 CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models Meiqi Chen et.al. 2410.21067 null
2024-10-28 Getting By Goal Misgeneralization With a Little Help From a Mentor Tu Trinh et.al. 2410.21052 null
2024-10-28 FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents Jannis Weil et.al. 2410.21029 link
2024-10-25 FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning Nicole Cho et.al. 2410.19727 null
2024-10-25 Evolving Neural Networks Reveal Emergent Collective Behavior from Minimal Agent Interactions Guilherme S. Y. Giardini et.al. 2410.19718 null
2024-10-25 Adversarial Environment Design via Regret-Guided Diffusion Models Hojun Chung et.al. 2410.19715 null
2024-10-25 Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks Yinglun Xu et.al. 2410.19705 null
2024-10-25 AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs Clemencia Siro et.al. 2410.19692 null
2024-10-25 The Sound of Silence in Social Networks Jesús Aranda et.al. 2410.19685 null
2024-10-25 Optimizing Hearthstone Agents using an Evolutionary Algorithm Pablo García-Sánchez et.al. 2410.19681 link
2024-10-25 MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services Hongjia Wu et.al. 2410.19665 null
2024-10-25 Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving Liu Yunhao et.al. 2410.19639 null
2024-10-25 Knowledge Graph Enhanced Language Agents for Recommendation Taicheng Guo et.al. 2410.19627 null
2024-10-24 Learning to Look: Seeking Information for Decision Making via Policy Factorization Shivin Dass et.al. 2410.18964 null
2024-10-24 OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning Xiaoqiang Wang et.al. 2410.18963 null
2024-10-24 Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play Sha Li et.al. 2410.18935 null
2024-10-24 SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment Caelan Garrett et.al. 2410.18907 null
2024-10-24 Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks Graziano A. Manduzio et.al. 2410.18890 null
2024-10-24 Learning Collusion in Episodic, Inventory-Constrained Markets Paul Friedrich et.al. 2410.18871 link
2024-10-25 An LLM Agent for Automatic Geospatial Data Analysis Yuxing Chen et.al. 2410.18792 null
2024-10-24 Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling for Autonomous Vehicles Yucheng Shi et.al. 2410.18786 null
2024-10-24 Active Target Tracking Using Bearing-only Measurements With Gaussian Process Learning Yingbo Fu et.al. 2410.18669 null
2024-10-24 Approximate EFX and Exact tEFX Allocations for Indivisible Chores: Improved Algorithms Mahyar Afshinmehr et.al. 2410.18655 null
2024-10-23 Prioritized Generative Replay Renhao Wang et.al. 2410.18082 null
2024-10-23 The Double-Edged Sword of Behavioral Responses in Strategic Classification: Theory and User Studies Raman Ebrahimi et.al. 2410.18066 null
2024-10-23 SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation Zihan Zhou et.al. 2410.18065 null
2024-10-23 A Comparative Assessment of Technology Acceptance and Learning Outcomes in Computer-based versus VR-based Pedagogical Agents Aimilios Hadjiliasi et.al. 2410.18048 null
2024-10-23 GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration Xin Li et.al. 2410.18032 link
2024-10-23 MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting Sungil Seok et.al. 2410.18012 null
2024-10-23 Dynamic models of gentrification Giovanni Mauro et.al. 2410.18004 null
2024-10-23 POMDP-Driven Cognitive Massive MIMO Radar: Joint Target Detection-Tracking In Unknown Disturbances Imad Bouhou et.al. 2410.17967 null
2024-10-23 On Regularity and Normalization in Sequential Screening Ian Ball et.al. 2410.17962 null
2024-10-23 Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models He Cao et.al. 2410.17922 link
2024-10-22 SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning Yizhou Chi et.al. 2410.17238 link
2024-10-22 Large Language Models Empowered Personalized Web Agents Hongru Cai et.al. 2410.17236 null
2024-10-22 Responsibility in a Multi-Value Strategic Setting Timothy Parker et.al. 2410.17229 null
2024-10-22 Scalable spectral representations for network multiagent control Zhaolin Ren et.al. 2410.17221 null
2024-10-23 Non-myopic Generation of Language Model for Reasoning and Planning Chang Ma et.al. 2410.17195 link
2024-10-22 DyPNIPP: Predicting Environment Dynamics for RL-based Robust Informative Path Planning Srujan Deolasee et.al. 2410.17186 null
2024-10-22 Layered LA-MAPF: a decomposition of large agent MAPF instance to accelerate solving without compromising solvability Zhuo Yao et.al. 2410.17160 link
2024-10-22 Mechanistic interplay between information spreading and opinion polarization Kleber A. Oliveira et.al. 2410.17151 null
2024-10-22 Advancing lunar exploration through virtual reality simulations: a framework for future human missions Giacomo Franchini et.al. 2410.17132 null
2024-10-22 Exploration and Persuasion Aleksandrs Slivkins et.al. 2410.17086 null
2024-10-21 Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos Gengshan Yang et.al. 2410.16259 null
2024-10-21 IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems Yihuan Mao et.al. 2410.16237 null
2024-10-21 Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping Ryan Li et.al. 2410.16232 null
2024-10-21 Role of obstacle softness in the diffusive behavior of active Particles Ankit Gupta et.al. 2410.16223 null
2024-10-21 CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning Kumar Manas et.al. 2410.16207 link
2024-10-22 LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation Hao Gao et.al. 2410.16197 link
2024-10-21 Spiking Neural Networks as a Controller for Emergent Swarm Agents Kevin Zhu et.al. 2410.16175 null
2024-10-21 A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns Tianyi Men et.al. 2410.16155 null
2024-10-21 AdChain: Decentralized Header Bidding Behkish Nassirzadeh et.al. 2410.16141 null
2024-10-21 Constrained Truthful Obnoxious Two-Facility Location with Optional Preferences Panagiotis Kanellopoulos et.al. 2410.16131 null
2024-10-18 Teaching Models to Balance Resisting and Accepting Persuasion Elias Stengel-Eskin et.al. 2410.14596 link
2024-10-18 Toolshed: Scale Tool-Equipped Agents with Advanced RAG-Tool Fusion and Tool Knowledge Bases Elias Lumer et.al. 2410.14594 null
2024-10-18 Temporal Fair Division of Indivisible Items Edith Elkind et.al. 2410.14593 null
2024-10-18 Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets Namid R. Stillman et.al. 2410.14587 null
2024-10-18 Neural Combinatorial Clustered Bandits for Recommendation Systems Baran Atalar et.al. 2410.14586 null
2024-10-18 Do LLMs estimate uncertainty well in instruction-following? Juyeon Heo et.al. 2410.14582 link
2024-10-18 When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs Hanna Kim et.al. 2410.14569 null
2024-10-18 RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions Zhiyuan Peng et.al. 2410.14567 link
2024-10-18 Performance bounds for multi-vehicle networks with local integrators Jonas Hansson et.al. 2410.14525 null
2024-10-18 Do LLMs “know” internally when they follow instructions? Juyeon Heo et.al. 2410.14516 link
2024-10-17 VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding Runsen Xu et.al. 2410.13860 link
2024-10-17 AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents Ke Yang et.al. 2410.13825 null
2024-10-18 Harnessing Webpage UIs for Text-Rich Visual Understanding Junpeng Liu et.al. 2410.13824 null
2024-10-17 Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems Alireza Ghafarollahi et.al. 2410.13768 null
2024-10-17 MobA: A Two-Level Agent System for Efficient Mobile Task Automation Zichen Zhu et.al. 2410.13757 link
2024-10-17 Interacting humans and robots can improve sensory prediction by adapting their viscoelasticity Xiaoxiao Cheng et.al. 2410.13755 null
2024-10-17 Real Eventual Exponential Positivity of Complex-valued Laplacians: Applications to Consensus in Multi-agent Systems Aditi Saxena et.al. 2410.13700 null
2024-10-17 ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization Xiutian Zhao et.al. 2410.13667 link
2024-10-17 A Comparative Study on Reasoning Patterns of OpenAI’s o1 Model Siwei Wu et.al. 2410.13639 link
2024-10-17 Phenotype structuring in collective cell migration:a tutorial of mathematical models and methods Tommaso Lorenzi et.al. 2410.13629 null
2024-10-16 JudgeBench: A Benchmark for Evaluating LLM-based Judges Sijun Tan et.al. 2410.12784 link
2024-10-16 Prophet Upper Bounds for Online Matching and Auctions José Soto et.al. 2410.12756 null
2024-10-16 HEnRY: A Multi-Agent System Framework for Multi-Domain Contexts Emmanuele Lacavalla et.al. 2410.12720 link
2024-10-16 A comparative analysis of metamodels for lumped cardiovascular models, and pipeline for sensitivity analysis, parameter estimation, and uncertainty quantification John M. Hanna et.al. 2410.12654 null
2024-10-16 Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control Koen de Vos et.al. 2410.12651 null
2024-10-16 Zeroth-Order Feedback Optimization in Multi-Agent Systems: Tackling Coupled Constraints Yingpeng Duan et.al. 2410.12647 null
2024-10-16 Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach Henrique Donâncio et.al. 2410.12598 null
2024-10-16 Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving Sihao Wu et.al. 2410.12568 null
2024-10-16 A Communication Consistent Approach to Signal Temporal Logic Task Decomposition in Multi-Agent Systems Gregorio Marchesini et.al. 2410.12563 null
2024-10-16 Nash equilibria in scalar discrete-time linear quadratic games Giulio Salizzoni et.al. 2410.12544 null
2024-10-15 Molecular Quantum Control Algorithm Design by Reinforcement Learning Anastasia Pipi et.al. 2410.11839 null
2024-10-15 G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks Guibin Zhang et.al. 2410.11782 null
2024-10-15 BlendRL: A Framework for Merging Symbolic and Neural Policy Learning Hikaru Shindo et.al. 2410.11689 null
2024-10-15 Optimal Mediation Mechanisms in Bilateral Trade Zhikang Fan et.al. 2410.11683 null
2024-10-15 Safety Filtering While Training: Improving the Performance and Sample Efficiency of Reinforcement Learning Agents Federico Pizarro Bejarano et.al. 2410.11671 link
2024-10-15 Markov-Nash equilibria in mean-field games under model uncertainty Johannes Langner et.al. 2410.11652 null
2024-10-15 Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search Jiamian Li et.al. 2410.11642 null
2024-10-15 Findings of the WMT 2024 Shared Task on Chat Translation Wafaa Mohammed et.al. 2410.11624 null
2024-10-15 Temporal Hyperproperties for Population Protocols Nicolas Waldburger et.al. 2410.11572 null
2024-10-15 Demo: Testing AI-driven MAC Learning in Autonomic Networks Leonard Paeleke et.al. 2410.11565 null
2024-10-14 AFlow: Automating Agentic Workflow Generation Jiayi Zhang et.al. 2410.10762 link
2024-10-14 Denial-of-Service Poisoning Attacks against Large Language Models Kuofeng Gao et.al. 2410.10760 link
2024-10-14 DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model Yuqi Wang et.al. 2410.10738 null
2024-10-14 Online Statistical Inference for Time-varying Sample-averaged Q-learning Saunak Kumar Panda et.al. 2410.10737 null
2024-10-14 Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach Rory Young et.al. 2410.10674 null
2024-10-14 Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning William A. Stigall et.al. 2410.10660 null
2024-10-14 Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty John Mern et.al. 2410.10610 null
2024-10-14 STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack Naman Gupta et.al. 2410.10584 null
2024-10-14 Consensus in Multiagent Systems with lack of connection Mohamed Bentaibi et.al. 2410.10486 null
2024-10-14 Compositional Shielding and Reinforcement Learning for Multi-Agent Systems Asger Horn Brorholt et.al. 2410.10460 null
2024-10-11 PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents Xiangyu Yin et.al. 2410.09034 link
2024-10-11 AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents Maksym Andriushchenko et.al. 2410.09024 null
2024-10-11 From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating UI Operation Impacts Zhuohao Jerry Zhang et.al. 2410.09006 null
2024-10-11 Cyclic jetting enables microbubble-mediated drug delivery Marco Cattaneo et.al. 2410.08990 null
2024-10-11 Best-of-Both-Worlds Fairness of the Envy-Cycle-Elimination Algorithm Jugal Garg et.al. 2410.08986 null
2024-10-11 Optimal Allocation with Peer Information Axel Niemeyer et.al. 2410.08954 null
2024-10-11 Transferable Belief Model on Quantum Circuits Qianli Zhou et.al. 2410.08949 null
2024-10-11 The Dynamics of Social Conventions in LLM populations: Spontaneous Emergence, Collective Biases and Tipping Points Ariel Flint Ashery et.al. 2410.08948 null
2024-10-11 Hyperspectral fluorescence imaging using a high-speed silicon photomultiplier array Chi Z. Huang et.al. 2410.08936 null
2024-10-11 MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL Claas A Voelcker et.al. 2410.08896 null
2024-10-10 Agent S: An Open Agentic Framework that Uses Computers Like a Human Saaket Agashe et.al. 2410.08164 link
2024-10-10 DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory Yutong Wang et.al. 2410.08143 link
2024-10-10 SoundScape: A Human-AI Co-Creation System Making Your Memories Heard Chongjun Zhong et.al. 2410.08136 null
2024-10-10 Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction Jarrid Rector-Brooks et.al. 2410.08134 null
2024-10-10 Mars: Situated Inductive Reasoning in an Open-World Environment Xiaojuan Tang et.al. 2410.08126 null
2024-10-10 Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System Weize Chen et.al. 2410.08115 null
2024-10-10 Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining Tianyi Bai et.al. 2410.08102 link
2024-10-10 Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread David Kerkmann et.al. 2410.08050 link
2024-10-10 Strategic Classification With Externalities Yiling Chen et.al. 2410.08032 null
2024-10-10 Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching Xiaoshan Lin et.al. 2410.08022 null
2024-10-09 Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making Manling Li et.al. 2410.07166 link
2024-10-09 Spatiotemporal Modeling and Forecasting at Scale with Dynamic Generalized Linear Models Pranay Pherwani et.al. 2410.07161 null
2024-10-09 I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy Gian Maria Campedelli et.al. 2410.07109 link
2024-10-09 Identifying and Addressing Delusions for Target-Directed Decision-Making Mingde Zhao et.al. 2410.07096 link
2024-10-09 MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering Jun Shern Chan et.al. 2410.07095 link
2024-10-10 Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology Xiangyu Wang et.al. 2410.07087 null
2024-10-09 MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses Zonglin Yang et.al. 2410.07076 link
2024-10-09 Retrieval-Augmented Decision Transformer: External Memory for In-context RL Thomas Schmied et.al. 2410.07071 link
2024-10-09 Mechanism Design for Exchange Markets Yusen Zheng et.al. 2410.07023 null
2024-10-09 Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach Xuanming Zhang et.al. 2410.06949 link
2024-10-07 Grounding Partially-Defined Events in Multimodal Data Kate Sanders et.al. 2410.05267 null
2024-10-07 GLEE: A Unified Framework and Benchmark for Language-based Economic Environments Eilam Shapira et.al. 2410.05254 link
2024-10-07 Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents Boyu Gou et.al. 2410.05243 link
2024-10-07 Scalable and Accurate Graph Reasoning with LLM-based Multi-Agents Yuwei Hu et.al. 2410.05130 null
2024-10-08 Last Iterate Convergence in Monotone Mean Field Games Noboru Isobe et.al. 2410.05127 null
2024-10-07 ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery Ziru Chen et.al. 2410.05080 null
2024-10-07 Extended Functional Representation Lemma: A Tool For Privacy, Semantic Representation, Caching, and Compression Design Amirreza Zamani et.al. 2410.05033 null
2024-10-07 Active Fine-Tuning of Generalist Policies Marco Bagatella et.al. 2410.05026 null
2024-10-07 Contest design with a finite type-space: A unifying approach Andrzej Baranski et.al. 2410.04970 null
2024-10-07 Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning Chen Zhang et.al. 2410.04936 null
2024-10-04 Open-World Reinforcement Learning over Long Short-Term Imagination Jiajian Li et.al. 2410.03618 link
2024-10-04 Never Mind The No-Ops: Faster and Less Volatile Simulation Modelling of Co-Evolutionary Species Interactions via Spatial Cyclic Games Dave Cliff et.al. 2410.03586 link
2024-10-04 Training on more Reachable Tasks for Generalisation in Reinforcement Learning Max Weltevrede et.al. 2410.03565 null
2024-10-04 Steering Large Language Models between Code Execution and Textual Reasoning Yongchao Chen et.al. 2410.03524 null
2024-10-04 Tournament versus Circulant: On Simulating 7-Species Evolutionary Spatial Cyclic Games with Ablated Predator-Prey Networks as Models of Biodiversity Dave Cliff et.al. 2410.03518 link
2024-10-04 MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation Hongcheng Wang et.al. 2410.03488 null
2024-10-04 VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning Han Lin et.al. 2410.03478 null
2024-10-04 MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents Junpeng Yue et.al. 2410.03450 null
2024-10-04 Attainable Force Approximation and Full-Pose Tracking Control of an Over-Actuated Thrust-Vectoring Modular Team UAV Yen-Cheng Chu et.al. 2410.03445 null
2024-10-04 ToolGen: Unified Tool Retrieval and Calling via Generation Renxi Wang et.al. 2410.03439 link
2024-10-03 ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI Ahmad Elawady et.al. 2410.02751 link
2024-10-03 Grounding Large Language Models In Embodied Environment With Imperfect World Models Haolan Liu et.al. 2410.02742 null
2024-10-03 DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects Zhaowei Wang et.al. 2410.02730 link
2024-10-03 Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization Ryan C. Barron et.al. 2410.02721 null
2024-10-03 Grounded Answers for Multi-agent Decision-making Problem through Generative World Model Zeyang Liu et.al. 2410.02664 null
2024-10-03 Undesirable Memorization in Large Language Models: A Survey Ali Satvaty et.al. 2410.02650 null
2024-10-04 Learning 3D Perception from Others’ Predictions Jinsu Yoo et.al. 2410.02646 null
2024-10-03 Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents Hanrong Zhang et.al. 2410.02644 link
2024-10-03 Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning Olivier Lepel et.al. 2410.02605 null
2024-10-03 Agents’ Room: Narrative Generation through Multi-step Collaboration Fantine Huot et.al. 2410.02603 link
2024-10-02 Windowed MAPF with Completeness Guarantees Rishi Veerapaneni et.al. 2410.01798 null
2024-10-02 Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning Prasanth Sengadu Suresh et.al. 2410.01790 null
2024-10-02 Social coordination perpetuates stereotypic expectations and behaviors across generations in deep multi-agent reinforcement learning Rebekah A. Gelpí et.al. 2410.01763 null
2024-10-02 PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillation Mohammadamin Davoodabadi et.al. 2410.01745 null
2024-10-02 Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning Xingrui Gu et.al. 2410.01739 null
2024-10-02 Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning Omayma Mahjoub et.al. 2410.01706 null
2024-10-02 Stable Offline Value Function Learning with Bisimulation-based Representations Brahma S. Pavse et.al. 2410.01643 null
2024-10-02 Moral Alignment for LLM Agents Elizaveta Tennant et.al. 2410.01639 null
2024-10-02 Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving Aron Distelzweig et.al. 2410.01628 null
2024-10-02 Automated Red Teaming with GOAT: the Generative Offensive Agent Tester Maya Pavlova et.al. 2410.01606 null
2024-09-30 LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner Xiaopan Zhang et.al. 2409.20560 null
2024-09-30 Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos Md Mohaiminul Islam et.al. 2409.20557 null
2024-09-30 Direct Multipath-Based SLAM Mingchao Liang et.al. 2409.20552 null
2024-09-30 COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models Divyanshu Daiya et.al. 2409.20502 null
2024-09-30 Impartial Selection Under Combinatorial Constraints Javier Cembrano et.al. 2409.20477 null
2024-09-30 Facility Location Games with Competitors Cheng Peng et.al. 2409.20396 null
2024-09-30 Machine Learning-enabled Traffic Steering in O-RAN: A Case Study on Hierarchical Learning Approach Md Arafat Habib et.al. 2409.20391 null
2024-09-30 Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models Yizhou Huang et.al. 2409.20364 null
2024-09-30 A mean field Jacobi process for modeling sustainable tourism Hidekazu Yoshioka et.al. 2409.20347 null
2024-09-30 MARLadona – Towards Cooperative Team Play Using Multi-Agent Reinforcement Learning Zichong Li et.al. 2409.20326 null
2024-09-27 Mean-Field Control Barrier Functions: A Framework for Real-Time Swarm Control Samy Wu Fung et.al. 2409.18945 null
2024-09-27 Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models Jiaming Li et.al. 2409.18943 link
2024-09-27 AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow Huizi Yu et.al. 2409.18924 null
2024-09-27 Best Arm Identification with Minimal Regret Junwen Yang et.al. 2409.18909 null
2024-09-27 Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks Richard Osuala et.al. 2409.18872 link
2024-09-27 Safe Decentralized Multi-Agent Control using Black-Box Predictors, Conformal Decision Policies, and Control Barrier Functions Sacha Huriot et.al. 2409.18862 null
2024-09-27 ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning Jannis Becktepe et.al. 2409.18827 link
2024-09-27 Facility Location Problem with Aleatory Agents Gennaro Auricchio et.al. 2409.18817 null
2024-09-27 Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs Yanyuan Qiao et.al. 2409.18794 null
2024-09-27 Forecasting Macroeconomic Dynamics using a Calibrated Data-Driven Agent-based Model Samuel Wiese et.al. 2409.18760 null
2024-09-26 StackGen: Generating Stable Structures from Silhouettes via Diffusion Luzhe Sun et.al. 2409.18098 null
2024-09-26 Infer Human’s Intentions Before Following Natural Language Instructions Yanming Wan et.al. 2409.18073 link
2024-09-27 Explaining Explaining Sergei Nirenburg et.al. 2409.18052 null
2024-09-26 Inverse Reinforcement Learning with Multiple Planning Horizons Jiayu Yao et.al. 2409.18051 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-26 Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving Haochen Liu et.al. 2409.18031 link
2024-09-26 Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective Yotam Wolf et.al. 2409.18028 null
2024-09-26 Control Industrial Automation System with Large Language Models Yuchen Xia et.al. 2409.18009 link
2024-09-26 Distributed Invariant Unscented Kalman Filter based on Inverse Covariance Intersection with Intermittent Measurements Zhian Ruan et.al. 2409.17997 null
2024-09-26 Nonparametric Inference Framework for Time-dependent Epidemic Models Son Luu et.al. 2409.17968 null
2024-09-25 Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents Junting Lu et.al. 2409.17140 null
2024-09-25 Collision-free time-optimal path parameterization for multi-robot teams Katherine Mao et.al. 2409.17079 null
2024-09-25 AI-Driven Risk-Aware Scheduling for Active Debris Removal Missions Antoine Poupon et.al. 2409.17012 null
2024-09-25 PitRSDNet: Predicting Intra-operative Remaining Surgery Duration in Endoscopic Pituitary Surgery Anjana Wijekoon et.al. 2409.16998 null
2024-09-25 Tell Me What You Don’t Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing Wenhao Liu et.al. 2409.16913 null
2024-09-25 A Roadmap for Embodied and Social Grounding in LLMs Sara Incao et.al. 2409.16900 null
2024-09-25 Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study Sota Kobuki et.al. 2409.16899 null
2024-09-25 Automating Traffic Model Enhancement with AI Research Agent Xusen Guo et.al. 2409.16876 link
2024-09-25 Communication Backbone Reconfiguration with Connectivity Maintenance Leonardo Santos et.al. 2409.16851 null
2024-09-25 Modeling the Modqueue: Towards Understanding and Improving Report Resolution on Reddit Tanvi Bajpai et.al. 2409.16840 null
2024-09-24 Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks Ahmed Shokry et.al. 2409.16208 null
2024-09-25 Extending Stable and Popular Matching Algorithms from Bipartite to Arbitrary Instances Gergely Csáji et.al. 2409.16173 null
2024-09-24 EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges Talor Abramovich et.al. 2409.16165 link
2024-09-25 Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed Alexander Prutsch et.al. 2409.16154 link
2024-09-24 Analyzing Probabilistic Methods for Evaluating Agent Capabilities Axel Højmark et.al. 2409.16125 null
2024-09-24 MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents Ming Zhu et.al. 2409.16120 link
2024-09-24 A decision-theoretic model for a principal-agent collaborative learning problem Getachew K Befekadu et.al. 2409.16068 null
2024-09-24 Bridging Environments and Language with Rendering Functions and Vision-Language Models Theo Cachet et.al. 2409.16024 null
2024-09-24 AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model Zhenghao Qi et.al. 2409.16019 link
2024-09-24 Automated test generation to evaluate tool-augmented LLMs as conversational AI agents Samuel Arcadinho et.al. 2409.15934 null
2024-09-18 Residual Descent Differential Dynamic Game (RD3G) – A Fast Newton Solver for Constrained General Sum Games Zhiyuan Zhang et.al. 2409.12152 null
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 link
2024-09-19 The Impact of Element Ordering on LM Agent Performance Wayne Chi et.al. 2409.12089 link
2024-09-19 Using Large Language Models to Generate Clinical Trial Tables and Figures Yumeng Yang et.al. 2409.12046 null
2024-09-19 Representing Positional Information in Generative World Models for Object Manipulation Stefano Ferraro et.al. 2409.12005 null
2024-09-18 Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning Claude Formanek et.al. 2409.12001 null
2024-09-18 On the Stability of Consensus Control under Rotational Ambiguities Zhonggang Li et.al. 2409.11979 null
2024-09-18 Anomalous behavior of Replicator dynamics for the Prisoner’s Dilemma on diluted lattices Fernanda R. Leivas et.al. 2409.11955 null
2024-09-18 Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling Arthur Müller et.al. 2409.11933 null
2024-09-18 Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks Samuel Belkadi et.al. 2409.11897 link
2024-09-17 Ising model with varying spin strength on a scale-free network: scaling functions and critical amplitude ratios M. Krasnytska et.al. 2409.11396 null
2024-09-17 Distributed Perception Aware Safe Leader Follower System via Control Barrier Methods Richie R. Suganda et.al. 2409.11394 null
2024-09-17 LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents Amine B. Hassouna et.al. 2409.11393 null
2024-09-17 CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark Zachary S. Siegel et.al. 2409.11363 link
2024-09-17 A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems Mostafa M. Shibl et.al. 2409.11358 null
2024-09-17 EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage Zeyi Liao et.al. 2409.11295 link
2024-09-17 P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task Weiye Xu et.al. 2409.11279 null
2024-09-17 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments Maria Rigaki et.al. 2409.11276 null
2024-09-18 The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives Samee Arif et.al. 2409.11261 link
2024-09-17 To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games? Chih-Yuan Chiu et.al. 2409.11257 null
2024-09-16 On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model Surajit Saha et.al. 2409.10413 null
2024-09-16 Reducing Leximin Fairness to Utilitarian Optimization Eden Hartman et.al. 2409.10395 null
2024-09-16 Decentralized and Asymmetric Multi-Agent Learning in Construction Sites Yakov Miron et.al. 2409.10375 null
2024-09-16 Instigating Cooperation among LLM Agents Using Adaptive Information Modulation Qiliang Chen et.al. 2409.10372 null
2024-09-16 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? Téo Guichoux et.al. 2409.10357 null
2024-09-16 Partial Ordering Bayesian Logistic Regression Model for Phase I Combination Trials and Computationally Efficient Approach to Operational Prior Specification Weishi Chen et.al. 2409.10352 link
2024-09-16 Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots Hongming Zhang et.al. 2409.10277 link
2024-09-16 Synchronization-Based Cooperative Distributed Model Predictive Control Julius Beerwerth et.al. 2409.10215 null
2024-09-16 Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles Mais Jamal et.al. 2409.10165 null
2024-09-16 Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions Alejandro Sánchez Roncero et.al. 2409.10117 null