CV Arxiv Daily

Updated on 2026.04.03

Usage instructions: here

Agent

Publish Date	Title	Authors	PDF	Code
2025-07-23	DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models	Liwenhan Xie et.al.	2507.17734	null
2025-07-23	BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems	Malsha Ashani Mahawatta Dona et.al.	2507.17722	null
2025-07-23	Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks	Ilias Chatzistefanidis et.al.	2507.17695	null
2025-07-23	Simulating multiple human perspectives in socio-ecological systems using large language models	Yongchao Zeng et.al.	2507.17680	null
2025-07-23	LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning	Luca Salvatore Lorello et.al.	2507.17482	null
2025-07-23	ERMV: Editing 4D Robotic Multi-view images to enhance embodied agents	Chang Nie et.al.	2507.17462	null
2025-07-23	IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception	Haichuan Li et.al.	2507.17445	null
2025-07-23	Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach	Hugh Adams et.al.	2507.17433	null
2025-07-23	CAPRI-CT: Causal Analysis and Predictive Reasoning for Image Quality Optimization in Computed Tomography	Sneha George Gnanakalavathy et.al.	2507.17420	null
2025-07-23	Residual Prophet Inequalities	Jose Correa et.al.	2507.17391	null
2025-07-22	ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning	Chi-Pin Huang et.al.	2507.16815	null
2025-07-22	LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs	Da-Chen Lian et.al.	2507.16809	null
2025-07-23	Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning	Yanjun Zheng et.al.	2507.16802	null
2025-07-23	Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent	Xiaoyu Zhan et.al.	2507.16799	null
2025-07-22	Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning	Mian Ibad Ali Shah et.al.	2507.16796	null
2025-07-22	Generalized non-reciprocal phase transitions in multipopulation systems	Cheyne Weis et.al.	2507.16763	null
2025-07-22	AI-enhanced conversational agents for personalized asthma support Factors for engagement, value and efficacy	Laura Moradbakhti et.al.	2507.16735	null
2025-07-23	Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints	Zhenyun Yin et.al.	2507.16727	null
2025-07-22	RAVine: Reality-Aligned Evaluation for Agentic Search	Yilong Xu et.al.	2507.16725	null
2025-07-22	Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation	Viktor Muryn et.al.	2507.16704	null
2025-07-21	LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra	Seth Karten et.al.	2507.15815	null
2025-07-21	Density control of multi-agent swarms via bio-inspired leader-follower plasticity	Gian Carlo Maffettone et.al.	2507.15781	null
2025-07-21	A Framework for Analyzing Abnormal Emergence in Service Ecosystems Through LLM-based Agent Intention Mining	Yifan Shen et.al.	2507.15770	null
2025-07-21	GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts	Jingyi Zheng et.al.	2507.15761	null
2025-07-21	Towards physician-centered oversight of conversational diagnostic AI	Elahe Vedadi et.al.	2507.15743	null
2025-07-21	General Matching Games	Felipe Garrido-Lucero et.al.	2507.15737	null
2025-07-21	Competitive Algorithms for Cooperative Multi-Agent Ski-Rental Problems	Xuchuang Wang et.al.	2507.15727	null
2025-07-21	Agentic AI for autonomous anomaly management in complex systems	Reza Vatankhah Barenji et.al.	2507.15676	null
2025-07-21	BugScope: Learn to Find Bugs Like Human	Jinyao Guo et.al.	2507.15671	null
2025-07-21	Asynchronous Collective Tree Exploration: a Distributed Algorithm, and a new Lower Bound	Romain Cosson et.al.	2507.15658	null
2025-07-18	DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration	Xiyun Li et.al.	2507.14088	null
2025-07-18	Collaborative Rational Speech Act: Pragmatic Reasoning for Multi-Turn Dialog	Lautaro Estienne et.al.	2507.14063	null
2025-07-18	Well posedness and propagation of chaos for multi-agent models with strategies and diffusive effects	Alessandro Baldi et.al.	2507.14058	null
2025-07-18	Online MMS Allocation for Chores	Jiaxin Song et.al.	2507.14039	null
2025-07-18	Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors	Jochen Wulf et.al.	2507.14034	null
2025-07-18	Byzantine-resilient federated online learning for Gaussian process regression	Xu Zhang et.al.	2507.14021	null
2025-07-18	DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation	Haoran Li et.al.	2507.13985	null
2025-07-18	A Multi-Objective Optimization framework for Decentralized Learning with coordination constraints	Roberto Morales et.al.	2507.13983	null
2025-07-18	Bottom-up Domain-specific Superintelligence: A Reliable Knowledge Graph is What We Need	Bhishma Dedhia et.al.	2507.13966	null
2025-07-18	NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning	Qingyi Chen et.al.	2507.13940	null
2025-07-17	A Survey of Context Engineering for Large Language Models	Lingrui Mei et.al.	2507.13334	null
2025-07-17	N Bugs on a Circle	Josh Briley et.al.	2507.13333	null
2025-07-17	Multi-Agent Synergy-Driven Iterative Visual Narrative Synthesis	Wang Xi et.al.	2507.13285	null
2025-07-17	Analysis Theory of Data Economy: Dataization, Technological Progress and Dynamic General Equilibrium	Yongheng Hu et.al.	2507.13274	null
2025-07-17	RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality	Ruohao Li et.al.	2507.13247	null
2025-07-17	GEMMAS: Graph-based Evaluation Metrics for Multi Agent Systems	Jisoo Lee et.al.	2507.13190	null
2025-07-17	Black Box Deployed – Functional Criteria for Artificial Moral Agents in the LLM Era	Matthew E. Brophy et.al.	2507.13175	null
2025-07-17	Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback	Suzie Kim et.al.	2507.13171	null
2025-07-17	Prompt Injection 2.0: Hybrid AI Threats	Jeremy McHugh et.al.	2507.13169	null
2025-07-17	SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models	Xiangyu Dong et.al.	2507.13152	null
2025-07-16	Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data	Chandana Cheerla et.al.	2507.12425	null
2025-07-16	Modeling Feasible Locomotion of Nanobots for Cancer Detection and Treatment	Noble Harasha et.al.	2507.12400	null
2025-07-16	Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate	Ana Davila et.al.	2507.12370	null
2025-07-16	GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities	Diganta Misra et.al.	2507.12367	null
2025-07-16	Social polarization promoted by sparse higher-order interactions	Hugo Pérez-Martínez et.al.	2507.12325	null
2025-07-17	Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot	Luca Garello et.al.	2507.12273	null
2025-07-16	Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes	Johann Frei et.al.	2507.12261	null
2025-07-16	Toward a Behavioural Translation Style Space: Simulating the Temporal Dynamics of Affect, Behaviour, and Cognition in Human Translation Production	Michael Carl et.al.	2507.12208	null
2025-07-16	BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search	Azhar Ikhtiarudin et.al.	2507.12189	null
2025-07-16	Fast and Scalable Game-Theoretic Trajectory Planning with Intentional Uncertainties	Zhenmin Huang et.al.	2507.12174	null
2025-07-15	DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering	Yinsheng Li et.al.	2507.11527	null
2025-07-15	Opinion dynamics: Statistical physics and beyond	Michele Starnini et.al.	2507.11521	null
2025-07-15	AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air	Shiyi Yang et.al.	2507.11515	null
2025-07-15	On the Complexity of the Optimal Correlated Equilibria in Extensive-Form Games	Vincent Cheval et.al.	2507.11509	null
2025-07-15	LF: Online Multi-Robot Path Planning Meets Optimal Trajectory Control	Ajay Shankar et.al.	2507.11464	null
2025-07-15	EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes	LG AI Research et.al.	2507.11407	null
2025-07-15	From Production Logistics to Smart Manufacturing: The Vision for a New RoboCup Industrial League	Supun Dissanayaka et.al.	2507.11402	null
2025-07-15	Dr.Copilot: A Multi-Agent Prompt Optimized Assistant for Improving Patient-Doctor Communication in Romanian	Andrei Niculae et.al.	2507.11299	null
2025-07-15	Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems	Dany Moshkovich et.al.	2507.11277	null
2025-07-15	An Empirical Study of Multi-Agent RAG for Real-World University Admissions Counseling	Anh Nguyen-Duc et.al.	2507.11272	null
2025-07-14	EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Mingxian Lin et.al.	2507.10548	null
2025-07-14	Graph World Model	Tao Feng et.al.	2507.10539	null
2025-07-14	DeepResearch $^{\text{Eco}}$ : A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology	Jennifer D’Souza et.al.	2507.10522	null
2025-07-14	An Empirical Evaluation of AI-Powered Non-Player Characters’ Perceived Realism and Performance in Virtual Reality Environments	Mikko Korkiakoski et.al.	2507.10469	null
2025-07-14	Logic layer Prompt Control Injection (LPCI): A Novel Security Vulnerability Class in Agentic Systems	Hammad Atta et.al.	2507.10457	null
2025-07-14	Negative entropy and non-equilibrium Euclidean shell	Yang An et.al.	2507.10450	null
2025-07-14	Am I on the Right Track? What Can Predicted Query Performance Tell Us about the Search Behaviour of Agentic RAG	Fangzheng Tian et.al.	2507.10411	null
2025-07-14	Machine-Learning to Trust	Ran Spiegler et.al.	2507.10363	null
2025-07-14	Toolsuite for Implementing Multiagent Systems Based on Communication Protocols	Amit K. Chopra et.al.	2507.10324	null
2025-07-14	Prompt Informed Reinforcement Learning for Visual Coverage Path Planning	Venkat Margapuri et.al.	2507.10284	null
2025-07-11	NeuralOS: Towards Simulating Operating Systems via Neural Generative Models	Luke Rivard et.al.	2507.08800	null
2025-07-11	SPLASH! Sample-efficient Preference-based inverse reinforcement learning for Long-horizon Adversarial tasks from Suboptimal Hierarchical demonstrations	Peter Crowley et.al.	2507.08707	null
2025-07-11	elsciRL: Integrating Language Solutions into Reinforcement Learning Problem Settings	Philip Osborne et.al.	2507.08705	null
2025-07-11	Introspection of Thought Helps AI Agents	Haoran Sun et.al.	2507.08664	null
2025-07-11	Safe Deep Reinforcement Learning for Resource Allocation with Peak Age of Information Violation Guarantees	Berire Gunes Reyhan et.al.	2507.08653	null
2025-07-11	DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images	Haoran Sun et.al.	2507.08648	null
2025-07-11	OnlineBEV: Recurrent Temporal Fusion in Bird’s Eye View Representations for Multi-Camera 3D Perception	Junho Koh et.al.	2507.08644	null
2025-07-11	Agentic Large Language Models for Conceptual Systems Engineering and Design	Soheyl Massoudi et.al.	2507.08619	null
2025-07-11	AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs	Florian Grötschla et.al.	2507.08616	null
2025-07-11	Emergent Natural Language with Communication Games for Improving Image Captioning Capabilities without Additional Data	Parag Dutta et.al.	2507.08610	null
2025-07-10	PyVision: Agentic Vision with Dynamic Tooling	Shitian Zhao et.al.	2507.07998	null
2025-07-10	OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding	JingLi Lin et.al.	2507.07984	null
2025-07-10	Reinforcement Learning with Action Chunking	Qiyang Li et.al.	2507.07969	null
2025-07-10	MIRIX: Multi-Agent Memory System for LLM-Based Agents	Yu Wang et.al.	2507.07957	null
2025-07-10	Agentic Retrieval of Topics and Insights from Earnings Calls	Anant Gupta et.al.	2507.07906	null
2025-07-11	The Trust Fabric: Decentralized Interoperability and Economic Coordination for the Agentic Web	Sree Bhargavi Balija et.al.	2507.07901	null
2025-07-10	Automating MD simulations for Proteins using Large language Models: NAMD-Agent	Achuth Chandrasekhar et.al.	2507.07887	null
2025-07-10	DocCHA: Towards LLM-Augmented Interactive Online diagnosis System	Xinyi Liu et.al.	2507.07870	null
2025-07-10	“So, Tell Me About Your Policy…”: Distillation of interpretable policies from Deep Reinforcement Learning agents	Giovanni Dispoto et.al.	2507.07848	null
2025-07-10	Perceptual Distortions and Autonomous Representation Learning in a Minimal Robotic System	David Warutumo et.al.	2507.07845	null
2025-07-09	4KAgent: Agentic Any Image to 4K Super-Resolution	Yushen Zuo et.al.	2507.07105	null
2025-07-09	Graph-Based Complexity Metrics for Multi-Agent Curriculum Learning: A Validated Approach to Task Ordering in Cooperative Coordination Environments	Farhaan Ebadulla et.al.	2507.07074	null
2025-07-09	Robust signal decompositions on the circle	Aral Kose et.al.	2507.07007	null
2025-07-09	Federated Learning-based MARL for Strengthening Physical-Layer Security in B5G Networks	Deemah H. Tashman et.al.	2507.06997	null
2025-07-09	The User-Centric Geo-Experience: An LLM-Powered Framework for Enhanced Planning, Navigation, and Dynamic Adaptation	Jieren Deng et.al.	2507.06993	null
2025-07-09	Optimizing Cognitive Networks: Reinforcement Learning Meets Energy Harvesting Over Cascaded Channels	Deemah H. Tashman et.al.	2507.06981	null
2025-07-09	Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues	Fareya Ikram et.al.	2507.06910	null
2025-07-09	MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection	Ziyan Liu et.al.	2507.06908	null
2025-07-09	SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds	Matthias Zeller et.al.	2507.06906	null
2025-07-09	Designing Adaptive Algorithms Based on Reinforcement Learning for Dynamic Optimization of Sliding Window Size in Multi-Dimensional Data Streams	Abolfazl Zarghani et.al.	2507.06901	null
2025-07-08	Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving	Xiangru Tang et.al.	2507.06229	null
2025-07-08	Aligned Textual Scoring Rules	Yuxuan Lu et.al.	2507.06221	null
2025-07-08	Evaluation of Habitat Robotics using Large Language Models	William Li et.al.	2507.06157	null
2025-07-08	OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety	Sanidhya Vijayvargiya et.al.	2507.06134	null
2025-07-08	A Directed Lazy Random Walk Model to Three-Way Dynamic Matching Problem	Souvik Roy et.al.	2507.06126	null
2025-07-08	On Lockean beliefs that are deductively closed and minimal change	Tommaso Flaminio et.al.	2507.06042	null
2025-07-08	Conditional Multi-Stage Failure Recovery for Embodied Agents	Youmna Farag et.al.	2507.06016	null
2025-07-08	From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination	Chang Yao et.al.	2507.06004	null
2025-07-08	Multi-Agent Debate Strategies to Enhance Requirements Engineering with Large Language Models	Marc Oriol et.al.	2507.05981	null
2025-07-08	CogniPlay: a work-in-progress Human-like model for General Game Playing	Aloïs Rautureau et.al.	2507.05868	null
2025-07-07	Spatio-Temporal LLM: Reasoning about Environments and Actions	Haozhen Zheng et.al.	2507.05258	null
2025-07-07	Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions	Yuanzhe Hu et.al.	2507.05257	null
2025-07-07	From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving	Fabian Konstantinidis et.al.	2507.05254	null
2025-07-07	Action Space Reduction Strategies for Reinforcement Learning in Autonomous Driving	Elahe Delavari et.al.	2507.05251	null
2025-07-07	Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration	Benjamin Li et.al.	2507.05244	null
2025-07-08	SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity’s Last Exam?	Jingyi Chai et.al.	2507.05241	null
2025-07-07	StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling	Meng Wei et.al.	2507.05240	null
2025-07-08	MedGemma Technical Report	Andrew Sellergren et.al.	2507.05201	null
2025-07-07	CREW-WILDFIRE: Benchmarking Agentic Multi-Agent Collaborations at Scale	Jonathan Hyun et.al.	2507.05178	null
2025-07-07	Vector Cost Bimatrix Games with Applications to Autonomous Racing	Benjamin R. Toaz et.al.	2507.05171	null
2025-07-03	Establishing Best Practices for Building Rigorous Agentic Benchmarks	Yuxuan Zhu et.al.	2507.02825	null
2025-07-03	Moral Responsibility or Obedience: What Do We Want from AI?	Joseph Boland et.al.	2507.02788	null
2025-07-06	KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs	Yuzhang Xie et.al.	2507.02773	null
2025-07-03	Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work	Guangwei Zhang et.al.	2507.02760	null
2025-07-03	Defining and classifying models of groups: The social ontology of higher-order networks	Jonathan St-Onge et.al.	2507.02758	null
2025-07-03	Multi-agent Auditory Scene Analysis	Caleb Rascon et.al.	2507.02755	null
2025-07-03	Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks	Sizhe Chen et.al.	2507.02735	null
2025-07-03	Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving	Matthieu Zimmer et.al.	2507.02726	null
2025-07-03	A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control	Zilin Kang et.al.	2507.02712	null
2025-07-03	Fluid Democracy in Federated Data Aggregation	Aditya Vema Reddy Kesari et.al.	2507.02710	null
2025-07-02	The Thin Line Between Comprehension and Persuasion in LLMs	Adrian de Wynter et.al.	2507.01936	null
2025-07-03	Decision-Oriented Text Evaluation	Yu-Shiang Huang et.al.	2507.01923	null
2025-07-02	An in-silico lung phantom to assess the performance of pulmonary artery segmentation using angiogram	Sunder Neelakantan et.al.	2507.01867	null
2025-07-02	Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents	Sanjay Krishna Anbalagan et.al.	2507.01862	null
2025-07-02	TD-MPC-Opt: Distilling Model-Based Multi-Task Reinforcement Learning Agents	Dmytro Kuzmenko et.al.	2507.01823	null
2025-07-02	AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction	Bin Rao et.al.	2507.01801	null
2025-07-02	ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving	Kai Chen et.al.	2507.01735	null
2025-07-02	Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI	Gopichand Kanumolu et.al.	2507.01717	null
2025-07-02	Using Machine Learning to Compute Constrained Optimal Carbon Tax Rules	Felix Kübler et.al.	2507.01704	null
2025-07-02	AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness	Zixin Chen et.al.	2507.01702	null
2025-07-01	SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning	Bo Liu et.al.	2506.24119	null
2025-06-30	Protocol insecurity with finitely many sessions and XOR	R Ramanujam et.al.	2506.24072	null
2025-06-30	Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC	Xinming Wei et.al.	2506.24045	null
2025-06-30	Ella: Embodied Social Agents with Lifelong Memory	Hongxin Zhang et.al.	2506.24019	null
2025-06-30	Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning	Seungjun Yi et.al.	2506.23998	null
2025-06-30	Harnessing AI Agents to Advance Research on Refugee Child Mental Health	Aditya Shrivastava et.al.	2506.23992	null
2025-06-30	LLM Agents Are the Antidote to Walled Gardens	Samuele Marro et.al.	2506.23978	null
2025-06-30	Flexible Moral Hazard Problems with Adverse Selection	Siwen Liu et.al.	2506.23954	null
2025-06-30	Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice	Akshit Kumar et.al.	2506.23924	null
2025-06-30	A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents	Hang Su et.al.	2506.23844	null
2025-06-27	The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements	Bingchen Zhao et.al.	2506.22419	null
2025-06-27	Why Are Parsing Actions for Understanding Message Hierarchies Not Random?	Daichi Kato et.al.	2506.22366	null
2025-06-27	Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation	Tao Li et.al.	2506.22365	null
2025-06-27	Embodied AI Agents: Modeling the World	Pascale Fung et.al.	2506.22355	null
2025-06-27	Agent-based modeling and the sociology of money: some suggestions for refining monetary theory using social simulation	Eduardo Coltre Ferraciolli et.al.	2506.22318	null
2025-06-27	Artificial Intelligent Disobedience: Rethinking the Agency of Our Artificial Teammates	Reuth Mirsky et.al.	2506.22276	null
2025-06-27	Exploring Modularity of Agentic Systems for Drug Discovery	Laura van Weesep et.al.	2506.22189	null
2025-06-27	Autonomic Microservice Management via Agentic AI and MAPE-K Integration	Matteo Esposito et.al.	2506.22185	null
2025-06-27	A Different Approach to AI Safety: Proceedings from the Columbia Convening on Openness in Artificial Intelligence and AI Safety	Camille François et.al.	2506.22183	null
2025-06-27	ASVSim (AirSim for Surface Vehicles): A High-Fidelity Simulation Framework for Autonomous Surface Vehicle Research	Bavo Lesy et.al.	2506.22174	null
2025-06-26	Whole-Body Conditioned Egocentric Video Prediction	Yutong Bai et.al.	2506.21552	null
2025-06-26	PsyLite Technical Report	Fangjun Ding et.al.	2506.21536	null
2025-06-26	Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge	Boyu Gou et.al.	2506.21506	null
2025-06-26	From multi-allocations to allocations, with subadditive valuations	Uriel Feige et.al.	2506.21493	null
2025-06-26	Ad-Hoc Human-AI Coordination Challenge	Tin Dizdarević et.al.	2506.21490	null
2025-06-26	Reinforcement Learning for Optimal Control of Spin Magnetometers	Logan W. Cooke et.al.	2506.21475	null
2025-06-26	Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents	Tianyi Men et.al.	2506.21252	null
2025-06-26	Dynamic Risk-Aware MPPI for Mobile Robots in Crowds via Efficient Monte Carlo Approximations	Elia Trevisan et.al.	2506.21205	null
2025-06-26	Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout	Apurva Shah et.al.	2506.21186	null
2025-06-26	Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4	Jongyeon Park et.al.	2506.21174	null
2025-06-25	MMSearch-R1: Incentivizing LMMs to Search	Jinming Wu et.al.	2506.20670	null
2025-06-25	The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind	Andrei Lupu et.al.	2506.20664	null
2025-06-25	Memento: Note-Taking for Your Future Self	Chao Wan et.al.	2506.20642	null
2025-06-25	Towards Community-Driven Agents for Machine Learning Engineering	Sijie Li et.al.	2506.20640	null
2025-06-25	Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm	Baixiang Huang et.al.	2506.20606	null
2025-06-25	Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges	Alexander D. Kalian et.al.	2506.20598	null
2025-06-25	An Explicit Solution for the Problem of Optimal Investment with Random Endowment	Michael Donisch et.al.	2506.20506	null
2025-06-25	Engineering Sentience	Konstantin Demin et.al.	2506.20504	null
2025-06-25	Opinion Dynamics with Highly Oscillating Opinions	Víctor A. Vargas-Pérez et.al.	2506.20472	null
2025-06-25	An Agentic System for Rare Disease Diagnosis with Traceable Reasoning	Weike Zhao et.al.	2506.20430	null
2025-06-24	JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning	Ai Han et.al.	2506.19846	null
2025-06-24	MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration	Yucheng Zhou et.al.	2506.19835	null
2025-06-24	Curating art exhibitions using machine learning	Eurico Covas et.al.	2506.19813	null
2025-06-24	LLM-Based Social Simulations Require a Boundary	Zengqing Wu et.al.	2506.19806	null
2025-06-24	Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning	Menglong Zhang et.al.	2506.19785	null
2025-06-24	SAGE: Strategy-Adaptive Generation Engine for Query Rewriting	Teng Wang et.al.	2506.19783	null
2025-06-24	A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects	Shulan Ruan et.al.	2506.19769	null
2025-06-24	From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking	Gyeongwon James Kim et.al.	2506.19724	null
2025-06-24	A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures	Dezhang Kong et.al.	2506.19676	null
2025-06-24	How trust networks shape students’ opinions about the proficiency of artificially intelligent assistants	Yutong Bu et.al.	2506.19655	null
2025-06-23	Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models	Kiymet Akdemir et.al.	2506.18900	null
2025-06-23	Steering Conceptual Bias via Transformer Latent-Subspace Activation	Vansh Sharma et.al.	2506.18887	null
2025-06-23	GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM	Annika Thomas et.al.	2506.18885	null
2025-06-23	Broad Validity of the First-Order Approach in Moral Hazard	Eduardo Azevedo et.al.	2506.18873	null
2025-06-23	Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning	Anthony Kobanda et.al.	2506.18847	null
2025-06-23	Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories	Islem Bouzenia et.al.	2506.18824	null
2025-06-23	Multi-Agent Online Control with Adversarial Disturbances	Anas Barakat et.al.	2506.18814	null
2025-06-23	Fair Allocation with Money: What is Your Objective?	Noga Klein Elmalem et.al.	2506.18794	null
2025-06-23	TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation	Kamil Szczepanik et.al.	2506.18783	null
2025-06-23	Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI	Daniel M. Lang et.al.	2506.18720	null
2025-06-20	VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning	Zhangyang Qi et.al.	2506.17221	null
2025-06-20	Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation	Xiuyu Yang et.al.	2506.17213	link
2025-06-20	Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems	Matias Martinez et.al.	2506.17208	null
2025-06-20	Towards AI Search Paradigm	Yuchen Li et.al.	2506.17188	null
2025-06-20	Capturing Misalignment	Pierfrancesco Guarino et.al.	2506.17176	null
2025-06-20	A Note on Proper Relational Structures	Adam Bjorndahl et.al.	2506.17142	null
2025-06-20	When Can Model-Free Reinforcement Learning be Enough for Thinking?	Josiah P. Hanna et.al.	2506.17124	null
2025-06-20	A general multi-stratum model for a nanofunctionalized releasing capsule: a computational study	Elia Onofri et.al.	2506.17078	null
2025-06-20	Behavior Driven Development for 3D Games	Fernando Pastor Ricós et.al.	2506.17057	null
2025-06-20	Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment	Leizhen Wang et.al.	2506.17029	null
2025-06-20	Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence	Yining Hong et.al.	2506.15677	null
2025-06-18	Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers	Tommaso Green et.al.	2506.15674	link
2025-06-18	SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence	Yao Zhang et.al.	2506.15672	null
2025-06-18	PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection	Wenhao Li et.al.	2506.15656	null
2025-06-18	FindingDory: A Benchmark to Evaluate Memory in Embodied Agents	Karmesh Yadav et.al.	2506.15635	null
2025-06-18	The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games	Lyle Goodyear et.al.	2506.15624	null
2025-06-18	Multi-Agent, Multi-Scale Systems with the Koopman Operator	Craig Bakker et.al.	2506.15589	null
2025-06-18	Learning to flock in open space by avoiding collisions and staying together	Martino Brambati et.al.	2506.15587	null
2025-06-18	Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents	Aline Dobrovsky et.al.	2506.15567	null
2025-06-18	Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning	Roger Creus Castanyer et.al.	2506.15544	link
2025-06-17	RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills	Chunru Lin et.al.	2506.14763	null
2025-06-17	Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems	Shiyu Cheng et.al.	2506.14749	null
2025-06-17	AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jiahao Qiu et.al.	2506.14728	null
2025-06-17	Linear Planar 3-SAT and Its Applications in Planning	Victorien Desbois et.al.	2506.14713	null
2025-06-17	AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions	Aishan Liu et.al.	2506.14697	null
2025-06-17	Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference	Kalliyan Velasco et.al.	2506.14690	null
2025-06-17	Unified Software Engineering agent as AI Software Engineer	Leonhard Applis et.al.	2506.14683	null
2025-06-17	StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery	Jina Kim et.al.	2506.14670	null
2025-06-17	SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning	Hexian Ni et.al.	2506.14648	null
2025-06-17	GenerationPrograms: Fine-grained Attribution with Executable Programs	David Wan et.al.	2506.14580	link
2025-06-16	MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering	Arya Fayyazi et.al.	2506.13755	null
2025-06-16	PB $^2$ : Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning	Brahim Driss et.al.	2506.13741	null
2025-06-16	The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning	Jiashun Liu et.al.	2506.13672	null
2025-06-16	We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems	Junfeng Fang et.al.	2506.13666	link
2025-06-16	Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning	Shulin Tian et.al.	2506.13654	null
2025-06-16	xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations	Kaiyuan Chen et.al.	2506.13651	null
2025-06-16	Deceptive Path Planning: A Bayesian Game Approach	Violetta Rostobaya et.al.	2506.13650	null
2025-06-16	CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation	Yuwei Du et.al.	2506.13599	null
2025-06-16	Agent Capability Negotiation and Binding Protocol (ACNBP)	Ken Huang et.al.	2506.13590	link
2025-06-16	Non-exchangeable mean-field theory for adaptive weights: propagation of chaos and graphon sampling lemma	Datong Zhou et.al.	2506.13587	null
2025-06-13	Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale	Junha Lee et.al.	2506.12009	null
2025-06-13	Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents?	Ramesh Raskar et.al.	2506.12003	null
2025-06-13	Self-Regulating Cars: Automating Traffic Control in Free Flow Road Networks	Ankit Bhardwaj et.al.	2506.11973	null
2025-06-13	Visual Pre-Training on Unlabeled Images using Reinforcement Learning	Dibya Ghosh et.al.	2506.11967	null
2025-06-13	Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning	Mohammadamin Moradi et.al.	2506.11957	null
2025-06-13	Secure API-Driven Research Automation to Accelerate Scientific Discovery	Tyler J. Skluzacek et.al.	2506.11950	null
2025-06-13	Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations	Miguel Suau et.al.	2506.11912	null
2025-06-13	Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients	Chapa Sirithunge et.al.	2506.11906	null
2025-06-13	An Explainable AI Framework for Dynamic Resource Management in Vehicular Network Slicing	Haochen Sun et.al.	2506.11882	null
2025-06-13	Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems	Zhipeng Bao et.al.	2506.11842	null
2025-06-12	AutoMind: Adaptive Knowledgeable Agent for Automated Data Science	Yixin Ou et.al.	2506.10974	link
2025-06-12	Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop	Justin Kerr et.al.	2506.10968	null
2025-06-12	SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks	Lianghong Guo et.al.	2506.10954	link
2025-06-12	Build the web for agents, not agents for the web	Xing Han Lù et.al.	2506.10953	null
2025-06-12	Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors	Chen Yueh-Han et.al.	2506.10949	link
2025-06-12	Execution Guided Line-by-Line Code Generation	Boaz Lavon et.al.	2506.10948	link
2025-06-12	Dynamic Epistemic Friction in Dialogue	Timothy Obiso et.al.	2506.10934	null
2025-06-12	Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence	Eduardo Baena et.al.	2506.10925	null
2025-06-12	Prediction and control of geometry-induced nematic order in growing multicellular systems	Lukas Hupe et.al.	2506.10867	null
2025-06-12	CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training	Alireza Salemi et.al.	2506.10844	link
2025-06-11	Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling	Tim Z. Xiao et.al.	2506.09998	null
2025-06-11	SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance	Wentao Ge et.al.	2506.09968	null
2025-06-11	The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability	Jiachen Hu et.al.	2506.09940	null
2025-06-11	On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing	Junlin Chen et.al.	2506.09924	null
2025-06-11	PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants	Zheng Zhao et.al.	2506.09902	link
2025-06-11	“What are my options?”: Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)	Noel Brindise et.al.	2506.09901	null
2025-06-11	OctoNav: Towards Generalist Embodied Navigation	Chen Gao et.al.	2506.09839	null
2025-06-11	Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy	Tonghe Wang et.al.	2506.09805	null
2025-06-11	Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy	Davide Grossi et.al.	2506.09789	null
2025-06-11	Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era	Shuo Jiang et.al.	2506.09755	null
2025-06-10	ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering	Yuki Imajuku et.al.	2506.09050	link
2025-06-10	VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning	Li Kang et.al.	2506.09049	null
2025-06-10	Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation	Xiaowen Ma et.al.	2506.09046	null
2025-06-10	The Decoupled Risk Landscape in Performative Prediction	Javier Sanguino et.al.	2506.09044	null
2025-06-10	Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System	Yuan Guo et.al.	2506.08972	null
2025-06-10	Towards Robust Deep Reinforcement Learning against Environmental State Perturbation	Chenxu Wang et.al.	2506.08961	null
2025-06-10	What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities	Wendong Bu et.al.	2506.08933	null
2025-06-10	Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL)	Maria-Veronica Ciocanel et.al.	2506.08916	link
2025-06-10	Intention-Conditioned Flow Occupancy Models	Chongyi Zheng et.al.	2506.08902	link
2025-06-10	Pairwise similarity method for majority domination problem	N. I. Shushko et.al.	2506.08886	null
2025-06-09	GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior	Penghao Wu et.al.	2506.08012	null
2025-06-09	Dreamland: Controllable World Creation with Simulator and Generative Models	Sicheng Mo et.al.	2506.08006	null
2025-06-09	Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System	Fan Yang et.al.	2506.07997	null
2025-06-09	$τ^2$ -Bench: Evaluating Conversational Agents in a Dual-Control Environment	Victor Barres et.al.	2506.07982	link
2025-06-09	Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator	Alberto Bazán-Guillén et.al.	2506.07980	null
2025-06-10	Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction	Junhong Shen et.al.	2506.07976	link
2025-06-09	HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Hongzheng Chen et.al.	2506.07972	link
2025-06-09	Diffusion of Responsibility in Collective Decision Making	Pavel Naumov et.al.	2506.07935	null
2025-06-09	LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement	Dimitris Panagopoulos et.al.	2506.07915	null
2025-06-09	A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit	Andrea Tiranti et.al.	2506.07877	null
2025-06-06	PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time	Weizhi Zhang et.al.	2506.06254	null
2025-06-06	Longer Lists Yield Better Matchings	Yuri Faenza et.al.	2506.06217	null
2025-06-06	Can Theoretical Physics Research Benefit from Language Agents?	Sirui Lu et.al.	2506.06214	null
2025-06-06	A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization	Muhammed Ustaomeroglu et.al.	2506.06179	null
2025-06-06	Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach	James Ford et.al.	2506.06175	null
2025-06-06	The Lock-in Hypothesis: Stagnation by Algorithm	Tianyi Alex Qiu et.al.	2506.06166	null
2025-06-06	(AI peers) are people learning from the same standpoint: Perception of AI characters in a Collaborative Science Investigation	Eunhye Grace Ko et.al.	2506.06165	null
2025-06-06	Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks	Adiba Mahbub Proma et.al.	2506.06153	null
2025-06-06	CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting	Peter Lengyel et.al.	2506.06128	null
2025-06-06	Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library	Weixun Wang et.al.	2506.06122	null
2025-06-05	Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games	Niv Eckhaus et.al.	2506.05309	link
2025-06-05	ProRefine: Inference-time Prompt Refinement with Textual Feedback	Deepak Pandita et.al.	2506.05305	null
2025-06-05	Control Tax: The Price of Keeping AI in Check	Mikhail Terekhov et.al.	2506.05296	null
2025-06-05	A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$ : Robust Imitation via Learning to Search	Arnav Kumar Jain et.al.	2506.05294	link
2025-06-05	Tight analyses of first-order methods with error feedback	Daniel Berg Thomsen et.al.	2506.05271	link
2025-06-06	Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams	Mohammed Almutairi et.al.	2506.05265	null
2025-06-05	Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning	Dravyansh Sharma et.al.	2506.05252	null
2025-06-05	Towards Language-Augmented Multi-Agent Deep Reinforcement Learning	Maxime Toquebiau et.al.	2506.05236	null
2025-06-05	A Framework for Ethical Judgment of Smart City Applications	Weichen Shi et.al.	2506.05172	null
2025-06-05	An emergence-oriented approach to cyclic pursuit	Zhaozhan Yao et.al.	2506.05157	null
2025-06-04	OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Junting Chen et.al.	2506.04217	link
2025-06-04	Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs	Alex DeWeese et.al.	2506.04215	null
2025-06-04	TracLLM: A Generic Framework for Attributing Long Context LLMs	Yanting Wang et.al.	2506.04202	link
2025-06-04	MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures	Elena Zamaraeva et.al.	2506.04195	null
2025-06-04	SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models	Yuhao Wu et.al.	2506.04180	null
2025-06-04	A primal-dual price-optimization method for computing equilibrium prices in mean-field games models	Xu Wang et.al.	2506.04169	link
2025-06-04	Image Editing As Programs with Diffusion Models	Yujia Hu et.al.	2506.04158	null
2025-06-05	macOSWorld: A Multilingual Interactive Benchmark for GUI Agents	Pei Yang et.al.	2506.04135	link
2025-06-04	TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems	Shaina Raza et.al.	2506.04133	null
2025-06-04	CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues	Disha Sheshanarayana et.al.	2506.04131	null
2025-06-03	GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents	Qianhui Wu et.al.	2506.03143	null
2025-06-03	Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning	Yinjie Wang et.al.	2506.03136	link
2025-06-03	Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff	Sophie Greenwood et.al.	2506.03102	null
2025-06-03	EgoVLM: Policy Optimization for Egocentric Video Understanding	Ashwin Vinod et.al.	2506.03097	link
2025-06-03	DPO Learning with LLMs-Judge Signal for Computer Use Agents	Man Luo et.al.	2506.03095	null
2025-06-03	Provable Reinforcement Learning from Human Feedback with an Unknown Link Function	Qining Zhang et.al.	2506.03066	null
2025-06-03	MAEBE: Multi-Agent Emergent Behavior Framework	Sinem Erisken et.al.	2506.03053	null
2025-06-03	EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment	Mikolaj Walczak et.al.	2506.03046	null
2025-06-03	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2506.03038	null
2025-06-03	TestAgent: An Adaptive and Intelligent Expert for Human Assessment	Junhao Yu et.al.	2506.03032	null
2025-05-30	Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents	Yaxin Luo et.al.	2505.24878	link
2025-05-30	Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks	Tajamul Ashraf et.al.	2505.24876	link
2025-05-30	VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software	Brandon Man et.al.	2505.24838	link
2025-05-30	Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation	Yucheng Zhou et.al.	2505.24787	link
2025-06-02	EXP-Bench: Can AI Conduct AI Research Experiments?	Patrick Tser Jern Kon et.al.	2505.24785	link
2025-05-30	Emergent Dynamics of Active Systems on Curved Environments	Euan D. Mackay et.al.	2505.24730	null
2025-05-30	CoRet: Improved Retriever for Code Editing	Fabio Fehr et.al.	2505.24715	null
2025-05-30	Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting	Wei Chen et.al.	2505.24710	link
2025-05-30	Towards a unified user modeling language for engineering human centered AI systems	Aaron Conrardy et.al.	2505.24697	null
2025-05-30	Multiple LLM Agents Debate for Equitable Cultural Alignment	Dayeon Ki et.al.	2505.24671	link
2025-05-29	From Chat Logs to Collective Insights: Aggregative Question Answering	Wentao Zhang et.al.	2505.23765	null
2025-05-29	ZeroGUI: Automating Online GUI Learning at Zero Human Cost	Chenyu Yang et.al.	2505.23762	link
2025-05-29	ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks	Akashah Shabbir et.al.	2505.23752	link
2025-05-29	ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering	Zexi Liu et.al.	2505.23723	link
2025-05-29	COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents	Arun Verma et.al.	2505.23720	null
2025-05-29	From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems	Zeinab Nezami et.al.	2505.23710	null
2025-05-29	Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics	Ran Zhang et.al.	2505.23695	link
2025-05-29	ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork	Caroline Wang et.al.	2505.23686	link
2025-05-29	GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents	Manish Shetty et.al.	2505.23671	link
2025-05-29	Initial Luminally Deposited FGF4 Critically Influences Blastocyst Patterning	Michael A. Ramirez-Sierra et.al.	2505.23650	null
2025-05-28	3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model	Wenbo Hu et.al.	2505.22657	null
2025-05-28	Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents	Michael Kirchhof et.al.	2505.22655	null
2025-05-28	WebDancer: Towards Autonomous Information Seeking Agency	Jialong Wu et.al.	2505.22648	link
2025-05-29	FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control	Younggyo Seo et.al.	2505.22642	null
2025-05-28	LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents	Rui Li et.al.	2505.22634	null
2025-05-28	HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym	Ngoc La et.al.	2505.22597	link
2025-05-28	GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git	Tobias Lindenbauer et.al.	2505.22583	link
2025-05-29	Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems	Hoang Pham et.al.	2505.22571	null
2025-05-28	Universal Visuo-Tactile Video Understanding for Embodied Interaction	Yifan Xie et.al.	2505.22566	null
2025-05-28	Training RL Agents for Multi-Objective Network Defense Tasks	Andres Molina-Markham et.al.	2505.22531	null
2025-05-27	Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making	Yihan Wang et.al.	2505.21503	null
2025-05-27	AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery	Haowei Wang et.al.	2505.21499	link
2025-05-27	Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers	Wei Pang et.al.	2505.21497	link
2025-05-27	UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents	Han Xiao et.al.	2505.21496	link
2025-05-27	Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming	Yang Yang et.al.	2505.21486	null
2025-05-27	Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration	Zijun Liu et.al.	2505.21471	link
2025-05-27	Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO	Muzhi Zhu et.al.	2505.21457	null
2025-05-27	Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks	Francesco Cozzi et.al.	2505.21426	link
2025-05-27	GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation	Naizhu Jin et.al.	2505.21425	null
2025-05-27	Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery	Lina Zhao et.al.	2505.21418	null
2025-05-27	MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents	Ziming Wei et.al.	2505.20148	link
2025-05-26	Agentic 3D Scene Generation with Spatially Contextualized VLMs	Xinhang Liu et.al.	2505.20129	null
2025-05-26	Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers	Zhengliang Shi et.al.	2505.20128	link
2025-05-26	Agentic AI Process Observability: Discovering Behavioral Variability	Fabiana Fournier et.al.	2505.20127	null
2025-05-26	Agents Require Metacognitive and Strategic Reasoning to Succeed in the Coming Labor Markets	Simpson Zhang et.al.	2505.20120	null
2025-05-27	TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent	Dominik Meier et.al.	2505.20118	link
2025-05-26	MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning	Thang Nguyen et.al.	2505.20096	null
2025-05-26	SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale	Qi Li et.al.	2505.20094	null
2025-05-26	REARANK: Reasoning Re-ranking Agent via Reinforcement Learning	Le Zhang et.al.	2505.20046	link
2025-05-26	Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking	Yihan Chen et.al.	2505.20023	null
2025-05-23	Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find	Owen Bianchi et.al.	2505.18148	null
2025-05-23	Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading	Mohamed Swailem et.al.	2505.18145	null
2025-05-23	Gaming Tool Preferences in Agentic LLMs	Kazem Faghih et.al.	2505.18135	link
2025-05-23	ProgRM: Build Better GUI Agents with Progress Rewards	Danyang Zhang et.al.	2505.18121	null
2025-05-23	Facility Location with Public Locations and Private Doubly-Peaked Costs	Richard Cole et.al.	2505.18114	null
2025-05-23	ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework	Lisheng Huang et.al.	2505.18105	link
2025-05-23	Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL	Joey Hong et.al.	2505.18098	null
2025-05-23	Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding	Xiaoyi Zhang et.al.	2505.18079	null
2025-05-23	Linear Mixture Distributionally Robust Markov Decision Processes	Zhishuai Liu et.al.	2505.18044	null
2025-05-23	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2505.17997	null
2025-05-22	SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding	Haoning Wu et.al.	2505.17012	link
2025-05-22	X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs	Rui Ye et.al.	2505.16997	link
2025-05-22	MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems	Rui Ye et.al.	2505.16988	link
2025-05-22	T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning	Amartya Chakraborty et.al.	2505.16986	null
2025-05-22	Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine	Adib Bazgir et.al.	2505.16982	null
2025-05-22	Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design	Zhenkun Li et.al.	2505.16979	null
2025-05-22	SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development	Yaxin Du et.al.	2505.16975	link
2025-05-22	Modeling Inequality in Complex Networks of Strategic Agents using Iterative Game-Theoretic Transactions	Mayank Kejriwal et.al.	2505.16966	null
2025-05-22	Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection	Jiaying Fu et.al.	2505.16954	null
2025-05-22	A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization	Shengyu Feng et.al.	2505.16952	null
2025-05-22	GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents	Yuqi Zhou et.al.	2505.15810	link
2025-05-21	The Agentic Economy	David M. Rothschild et.al.	2505.15799	null
2025-05-22	HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving	Zhiwen Chen et.al.	2505.15793	null
2025-05-21	Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning	Pedro P. Santos et.al.	2505.15782	null
2025-05-21	Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses	Xiaoxue Yang et.al.	2505.15738	link
2025-05-21	DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning	Gaurav Srivastava et.al.	2505.15734	null
2025-05-21	Quantum Dots as Functional Nanosystems for Enhanced Biomedical Applications	Pronama Biswas et.al.	2505.15705	null
2025-05-21	HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning	Xiaodong Mei et.al.	2505.15703	null
2025-05-21	Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives	Milad Kazemi et.al.	2505.15693	null
2025-05-21	From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems	Xiuchao Sui et.al.	2505.15685	link
2025-05-20	NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search	Sunhao Dai et.al.	2505.14680	null
2025-05-20	ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions	Bufang Yang et.al.	2505.14668	null
2025-05-20	AI Agents in the Electricity Market Game with Cryptocurrency Transactions: A Post-Terminator Analysis	Microsoft Copilot et.al.	2505.14612	null
2025-05-20	Agent Context Protocols Enhance Collective Inference	Devansh Bhardwaj et.al.	2505.14569	null
2025-05-20	Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study	Saahil Mahato et.al.	2505.14544	link
2025-05-20	A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version)	Gaia Belardinelli et.al.	2505.14539	null
2025-05-20	Energy-Efficient Deep Reinforcement Learning with Spiking Transformers	Mohammad Irfan Uddin et.al.	2505.14533	null
2025-05-20	BACON: A fully explainable AI model with graded logic for decision making problems	Haishi Bai et.al.	2505.14510	null
2025-05-20	Design and Evaluation of a Microservices Cloud Framework for Online Travel Platforms	Biman Barua et.al.	2505.14508	null
2025-05-20	Security of Distributed Gradient Descent Against Byzantine Agents	Sribalaji C. Anand et.al.	2505.14473	null
2025-05-19	G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning	Liang Chen et.al.	2505.13426	link
2025-05-20	A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut	Gabriel Malikal et.al.	2505.13405	null
2025-05-19	Robin: A multi-agent system for automating scientific discovery	Ali Essam Ghareeb et.al.	2505.13400	null
2025-05-19	Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges	Hongru Wang et.al.	2505.13328	null
2025-05-19	Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions	Saleh Soudijani et.al.	2505.13311	null
2025-05-19	TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents	Yifu Cai et.al.	2505.13291	link
2025-05-19	Hybrid Voting-Based Task Assignment in Modular Construction Scenarios	Daniel Weiner et.al.	2505.13278	null
2025-05-19	From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery	Tianshi Zheng et.al.	2505.13259	link
2025-05-19	Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability	Jingyi Ren et.al.	2505.13258	link
2025-05-19	Composing Dextrous Grasping and In-hand Manipulation via Scoring with a Reinforcement Learning Critic	Lennart Röstel et.al.	2505.13253	null
2025-05-16	Automatic Reward Shaping from Confounded Offline Data	Mingxuan Li et.al.	2505.11478	null
2025-05-16	Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks	Wesley A Suttle et.al.	2505.11461	null
2025-05-16	Robust Equilibria in Shared Resource Allocation via Strengthening Border’s Theorem	David X. Lin et.al.	2505.11431	null
2025-05-16	Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis	Jing Liu et.al.	2505.11401	null
2025-05-16	Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation	Zihan Wang et.al.	2505.11383	link
2025-05-16	GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents	Lingxiao Diao et.al.	2505.11368	null
2025-05-16	Long-Term Average Impulse Control with Mean Field Interactions	K. L. Helmes et.al.	2505.11345	null
2025-05-16	Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics	Ardian Selmonaj et.al.	2505.11311	null
2025-05-16	Diffusion Learning with Partial Agent Participation and Local Updates	Elsa Rizk et.al.	2505.11307	null
2025-05-16	Meta-World+: An Improved, Standardized, RL Benchmark	Reginald McLean et.al.	2505.11289	link
2025-05-15	Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models	Annie Wong et.al.	2505.10543	link
2025-05-15	Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation	Xinrui Wang et.al.	2505.10522	null
2025-05-15	Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning	Andrea Baisero et.al.	2505.10484	null
2025-05-15	Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps	Ningyuan Yang et.al.	2505.10482	null
2025-05-15	AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge	Ranjan Sapkota et.al.	2505.10468	null
2025-05-15	Bridging Theory and Perception in Fair Division: A Study on Comparative and Fair Share Notions	Hadi Hosseini et.al.	2505.10433	null
2025-05-15	Aggregating Information and Preferences with Bounded-Size Deviations	Qishen Han et.al.	2505.10388	null
2025-05-15	Multi-Agent Path Finding For Large Agents Is Intractable	Artem Agafonov et.al.	2505.10387	null
2025-05-15	Plasticity as the Mirror of Empowerment	David Abel et.al.	2505.10361	null
2025-05-15	Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change	Jonathan Clifford Balloch et.al.	2505.10330	null
2025-05-14	Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?	Anthony GX-Chen et.al.	2505.09614	null
2025-05-14	WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models	Abdullah Mushtaq et.al.	2505.09595	null
2025-05-14	Preserving Plasticity in Continual Learning with Adaptive Linearity Injection	Seyed Roozbeh Razavi Rohani et.al.	2505.09486	null
2025-05-14	Streaming Multi-agent Pathfinding	Mingkai Tang et.al.	2505.09472	link
2025-05-14	CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios	Raghav Garg et.al.	2505.09436	link
2025-05-15	Decentralized Nonlinear Model Predictive Control-Based Flock Navigation with Real-Time Obstacle Avoidance in Unknown Obstructed Environments	Nuthasith Gerdpratoom et.al.	2505.09434	null
2025-05-14	Using Dopants as Agents to Probe Key Electronic Properties of Organic Semiconductors	Artem Fediai et.al.	2505.09431	null
2025-05-14	Linear Search with Probabilistic Detection and Variable Speeds	Jared Coleman et.al.	2505.09429	link
2025-05-15	SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation	Achref Doula et.al.	2505.09427	null
2025-05-14	The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners	Vince Trencsenyi et.al.	2505.09396	null
2025-05-14	Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology	Yatai Ji et.al.	2505.08765	null
2025-05-13	Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots	Glaucia Melo et.al.	2505.08648	null
2025-05-13	TRAIL: Trace Reasoning and Agentic Issue Localization	Darshan Deshpande et.al.	2505.08638	null
2025-05-13	Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning	Shuai Han et.al.	2505.08630	null
2025-05-13	OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning	Zhaochen Su et.al.	2505.08617	link
2025-05-13	MC-Swarm: Minimal-Communication Multi-Agent Trajectory Planning and Deadlock Resolution for Quadrotor Swarm	Yunwoo Lee et.al.	2505.08593	null
2025-05-14	Communication-Efficient Distributed Online Nonconvex Optimization with Time-Varying Constraints	Kunpeng Zhang et.al.	2505.08592	null
2025-05-13	The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News	Yuhan Liu et.al.	2505.08532	null
2025-05-13	Strategy-Augmented Planning for Large Language Models via Opponent Exploitation	Shuai Xu et.al.	2505.08459	link
2025-05-13	Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting	Emlyn Williams et.al.	2505.08458	null
2025-05-12	Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models	Seungjae Lee et.al.	2505.07815	null
2025-05-12	A Theoretical Framework for Explaining Reinforcement Learning with Shapley Values	Daniel Beechey et.al.	2505.07797	link
2025-05-12	MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering	Rushi Qiang et.al.	2505.07782	link
2025-05-12	Multi-Agent Path Finding via Finite-Horizon Hierarchical Factorization	Jiarui Li et.al.	2505.07779	null
2025-05-12	Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving	Xinji Mai et.al.	2505.07773	link
2025-05-12	Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture	Rintaro Ando et.al.	2505.07757	null
2025-05-13	VTutor for High-Impact Tutoring at Scale: Managing Engagement and Real-Time Multi-Screen Monitoring with P2P Connections	Eason Chen et.al.	2505.07736	null
2025-05-13	Codifying Character Logic in Role-Playing	Letian Peng et.al.	2505.07705	link
2025-05-12	Belief Injection for Epistemic Control in Linguistic State Space	Sebastian Dumbrava et.al.	2505.07693	null
2025-05-12	Chronocept: Instilling a Sense of Time in Machines	Krish Goel et.al.	2505.07637	link
2025-05-09	Robust Multi-Agent Decision-Making in Finite-Population Games	Shinkyu Park et.al.	2505.06200	null
2025-05-09	Neuro-Symbolic Concepts	Jiayuan Mao et.al.	2505.06191	null
2025-05-09	The Power of Matching for Online Fractional Hedonic Games	Martin Bullinger et.al.	2505.06163	null
2025-05-09	Realistic Adversarial Attacks for Robustness Evaluation of Trajectory Prediction Models via Future State Perturbation	Julian F. Schumann et.al.	2505.06134	link
2025-05-09	ELA-ZSON: Efficient Layout-Aware Zero-Shot Object Navigation Agent with Hierarchical Planning	Jiawei Hou et.al.	2505.06131	null
2025-05-09	Oncolytic mechanisms and immunotherapeutic potential of Newcastle disease virus in cancer therapy	Umar Ahmad et.al.	2505.06067	null
2025-05-09	Offline Multi-agent Reinforcement Learning via Score Decomposition	Dan Qiao et.al.	2505.05968	null
2025-05-09	Learning Power Control Protocol for In-Factory 6G Subnetworks	Uyoata E. Uyoata et.al.	2505.05967	null
2025-05-09	Cost-Effective, Low Latency Vector Search with Azure Cosmos DB	Nitish Upreti et.al.	2505.05885	link
2025-05-09	Evolutionary ecology of words	Reiji Suzuki et.al.	2505.05863	null
2025-05-08	RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles	Pouria Behnoudfar et.al.	2505.05452	null
2025-05-08	clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations	Chalamalasetti Kranti et.al.	2505.05445	null
2025-05-09	EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation	Biao Yi et.al.	2505.05440	null
2025-05-08	Empowering Scientific Workflows with Federated Agents	J. Gregory Pauloski et.al.	2505.05428	link
2025-05-08	Robustly optimal dynamics for active matter reservoir computing	Mario U. Gaimann et.al.	2505.05420	null
2025-05-08	Weighted Envy-Freeness Revisited: Indivisible Resource and House Allocations	Yuxi Liu et.al.	2505.05353	null
2025-05-08	Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects	Agnese Chiatti et.al.	2505.05318	null
2025-05-08	HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow	You Peng et.al.	2505.05286	link
2025-05-09	Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents	Kaixin Wang et.al.	2505.05283	null
2025-05-08	Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration	Andreas Kontogiannis et.al.	2505.05262	link
2025-05-07	Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions	Stéphane Aroca-Ouellette et.al.	2505.04579	link
2025-05-07	Optimal Deterministic Rendezvous in Labeled Lines	Yann Bourreau et.al.	2505.04564	null
2025-05-07	Qualitative Analysis of $ω$ -Regular Objectives on Robust MDPs	Ali Asadi et.al.	2505.04539	null
2025-05-07	Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving	Qi Liu et.al.	2505.04528	null
2025-05-07	RLMiniStyler: Light-weight RL Style Agent for Arbitrary Sequential Neural Style Generation	Jing Hu et.al.	2505.04424	link
2025-05-07	Consensus-Aware AV Behavior: Trade-offs Between Safety, Interaction, and Performance in Mixed Urban Traffic	Mohammad Elayan et.al.	2505.04379	link
2025-05-07	Extending a Quantum Reinforcement Learning Exploration Policy with Flags to Connect Four	Filipe Santos et.al.	2505.04371	null
2025-05-07	Benchmarking LLMs’ Swarm intelligence	Kai Ruan et.al.	2505.04364	link
2025-05-07	Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows	Wenhao Li et.al.	2505.04354	null
2025-05-07	Resist Platform-Controlled AI Agents and Champion User-Centric Agent Advocates	Sayash Kapoor et.al.	2505.04345	null
2025-05-06	Multi-Agent System for Comprehensive Soccer Understanding	Jiayuan Rao et.al.	2505.03735	null
2025-05-06	WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch	Zimu Lu et.al.	2505.03733	link
2025-05-06	Critical habitat size of organisms diffusing with stochastic resetting	Luiz Menon et.al.	2505.03727	null
2025-05-06	Meta-Optimization and Program Search using Language Models for Task and Motion Planning	Denis Shcherba et.al.	2505.03725	null
2025-05-06	Accelerated Decentralized Constraint-Coupled Optimization: A Dual $^2$ Approach	Jingwang Li et.al.	2505.03719	null
2025-05-06	Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid	Parv Kapoor et.al.	2505.03694	null
2025-05-06	Location-Restricted Stable Matching	Garret Castro et.al.	2505.03680	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-06	Gap the (Theory of) Mind: Sharing Beliefs About Teammates’ Goals Boosts Collaboration Perception, Not Performance	Yotam Amitai et.al.	2505.03674	null
2025-05-06	RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration	Huajie Tan et.al.	2505.03673	link
2025-05-05	Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation	Lu Ling et.al.	2505.02836	null
2025-05-05	AutoLibra: Agent Metric Induction from Open-Ended Feedback	Hao Zhu et.al.	2505.02820	link
2025-05-05	Generating HomeAssistant Automations Using an LLM-based Chatbot	Mathyas Giudici et.al.	2505.02802	null
2025-05-05	Recolorable Graph Exploration by an Oblivious Agent with Fewer Colors	Shota Takahashi et.al.	2505.02789	null
2025-05-05	Brief Announcement: Minimizing Energy Solves Relative Majority with a Cubic Number of States in Population Protocols	Tom-Lukas Breitkopf et.al.	2505.02785	null
2025-05-05	Merging plasmoids and nanojet-like ejections in a coronal current sheet	Samrat Sen et.al.	2505.02733	null
2025-05-05	Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework	Andrzej Mizera et.al.	2505.02712	link
2025-05-05	Technical Report: Evaluating Goal Drift in Language Model Agents	Rauno Arike et.al.	2505.02709	null
2025-05-05	Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	Yemin Shi et.al.	2505.02707	link
2025-05-05	Exploring LLM-Powered Role and Action-Switching Pedagogical Agents for History Education in Virtual Reality	Zihao Zhu et.al.	2505.02699	null
2025-05-02	Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story	Vincenzo De Paola et.al.	2505.01336	null
2025-05-02	Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning	Mohammed Sumayli et.al.	2505.01332	null
2025-05-02	The Dance of the Sheared Eigenfunctions	J. Oliveira-Cony et.al.	2505.01303	null
2025-05-02	Pattern formation using an intrinsic optimal control approach	Tianhao Li et.al.	2505.01302	null
2025-05-02	Essential Workers at Risk: An Agent-Based Model (SAFE-ABM) with Bayesian Uncertainty Quantification	Elizabeth B. Amona et.al.	2505.01243	null
2025-05-02	Bilateral Cognitive Security Games in Networked Control Systems under Stealthy Injection Attacks	Anh Tung Nguyen et.al.	2505.01232	null
2025-05-02	Non-universal Impact of Cholesterol on Ionic Liquid-Membrane Interactions	J. Gupta et.al.	2505.01230	null
2025-05-02	A Space-Time Trade-off for Fast Self-Stabilizing Leader Election in Population Protocols	Henry Austin et.al.	2505.01210	null
2025-05-02	Explainable AI Based Diagnosis of Poisoning Attacks in Evolutionary Swarms	Mehrdad Asadi et.al.	2505.01181	null
2025-05-02	Simulating Tertiary Educational Decision Dynamics: An Agent-Based Model for the Netherlands	Jean-Paul Daemen et.al.	2505.01142	null
2025-05-01	Towards Autonomous Micromobility through Scalable Urban Simulation	Wayne Wu et.al.	2505.00690	null
2025-05-01	Visual Test-time Scaling for GUI Agent Grounding	Tiange Luo et.al.	2505.00684	link
2025-05-01	Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions	Yiming Du et.al.	2505.00675	link
2025-05-01	A Finite-State Controller Based Offline Solver for Deterministic POMDPs	Alex Schutz et.al.	2505.00596	link
2025-05-01	ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models	Jiarong Wei et.al.	2505.00586	null
2025-05-01	A continuum thermodynamic model of the influence of non-ionic surfactant on mass transfer from gas bubbles	Dieter Bothe et.al.	2505.00581	null
2025-05-01	Directly Forecasting Belief for Reinforcement Learning with Delays	Qingyuan Wu et.al.	2505.00546	link
2025-05-01	Emergence of Roles in Robotic Teams with Model Sharing and Limited Communication	Ian O’Flynn et.al.	2505.00540	null
2025-05-01	Safety-Critical Traffic Simulation with Guided Latent Diffusion Model	Mingxing Peng et.al.	2505.00515	null
2025-05-01	Variational OOD State Correction for Offline Reinforcement Learning	Ke Jiang et.al.	2505.00503	null
2025-04-30	A Survey of Interactive Generative Video	Jiwen Yu et.al.	2504.21853	null
2025-04-30	TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments	Sichang Tu et.al.	2504.21851	null
2025-04-30	Characterizing AI Agents for Alignment and Governance	Atoosa Kasirzadeh et.al.	2504.21848	null
2025-04-30	SWE-smith: Scaling Data for Software Engineering Agents	John Yang et.al.	2504.21798	null
2025-04-30	WebThinker: Empowering Large Reasoning Models with Deep Research Capability	Xiaoxi Li et.al.	2504.21776	link
2025-04-30	Is Intermediate Fusion All You Need for UAV-based Collaborative Perception?	Jiuwu Hao et.al.	2504.21774	link
2025-04-30	LLM-based Interactive Imitation Learning for Robotic Manipulation	Jonas Werner et.al.	2504.21769	link
2025-04-30	Asymptotic Analysis of Weighted Fair Division	Pasin Manurangsi et.al.	2504.21728	null
2025-04-30	LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics	Marc Glocker et.al.	2504.21716	link
2025-04-30	Economic Inequality between Groups in an a priori Stratified Society	Thiago Dias et.al.	2504.21703	null
2025-04-29	Toward Efficient Exploration by Large Language Model Agents	Dilip Arumugam et.al.	2504.20997	null
2025-04-29	TesserAct: Learning 4D Embodied World Models	Haoyu Zhen et.al.	2504.20995	null
2025-04-29	XPG-RL: Reinforcement Learning with Explainable Priority Guidance for Efficiency-Boosted Mechanical Search	Yiting Zhang et.al.	2504.20969	null
2025-04-29	AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security	Zikui Cai et.al.	2504.20965	link
2025-04-29	Opinion-Driven Decision-Making for Multi-Robot Navigation through Narrow Corridors	Norah K. Alghamdi et.al.	2504.20947	null
2025-04-29	Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity	Taisuke Kobayashi et.al.	2504.20932	null
2025-04-29	Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR	Shahbaz P Qadri Syed et.al.	2504.20927	null
2025-04-29	Modeling AI-Human Collaboration as a Multi-Agent Adaptation	Prothit Sen et.al.	2504.20903	link
2025-04-29	CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models	Hasan Md Tusfiqur Alam et.al.	2504.20898	link
2025-04-29	Does Feedback Help in Bandits with Arm Erasures?	Merve Karakas et.al.	2504.20894	null
2025-04-28	Towards Automated Scoping of AI for Social Good Projects	Jacob Emmerson et.al.	2504.20010	null
2025-04-28	Simplified and Secure MCP Gateways for Enterprise AI Integration	Ivo Brett et.al.	2504.19997	link
2025-04-28	TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons	Emre Can Acikgoz et.al.	2504.19982	null
2025-04-28	On one generalization of stable allocations in a two-sided market	Alexander V. Karzanov et.al.	2504.19978	null
2025-04-28	Securing Agentic AI: A Comprehensive Threat Model and Mitigation Framework for Generative AI Agents	Vineeth Sai Narajala et.al.	2504.19956	null
2025-04-28	Securing GenAI Multi-Agent Systems Against Tool Squatting: A Zero Trust Registry-Based Approach	Vineeth Sai Narajala et.al.	2504.19951	null
2025-04-28	Automated decision-making for dynamic task assignment at scale	Riccardo Lo Bianco et.al.	2504.19933	link
2025-04-28	Can AI Agents Design and Implement Drug Discovery Pipelines?	Khachik Smbatyan et.al.	2504.19912	null
2025-04-28	LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects	Guangyi Liu et.al.	2504.19838	link
2025-04-28	PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping	Feng Chen et.al.	2504.19818	link
2025-04-25	Instrumentation for Better Demonstrations: A Case Study	Remko Proesmans et.al.	2504.18481	null
2025-04-25	Improved Dwell-times for Switched Nonlinear Systems using Memory Regression Extension	Muzaffar Qureshi et.al.	2504.18457	null
2025-04-25	Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior	Milad Sefidgaran et.al.	2504.18455	null
2025-04-25	On monotone completion of risk markets: Limit results for incomplete risk markets	Iman Khajepour et.al.	2504.18436	null
2025-04-25	LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection	Rajesh Yarra et.al.	2504.18423	null
2025-04-25	Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant	Lei Shen et.al.	2504.18373	link
2025-04-25	Interpretable Affordance Detection on 3D Point Clouds with Probabilistic Prototypes	Maximilian Xiling Li et.al.	2504.18355	null
2025-04-25	Revisiting Data Auditing in Large Vision-Language Models	Hongyu Zhu et.al.	2504.18349	null
2025-04-25	Optimal Control of Sensor-Induced Illusions on Robotic Agents	Lorenzo Medici et.al.	2504.18339	null
2025-04-25	Towards Adaptive Software Agents for Debugging	Yacine Majdoub et.al.	2504.18316	null
2025-04-24	Robotic Task Ambiguity Resolution via Natural Language Interaction	Eugenio Chisari et.al.	2504.17748	null
2025-04-24	Applied Sheaf Theory For Multi-agent Artificial Intelligence (Reinforcement Learning) Systems: A Prospectus	Eric Schmid et.al.	2504.17700	null
2025-04-24	‘The Boring and the Tedious’: Invisible Labour in India’s Gig-Economy	Pratyay Suvarnapathaki et.al.	2504.17697	null
2025-04-24	Towards a HIPAA Compliant Agentic AI System in Healthcare	Subash Neupane et.al.	2504.17669	null
2025-04-24	A Constraint Opinion Model	Fabio Gadducci et.al.	2504.17605	null
2025-04-24	Mitigating xApp conflicts for efficient network slicing in 6G O-RAN: a graph convolutional-based attention network approach	Sihem Bakri et.al.	2504.17590	null
2025-04-24	A Multi-Agent, Laxity-Based Aggregation Strategy for Cost-Effective Electric Vehicle Charging and Local Transformer Overload Prevention	Kristoffer Christensen et.al.	2504.17575	null
2025-04-24	Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks	Yuelin Liu et.al.	2504.17526	null
2025-04-24	Communication-Efficient Personalized Distributed Learning with Data and Node Heterogeneity	Zhuojun Tian et.al.	2504.17520	null
2025-04-24	Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning	Mingqi Yuan et.al.	2504.17490	null
2025-04-23	OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents	Raghav Thind et.al.	2504.16918	null
2025-04-23	Building A Secure Agentic AI Application Leveraging A2A Protocol	Idan Habler et.al.	2504.16902	null
2025-04-23	Do Large Language Models know who did what to whom?	Joseph M. Denning et.al.	2504.16884	null
2025-04-23	Hybrid Reinforcement Learning and Model Predictive Control for Adaptive Control of Hydrogen-Diesel Dual-Fuel Combustion	Julian Bedei et.al.	2504.16875	null
2025-04-23	Monte Carlo Planning with Large Language Model for Text-Based Game Agents	Zijing Shi et.al.	2504.16855	null
2025-04-23	Fair division of the replacement-units without an appraiser in urban renewal processes	Noga Klein Elmalem et.al.	2504.16852	null
2025-04-23	MLOps Monitoring at Scale for Digital Platforms	Yu Jeffrey Hu et.al.	2504.16789	null
2025-04-23	A Survey of AI Agent Protocols	Yingxuan Yang et.al.	2504.16736	null
2025-04-24	DYNUS: Uncertainty-aware Trajectory Planner in Dynamic Unknown Environments	Kota Kondo et.al.	2504.16734	null
2025-04-23	IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery	Aniketh Garikaparthi et.al.	2504.16728	link
2025-04-22	MR. Video: “MapReduce” is the Principle for Long Video Understanding	Ziqi Pang et.al.	2504.16082	null
2025-04-22	LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities	Thomas Schmied et.al.	2504.16078	null
2025-04-22	Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation	Zhiyuan Hu et.al.	2504.16073	null
2025-04-22	ForesightNav: Learning Scene Imagination for Efficient Exploration	Hardik Shah et.al.	2504.16062	link
2025-04-22	Reinforcement Learning and Metaheuristics for Feynman Integral Reduction	Mao Zeng et.al.	2504.16045	null
2025-04-22	A Lagrangian Approach to Optimal Lotteries in Non-Convex Economies	Chengfeng Shen et.al.	2504.15997	null
2025-04-22	Neuroadaptive Haptics: Comparing Reinforcement Learning from Explicit Ratings and Neural Signals for Adaptive XR Systems	Lukas Gehrke et.al.	2504.15984	null
2025-04-22	Towards Test Generation from Task Description for Mobile Testing with Multi-modal Reasoning	Hieu Huynh et.al.	2504.15917	link
2025-04-22	Learning the Spoofability of Limit Order Books With Interpretable Probabilistic Neural Networks	Timothée Fabre et.al.	2504.15908	null
2025-04-22	A closer look at how large language models trust humans: patterns and biases	Valeria Lerman et.al.	2504.15801	null
2025-04-21	Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs	Chun-Hsiao Yeh et.al.	2504.15280	link
2025-04-21	Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning	Ehsan Ahmadi et.al.	2504.15263	null
2025-04-21	FlowReasoner: Reinforcing Query-Level Meta-Agents	Hongcheng Gao et.al.	2504.15257	link
2025-04-21	A Self-Improving Coding Agent	Maxime Robeyns et.al.	2504.15228	null
2025-04-21	An experimental study of the influence of anonymous information on social media users	Boleslaw K. Szymanski et.al.	2504.15215	null
2025-04-21	Fully Adaptive Stepsizes: Which System Benefit More – Centralized or Decentralized?	Diyako Ghaderyan et.al.	2504.15196	null
2025-04-21	Behavioral Universe Network (BUN): A Behavioral Information-Based Framework for Complex Systems	Wei Zhou et.al.	2504.15146	null
2025-04-21	Neural ATTF: A Scalable Solution to Lifelong Multi-Agent Path Planning	Kushal Shah et.al.	2504.15130	null
2025-04-21	Contemplative Wisdom for Superalignment	Ruben Laukkonen et.al.	2504.15125	null
2025-04-21	Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN	Lin Wang et.al.	2504.15099	null
2025-04-18	Science Hierarchography: Hierarchical Organization of Science Literature	Muhan Gao et.al.	2504.13834	link
2025-04-18	LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark	Guangyi Liu et.al.	2504.13805	null
2025-04-18	ChatNekoHacker: Real-Time Fan Engagement with Conversational Agents	Takuya Sera et.al.	2504.13793	null
2025-04-21	BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models	Zhengxian Wu et.al.	2504.13775	null
2025-04-18	$O(p \log d)$ Subgraph Isomorphism using Stigmergic Swarming Agents	H. Van Dyke Parunak et.al.	2504.13722	null
2025-04-18	Stability of flocking in the reciprocal two-species Vicsek model: Effects of relative population, motility, and noise	Aditya Kumar Dutta et.al.	2504.13709	null
2025-04-18	OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation	Yichen Wu et.al.	2504.13707	null
2025-04-18	Modelling Immunity in Agent-based Models	Gray Manicom et.al.	2504.13706	null
2025-04-18	EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model	Sijing Li et.al.	2504.13650	link
2025-04-18	Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning	Tao He et.al.	2504.13643	null
2025-04-17	Sleep-time Compute: Beyond Inference Scaling at Test-time	Kevin Lin et.al.	2504.13171	link
2025-04-17	Exploring Expert Failures Improves LLM Agent Tuning	Li-Cheng Lan et.al.	2504.13145	null
2025-04-17	Object-Driven Narrative in AR: A Scenario-Metaphor Framework with VLM Integration	Yusi Sun et.al.	2504.13119	null
2025-04-17	Retrieval-Augmented Generation with Conflicting Evidence	Han Wang et.al.	2504.13079	link
2025-04-17	InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning	Zheng Wang et.al.	2504.13032	null
2025-04-17	Why Ask One When You Can Ask $k$ ? Two-Stage Learning-to-Defer to a Set of Experts	Yannis Montreuil et.al.	2504.12988	null
2025-04-17	QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?	Zhouyang Jiang et.al.	2504.12961	null
2025-04-17	Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback	Nearchos Potamitis et.al.	2504.12951	null
2025-04-17	RL-PINNs: Reinforcement Learning-Driven Adaptive Sampling for Efficient Training of PINNs	Zhenao Song et.al.	2504.12949	null
2025-04-18	Customizing Emotional Support: How Do Individuals Construct and Interact With LLM-Powered Chatbots	Xi Zheng et.al.	2504.12943	null
2025-04-16	Adapting a World Model for Trajectory Following in a 3D Game	Marko Tot et.al.	2504.12299	null
2025-04-16	Optimal flock formation induced by agent heterogeneity	Arthur N. Montanari et.al.	2504.12297	link
2025-04-16	Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning	Mahmoud Salhab et.al.	2504.12254	null
2025-04-16	Data Assimilation for Robust UQ Within Agent-Based Simulation on HPC Systems	Adam Spannaus et.al.	2504.12228	null
2025-04-16	Communication Optimization for Decentralized Learning atop Bandwidth-limited Edge Networks	Tingyang Sun et.al.	2504.12210	null
2025-04-16	ARCeR: an Agentic RAG for the Automated Definition of Cyber Ranges	Matteo Lupinacci et.al.	2504.12143	null
2025-04-16	Multilingual Contextualization of Large Language Models for Document-Level Machine Translation	Miguel Moura Ramos et.al.	2504.12140	null
2025-04-16	The Social Learning Barrier	Florian Brandl et.al.	2504.12136	null
2025-04-16	EmoACT: a Framework to Embed Emotions into Artificial Agents Based on Affect Control Theory	Francesca Corrao et.al.	2504.12125	null
2025-04-16	Towards LLM Agents for Earth Observation	Chia Hsiang Kao et.al.	2504.12110	null
2025-04-15	TextArena	Leon Guertler et.al.	2504.11442	link
2025-04-15	Embodied World Models Emerge from Navigational Task in Open-Ended Environments	Li Jin et.al.	2504.11419	null
2025-04-15	Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions	Wang Bill Zhu et.al.	2504.11373	link
2025-04-15	DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks	Yupei Liu et.al.	2504.11358	link
2025-04-15	Learning to Be A Doctor: Searching for Effective Medical Agent Architectures	Yangyang Zhuang et.al.	2504.11301	null
2025-04-15	Policy heterogeneity improves collective olfactory search in 3-D turbulence	Lorenzo Piro et.al.	2504.11291	null
2025-04-15	The Obvious Invisible Threat: LLM-Powered GUI Agents’ Vulnerability to Fine-Print Injections	Chaoran Chen et.al.	2504.11281	null
2025-04-15	Multi-Agent Reinforcement Learning for Greenhouse Gas Offset Credit Markets	Liam Welsh et.al.	2504.11258	null
2025-04-16	UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis	Xinyi Liu et.al.	2504.11257	null
2025-04-15	A Rollout-Based Algorithm and Reward Function for Efficient Resource Allocation in Business Processes	Jeroen Middelhuis et.al.	2504.11250	null
2025-04-14	The Price of Competitive Information Disclosure	Siddhartha Banerjee et.al.	2504.10459	null
2025-04-15	GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents	Xiaobo Xia et.al.	2504.10458	null
2025-04-14	RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users	Suyu Ye et.al.	2504.10445	link
2025-04-14	Position Uncertainty in a Prisoner’s Dilemma Game : An Experiment	Chowdhury Mohammad Sakib Anwar et.al.	2504.10441	null
2025-04-14	Silent Self-Stabilizing Ranking: Time Optimal and Space Efficient	Petra Berenbrink et.al.	2504.10417	null
2025-04-14	Ctrl-Z: Controlling AI Agents via Resampling	Aryan Bhatt et.al.	2504.10374	null
2025-04-14	Proteinoid spikes: from protocognitive to universal approximating agents	Saksham Sharma et.al.	2504.10362	null
2025-04-14	Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving	Xiaoshan Zhou et.al.	2504.10296	null
2025-04-14	Characterizing LLM-driven Social Network: The Chirper.ai Case	Yiming Zhu et.al.	2504.10286	null
2025-04-14	RealHarm: A Collection of Real-World Language Model Application Failures	Pierre Le Jeune et.al.	2504.10277	link
2025-04-11	DocAgent: A Multi-Agent System for Automated Code Documentation Generation	Dayu Yang et.al.	2504.08725	link
2025-04-11	SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents	Muhammad Shihab Rashid et.al.	2504.08703	link
2025-04-11	SeaView: Software Engineering Agent Visual Interface for Enhanced Workflow	Timothy Bula et.al.	2504.08696	null
2025-04-11	TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning	Hang Ni et.al.	2504.08694	null
2025-04-11	Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing	Jiho Kim et.al.	2504.08687	null
2025-04-11	Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents	Alessio Buscemi et.al.	2504.08640	null
2025-04-11	Optimal selection of the most informative nodes for a noisy DeGroot model with stubborn agents	Roberta Raineri et.al.	2504.08622	null
2025-04-11	MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation	Tao Zhang et.al.	2504.08621	link
2025-04-11	Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage Constraints	Mohamed S. Talamali et.al.	2504.08585	null
2025-04-11	FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents	Xin Tan et.al.	2504.08581	null
2025-04-10	Fast Adaptation with Behavioral Foundation Models	Harshit Sikchi et.al.	2504.07896	null
2025-04-10	Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge	Riccardo Cantini et.al.	2504.07887	link
2025-04-11	An LLM-Driven Multi-Agent Debate System for Mendelian Diseases	Xinyang Zhou et.al.	2504.07881	null
2025-04-10	Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis	Fei-Hsuan Yu et.al.	2504.07872	null
2025-04-10	In itinere infections covertly undermine localized epidemic control in metapopulations	Francesca Dilisante et.al.	2504.07849	null
2025-04-10	Anytime Single-Step MAPF Planning with Anytime PIBT	Nayesha Gandotra et.al.	2504.07841	null
2025-04-10	Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems	Simon Lermen et.al.	2504.07831	null
2025-04-10	MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations	Genglin Liu et.al.	2504.07830	link
2025-04-10	Active Matter Flocking via Predictive Alignment	Julian Giraldo-Barreto et.al.	2504.07778	null
2025-04-10	Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents	Manh Hung Nguyen et.al.	2504.07655	null
2025-04-09	SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills	Boyuan Zheng et.al.	2504.07079	null
2025-04-09	A Unified Agentic Framework for Evaluating Conditional Image Generation	Jifang Wang et.al.	2504.07046	link
2025-04-09	Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration	Kostas Hatalis et.al.	2504.06943	null
2025-04-09	AI-Driven Consensus: Modeling Multi-Agent Networks with Long-Range Interactions through path-Laplacian Matrices	Yusef Ahsini et.al.	2504.06894	link
2025-04-09	More connection, less community: network formation and local public goods provision	Alastair Langtry et.al.	2504.06872	null
2025-04-09	Persona Dynamics: Unveiling the Impact of Personality Traits on Agents in Text-Based Games	Seungwon Lim et.al.	2504.06868	link
2025-04-09	IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments	Can Zhang et.al.	2504.06827	null
2025-04-09	Inducing Programmatic Skills for Agentic Tasks	Zora Zhiruo Wang et.al.	2504.06821	link
2025-04-09	FamilyTool: A Multi-hop Personalized Tool Use Benchmark	Yuxin Wang et.al.	2504.06766	link
2025-04-09	Adaptive Human-Robot Collaborative Missions using Hybrid Task Planning	Gricel Vázquez et.al.	2504.06746	null
2025-04-08	FEABench: Evaluating Language Models on Multiphysics Reasoning Ability	Nayantara Mudur et.al.	2504.06260	link
2025-04-08	The Work Capacity of Channels with Memory: Maximum Extractable Work in Percept-Action Loops	Lukas J. Fiderer et.al.	2504.06209	null
2025-04-08	TxGemma: Efficient and Agentic LLMs for Therapeutics	Eric Wang et.al.	2504.06196	null
2025-04-08	SkillFlow: Efficient Skill and Code Transfer Through Communication in Adapting AI Agents	Pagkratios Tagkopoulos et.al.	2504.06188	null
2025-04-08	Linear Regulator-Based Synchronization of Positive Multi-Agent Systems	Alba Gurpegui et.al.	2504.06169	null
2025-04-08	V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models	Xiangxi Zheng et.al.	2504.06148	link
2025-04-08	Deploying Chatbots in Customer Service: Adoption Hurdles and Simple Remedies	Evgeny Kagan et.al.	2504.06145	null
2025-04-08	A Multimedia Analytics Model for the Foundation Model Era	Marcel Worring et.al.	2504.06138	null
2025-04-08	Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning	Tooraj Helmi et.al.	2504.06135	null
2025-04-08	Accelerating Vehicle Routing via AI-Initialized Genetic Algorithms	Ido Greenberg et.al.	2504.06126	null
2025-04-07	CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models	Kavana Venkatesh et.al.	2504.05306	null
2025-04-07	How to evaluate control measures for LLM agents? A trajectory from today to superintelligence	Tomek Korbak et.al.	2504.05259	null
2025-04-07	Rationalizing dynamic choices	Henrique de Oliveira et.al.	2504.05251	null
2025-04-07	Reducing the Communication of Distributed Model Predictive Control: Autoencoders and Formation Control	Torben Schiz et.al.	2504.05223	null
2025-04-07	DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation	Xinglin Lyu et.al.	2504.05122	link
2025-04-07	AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments	Saeid Ario Vaghefi et.al.	2504.05104	null
2025-04-07	AI-Driven Tactical Communications and Networking for Defense: A Survey and Emerging Trends	Victor Monzon Baeza et.al.	2504.05071	null
2025-04-07	Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning	Sugyeong Eo et.al.	2504.05047	null
2025-04-08	Attention-Augmented Inverse Reinforcement Learning with Graph Convolutions for Multi-Agent Task Allocation	Huilin Yin et.al.	2504.05045	null
2025-04-07	Mixture-of-Personas Language Models for Population Simulation	Ngoc Bui et.al.	2504.05019	null
2025-04-04	Bonsai: Interpretable Tree-Adaptive Grounded Reasoning	Kate Sanders et.al.	2504.03640	null
2025-04-04	Epicast 2.0: A large-scale, demographically detailed, agent-based model for simulating respiratory pathogen spread in the United States	Prescott C. Alexander et.al.	2504.03604	null
2025-04-04	APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay	Akshara Prabhakar et.al.	2504.03601	null
2025-04-04	A Lower Bound on Conservative Elementary Object Systems Coverability	Francesco Di Cosmo et.al.	2504.03591	null
2025-04-04	SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement	Runnan Fang et.al.	2504.03561	link
2025-04-04	Agentic Knowledgeable Self-awareness	Shuofei Qiao et.al.	2504.03553	link
2025-04-04	The Limits of “Fairness” of the Variational Generalized Nash Equilibrium	Sophie Hall et.al.	2504.03540	null
2025-04-04	RANa: Retrieval-Augmented Navigation	Gianluca Monaci et.al.	2504.03524	null
2025-04-04	Target Prediction Under Deceptive Switching Strategies via Outlier-Robust Filtering of Partially Observed Incomplete Trajectories	Yiming Meng et.al.	2504.03502	null
2025-04-04	A stochastic volatility approximation for a tick-by-tick price model with mean-field interaction	Paolo Dai Pra et.al.	2504.03445	null
2025-04-03	Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets	Chuning Zhu et.al.	2504.02792	null
2025-04-03	Sequential Binary Hypothesis Testing with Competing Agents under Information Asymmetry	Aneesh Raghavan et.al.	2504.02743	null
2025-04-03	Responsible Development of Offensive AI	Ryan Marinelli et.al.	2504.02701	link
2025-04-03	The Tension between Trust and Oversight in Long-term Relationships	Peter Achim et.al.	2504.02696	null
2025-04-03	Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL	Achilles Kiwanuka Machumilane et.al.	2504.02688	null
2025-04-03	A Set-Theoretic Robust Control Approach for Linear Quadratic Games with Unknown Counterparts	Francesco Bianchin et.al.	2504.02679	null
2025-04-03	Affordable AI Assistants with Knowledge Graph of Thoughts	Maciej Besta et.al.	2504.02670	null
2025-04-03	SymDQN: Symbolic Knowledge and Reasoning in Neural Network-based Reinforcement Learning	Ivo Amador et.al.	2504.02654	null
2025-04-04	Controlled Social Learning: Altruism vs. Bias	Raghu Arghal et.al.	2504.02648	null
2025-04-03	Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions	PeiJie Yu et.al.	2504.02623	link
2025-04-02	Graphon games and an idealized limit of large network games	Motoki Otsuka et.al.	2504.01944	null
2025-04-02	Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection	Souradip Chakraborty et.al.	2504.01931	null
2025-04-02	Gen-C: Populating Virtual Worlds with Generative Crowds	Andreas Panayiotou et.al.	2504.01924	null
2025-04-02	Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning	Yinggan Xu et.al.	2504.01911	null
2025-04-02	Interpreting Emergent Planning in Model-Free Reinforcement Learning	Thomas Bush et.al.	2504.01871	null
2025-04-02	PaperBench: Evaluating AI’s Ability to Replicate AI Research	Giulio Starace et.al.	2504.01848	link
2025-04-02	A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning	Yuyang Qiu et.al.	2504.01839	null
2025-04-02	Budget-Feasible Contracts	Michal Feldman et.al.	2504.01773	null
2025-04-03	Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning	Ke Jiang et.al.	2504.01719	null
2025-04-02	Reasoning LLMs for User-Aware Multimodal Conversational Agents	Hamed Rahimi et.al.	2504.01700	null
2025-03-31	RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy	Zhonghan Zhao et.al.	2503.24388	null
2025-03-31	Coordinating Distributed Energy Resources with Nodal Pricing in Distribution Networks: a Game-Theoretic Approach	Eli Brock et.al.	2503.24342	null
2025-03-31	Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning	Yubo Zhang et.al.	2503.24296	null
2025-03-31	Value of Information-based Deceptive Path Planning Under Adversarial Interventions	Wesley A. Suttle et.al.	2503.24284	null
2025-03-31	MaintainCoder: Maintainable Code Generation Under Dynamic Requirements	Zhengren Wang et.al.	2503.24260	link
2025-03-31	PAARS: Persona Aligned Agentic Retail Shoppers	Saab Mansour et.al.	2503.24228	null
2025-03-31	Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany	Abdul Sittar et.al.	2503.24199	null
2025-03-31	Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms	Shuoming Zhang et.al.	2503.24191	null
2025-03-31	Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up	Ziming Cheng et.al.	2503.24180	null
2025-03-31	Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy	Harry Robertshaw et.al.	2503.24140	null
2025-03-28	Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions	Mohammad Almansoori et.al.	2503.22678	null
2025-03-28	ActionStudio: A Lightweight Framework for Data and Training of Action Models	Jianguo Zhang et.al.	2503.22673	link
2025-03-28	On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations	Rajdeep Singh Hundal et.al.	2503.22575	null
2025-03-28	SafeCast: Risk-Responsive Motion Forecasting for Autonomous Vehicles	Haicheng Liao et.al.	2503.22541	null
2025-03-28	Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement	Wenqiang Luo et.al.	2503.22512	null
2025-03-28	Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments	Luke Rowe et.al.	2503.22496	null
2025-03-28	WorkTeam: Constructing Workflows from Natural Language with Multi-Agents	Hanchao Liu et.al.	2503.22473	null
2025-03-28	Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey	Shengyue Guan et.al.	2503.22458	null
2025-03-28	Scaling Laws of Scientific Discovery with AI and Robot Scientists	Pengsong Zhang et.al.	2503.22444	null
2025-03-28	CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching	Zhonghao Jiang et.al.	2503.22424	link
2025-03-27	MemInsight: Autonomous Memory Augmentation for LLM Agents	Rana Salama et.al.	2503.21760	null
2025-03-27	GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics	Arsham Gholamzadeh Khoee et.al.	2503.21735	null
2025-03-27	Collab: Controlled Decoding using Mixture of Agents for LLM Alignment	Souradip Chakraborty et.al.	2503.21720	null
2025-03-27	A tale of two goals: leveraging sequentiality in multi-goal scenarios	Olivier Serris et.al.	2503.21677	null
2025-03-27	Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI	Danaja Rutar et.al.	2503.21668	null
2025-03-27	UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning	Zhengxi Lu et.al.	2503.21620	link
2025-03-27	A Measure Based Generalizable Approach to Understandability	Vikas Kushwaha et.al.	2503.21615	null
2025-03-27	A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond	Xiaoye Qu et.al.	2503.21614	link
2025-03-27	A Deep Reinforcement Learning-based Approach for Adaptive Handover Protocols	Johannes Voigt et.al.	2503.21601	null
2025-03-27	debug-gym: A Text-Based Environment for Interactive Debugging	Xingdi Yuan et.al.	2503.21557	null
2025-03-26	Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields	Shijie Zhou et.al.	2503.20776	null
2025-03-26	Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why?	Ilia Shilov et.al.	2503.20772	null
2025-03-27	Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs	Yuxuan Lu et.al.	2503.20749	null
2025-03-26	Prospect for measuring work statistics in quantum coherent systems	Cheolhee Han et.al.	2503.20729	null
2025-03-26	Convergence Theory of Flexible ALADIN for Distributed Optimization	Xu Du et.al.	2503.20716	null
2025-03-26	Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control	Eloy Anguiano Batanero et.al.	2503.20688	null
2025-03-27	Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound	Yuhao Huang et.al.	2503.20685	null
2025-03-26	TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews	Huimin Xu et.al.	2503.20666	null
2025-03-26	Agent-Based Analysis of the Impact of Near Real-Time Data and Smart Balancing on the Frequency Stability of Power Systems	Johannes Lips et.al.	2503.20665	null
2025-03-26	State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning	Zongyuan Zhang et.al.	2503.20613	null
2025-03-25	Energetic advantages for quantum agents in online execution of complex strategies	Jayne Thompson et.al.	2503.19896	null
2025-03-25	A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design	Jie Tian et.al.	2503.19889	null
2025-03-25	Collaborative Satisfaction of Long-Term Spatial Constraints in Multi-Agent Systems: A Distributed Optimization Approach (extended version)	Farhad Mehdifar et.al.	2503.19879	null
2025-03-25	Towards Online Multi-Modal Social Interaction Understanding	Xinpeng Li et.al.	2503.19851	link
2025-03-25	FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs	Carlos Plou et.al.	2503.19850	null
2025-03-25	Thinking agents for zero-shot generalization to qualitatively novel tasks	Thomas Miconi et.al.	2503.19815	null
2025-03-25	Simulating Tracking Data to Advance Sports Analytics Research	David Radke et.al.	2503.19809	link
2025-03-25	Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation	Lewis Newsham et.al.	2503.19752	null
2025-03-25	Writing as a testbed for open ended agents	Sian Gooding et.al.	2503.19711	null
2025-03-25	Optimal Path Planning and Cost Minimization for a Drone Delivery System Via Model Predictive Control	Muhammad Al-Zafar Khan et.al.	2503.19699	null
2025-03-24	AdaWorld: Learning Adaptable World Models with Latent Actions	Shenyuan Gao et.al.	2503.18938	link
2025-03-24	AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration	Zhexuan Wang et.al.	2503.18891	link
2025-03-24	Dynamics of Insect Paraintelligence: How a Mindless Colony of Ants Meaningfully Moves a Beetle	Eldar Knar et.al.	2503.18858	null
2025-03-24	Self-Organizing Graph Reasoning Evolves into a Critical State for Continuous Discovery Through Structural-Semantic Dynamics	Markus J. Buehler et.al.	2503.18852	null
2025-03-24	EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments	Sara Fish et.al.	2503.18825	null
2025-03-24	Faster Heat Transfer Clarifies the Unexpected Twist in the Simultaneous Freezing of Hot versus Cold Water	James D. Brownridge et.al.	2503.18820	null
2025-03-24	Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm	Chak Lam Shek et.al.	2503.18816	null
2025-03-24	Defeating Prompt Injections by Design	Edoardo Debenedetti et.al.	2503.18813	null
2025-03-24	Simulation-Driven Balancing of Competitive Game Levels with Reinforcement Learning	Florian Rupp et.al.	2503.18748	link
2025-03-24	Unsupervised Acquisition of Discrete Grammatical Categories	David Ph. Shakouri et.al.	2503.18702	null
2025-03-21	HCAST: Human-Calibrated Autonomy Software Tasks	David Rein et.al.	2503.17354	link
2025-03-21	CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities	Yuxuan Zhu et.al.	2503.17332	link
2025-03-21	LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language	Kun Chu et.al.	2503.17309	link
2025-03-21	Exploring the Temporal Dynamics of Facial Mimicry in Emotion Processing Using Action Units	Meisam Jamshidi Seikavandi et.al.	2503.17306	null
2025-03-21	Coarsening in the Persistent Voter Model: analytical results	R. G. de Almeida et.al.	2503.17295	null
2025-03-21	Curriculum RL meets Monte Carlo Planning: Optimization of a Real World Container Management Problem	Abhijeet Pendyala et.al.	2503.17194	link
2025-03-21	Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection	Duanrui Yu et.al.	2503.17175	null
2025-03-21	Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning	Chan Kim et.al.	2503.17125	null
2025-03-21	Deterministic AI Agent Personality Expression through Standard Psychological Diagnostics	J. M. Diederik Kruijssen et.al.	2503.17085	null
2025-03-21	Replay4NCL: An Efficient Memory Replay-based Methodology for Neuromorphic Continual Learning in Embedded AI Systems	Mishal Fatima Minhas et.al.	2503.17061	null
2025-03-20	Survey on Evaluation of LLM-based Agents	Asaf Yehudai et.al.	2503.16416	null
2025-03-20	Computing Lindahl Equilibrium for Public Goods with and without Funding Caps	Christian Kroer et.al.	2503.16414	null
2025-03-20	RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints	Yiran Qin et.al.	2503.16408	null
2025-03-20	Do Visual Imaginations Improve Vision-and-Language Navigation Agents?	Akhil Perincherry et.al.	2503.16394	null
2025-03-20	JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse	Muyao Li et.al.	2503.16365	null
2025-03-20	Issue2Test: Generating Reproducing Test Cases from Issue Reports	Noor Nashid et.al.	2503.16320	null
2025-03-20	Characterizing the Convergence of Game Dynamics via Potentialness	Martin Bichler et.al.	2503.16285	link
2025-03-20	Binary-Report Peer Prediction for Real-Valued Signal Spaces	Rafael Frongillo et.al.	2503.16280	null
2025-03-20	AI Agents in Cryptoland: Practical Attacks and No Silver Bullet	Atharv Singh Patlan et.al.	2503.16248	null
2025-03-20	Dispersion is (Almost) Optimal under (A)synchrony	Ajay D. Kshemkalyani et.al.	2503.16216	null
2025-03-19	More Information is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns	Kushagra Gupta et.al.	2503.15486	null
2025-03-19	SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks	Yifei Zhou et.al.	2503.15478	link
2025-03-19	Energy-efficient Merging of Connected and Automated Vehicles using Control Barrier Functions	Shreshta Rajakumar Deshpande et.al.	2503.15379	null
2025-03-19	Lyapunov-Based Graph Neural Networks for Adaptive Control of Multi-Agent Systems	Brandon C. Fallin et.al.	2503.15360	null
2025-03-19	MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration	David Wan et.al.	2503.15272	null
2025-03-19	Exploring Large Language Models for Word Games:Who is the Spy?	Chentian Wei et.al.	2503.15235	link
2025-03-19	A Personalized Data-Driven Generative Model of Human Motion	Angelo Di Porzio et.al.	2503.15225	null
2025-03-19	When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection	Tittaya Mairittha et.al.	2503.15204	null
2025-03-19	Learning Topology Actions for Power Grid Control: A Graph-Based Soft-Label Imitation Learning Approach	Mohamed Hassouna et.al.	2503.15190	null
2025-03-19	Role-Selection Game in Block Production under Proposer-Builder Separation	Yanzhen Li et.al.	2503.15184	null
2025-03-18	Gricean Norms as a Basis for Effective Collaboration	Fardin Saad et.al.	2503.14484	link
2025-03-18	Don’t lie to your friends: Learning what you know from collaborative self-play	Jacob Eisenstein et.al.	2503.14481	null
2025-03-18	EnvBench: A Benchmark for Automated Environment Setup	Aleksandra Eliseeva et.al.	2503.14443	link
2025-03-18	PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play	Wei Fang et.al.	2503.14432	null
2025-03-18	Decentralized RISE-based Control for Exponential Heterogeneous Multi-Agent Target Tracking of Second-Order Nonlinear Systems	Cristian F. Nino et.al.	2503.14418	null
2025-03-18	Large Language Models for Virtual Human Gesture Selection	Parisa Ghanad Torshizi et.al.	2503.14408	null
2025-03-18	Unified Analysis of Decentralized Gradient Descent: a Contraction Mapping Framework	Erik G. Larsson et.al.	2503.14353	null
2025-03-18	MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration	Yisen Xu et.al.	2503.14340	null
2025-03-18	DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal	Vaibhav Aggarwal et.al.	2503.14269	link
2025-03-18	Conversational Agents as Catalysts for Critical Thinking: Challenging Social Influence in Group Decision-making	Soohwan Lee et.al.	2503.14263	null
2025-03-17	VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning	Ye Liu et.al.	2503.13444	link
2025-03-17	A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives	Weiqiang Jin et.al.	2503.13415	null
2025-03-17	Reward Adaptation Via Q-Manipulation	Kevin Vora et.al.	2503.13414	null
2025-03-17	Toward Generative 6G Simulation: An Experimental Multi-Agent LLM and ns-3 Integration	Farhad Rezazadeh et.al.	2503.13402	null
2025-03-17	MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research	James Burgess et.al.	2503.13399	link
2025-03-17	Mixtures of ensembles: System separation and identification via optimal transport	Filip Elvander et.al.	2503.13362	null
2025-03-17	Optimal intrinsic formation using exogenous systems	Yueyue Xu et.al.	2503.13359	null
2025-03-17	Agents Play Thousands of 3D Video Games	Zhongwen Xu et.al.	2503.13356	null
2025-03-17	Goal2Story: A Multi-Agent Fleet based on Privately Enabled sLLMs for Impacting Mapping on Requirements Elicitation	Xinkai Zou et.al.	2503.13279	null
2025-03-17	Knowledge-Aware Iterative Retrieval for Multi-Agent Systems	Seyoung Song et.al.	2503.13275	null
2025-03-14	Scaling the Automated Discovery of Quantum Circuits via Reinforcement Learning with Gadgets	Jan Olle et.al.	2503.11638	null
2025-03-14	Essentials of the kinetic theory of multi-agent systems	Nadia Loy et.al.	2503.11554	null
2025-03-14	Multi-robot coordination for connectivity recovery after unpredictable environment changes	Yaroslav Marchukov et.al.	2503.11520	null
2025-03-14	Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks	Diego Gosmar et.al.	2503.11517	link
2025-03-14	Multi-agent coordination for on-demand data gathering with periodic information upload	Yaroslav Marchukov et.al.	2503.11504	null
2025-03-14	Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control	Yifeng Zhang et.al.	2503.11488	null
2025-03-14	Research Vision: Multi-Agent Path Planning for Cops And Robbers Via Reactive Synthesis	William Fishell et.al.	2503.11475	null
2025-03-14	Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning	Jose-Luis Holgado-Alvarez et.al.	2503.11467	null
2025-03-14	Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves	Aryaman Reddi et.al.	2503.11452	link
2025-03-14	Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery	Balaji Rama et.al.	2503.11444	link
2025-03-13	UniGoal: Towards Universal Zero-shot Goal-oriented Navigation	Hang Yin et.al.	2503.10630	null
2025-03-13	Uncertainty in Action: Confidence Elicitation in Embodied Agents	Tianjiao Yu et.al.	2503.10628	null
2025-03-13	CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing	Advait Gupta et.al.	2503.10613	link
2025-03-13	GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding	Rui Hu et.al.	2503.10596	link
2025-03-13	The Lagrangian Method for Solving Constrained Markov Games	Soham Das et.al.	2503.10561	null
2025-03-13	A large multi-agent system with noise both in position and control	Giuseppe D’Onofrio et.al.	2503.10543	null
2025-03-13	Fair allocations with subadditive and XOS valuations	Uriel Feige et.al.	2503.10513	null
2025-03-13	SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models	Sahar Admoni et.al.	2503.10509	null
2025-03-13	SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process	Tom Maus et.al.	2503.10466	null
2025-03-13	Compliant Control of Quadruped Robots for Assistive Load Carrying	Nimesh Khandelwal et.al.	2503.10401	null
2025-03-12	Auspex: Building Threat Modeling Tradecraft into an Artificial Intelligence-based Copilot	Andrew Crossman et.al.	2503.09586	null
2025-03-12	Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks	Lutfi Eren Erdogan et.al.	2503.09572	null
2025-03-12	The turnpike control in stochastic multi-agent dynamics: a discrete-time approach with exponential integrators	Fabio Cassini et.al.	2503.09549	null
2025-03-13	Large Language Models for Multi-Facility Location Mechanism Design	Nguyen Thach et.al.	2503.09533	null
2025-03-12	PairVDN - Pair-wise Decomposed Value Functions	Zak Buzzard et.al.	2503.09521	link
2025-03-12	RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment	Md Morshed Alam et.al.	2503.09513	null
2025-03-12	TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues	Hannah VanderHoeven et.al.	2503.09511	null
2025-03-12	ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning	Ziyu Wan et.al.	2503.09501	link
2025-03-12	SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery	Jiayuan Huang et.al.	2503.09474	null
2025-03-12	Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation	Máté Tóth et.al.	2503.09464	null
2025-03-11	CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving	Changxing Liu et.al.	2503.08683	link
2025-03-11	AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence	Zekun Li et.al.	2503.08669	null
2025-03-11	EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments	Dongping Li et.al.	2503.08604	link
2025-03-11	GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training	Tong Wei et.al.	2503.08525	null
2025-03-11	ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews	Xian Gao et.al.	2503.08506	null
2025-03-11	Existence of Optimal Contracts for Principal-Agent Problem with Drift Control and Quadratic Effort Cost	Xinfu Chen et.al.	2503.08503	null
2025-03-11	Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RAN	F. Giarrè et.al.	2503.08493	null
2025-03-11	Hybrid Deep Reinforcement Learning for Radio Tracer Localisation in Robotic-assisted Radioguided Surgery	Hanyi Zhang et.al.	2503.08492	null
2025-03-11	Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding	Tim Steinke et.al.	2503.08474	null
2025-03-12	An Autonomous RL Agent Methodology for Dynamic Web UI Testing in a BDD Framework	Ali Hassaan Mughal et.al.	2503.08464	null
2025-03-10	MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning	Xiangru Tang et.al.	2503.07459	link
2025-03-10	LLMs syntactically adapt their language use to their conversational partner	Florian Kandra et.al.	2503.07457	null
2025-03-10	Towards Safe Robot Foundation Models	Maximilian Tölle et.al.	2503.07404	null
2025-03-10	Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning	Kha Vo et.al.	2503.07397	null
2025-03-10	AttentionSwarm: Reinforcement Learning with Attention Control Barier Function for Crazyflie Drones in Dynamic Environments	Grik Tadevosyan et.al.	2503.07376	null
2025-03-10	Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future	Yannick Oswald et.al.	2503.07364	null
2025-03-10	Temporal Triplane Transformers as Occupancy World Models	Haoran Xu et.al.	2503.07338	null
2025-03-10	Dynamic Path Navigation for Motion Agents with LLM Reasoning	Yubo Zhao et.al.	2503.07323	null
2025-03-10	Experimental Exploration: Investigating Cooperative Interaction Behavior Between Humans and Large Language Model Agents	Guanxuan Jiang et.al.	2503.07320	null
2025-03-10	Automated Movie Generation via Multi-Agent CoT Planning	Weijia Wu et.al.	2503.07314	link
2025-03-07	On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations	Vittorio Bilò et.al.	2503.05695	null
2025-03-07	A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval	Yu Zhang et.al.	2503.05659	link
2025-03-07	Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning	Justin Chih-Yao Chen et.al.	2503.05641	null
2025-03-07	InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model	Feeza Khan Khanzada et.al.	2503.05573	null
2025-03-07	Tractable Representations for Convergent Approximation of Distributional HJB Equations	Julie Alhosh et.al.	2503.05563	null
2025-03-07	ALMAGAL I. The ALMA evolutionary study of high-mass protocluster formation in the Galaxy. Presentation of the survey and early results	S. Molinari et.al.	2503.05555	null
2025-03-07	Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning	Raphael Trumpp et.al.	2503.05546	null
2025-03-07	The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence	Noah Mamie et.al.	2503.05473	null
2025-03-07	Game Theory in Formula 1: Multi-agent Physical and Strategical Interactions	Giona Fienia et.al.	2503.05421	null
2025-03-07	First-passage-time statistics of active Brownian particles: A perturbative approach	Yanis Baouche et.al.	2503.05401	null
2025-03-06	The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making	Stephen Pilli et.al.	2503.04692	null
2025-03-06	Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases	Pengcheng Qiu et.al.	2503.04691	null
2025-03-06	Multi-Agent Inverse Q-Learning from Demonstrations	Nathaniel Haynam et.al.	2503.04679	null
2025-03-06	Data-Driven Distributed Optimization via Aggregative Tracking and Deep-Learning	Riccardo Brumali et.al.	2503.04668	null
2025-03-06	Assessing the performance of compartmental and renewal models for learning $R_{t}$ using spatially heterogeneous epidemic simulations on real geographies	Matthew Ghosh et.al.	2503.04648	null
2025-03-06	SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing	Xiangchao Yan et.al.	2503.04629	link
2025-03-06	The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy	Xinyi Hou et.al.	2503.04596	null
2025-03-06	Advancing Solutions for the Three-Body Problem Through Physics-Informed Neural Networks	Manuel Santos Pereira et.al.	2503.04585	null
2025-03-06	ToolFuzz – Automated Agent Tool Testing	Ivan Milev et.al.	2503.04479	null
2025-03-06	From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design	Felix Ocker et.al.	2503.04417	null
2025-03-05	The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems	Richard Ren et.al.	2503.03750	null
2025-03-05	CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning	Yuqi Zhou et.al.	2503.03743	link
2025-03-05	A Practical Memory Injection Attack against LLM Agents	Shen Dong et.al.	2503.03704	null
2025-03-05	MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems	Rui Ye et.al.	2503.03686	null
2025-03-05	Optimally Installing Strict Equilibria	Jeremy McMahan et.al.	2503.03676	null
2025-03-05	Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Bar Karov et.al.	2503.03669	link
2025-03-05	A Generative Approach to High Fidelity 3D Reconstruction from Text Data	Venkat Kumar R et.al.	2503.03664	null
2025-03-05	Motion Planning and Control with Unknown Nonlinear Dynamics through Predicted Reachability	Zhiquan Zhang et.al.	2503.03633	null
2025-03-05	TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation	Haowei Sun et.al.	2503.03629	link
2025-03-05	Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories	Alperen Yildiz et.al.	2503.03586	null
2025-03-04	MuBlE: MuJoCo and Blender simulation Environment and Benchmark for Task Planning in Robot Manipulation	Michal Nazarczuk et.al.	2503.02834	link
2025-03-04	Meta-Learning to Explore via Memory Density Feedback	Kevin L. McKee et.al.	2503.02831	null
2025-03-04	Do Not Trust Licenses You See – Dataset Compliance Requires Massive-Scale AI-Powered Lifecycle Tracing	Jaekyeom Kim et.al.	2503.02784	null
2025-03-04	Quantitative Resilience Modeling for Autonomous Cyber Defense	Xavier Cadet et.al.	2503.02780	null
2025-03-04	From Metaphor to Mechanism: How LLMs Decode Traditional Chinese Medicine Symbolic Language for Modern Clinical Relevance	Jiacheng Tang et.al.	2503.02760	null
2025-03-04	Consumption-portfolio choice with preferences for liquid assets	Guohui Guan et.al.	2503.02697	null
2025-03-04	Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems	Jakob Weber et.al.	2503.02693	link
2025-03-04	FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting	Congluo Xu et.al.	2503.02692	null
2025-03-04	MPO: Boosting LLM Agents with Meta Plan Optimization	Weimin Xiong et.al.	2503.02682	link
2025-03-04	Unique existence of solution and Hyers-Ulam stability for a new fractional differential quasi-variational inequality with Mittag-Leffler kernel and its applications	Zeng-bao Wu et.al.	2503.02669	null
2025-02-28	Hybrid Team Tetris: A New Platform For Hybrid Multi-Agent, Multi-Human Teaming	Kaleb Mcdowell et.al.	2502.21300	null
2025-02-28	Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind	Dingyi Zhang et.al.	2502.21297	null
2025-02-28	ReaLJam: Real-Time Human-AI Music Jamming with Reinforcement Learning-Tuned Transformers	Alexander Scarlatos et.al.	2502.21267	null
2025-02-28	Towards Developing Ethical Reasoners: Integrating Probabilistic Reasoning and Decision-Making for Complex AI Systems	Nijesh Upreti et.al.	2502.21250	null
2025-02-28	A Method of Selective Attention for Reservoir Based Agents	Kevin McKee et.al.	2502.21229	null
2025-02-28	ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments	Pedro Gimenes et.al.	2502.21208	null
2025-03-03	Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction	Baiting Luo et.al.	2502.21186	link
2025-02-28	Reducing Reward Dependence in RL Through Adaptive Confidence Discounting	Muhammed Yusuf Satici et.al.	2502.21181	null
2025-02-28	Autonomous Curriculum Design via Relative Entropy Based Task Modifications	Muhammed Yusuf Satici et.al.	2502.21166	null
2025-02-28	Cryptis: Cryptographic Reasoning in Separation Logic	Arthur Azevedo de Amorim et.al.	2502.21156	null
2025-02-27	Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation	Siddhant Haldar et.al.	2502.20391	link
2025-02-27	Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis	Jeffrey Yang Fan Chiang et.al.	2502.20383	null
2025-02-27	Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers	Shalev Lifshitz et.al.	2502.20379	null
2025-02-27	Multi-Agent Path Planning in Complex Environments using Gaussian Belief Propagation with Global Path Finding	Jens Høigaard Jensen et.al.	2502.20369	link
2025-02-27	Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Ryan C. Barron et.al.	2502.20364	link
2025-02-27	Trajectory-to-Action Pipeline (TAP): Automated Scenario Description Extraction for Autonomous Vehicle Behavior Comparison	Aron Harder et.al.	2502.20353	null
2025-02-27	Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning	Thomas Budiarjo et.al.	2502.20348	null
2025-02-27	Safety Representations for Safer Policy Learning	Kaustubh Mani et.al.	2502.20341	null
2025-02-27	Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application	Thomas Hickling et.al.	2502.20326	null
2025-02-27	M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging	Jinghao Feng et.al.	2502.20301	null
2025-02-26	Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation	Shiven Sinha et.al.	2502.19414	link
2025-02-26	TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding	Max Ku et.al.	2502.19400	null
2025-02-26	Hybrid Robot Learning for Automatic Robot Motion Planning in Manufacturing	Siddharth Singh et.al.	2502.19340	null
2025-02-26	Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems	Hao Peng et.al.	2502.19328	link
2025-02-26	CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query	Zhe Wang et.al.	2502.19313	null
2025-02-26	WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies	William Solow et.al.	2502.19308	link
2025-02-26	Agent-centric Information Access	Evangelos Kanoulas et.al.	2502.19298	null
2025-02-26	CritiQ: Mining Data Quality Criteria from Human Preferences	Honglin Guo et.al.	2502.19279	null
2025-02-26	EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region	Nadya Abdel Madjid et.al.	2502.19260	link
2025-02-26	ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding	Qihang Peng et.al.	2502.19247	null
2025-02-25	FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response	Mollie Shichman et.al.	2502.18452	null
2025-02-25	MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning	Chanwoo Park et.al.	2502.18439	null
2025-02-25	ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies	Pedro Sequeira et.al.	2502.18438	null
2025-02-25	CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing	Yafei Ou et.al.	2502.18437	null
2025-02-25	AgentRM: Enhancing Agent Generalization with Reward Modeling	Yu Xia et.al.	2502.18407	null
2025-02-25	Responsible AI Agents	Deven R. Desai et.al.	2502.18359	null
2025-02-25	WebGames: Challenging General-Purpose Web-Browsing AI Agents	George Thomas et.al.	2502.18356	link
2025-02-25	RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction	Jianhao Yan et.al.	2502.18308	null
2025-02-25	Smart and Efficient IoT-Based Irrigation System Design: Utilizing a Hybrid Agent-Based and System Dynamics Approach	Taha Ahmadi Pargo et.al.	2502.18298	null
2025-02-25	A Competitive Posted-Price Mechanism for Online Budget-Feasible Auctions	Andreas Charalampopoulos et.al.	2502.18265	null
2025-02-24	Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making	Luca Lalor et.al.	2502.17417	null
2025-02-24	Distributed Coordination for Heterogeneous Non-Terrestrial Networks	Jikang Deng et.al.	2502.17366	null
2025-02-24	Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents	Prafulla Kumar Choubey et.al.	2502.17321	null
2025-02-24	Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach	Jichen Li et.al.	2502.17307	null
2025-02-24	IGDA: Interactive Graph Discovery through Large Language Model Agents	Alex Havrilla et.al.	2502.17189	null
2025-02-24	Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being	Bin Yin et.al.	2502.17172	null
2025-02-24	A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning	Hamidreza Mazandarani et.al.	2502.17167	null
2025-02-24	Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case	Hamidreza Mazandarani et.al.	2502.17120	null
2025-02-24	Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration	Junyang Wang et.al.	2502.17110	null
2025-02-24	Generative Models in Decision Making: A Survey	Yinchuan Li et.al.	2502.17100	null
2025-02-21	AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind	Zhining Zhang et.al.	2502.15676	link
2025-02-21	Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities	Natasha Astudillo et.al.	2502.15663	null
2025-02-21	Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network	Vincent Hsiao et.al.	2502.15662	null
2025-02-21	Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?	Yoshua Bengio et.al.	2502.15657	null
2025-02-21	A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications	Jefferson Silveira et.al.	2502.15649	null
2025-02-21	WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents	Xinhang Liu et.al.	2502.15601	null
2025-02-21	SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instrucion Following Evaluation for Social Agents	Wenyuan Zhang et.al.	2502.15538	link
2025-02-21	Contract DesignUnderApproximate Best Responses	Francesco Bacchiocchi et.al.	2502.15523	null
2025-02-21	SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning	Xuyang Li et.al.	2502.15512	null
2025-02-21	Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing	Masaya Kobayashi et.al.	2502.15506	null
2025-02-20	GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks	Jianwen Luo et.al.	2502.14848	link
2025-02-20	Red-Teaming LLM Multi-Agent Systems via Communication Attacks	Pengfei He et.al.	2502.14847	null
2025-02-20	Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation	Yue Yang et.al.	2502.14846	null
2025-02-20	Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models	Vlad Sobal et.al.	2502.14819	null
2025-02-20	Optimizing Model Selection for Compound AI Systems	Lingjiao Chen et.al.	2502.14815	link
2025-02-20	Byzantine Game Theory: Sun Tzus Boxes	Andrei Constantinescu et.al.	2502.14812	null
2025-02-20	Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission	Gregg Rabideau et.al.	2502.14803	null
2025-02-20	A Multi-Agent Perspective on Modern Information Retrieval	Haya Nachimovsky et.al.	2502.14796	null
2025-02-20	Making Universal Policies Universal	Niklas Höpner et.al.	2502.14777	link
2025-02-20	Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis	Priyanka Kargupta et.al.	2502.14767	link
2025-02-19	Autellix: An Efficient Serving Engine for LLM Agents as General Programs	Michael Luo et.al.	2502.13965	null
2025-02-19	LIDDIA: Language-based Intelligent Drug Discovery Agent	Reza Averly et.al.	2502.13959	null
2025-02-19	RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision	Guangzhi Xiong et.al.	2502.13957	null
2025-02-19	Qwen2.5-VL Technical Report	Shuai Bai et.al.	2502.13923	null
2025-02-19	Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health	Xingbo Wang et.al.	2502.13920	link
2025-02-19	DataSciBench: An LLM Agent Benchmark for Data Science	Dan Zhang et.al.	2502.13897	link
2025-02-19	NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants	Yiran Qin et.al.	2502.13894	null
2025-02-19	Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents	Jiahao Liu et.al.	2502.13843	link
2025-02-19	ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities	Chanjin Zheng et.al.	2502.13832	link
2025-02-19	Learning to explore when mistakes are not allowed	Charly Pecqueux-Guézénec et.al.	2502.13801	null
2025-02-18	AIDE: AI-Driven Exploration in the Space of Code	Zhengyao Jiang et.al.	2502.13138	link
2025-02-18	Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions	Taedong Yun et.al.	2502.13135	null
2025-02-18	Magma: A Foundation Model for Multimodal AI Agents	Jianwei Yang et.al.	2502.13130	link
2025-02-18	Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning	Jingyang Lin et.al.	2502.13127	null
2025-02-18	Approximately Efficient Bilateral Trade with Samples	Yuan Deng et.al.	2502.13122	null
2025-02-18	Text2World: Benchmarking Large Language Models for Symbolic World Model Generation	Mengkang Hu et.al.	2502.13092	null
2025-02-18	Interactive Agents to Overcome Ambiguity in Software Engineering	Sanidhya Vijayvargiya et.al.	2502.13069	link
2025-02-18	Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection	Jingbiao Mei et.al.	2502.13061	link
2025-02-18	AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks	Yurun Chen et.al.	2502.13053	null
2025-02-18	Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks	Markus J. Buehler et.al.	2502.13025	link
2025-02-17	HARBOR: Exploring Persona Dynamics in Multi-Agent Competition	Kenan Jiang et.al.	2502.12149	null
2025-02-17	Scaling Autonomous Agents via Automatic Reward Modeling And Planning	Zhenfang Chen et.al.	2502.12130	null
2025-02-17	A-MEM: Agentic Memory for LLM Agents	Wujiang Xu et.al.	2502.12110	link
2025-02-17	Relational Norms for Human-AI Cooperation	Brian D. Earp et.al.	2502.12102	null
2025-02-17	A Study on Leveraging Search and Self-Feedback for Agent Reasoning	Karthikeyan K et.al.	2502.12094	null
2025-02-17	Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation	Zhongyi Qiu et.al.	2502.12073	null
2025-02-17	A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice	Carole Adam et.al.	2502.12058	null
2025-02-17	Multi-agent coordination via communication partitions	Wei-Chen Lee et.al.	2502.12042	null
2025-02-17	Machine Learning Should Maximize Welfare, Not (Only) Accuracy	Nir Rosenfeld et.al.	2502.11981	null
2025-02-17	FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control	Yutong Ye et.al.	2502.11937	null
2025-02-14	Representation and Interpretation in Artificial and Natural Computing	Luis A. Pineda et.al.	2502.10383	null
2025-02-14	Agentic Verification for Ambiguous Query Disambiguation	Youngwon Lee et.al.	2502.10352	null
2025-02-14	Process Reward Models for LLM Agents: Practical Framework and Directions	Sanjiban Choudhury et.al.	2502.10325	link
2025-02-14	Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations	Abdelrhman Shaheen et.al.	2502.10303	null
2025-02-14	Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers	Aivin V. Solatorio et.al.	2502.10263	link
2025-02-14	Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding	Laurin Luttmann et.al.	2502.10233	link
2025-02-14	A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation	Redha Taguelmimt et.al.	2502.10226	null
2025-02-14	Do Large Language Models Reason Causally Like Us? Even Better?	Hanna M. Dettki et.al.	2502.10215	null
2025-02-14	Dynamic Reinforcement Learning for Actors	Katsunari Shibata et.al.	2502.10200	null
2025-02-14	Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design	Jingjie Ni et.al.	2502.10187	null
2025-02-13	Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs	Siyan Zhao et.al.	2502.09597	link
2025-02-13	KIMAs: A Configurable Knowledge Integrated Multi-Agent System	Zitao Li et.al.	2502.09596	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Learning to Coordinate with Experts	Mohamad H. Danesh et.al.	2502.09583	link
2025-02-13	Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks	Qian Wan et.al.	2502.09577	null
2025-02-13	MDCrow: Automating Molecular Dynamics Workflows with Large Language Models	Quintina Campbell et.al.	2502.09565	link
2025-02-13	EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents	Rui Yang et.al.	2502.09560	null
2025-02-13	Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages	Shreyan Biswas et.al.	2502.09532	null
2025-02-13	Exact Leader Estimation: A New Approach for Distributed Differentiation	Rodrigo Aldana-Lopez et.al.	2502.09529	null
2025-02-13	Forward-backward Contention Resolution Schemes for Fair Rationing	Will Ma et.al.	2502.09521	null
2025-02-12	Poly-Autoregressive Prediction for Modeling Interactions	Neerja Thakkar et.al.	2502.08646	null
2025-02-12	Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs	Mantas Mazeika et.al.	2502.08640	null
2025-02-12	SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent	Keyeun Lee et.al.	2502.08599	link
2025-02-12	Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners	David Easley et.al.	2502.08597	null
2025-02-12	Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks	Ang Li et.al.	2502.08586	null
2025-02-12	Statistically validated projection of bipartite signed networks	Anna Gallo et.al.	2502.08567	null
2025-02-12	Human-Centric Foundation Models: Perception, Generation and Agentic Modeling	Shixiang Tang et.al.	2502.08556	link
2025-02-12	Extreme vulnerability to intruder attacks destabilizes network dynamics	Amirhossein Nazerian et.al.	2502.08552	null
2025-02-12	Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation	Mahnaz Koupaee et.al.	2502.08514	link
2025-02-12	Resilient Quantized Consensus in Multi-Hop Relay Networks	Liwei Yuan et.al.	2502.08455	null
2025-02-11	MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces	Loris Gaven et.al.	2502.07709	link
2025-02-11	Human Decision-making is Susceptible to AI-driven Manipulation	Sahand Sabour et.al.	2502.07663	link
2025-02-11	Robust-Sorting and Applications to Ulam-Median	Ragesh Jaiswal et.al.	2502.07653	null
2025-02-11	Distributed Value Decomposition Networks with Networked Agents	Guilherme S. Varela et.al.	2502.07635	null
2025-02-11	Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy	Kristijan Atanasov et.al.	2502.07593	null
2025-02-11	DMWM: Dual-Mind World Model with Long-Term Imagination	Lingyi Wang et.al.	2502.07591	null
2025-02-11	Pure $ε$ -equilibrium in random games	Bary S. R. Pradelski et.al.	2502.07585	null
2025-02-11	Genetic evolution of a multi-generational population in the context of interstellar space travels – Part II: Phenotypic effects of gene expression	Frédéric Marin et.al.	2502.07559	null
2025-02-11	Unsupervised Translation of Emergent Communication	Ido Levy et.al.	2502.07552	null
2025-02-11	A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond	Zicheng Hu et.al.	2502.07514	null
2025-02-10	Visual Agentic AI for Spatial Reasoning with a Dynamic API	Damiano Marsili et.al.	2502.06787	null
2025-02-10	Towards Internet-Scale Training For Agents	Brandon Trabucco et.al.	2502.06776	null
2025-02-10	Distributed Constraint-Coupled Optimization: Harnessing ADMM-consensus for robustness	Mohamed Abdelmouamin Messilem et.al.	2502.06763	null
2025-02-10	Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty	Valia Efthymiou et.al.	2502.06749	null
2025-02-10	Institutional Preferences in the Laboratory	Qiankun Zhong et.al.	2502.06748	null
2025-02-10	Wandering around: A bioinspired approach to visual attention through object motion sensitivity	Giulia D Angelo et.al.	2502.06747	link
2025-02-10	AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection	Roohan Ahmed Khan et.al.	2502.06725	null
2025-02-10	Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene	Tai-Yu Pan et.al.	2502.06682	null
2025-02-10	Quantile Multi-Armed Bandits with 1-bit Feedback	Ivan Lau et.al.	2502.06678	null
2025-02-10	Unbiased Evaluation of Large Language Models from a Causal Perspective	Meilin Chen et.al.	2502.06655	null
2025-02-07	Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray	Yunhang Shen et.al.	2502.05177	link
2025-02-07	MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison	Kaijie Zhu et.al.	2502.05174	link
2025-02-07	From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance	Jiamin Xu et.al.	2502.05145	link
2025-02-07	Maximin Share Guarantees for Few Agents with Subadditive Valuations	George Christodoulou et.al.	2502.05141	null
2025-02-07	Joint TITE-CRM for Dual Agent Dose Finding Studies	Helen Barnett et.al.	2502.05072	null
2025-02-07	Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation	Wenqi Bai et.al.	2502.05069	null
2025-02-07	nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow	Geliang Ouyang et.al.	2502.05036	link
2025-02-07	Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency	Qixin Zhang et.al.	2502.05028	null
2025-02-07	Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning	Tristan K. Schuler et.al.	2502.05014	null
2025-02-07	The Rising Threat to Emerging AI-Powered Search Engines	Zeren Luo et.al.	2502.04951	null
2025-02-06	ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization	Yinjie Wang et.al.	2502.04306	link
2025-02-06	Mutual Multilinearity of Nonequilibrium Network Currents	Sara Dal Cengio et.al.	2502.04298	null
2025-02-06	DECAF: Learning to be Fair in Multi-agent Resource Allocation	Ashwin Kumar et.al.	2502.04281	null
2025-02-06	Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study	Michael Walters et.al.	2502.04249	null
2025-02-06	Multi-agent Architecture Search via Agentic Supernet	Guibin Zhang et.al.	2502.04180	link
2025-02-06	Dense Fixed-Wing Swarming using Receding-Horizon NMPC	Varun Madabushi et.al.	2502.04174	null
2025-02-06	Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning	Wesley A. Suttle et.al.	2502.04141	null
2025-02-06	Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation	Jiahao Lu et.al.	2502.04139	null
2025-02-06	VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output	Eason Chen et.al.	2502.04103	null
2025-02-06	Strategic Learning with Local Explanations as Feedback	Kiet Q. H. Vo et.al.	2502.04058	null
2025-02-05	A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs)	Yiye Chen et.al.	2502.03450	null
2025-02-05	Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators	Yuan Xinjie et.al.	2502.03424	null
2025-02-05	Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach	Abdullahi Isa Ahmed et.al.	2502.03377	null
2025-02-05	Learning from Active Human Involvement through Proxy Value Propagation	Zhenghao Peng et.al.	2502.03369	null
2025-02-05	PalimpChat: Declarative and Interactive AI analytics	Chunwei Liu et.al.	2502.03368	null
2025-02-05	Inverse Mixed Strategy Games with Generative Trajectory Models	Max Muchen Sun et.al.	2502.03356	null
2025-02-05	Implicit Communication in Human-Robot Collaborative Transport	Elvin Yang et.al.	2502.03346	link
2025-02-05	Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes	Haotian Wu et.al.	2502.03335	null
2025-02-05	SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs	Ben Liu et.al.	2502.03283	null
2025-02-05	Modeling and Optimization of Insulin Injection for Type-1 Diabetes Mellitus Management	Rinrada Jadsadaphongphaibool et.al.	2502.03269	null
2025-02-04	QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search	Zongyu Lin et.al.	2502.02584	link
2025-02-04	Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents	Shayan Kiyani et.al.	2502.02561	null
2025-02-04	AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis	Divya Bharti et.al.	2502.02555	link
2025-02-04	Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks	Huiqun Huang et.al.	2502.02537	null
2025-02-04	Adaptive Self-improvement LLM Agentic System for ML Library Development	Genghan Zhang et.al.	2502.02534	link
2025-02-04	Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies	Han Zhou et.al.	2502.02533	null
2025-02-04	Why human-AI relationships need socioaffective alignment	Hannah Rose Kirk et.al.	2502.02528	null
2025-02-04	The Cost Perspective of Liquid Democracy: Feasibility and Control	Shiri Alouf-Heffetz et.al.	2502.02380	null
2025-02-04	Mirai: A Wearable Proactive AI “Inner-Voice” for Contextual Nudging	Cathy Mengying Fang et.al.	2502.02370	null
2025-02-04	MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning	Lavanya Ratnabala et.al.	2502.02311	null
2025-01-31	Vintix: Action Model via In-Context Reinforcement Learning	Andrey Polubarov et.al.	2501.19400	link
2025-01-31	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	Mustafa O. Karabag et.al.	2501.19398	link
2025-01-31	Learning Contracts in Hierarchical Multi-Agent Systems	Antoine Scheid et.al.	2501.19388	null
2025-01-31	The Physics and Metaphysics of Social Powers: Bridging Cognitive Processing and Social Dynamics, a New Perspective on Power through Active Inference	Mahault Albarracin et.al.	2501.19368	null
2025-01-31	PixelWorld: Towards Perceiving Everything as Pixels	Zhiheng Lyu et.al.	2501.19339	null
2025-01-31	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	Anirudh Chari et.al.	2501.19318	null
2025-01-31	Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning	Balint Gyevnar et.al.	2501.19256	null
2025-02-03	SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments	Hüseyin Aydın et.al.	2501.19245	link
2025-01-31	Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics	Xingyu Wang et.al.	2501.19239	null
2025-01-31	A parallelizable variant of HCA*	Sreenivasan Ganti et.al.	2501.19218	null
2025-01-30	Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method	Peter Baile Chen et.al.	2501.18539	null
2025-01-30	Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems	Parth Ganeriwala et.al.	2501.18506	null
2025-01-30	Graph Exploration with Edge Weight Estimates	Matthias Gehnen et.al.	2501.18496	null
2025-01-30	Conversation Games and a Strategic View of the Turing Test	Kaveh Aryan et.al.	2501.18455	null
2025-01-30	Stable Marriage: Loyalty vs. Competition	Amit Ronen et.al.	2501.18442	null
2025-01-30	Gravity-Bench-v1: A Benchmark on Gravitational Physics Discovery for Agents	Nolan Koblischke et.al.	2501.18411	null
2025-01-30	Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach	Tianpeng Pan et.al.	2501.18320	null
2025-01-30	Model-Free RL Agents Demonstrate System 1-Like Intentionality	Hal Ashton et.al.	2501.18299	null
2025-01-30	CueTip: An Interactive and Explainable Physics-aware Pool Assistant	Sean Memery et.al.	2501.18291	null
2025-01-30	Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents	ShuiDe Wen et.al.	2501.18190	null
2025-01-29	From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning	Junseok Park et.al.	2501.17842	null
2025-01-29	A note on the Cucker-Smale model with time delay and communication failures	Elisa Continelli et.al.	2501.17743	null
2025-01-29	RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts	Eujeong Choi et.al.	2501.17715	link
2025-01-29	Inferring Implicit Goals Across Differing Task Models	Silvia Tulli et.al.	2501.17704	null
2025-01-29	CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization	Derui Wang et.al.	2501.17667	link
2025-01-29	Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps	Scott Fredriksson et.al.	2501.17661	null
2025-01-29	Coalitional control: a bottom-up approach	Filiberto Fele et.al.	2501.17614	null
2025-01-29	Coalitional model predictive control of an irrigation canal	Filiberto Fele et.al.	2501.17561	null
2025-01-29	Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant	Gaole He et.al.	2501.17546	link
2025-01-29	Sequential Learning of the Pareto Front for Multi-objective Bandits	Elise Crépon et.al.	2501.17513	link
2025-01-28	Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning	Rémy Hosseinkhan Boucher et.al.	2501.17115	null
2025-01-28	CRSet: Non-Interactive Verifiable Credential Revocation with Metadata Privacy for Issuers and Everyone Else	Felix Hoops et.al.	2501.17089	null
2025-01-28	Learning Mean Field Control on Sparse Graphs	Christian Fabian et.al.	2501.17079	null
2025-01-28	Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning	Anna Soligo et.al.	2501.17077	null
2025-01-28	Context is Key in Agent Security	Lillian Tsai et.al.	2501.17070	null
2025-01-28	Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework	Longzhong Lin et.al.	2501.17015	null
2025-01-28	Towards Open-Source and Modular Space Systems with ATMOS	Pedro Roque et.al.	2501.16973	link
2025-01-28	Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning	Xi Chen et.al.	2501.16966	null
2025-01-28	ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations	Xinyi Ni et.al.	2501.16945	null
2025-01-28	Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies	Suzie Grondin et.al.	2501.16935	null
2025-01-27	LUCY: Linguistic Understanding and Control Yielding Early Stage of Her	Heting Gao et.al.	2501.16327	link
2025-01-27	Privacy-aware Nash Equilibrium Synthesis with Partially Ordered LTL $_f$ Objectives	Caleb Probine et.al.	2501.16307	null
2025-01-27	Multi-Agent Geospatial Copilots for Remote Sensing Workflows	Chaehong Lee et.al.	2501.16254	null
2025-01-27	Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma	Richard Willis et.al.	2501.16173	link
2025-01-27	AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants	Pascal J. Sager et.al.	2501.16150	null
2025-01-27	Quantifying the Self-Interest Level of Markov Social Dilemmas	Richard Willis et.al.	2501.16138	null
2025-01-27	Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection	Eslam Eldeeb et.al.	2501.16098	null
2025-01-27	Galaxy Era: Agent-based Simulation of Execution Tickets	Pascal Stichler et.al.	2501.16090	link
2025-01-27	Value-oriented forecast reconciliation for renewables in electricity markets	Honglin Wen et.al.	2501.16086	null
2025-01-27	Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki	Vanja Falck et.al.	2501.16080	null
2025-01-24	An Attentive Graph Agent for Topology-Adaptive Cyber Defence	Ilya Orson Sandoval et.al.	2501.14700	link
2025-01-24	The Division of Surplus and the Burden of Proof	Deniz Kattwinkel et.al.	2501.14686	null
2025-01-24	MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications	Yixing Jiang et.al.	2501.14654	link
2025-01-24	Whisper D-SGD: Correlated Noise Across Agents for Differentially Private Decentralized Learning	Angelo Rodio et.al.	2501.14644	link
2025-01-24	Fair Division Beyond Monotone Valuations	Siddharth Barman et.al.	2501.14609	null
2025-01-24	Hybrid Quantum-Classical Multi-Agent Pathfinding	Thore Gerlach et.al.	2501.14568	null
2025-01-24	Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation	Wenzhang Liu et.al.	2501.14543	link
2025-01-24	Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning	Yuhan Hu et.al.	2501.14488	null
2025-01-24	Avoiding Overfitting in Variable-Order Markov Models: a Cross-Validation Approach	Valeria Secchini et.al.	2501.14476	null
2025-01-24	The Pseudo-Dimension of Contracts	Paul Duetting et.al.	2501.14474	null
2025-01-23	GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration	Yue Fan et.al.	2501.13896	null
2025-01-23	Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning	Matyáš Lorenc et.al.	2501.13883	link
2025-01-23	Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems	Ethan Wilson et.al.	2501.13878	null
2025-01-23	EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents	Yuhui Yun et.al.	2501.13746	null
2025-01-23	Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System	Haikuo Du et.al.	2501.13727	link
2025-01-23	A Non-Parametric Approach to Heterogeneity Analysis	Avner Seror et.al.	2501.13721	null
2025-01-23	Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel–Young Loss Perspective and Gap-Dependent Regret Analysis	Shinsaku Sakaue et.al.	2501.13648	null
2025-01-23	WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control	Claire Bizon Monroc et.al.	2501.13592	link
2025-01-23	Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation	Nasir Khan et.al.	2501.13552	null
2025-01-23	Towards a Theory of AI Personhood	Francis Rhys Ward et.al.	2501.13533	null
2025-01-22	Boosting MCTS with Free Energy Minimization	Mawaba Pascal Dao et.al.	2501.13083	null
2025-01-22	Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment	Melissa Kazemi Rad et.al.	2501.13080	null
2025-01-22	Evolution and The Knightian Blindspot of Machine Learning	Joel Lehman et.al.	2501.13075	null
2025-01-22	Optimizing Return Distributions with Distributional Dynamic Programming	Bernardo Ávila Pires et.al.	2501.13028	null
2025-01-22	The regret lower bound for communicating Markov Decision Processes	Victor Boone et.al.	2501.13013	null
2025-01-22	MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking	Sebastian Farquhar et.al.	2501.13011	null
2025-01-22	Constructive characterisations of the must-preorder for asynchrony	Giovanni Bernardi et.al.	2501.13002	link
2025-01-22	An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management	Eslam Eldeeb et.al.	2501.12991	null
2025-01-22	Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization	Hossein Nejatbakhsh Esfahani et.al.	2501.12989	null
2025-01-22	Quantification of Ultrafast Nonlinear Photothermal and Photoacoustic Effects in Molecular Thin Films via Time-Domain Brillouin Scattering	Valentin Cherruault et.al.	2501.12912	null
2025-01-21	Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists	Thomas F. Eisenmann et.al.	2501.12374	link
2025-01-21	UI-TARS: Pioneering Automated GUI Interaction with Native Agents	Yujia Qin et.al.	2501.12326	link
2025-01-21	Transitions to synchronization in adaptive multilayer networks with higher-order interactions	Richita Ghosh et.al.	2501.12301	null
2025-01-21	mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework	Bingyi Liu et.al.	2501.12263	null
2025-01-21	Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control	Mark Gonzales et.al.	2501.12234	null
2025-01-21	Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access	Antonio López Martínez et.al.	2501.12229	null
2025-01-21	Convergence of time-delayed opinion dynamics with complex interaction types	Lingling Yao et.al.	2501.12219	null
2025-01-21	RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression	Uri Gadot et.al.	2501.12216	null
2025-01-21	Experience-replay Innovative Dynamics	Tuo Zhang et.al.	2501.12199	null
2025-01-21	Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window	A. Bautista et.al.	2501.12198	null
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	link
2025-01-17	Towards Human-Guided, Data-Centric LLM Co-Pilots	Evgeny Saveliev et.al.	2501.10321	null
2025-01-17	Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling	Suvodip Dey et.al.	2501.10316	link
2025-01-17	Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture	Suvidha Mhatre et.al.	2501.10292	null
2025-01-17	Evidence for the gravity-driven and magnetically-regularized gas flows feeding the massive protostellar cluster in Cep A	Panigrahy Sandhyarani et.al.	2501.10280	null
2025-01-17	Grey-Box Fuzzing in Constrained Ultra-Large Systems: Lessons for SE Community	Jiazhao Yu et.al.	2501.10269	null
2025-01-17	Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments	Niklas Dahlquist et.al.	2501.10262	null
2025-01-17	Logarithmic Regret for Nonlinear Control	James Wang et.al.	2501.10261	null
2025-01-17	Secure Semantic Communication With Homomorphic Encryption	Rui Meng et.al.	2501.10182	null
2025-01-17	PaSa: An LLM Agent for Comprehensive Academic Paper Search	Yichen He et.al.	2501.10120	link
2025-01-16	CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education	Tianyu Wang et.al.	2501.09709	link
2025-01-16	The Goofus & Gallant Story Corpus for Practical Value Alignment	Md Sultan Al Nahian et.al.	2501.09707	null
2025-01-16	Authenticated Delegation and Authorized AI Agents	Tobin South et.al.	2501.09674	null
2025-01-16	NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes	Nathaniel S. Keplinger et.al.	2501.09646	link
2025-01-16	Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework	Yushen Lin et.al.	2501.09631	null
2025-01-16	A Multi-agent System for Hybrid Optimization	Eric S. Fraga et.al.	2501.09563	null
2025-01-16	Solving the unsolvable: Translating case law in Hong Kong	King-kui Sin et.al.	2501.09444	null
2025-01-16	ADAGE: A generic two-layer framework for adaptive agent based modelling	Benjamin Patrick Evans et.al.	2501.09429	null
2025-01-16	AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling	Ancheng Xu et.al.	2501.09426	null
2025-01-16	Agent-Based Simulation of a Perpetual Futures Market	Ramshreyas Rao et.al.	2501.09404	null
2025-01-15	Personality Modeling for Persuasion of Misinformation using AI Agent	Qianmin Lou et.al.	2501.08985	null
2025-01-15	Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action	Fouad Bousetouane et.al.	2501.08944	null
2025-01-15	A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management	Surya Murthy et.al.	2501.08941	null
2025-01-15	Disentangling Exploration of Large Language Models by Optimal Exploitation	Tim Grams et.al.	2501.08925	null
2025-01-15	Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning	Qinyu Ma et.al.	2501.08897	link
2025-01-15	Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts	Antonio Castellanos et.al.	2501.08869	null
2025-01-15	The geometry of moral decision making	Roland M. Friedrich et.al.	2501.08865	null
2025-01-15	On the Dominance of Truth-Telling in Gradual Mechanisms	Wenqian Wang et.al.	2501.08802	null
2025-01-15	Networked Agents in the Dark: Team Value Learning under Partial Observability	Guilherme S. Varela et.al.	2501.08778	null
2025-01-15	Leveraging LLM Agents for Translating Network Configurations	Yunze Wei et.al.	2501.08760	null
2025-01-14	ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations	Ziyuan Huang et.al.	2501.08324	null
2025-01-14	Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation	Aleix Nicolás Olivé et.al.	2501.08280	null
2025-01-14	Addressing the sustainable AI trilemma: a case study on LLM agents and RAG	Hui Wu et.al.	2501.08262	link
2025-01-14	Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps	Kannan Parthasarathy et.al.	2501.08243	null
2025-01-14	Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning	Enrique Adrian Villarrubia-Martin et.al.	2501.08234	null
2025-01-14	ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems	Mohita Chowdhury et.al.	2501.08208	null
2025-01-14	An Elementary Microscopic Model of Sympatric Speciation	Franco Bagnoli et.al.	2501.08130	null
2025-01-14	Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving	Guizhe Jin et.al.	2501.08096	null
2025-01-14	AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation	Feng Zhang et.al.	2501.08088	null
2025-01-14	CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning	Guoliang He et.al.	2501.08071	link
2025-01-13	WebWalker: Benchmarking LLMs in Web Traversal	Jialong Wu et.al.	2501.07572	link
2025-01-13	SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds	Grik Tadevosyan et.al.	2501.07566	null
2025-01-13	SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing	Varun Biyyala et.al.	2501.07554	link
2025-01-13	Evaluating Agent-based Program Repair at Google	Pat Rondon et.al.	2501.07531	null
2025-01-13	Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning	Haonan Xu et.al.	2501.07508	null
2025-01-13	How low-cost AI universal approximators reshape market efficiency	Paolo Barucca et.al.	2501.07489	null
2025-01-13	SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM)	Xiang Cheng et.al.	2501.07459	link
2025-01-13	Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI	Rolf Pfister et.al.	2501.07458	null
2025-01-13	Online inductive learning from answer sets for efficient reinforcement learning exploration	Celeste Veronese et.al.	2501.07445	null
2025-01-13	Attention when you need	Lokesh Boominathan et.al.	2501.07440	null
2025-01-10	PEACE: Empowering Geologic Map Holistic Understanding with MLLMs	Yangyu Huang et.al.	2501.06184	null
2025-01-10	A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem	Allen George Philip et.al.	2501.06130	null
2025-01-10	Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation	Guojun Xiong et.al.	2501.06103	null
2025-01-10	Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks	Kevin Fu et.al.	2501.06058	link
2025-01-10	Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems	Nathaniel Hamilton et.al.	2501.06016	null
2025-01-10	Enhanced Acoustic Beamforming with Sub-Aperture Angular Multiply and Sum – in vivo and in Human Demonstration	Matthieu Toulemonde et.al.	2501.05837	null
2025-01-10	CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech	Madhurananda Pahar et.al.	2501.05755	null
2025-01-10	Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions	Sonia Raychaudhuri et.al.	2501.05750	null
2025-01-10	How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond	Chen Huang et.al.	2501.05714	null
2025-01-10	Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains	Vighnesh Subramaniam et.al.	2501.05707	null
2025-01-09	Search-o1: Agentic Search-Enhanced Large Reasoning Models	Xiaoxi Li et.al.	2501.05366	link
2025-01-09	Control of Overpopulated Tails in Kinetic Epidemic Models	Mattia Zanella et.al.	2501.05365	null
2025-01-09	A Path Variant of the Explorer Director Game on Graphs	Abigail Raz et.al.	2501.05364	null
2025-01-09	On Corrigibility and Alignment in Multi Agent Games	Edmund Dable-Heath et.al.	2501.05360	null
2025-01-09	A learning agent-based approach to the characterization of open quantum systems	Lorenzo Fioroni et.al.	2501.05350	null
2025-01-09	The Bakers and Millers Game with Restricted Locations	Simon Krogmann et.al.	2501.05334	null
2025-01-09	Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning	Dmytro Kuzmenko et.al.	2501.05329	null
2025-01-09	Contrast-Free Myocardial Scar Segmentation in Cine MRI using Motion and Texture Fusion	Guang Yang et.al.	2501.05241	null
2025-01-09	CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness	Shoucheng Song et.al.	2501.05207	null
2025-01-09	Emergence of human-like polarization among large language model agents	Jinghua Piao et.al.	2501.05171	null
2025-01-08	RadGPT: Constructing 3D Image-Text Tumor Datasets	Pedro R. A. S. Bassi et.al.	2501.04678	link
2025-01-08	InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection	Yuhang Liu et.al.	2501.04575	link
2025-01-08	The importance of being discrete – An agent-based model for active nematics and more	Mathieu Dedenon et.al.	2501.04559	null
2025-01-08	Approximately EFX and PO Allocations for Bivalued Chores	Zehan Lin et.al.	2501.04550	null
2025-01-08	Cyber-Physical Steganography in Robotic Motion Control	Ching-Chun Chang et.al.	2501.04541	null
2025-01-08	Safe Reinforcement Learning with Minimal Supervision	Alexander Quessy et.al.	2501.04481	null
2025-01-08	Hybrid Artificial Intelligence Strategies for Drone Navigation	Rubén San-Segundo et.al.	2501.04472	null
2025-01-08	A Digital Shadow for Modeling, Studying and Preventing Urban Crime	Juan Palma-Borda et.al.	2501.04435	null
2025-01-08	User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation	Krisztian Balog et.al.	2501.04410	null
2025-01-08	Agent Laboratory: Using LLM Agents as Research Assistants	Samuel Schmidgall et.al.	2501.04227	null
2025-01-07	Kinetic theory of decentralized learning for smart active matter	Gerhard Jung et.al.	2501.03948	null
2025-01-07	Implicit Coordination using Active Epistemic Inference	Lauren Bramblett et.al.	2501.03907	null
2025-01-07	Truthful mechanisms for linear bandit games with private contexts	Yiting Hu et.al.	2501.03865	null
2025-01-07	Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants	Philip Weber et.al.	2501.03862	null
2025-01-07	Run-and-tumble chemotaxis using reinforcement learning	Ramesh Pramanik et.al.	2501.03687	null
2025-01-07	The Textbook of Tomorrow: Rethinking Course Material Interfacing in the Era of GPT	Audrey Olson et.al.	2501.03618	null
2025-01-07	Distributed Observer for Descriptor Linear System: The Luenberger Observer Method	Shuai Liu et.al.	2501.03564	null
2025-01-07	Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective	Tianyang Duan et.al.	2501.03562	null
2025-01-07	FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis	Xiaojiao Xiao et.al.	2501.03526	link
2025-01-07	A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages	Jinming Gao et.al.	2501.03496	null
2025-01-06	Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation	Yuhui Zhang et.al.	2501.03225	link
2025-01-06	Turn-based Multi-Agent Reinforcement Learning Model Checking	Dennis Gross et.al.	2501.03187	null
2025-01-06	Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning	Muyun Li et.al.	2501.03162	null
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty	Andreas Athanasopoulos et.al.	2501.03018	link
2025-01-06	Approximating N-Player Nash Equilibrium through Gradient Descent	Dongge Wang et.al.	2501.03001	null
2025-01-06	CALM: Curiosity-Driven Auditing for Large Language Models	Xiang Zheng et.al.	2501.02997	link
2025-01-06	CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems	Chuanbo Hua et.al.	2501.02977	link
2025-01-06	Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective	Chuxiong Sun et.al.	2501.02888	null
2025-01-06	A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation	Toomas Tahves et.al.	2501.02858	null
2025-01-03	QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture	Shvetank Prakash et.al.	2501.01892	null
2025-01-03	Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification	Xiangxiang Dai et.al.	2501.01849	link
2025-01-03	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	SDPO: Segment-Level Direct Preference Optimization for Social Agents	Aobo Kong et.al.	2501.01821	link
2025-01-03	Distributed Framework Construction for Affine Formation Control	Huiming Li et.al.	2501.01817	null
2025-01-03	Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery	Baoru Huang et.al.	2501.01752	null
2025-01-03	Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning	Gavin B. Rens et.al.	2501.01727	null
2025-01-03	AgentRefine: Enhancing Agent Generalization through Refinement Tuning	Dayuan Fu et.al.	2501.01702	null
2025-01-03	The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective	Alexander Lam et.al.	2501.01660	null
2025-01-03	PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents	Jingoo Lee et.al.	2501.01594	null
2025-01-02	Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective	Julian Barreiro-Gomez et.al.	2501.01389	null
2025-01-02	PIMAEX: Multi-Agent Exploration through Peer Incentivization	Michael Kölle et.al.	2501.01266	null
2025-01-02	Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants	Lixiong Qin et.al.	2501.01243	null
2025-01-02	From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma	Tianqi Song et.al.	2501.01220	null
2025-01-02	D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma	Ludovico Musenich et.al.	2501.01211	null
2025-01-02	Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects	Abdullah Mushtaq et.al.	2501.01205	null
2025-01-02	3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer	Jiajun Deng et.al.	2501.01163	null
2025-01-02	A3: Android Agent Arena for Mobile GUI Agents	Yuxiang Chai et.al.	2501.01149	null
2025-01-02	Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method	Ruichen Zhang et.al.	2501.01141	null
2025-01-02	Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning	Min Whoo Lee et.al.	2501.01140	null
2024-12-30	Distributed Mixture-of-Agents for Edge Inference with Large Language Models	Purbesh Mitra et.al.	2412.21200	link
2024-12-30	Aviary: training language agents on challenging scientific tasks	Siddharth Narayanan et.al.	2412.21154	link
2024-12-30	Training Software Engineering Agents and Verifiers with SWE-Gym	Jiayi Pan et.al.	2412.21139	link
2024-12-30	Positional information trade-offs in boundary-driven reaction-diffusion systems	Jonas Berx et.al.	2412.21113	null
2024-12-30	Exploring and Controlling Diversity in LLM-Agent Conversation	KuanChao Chu et.al.	2412.21102	null
2024-12-30	Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024	Reza Azadeh et.al.	2412.21088	null
2024-12-30	Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding	Wenhao Zhuang et.al.	2412.21069	null
2024-12-30	Plancraft: an evaluation dataset for planning with LLM agents	Gautier Dagan et.al.	2412.21033	link
2024-12-30	UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI	Fangwei Zhong et.al.	2412.20977	null
2024-12-31	SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity	Pengfei Jing et.al.	2412.20787	null
2024-12-27	Bottom-up robust modeling for the foraging behavior of Physarum polycephalum	Damiano Reginato et.al.	2412.19790	null
2024-12-27	Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration	Le Chen et.al.	2412.19770	link
2024-12-27	Can Large Language Models Adapt to Other Agents In-Context?	Matthew Riemer et.al.	2412.19726	null
2024-12-27	OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis	Qiushi Sun et.al.	2412.19723	null
2024-12-27	The Value of Recall in Extensive-Form Games	Ratip Emin Berker et.al.	2412.19659	null
2024-12-27	Xmodel-2 Technical Report	Wang Qun et.al.	2412.19638	link
2024-12-27	Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives	Guy Avni et.al.	2412.19609	null
2024-12-27	Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following	Yuxiao Yang et.al.	2412.19562	null
2024-12-27	Quantiles under ambiguity and risk sharing	Peng Liu et.al.	2412.19546	null
2024-12-27	TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data	Xiang Huang et.al.	2412.19544	link
2024-12-24	Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems	Fernando Jia et.al.	2412.18601	link
2024-12-24	Automated Code Review In Practice	Umut Cihan et.al.	2412.18531	null
2024-12-24	Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving	Hao Pang et.al.	2412.18511	null
2024-12-24	Calibrating the Subjective	Mark Whitmeyer et.al.	2412.18486	null
2024-12-24	Multi-Agent Norm Perception and Induction in Distributed Healthcare	Chao Li et.al.	2412.18454	null
2024-12-24	3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding	Tatiana Zemskova et.al.	2412.18450	link
2024-12-24	GeAR: Graph-enhanced Agent for Retrieval-augmented Generation	Zhili Shen et.al.	2412.18431	null
2024-12-24	Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent	Farhad Nooralahzadeh et.al.	2412.18428	link
2024-12-24	GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent	Kangjia Zhao et.al.	2412.18426	null
2024-12-24	Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles	Zihan Wang et.al.	2412.18416	null
2024-12-23	Observation Interference in Partially Observable Assistance Games	Scott Emmons et.al.	2412.17797	null
2024-12-23	ResearchTown: Simulator of Human Research Community	Haofei Yu et.al.	2412.17767	link
2024-12-23	Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning	Christian A. Schroth et.al.	2412.17740	null
2024-12-23	Robin Hood Reachability Bidding Games	Shaull Almagor et.al.	2412.17718	null
2024-12-23	SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC	Yue Deng et.al.	2412.17707	link
2024-12-23	Large Language Model Safety: A Holistic Survey	Dan Shi et.al.	2412.17686	link
2024-12-23	Shape and Performance of Fastest Paths over Networks with Interacting Selfish Agents	Marco Cogoni et.al.	2412.17665	null
2024-12-23	CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction	Yuanyuan Gao et.al.	2412.17612	null
2024-12-23	Fluid-Derived Lattices for Unbiased Modeling of Bacterial Colony Growth	Bryan Verhoef et.al.	2412.17604	null
2024-12-23	PC Agent: While You Sleep, AI Works – A Cognitive Journey into Digital World	Yanheng He et.al.	2412.17589	link
2024-12-20	Offline Reinforcement Learning for LLM Multi-Step Reasoning	Huaijie Wang et.al.	2412.16145	link
2024-12-20	Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information	Dirk Bergemann et.al.	2412.16132	null
2024-12-20	Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG	Hasan Md Tusfiqur Alam et.al.	2412.16086	link
2024-12-20	Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning	Jingbo Chen et.al.	2412.15975	null
2024-12-20	The multilayer garbage disposal game	Hsin-Lun Li et.al.	2412.15942	null
2024-12-20	Speedup Techniques for Switchable Temporal Plan Graph Optimization	He Jiang et.al.	2412.15908	null
2024-12-20	Exploring the Effects of AI Nonverbal Emotional Cues on Human Decision Certainty in Moral Dilemmas	Chenyi Zhang et.al.	2412.15834	null
2024-12-20	WebLLM: A High-Performance In-Browser LLM Inference Engine	Charlie F. Ruan et.al.	2412.15803	link
2024-12-20	FTISS Adaptive Bearing-Only Formation Tracking Control with Unknown Disturbance Rejection	Hong Liang Cheah et.al.	2412.15757	null
2024-12-20	Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion	Martin Bichler et.al.	2412.15707	null
2024-12-19	AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving	Shuo Xing et.al.	2412.15206	link
2024-12-19	Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration	Junjia Liu et.al.	2412.15166	link
2024-12-19	Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents	Jessica Woodgate et.al.	2412.15163	null
2024-12-19	Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents	Serafina Kamp et.al.	2412.15162	null
2024-12-19	Probabilistic Strategy Logic with Degrees of Observability	Chunyan Mu et.al.	2412.15135	null
2024-12-19	From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model	Jerome Garnier-Brun et.al.	2412.14996	null
2024-12-19	Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination	Leonardo Barcellona et.al.	2412.14957	null
2024-12-19	Long Time Behavior and Stabilization for Displacement Monotone Mean Field Games	Marco Cirant et.al.	2412.14903	null
2024-12-19	Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning	Anthony Kobanda et.al.	2412.14865	null
2024-12-19	Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning	Mohammadreza nakhaei et.al.	2412.14834	link
2024-12-18	TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks	Frank F. Xu et.al.	2412.14161	link
2024-12-18	Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report	Markus Dablander et.al.	2412.14085	null
2024-12-18	A Computationally Grounded Framework for Cognitive Attitudes (extended version)	Tiago de Lima et.al.	2412.14073	null
2024-12-18	Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning	Adi Shuchami et.al.	2412.14039	link
2024-12-18	Decentralized Convergence to Equilibrium Prices in Trading Networks	Edwin Lock et.al.	2412.13972	null
2024-12-18	Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves	Martin Kurečka et.al.	2412.13962	null
2024-12-18	Harvesting energy from turbulent winds with Reinforcement Learning	Lorenzo Basile et.al.	2412.13961	null
2024-12-18	Towards privacy-preserving cooperative control via encrypted distributed optimization	Philipp Binfet et.al.	2412.13953	null
2024-12-18	Strategyproof Matching of Roommates and Rooms	Hadi Hosseini et.al.	2412.13887	null
2024-12-18	Who Saves us From Risk? Altruists Promote Cooperation in a Public Investment Game	Shen Zhang et.al.	2412.13816	null
2024-12-17	Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents	Yifei Zhou et.al.	2412.13194	null
2024-12-17	GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding	Haoyi Jiang et.al.	2412.13193	link
2024-12-17	SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents	Sheng Yin et.al.	2412.13178	link
2024-12-17	Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs – A Graph Sequential Embedding Method	Jiate Li et.al.	2412.13134	link
2024-12-17	Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements	Rafael Dewes et.al.	2412.13114	null
2024-12-17	Active Reinforcement Learning Strategies for Offline Policy Improvement	Ambedkar Dukkipati et.al.	2412.13106	null
2024-12-17	AI PERSONA: Towards Life-long Personalization of LLMs	Tiannan Wang et.al.	2412.13103	null
2024-12-17	Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks	Kevin McKee et.al.	2412.13093	null
2024-12-17	Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks	Kun Huang et.al.	2412.13054	null
2024-12-18	NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation	Karan Wanchoo et.al.	2412.13026	null
2024-12-16	Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives	Marius Belly et.al.	2412.12063	link
2024-12-16	Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers	Farnaz Nouraei et.al.	2412.12061	null
2024-12-16	Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps	Linfeng Zhao et.al.	2412.12024	null
2024-12-16	Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm	Rajat Khanda et.al.	2412.12006	null
2024-12-16	CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception	Senkang Hu et.al.	2412.12000	null
2024-12-16	AlphaZero Neural Scaling and Zipf’s Law: a Tale of Board Games and Power Laws	Oren Neumann et.al.	2412.11979	link
2024-12-16	Learning Human-Aware Robot Policies for Adaptive Assistance	Jason Qin et.al.	2412.11913	null
2024-12-16	Reentrant phase behavior in binary topological flocks with nonreciprocal alignment	Tian Tang et.al.	2412.11871	null
2024-12-16	The Black Ninjas and the Sniper: On Robustness of Population Protocols	Benno Lossin et.al.	2412.11783	null
2024-12-16	Prediction of social dilemmas in networked populations via graph neural networks	Huaiyu Tan et.al.	2412.11775	null
2024-12-13	Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining	Zhiqi Ge et.al.	2412.10342	null
2024-12-13	Reciprocity in Interbank Markets	Lutz Honvehlmann et.al.	2412.10329	null
2024-12-13	*MeshA: Efficient Path Planing With Motion Primitives**	Marat Agranovskiy et.al.	2412.10320	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	null
2024-12-13	Cultural Evolution of Cooperation among LLM Agents	Aron Vallinder et.al.	2412.10270	null
2024-12-13	ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL	Yang Qin et.al.	2412.10138	link
2024-12-13	You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects	Islem Bouzenia et.al.	2412.10133	link
2024-12-13	Reward Machine Inference for Robotic Manipulation	Mattijs Baert et.al.	2412.10096	null
2024-12-13	Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints	Dolev Mutzari et.al.	2412.10083	null
2024-12-13	Large Action Models: From Inception to Implementation	Lu Wang et.al.	2412.10047	link
2024-12-12	GenEx: Generating an Explorable World	Taiming Lu et.al.	2412.09624	null
2024-12-12	AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials	Yiheng Xu et.al.	2412.09605	null
2024-12-12	DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction	Yu Feng et.al.	2412.09572	null
2024-12-12	Can Modern LLMs Act as Agent Cores in Radiology~Environments?	Qiaoyu Zheng et.al.	2412.09529	link
2024-12-12	Agent-based Video Trimming	Lingfeng Yang et.al.	2412.09513	null
2024-12-12	Solving Multiagent Path Finding on Highly Centralized Networks	Foivos Fioravantes et.al.	2412.09433	null
2024-12-12	From Intention To Implementation: Automating Biomedical Research via LLMs	Yi Luo et.al.	2412.09429	null
2024-12-12	Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer	Adam Labiosa et.al.	2412.09417	null
2024-12-12	Uncommon Belief in Rationality	Qi Shi et.al.	2412.09407	null
2024-12-12	Falcon-UI: Understanding GUI Before Following User Instructions	Huawen Shen et.al.	2412.09362	null
2024-12-11	GPD-1: Generative Pre-training for Driving	Zixun Xie et.al.	2412.08643	link
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-11	RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation	Mingfei Han et.al.	2412.08591	null
2024-12-11	Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead	Yanqi Su et.al.	2412.08581	null
2024-12-11	GenPlan: Generative sequence models as adaptive planners	Akash Karthikeyan et.al.	2412.08565	link
2024-12-11	An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios	Leandro Parada et.al.	2412.08562	null
2024-12-11	Exact Algorithms for Multiagent Path Finding with Communication Constraints on Tree-Like Structures	Foivos Fioravantes et.al.	2412.08556	null
2024-12-11	Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending against Poisoning Attacks	Ao Liu et.al.	2412.08555	null
2024-12-11	MaestroMotif: Skill Design from Artificial Intelligence Feedback	Martin Klissarov et.al.	2412.08542	null
2024-12-11	Spatial segregation across travelling fronts in individual-based and continuum models for the growth of heterogeneous cell populations	José A. Carrillo et.al.	2412.08535	null
2024-12-10	Balancing Mobility Behaviors to avoid Global epidemics from Local Outbreaks	Pablo Valgañón et.al.	2412.07656	null
2024-12-10	Searching for Structure: Investigating Emergent Communication with Large Language Models	Tom Kouwenhoven et.al.	2412.07646	null
2024-12-10	Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization	Zongkai Liu et.al.	2412.07639	link
2024-12-10	Swarm Behavior Cloning	Jonas Nüßlein et.al.	2412.07617	null
2024-12-10	Modeling Speculative Trading Patterns in Token Markets: An Agent-Based Analysis with TokenLab	Mengjue Wang et.al.	2412.07512	null
2024-12-10	ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning	Hongshu Guo et.al.	2412.07507	null
2024-12-10	SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World	Jiaqi Zhang et.al.	2412.07472	link
2024-12-10	Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems	Sen Kong et.al.	2412.07471	null
2024-12-10	Dynamic Ensemble Reasoning for LLM Experts	Jinwu Hu et.al.	2412.07448	null
2024-12-10	ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving	Rongqing Li et.al.	2412.07369	null
2024-12-09	Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty	Meera Hahn et.al.	2412.06771	link
2024-12-09	AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark	Lan Li et.al.	2412.06724	link
2024-12-09	Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies	Dilian Gurov et.al.	2412.06706	null
2024-12-09	Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework	Tianming Liu et.al.	2412.06681	null
2024-12-09	Self-Interested Agents in Collaborative Learning: An Incentivized Adaptive Data-Centric Framework	Nithia Vijayan et.al.	2412.06597	null
2024-12-09	Argentine ants regulate traffic flow with stopped individuals	Ulrich Dobramysl et.al.	2412.06587	null
2024-12-09	Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation	Egor Cherepanov et.al.	2412.06531	null
2024-12-09	EFX Allocations on Some Multi-graph Classes	Umang Bhaskar et.al.	2412.06513	null
2024-12-09	The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap	Yedi Zhang et.al.	2412.06512	null
2024-12-09	Reasoning about Strategic Abilities in Stochastic Multi-agent Systems	Yedi Zhang et.al.	2412.06509	null
2024-12-06	TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft	Qian Long et.al.	2412.05255	link
2024-12-06	AI’s assigned gender affects human-AI cooperation	Sepideh Bazazi et.al.	2412.05214	null
2024-12-06	SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot	Jinlin Wu et.al.	2412.05187	link
2024-12-06	Sense and Sensitivity: Evaluating the simulation of social dynamics via Large Language Models	Da Ju et.al.	2412.05093	null
2024-12-06	Synchronization and desynchronization in ensembles of mobile agents	E. M. Varvarin et.al.	2412.05040	null
2024-12-06	Frontier Models are Capable of In-context Scheming	Alexander Meinke et.al.	2412.04984	null
2024-12-06	Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task	Raphael C. Engelhardt et.al.	2412.04974	null
2024-12-06	Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games	Ryota Nonomura et.al.	2412.04937	link
2024-12-06	Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase	Zak Hussain et.al.	2412.04936	link
2024-12-06	PERCY: A Multimodal Dataset and Conversational System for Personalized and Emotionally Aware Human-Robot Interaction	Mohammed Althubyani et.al.	2412.04908	null
2024-12-05	Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction	Yiheng Xu et.al.	2412.04454	null
2024-12-05	GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration	Kaiyi Huang et.al.	2412.04440	null
2024-12-05	Sub-diffraction Imaging of Carrier Dynamics in Halide Perovskite Semiconductors: Effects of Passivation, Morphology, and Ion Motion	Madeleine D. Breshears et.al.	2412.04423	null
2024-12-05	Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation	Xuying Li et.al.	2412.04415	null
2024-12-05	EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding	Yuqi Wu et.al.	2412.04380	link
2024-12-05	Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach	Haoran Su et.al.	2412.04369	null
2024-12-05	Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting	Edoardo Cetin et.al.	2412.04368	null
2024-12-05	Machine Theory of Mind for Autonomous Cyber-Defence	Luke Swaby et.al.	2412.04367	null
2024-12-05	Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles	Ke Sun et.al.	2412.04341	null
2024-12-05	Action Mapping for Reinforcement Learning in Continuous Environments with Constraints	Mirco Theile et.al.	2412.04327	null
2024-12-04	Navigation World Models	Amir Bar et.al.	2412.03572	null
2024-12-04	From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents	Xinyi Mou et.al.	2412.03563	link
2024-12-04	Categorize and randomize: a model of sequential stochastic choice	Ester Sudano et.al.	2412.03554	null
2024-12-04	SPICE: Smart Projection Interface for Cooking Enhancement	Vera Prohaska et.al.	2412.03551	link
2024-12-04	Risk-aware Classification via Uncertainty Quantification	Murat Sensoy et.al.	2412.03391	null
2024-12-04	WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis	Chengwei Hu et.al.	2412.03359	null
2024-12-04	AI-Driven Day-to-Day Route Choice	Leizhen Wang et.al.	2412.03338	link
2024-12-04	Mean-field Concentration of Opinion Dynamics in Random Graphs	Javiera Gutiérrez-Ramírez et.al.	2412.03207	null
2024-12-04	AffordDP: Generalizable Diffusion Policy with Transferable Affordance	Shijie Wu et.al.	2412.03142	null
2024-12-04	ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning	Zhe Xie et.al.	2412.03104	link
2024-12-03	Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation	Gabriele Giudici et.al.	2412.02644	null
2024-12-03	Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework	Ziheng Liu et.al.	2412.02581	null
2024-12-03	Generating Critical Scenarios for Testing Automated Driving Systems	Trung-Hieu Nguyen et.al.	2412.02574	link
2024-12-03	TAB-Fields: A Maximum Entropy Framework for Mission-Aware Adversarial Planning	Gokul Puthumanaillam et.al.	2412.02570	link
2024-12-03	Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization	Nicolás García Trillos et.al.	2412.02535	link
2024-12-03	General Resetting Theory for Group Avoidance	Juhee Lee et.al.	2412.02524	null
2024-12-03	Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations	Conghao Wong et.al.	2412.02447	null
2024-12-03	A Multi-Agent Framework for Extensible Structured Text Generation in PLCs	Donghao Yang et.al.	2412.02410	null
2024-12-03	Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction	Ziqian Zou et.al.	2412.02395	null
2024-12-03	Bio-inspired visual relative localization for large swarms of UAVs	Martin Křížek et.al.	2412.02393	null
2024-11-29	EF1 Allocations for Identical Trilean and Separable Single-Peaked Valuations	Umang Bhaskar et.al.	2411.19881	null
2024-11-29	Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models	Claudio Agnorelli et.al.	2411.19840	null
2024-11-29	Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation	Robin D. Pesl et.al.	2411.19804	null
2024-11-29	CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives	Armin Saghafian et.al.	2411.19787	link
2024-11-29	The 2024 Motile Active Matter Roadmap	Gerhard Gompper et.al.	2411.19783	null
2024-11-29	HVAC-DPT: A Decision Pretrained Transformer for HVAC Control	Anaïs Berkes et.al.	2411.19746	null
2024-11-29	Relative Representations of Latent Spaces enable Efficient Semantic Channel Equalization	Tomás Hüttebräucker et.al.	2411.19719	null
2024-11-29	RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents	Shi Zifeng et.al.	2411.19639	null
2024-11-29	Build An Influential Bot In Social Media Simulations With Large Language Models	Bailu Jin et.al.	2411.19635	null
2024-11-29	Solving Rubik’s Cube Without Tricky Sampling	Yicheng Lin et.al.	2411.19583	null
2024-11-27	Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective	Zhi Zhang et.al.	2411.18615	null
2024-11-27	Robust Offline Reinforcement Learning with Linearly Structured $f$ -Divergence Regularization	Cheng Tang et.al.	2411.18612	null
2024-11-27	AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans	Dillon Loh et.al.	2411.18539	link
2024-11-27	Biswas-Chatterjee-Sen kinetic exchange opinion model for two connected groups	Krzysztof Suchecki et.al.	2411.18527	null
2024-11-27	NeuroAI for AI Safety	Patrick Mineault et.al.	2411.18526	null
2024-11-27	Collective decision making by embodied neural agents	Nicolas Coucke et.al.	2411.18498	link
2024-11-27	Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator	Frederic Kirstein et.al.	2411.18444	null
2024-11-28	A Multi-Agent Dual Dialogue System to Support Mental Health Care Providers	Onno P. Kampman et.al.	2411.18429	null
2024-11-27	Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration	Esmaeel Mohammadi et.al.	2411.18305	null
2024-11-27	InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving	Xiyan Jiang et.al.	2411.18302	link
2024-11-26	SketchAgent: Language-Driven Sequential Sketch Generation	Yael Vinker et.al.	2411.17673	null
2024-11-26	MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation	Harsh Singh et.al.	2411.17636	null
2024-11-26	Making History Readable	Bipasha Banerjee et.al.	2411.17600	null
2024-11-26	Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals	William A. Ingram et.al.	2411.17598	null
2024-11-26	Decision making in stochastic extensive form II: Stochastic extensive forms and games	E. Emanuel Rapsch et.al.	2411.17587	null
2024-11-26	Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence	Ross O’Driscoll et.al.	2411.17585	null
2024-11-26	Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach	Yaosheng Deng et.al.	2411.17552	null
2024-11-26	ShowUI: One Vision-Language-Action Model for GUI Visual Agent	Kevin Qinghong Lin et.al.	2411.17465	link
2024-11-26	Object-centric proto-symbolic behavioural reasoning from pixels	Ruben van Bergen et.al.	2411.17438	link
2024-11-26	Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning	Mahdi Salahshour et.al.	2411.17353	null
2024-11-25	Winning opinion: Following Your Friends’ Advice or That of Their Friends?	Francisco J. Muñoz et.al.	2411.16671	null
2024-11-25	Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination	Viswa Narayanan Sankaranarayanan et.al.	2411.16608	null
2024-11-25	Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete?	Connor Douglas et.al.	2411.16574	null
2024-11-25	Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation	Muhammad Burhan Hafez et.al.	2411.16532	link
2024-11-25	Reinforcement Learning for Bidding Strategy Optimization in Day-Ahead Energy Market	Luca Di Persio et.al.	2411.16519	null
2024-11-25	Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding	Hongzhi Zang et.al.	2411.16506	link
2024-11-25	Distributed Online Optimization with Stochastic Agent Availability	Juliette Achddou et.al.	2411.16477	null
2024-11-25	Generating social networks with static and dynamic utility-maximization approaches	Aldric Labarthe et.al.	2411.16464	link
2024-11-25	Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction	Haoming Li et.al.	2411.16457	null
2024-11-25	TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation	Linqing Zhong et.al.	2411.16425	null
2024-11-22	RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts	Hjalmar Wijk et.al.	2411.15114	link
2024-11-22	XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models	Yixin Dong et.al.	2411.15100	null
2024-11-22	On Multi-Agent Inverse Reinforcement Learning	Till Freihaut et.al.	2411.15046	null
2024-11-22	Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium	Zeyang Li et.al.	2411.15036	null
2024-11-22	On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations	Guojun Xiong et.al.	2411.15014	null
2024-11-22	ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data	Junhong Shen et.al.	2411.15004	link
2024-11-22	Free Energy Projective Simulation (FEPS): Active inference with interpretability	Joséphine Pazem et.al.	2411.14991	null
2024-11-22	BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence	Xuewu Lin et.al.	2411.14869	link
2024-11-22	Universal and Context-Independent Triggers for Precise Control of LLM Outputs	Jiashuo Liang et.al.	2411.14738	null
2024-11-22	Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents	Hanwen Shi et.al.	2411.14637	null
2024-11-21	Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models	Yuhao Dong et.al.	2411.14432	link
2024-11-21	Multi-Agent Environments for Vehicle Routing Problems	Ricardo Gama et.al.	2411.14411	link
2024-11-21	Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs	Ofer Dagan et.al.	2411.14404	null
2024-11-21	SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching	Arjun P S et.al.	2411.14322	link
2024-11-21	Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks	Kubra Duran et.al.	2411.14281	null
2024-11-21	Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation	Pedro Enrique Iturria-Rivera et.al.	2411.14264	null
2024-11-21	Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems	Junhua Liu et.al.	2411.14214	null
2024-11-21	SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization	Shuchen Zhu et.al.	2411.14166	null
2024-11-21	Multi-terminal Strong Coordination subject to Secrecy Constraints	Viswanathan Ramachandran et.al.	2411.14123	null
2024-11-21	Umbrella Reinforcement Learning – computationally efficient tool for hard non-linear problems	Egor E. Nuzhin et.al.	2411.14117	link
2024-11-20	BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games	Davide Paglieri et.al.	2411.13543	null
2024-11-20	Metacognition for Unknown Situations and Environments (MUSE)	Rodolfo Valiente et.al.	2411.13537	null
2024-11-20	AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations	Gaurav Verma et.al.	2411.13451	null
2024-11-20	Robust Monocular Visual Odometry using Curriculum Learning	Assaf Lahiany et.al.	2411.13438	null
2024-11-20	A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback	Alireza Rashidi Laleh et.al.	2411.13410	null
2024-11-20	Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership	Lars Fluri et.al.	2411.13381	null
2024-11-20	WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving	Siwei Chen et.al.	2411.13340	link
2024-11-20	Revealed Information	Laura Doval et.al.	2411.13293	null
2024-11-20	Transforming the Hybrid Cloud for Emerging AI Workloads	Deming Chen et.al.	2411.13239	null
2024-11-20	Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications	Tiago Roux Oliveira et.al.	2411.13234	null
2024-11-19	Reinforcement Learning, Collusion, and the Folk Theorem	Galit Askenazi-Golan et.al.	2411.12725	null
2024-11-19	UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments	Chunru Lin et.al.	2411.12711	null
2024-11-19	Weighted Envy Freeness With Limited Subsidies	Noga Klein Elmalem et.al.	2411.12696	null
2024-11-19	Quasi-stability notions in two-sided matching models	Nadia Guiñazú et.al.	2411.12533	null
2024-11-19	Coevolution of relationship-driven cooperation under recommendation protocol on multiplex networks	Hongyu Yue et.al.	2411.12436	null
2024-11-19	Instrumentation of Software Systems with OpenTelemetry for Software Visualization	Malte Hansen et.al.	2411.12380	null
2024-11-19	C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention	Xiaohe Li et.al.	2411.12313	null
2024-11-19	SNN-Based Online Learning of Concepts and Action Laws in an Open World	Christel Grimaud et.al.	2411.12308	null
2024-11-19	Emergence of Implicit World Models from Mortal Agents	Kazuya Horibe et.al.	2411.12304	null
2024-11-19	Could Humans Outshine AI in Visual Data Analysis?	Ratanond Koonchanok et.al.	2411.12299	null
2024-11-18	Generative World Explorer	Taiming Lu et.al.	2411.11844	null
2024-11-18	Reinterpreting Delay and Procrastination	Conrad Kosowsky et.al.	2411.11828	null
2024-11-18	Competing Bandits in Decentralized Large Contextual Matching Markets	Satush Parikh et.al.	2411.11794	null
2024-11-18	LLM-IE: A Python Package for Generative Information Extraction with Large Language Models	Enshuo Hsu et.al.	2411.11779	null
2024-11-18	Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework	Yannick Metz et.al.	2411.11761	null
2024-11-18	The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning	Longju Bai et.al.	2411.11758	link
2024-11-18	Distributed Asynchronous Time-Varying Quadratic Programming with Asynchronous Objective Sampling	Gabriel Behrendt et.al.	2411.11732	null
2024-11-18	Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment	Allison Huang et.al.	2411.11731	link
2024-11-18	TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World	Xianlong Wang et.al.	2411.11683	null
2024-11-18	Artificial Scientific Discovery	Antonio Norelli et.al.	2411.11672	null
2024-11-15	Fair Division via the Cake-Cutting Share	Yannan Bai et.al.	2411.10434	null
2024-11-15	Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash	Parsa Hejabi et.al.	2411.10422	link
2024-11-15	The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use	Siyuan Hu et.al.	2411.10323	link
2024-11-15	Static network structure cannot stabilize cooperation among Large Language Model agents	Jin Han et.al.	2411.10294	null
2024-11-15	Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review	Hossein Hassani et.al.	2411.10268	null
2024-11-15	Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning	Jingru Yang et.al.	2411.10252	null
2024-11-15	An Empirical Study on LLM-based Agents for Automated Bug Fixing	Xiangxin Meng et.al.	2411.10213	null
2024-11-15	Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking	Valeria Jannelli et.al.	2411.10184	null
2024-11-15	Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks	Marco Matarese et.al.	2411.10176	null
2024-11-15	The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning	Moritz Schneider et.al.	2411.10175	null
2024-11-14	Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games	Georgios Pantazis et.al.	2411.09636	null
2024-11-14	Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents	Yuyou Gan et.al.	2411.09523	null
2024-11-14	Randomized Truthful Auctions with Learning Agents	Gagan Aggarwal et.al.	2411.09517	null
2024-11-14	Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity	Sneha Ramshanker et.al.	2411.09493	null
2024-11-14	Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches	Carlos J. Costa et.al.	2411.09313	null
2024-11-14	Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning	Dunwei Tu et.al.	2411.09250	null
2024-11-14	Risk-aware MPPI for Stochastic Hybrid Systems	Hardik Parwana et.al.	2411.09198	link
2024-11-14	Enhancing reinforcement learning for population setpoint tracking in co-cultures	Sebastián Espinel-Ríos et.al.	2411.09177	null
2024-11-14	Artificial Theory of Mind and Self-Guided Social Organisation	Michael S. Harré et.al.	2411.09169	null
2024-11-14	Theory of Mind Enhances Collective Intelligence	Michael S. Harré et.al.	2411.09168	null
2024-11-13	The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games	Dan Calderone et.al.	2411.08809	null
2024-11-13	FinRobot: AI Agent for Equity Research and Valuation with Large Language Models	Tianyu Zhou et.al.	2411.08804	link
2024-11-13	Evaluating World Models with LLM for Decision Making	Chang Yang et.al.	2411.08794	null
2024-11-13	Towards Fair and Efficient Public Transportation: A Bus Stop Model	Martin Bullinger et.al.	2411.08784	link
2024-11-13	Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces	Arabinda Ghosh et.al.	2411.08754	null
2024-11-13	Statistical Operating Characteristics of Current Early Phase Dose Finding Designs with Toxicity and Efficacy in Oncology	Hao Sun et.al.	2411.08698	null
2024-11-13	Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models	Jan Albrecht et.al.	2411.08692	null
2024-11-13	Robot See, Robot Do: Imitation Reward for Noisy Financial Environments	Sven Goluža et.al.	2411.08637	null
2024-11-13	On the Application of Model Predictive Control to a Weighted Coverage Path Planning Problem	Kilian Schweppe et.al.	2411.08634	null
2024-11-13	NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation	Youzhi Liu et.al.	2411.08579	null
2024-11-12	LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models	Anoop Cherian et.al.	2411.08027	null
2024-11-12	Incentive Design with Spillovers	Krishna Dasaratha et.al.	2411.08026	null
2024-11-12	From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents	Chuyi Kong et.al.	2411.07965	null
2024-11-12	Learning Memory Mechanisms for Decision Making through Demonstrations	William Yue et.al.	2411.07954	link
2024-11-12	RedCode: Risky Code Execution and Generation Benchmark for Code Agents	Chengquan Guo et.al.	2411.07781	link
2024-11-12	Efficiency of energy-consuming random walkers: Variability in energy helps	Mohsen Ghasemi Nezhadhaghighi et.al.	2411.07771	null
2024-11-12	Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows	Fangyu Lei et.al.	2411.07763	null
2024-11-12	Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning	Stefan Pranger et.al.	2411.07700	null
2024-11-12	World Models: The Safety Perspective	Zifan Zeng et.al.	2411.07690	null
2024-11-12	Safe Exploitative Play with Untrusted Type Beliefs	Tongxin Li et.al.	2411.07679	null
2024-11-11	Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving	Botao Yu et.al.	2411.07228	null
2024-11-11	Grounding Video Models to Actions through Goal Conditioned Exploration	Yunhao Luo et.al.	2411.07223	null
2024-11-11	‘Explaining RL Decisions with Trajectories’: A Reproducibility Study	Karim Abdel Sadek et.al.	2411.07200	link
2024-11-11	Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation	Yao Ma et.al.	2411.07185	null
2024-11-11	RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration	Young-Min Cho et.al.	2411.07161	null
2024-11-11	Azurin-Based Peptide p28 Arrests the p53-HDM2 Interactions: A Novel Anti-Cancer Pathway	Albin Joy et.al.	2411.07124	null
2024-11-11	Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing	Chuye Hong et.al.	2411.07104	null
2024-11-11	Bounded Rationality Equilibrium Learning in Mean Field Games	Yannick Eich et.al.	2411.07099	link
2024-11-11	A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs	Myeongsoo Kim et.al.	2411.07098	null
2024-11-11	Differentially-Private Collaborative Online Personalized Mean Estimation	Yauhen Yakimenka et.al.	2411.07094	null
2024-11-08	Topology-aware Reinforcement Feature Space Reconstruction for Graph Data	Wangyang Ying et.al.	2411.05742	null
2024-11-08	A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics	Puze Liu et.al.	2411.05718	null
2024-11-08	Settling the Complexity of Popularity in Additively Separable and Fractional Hedonic Games	Martin Bullinger et.al.	2411.05713	null
2024-11-08	Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning	Indranil Sur et.al.	2411.05683	null
2024-11-08	The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent	Leon O. H. Kroczek et.al.	2411.05653	null
2024-11-08	LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution	Yuheng Zhao et.al.	2411.05651	null
2024-11-08	Expectation vs. Reality: Towards Verification of Psychological Games	Marta Kwiatkowska et.al.	2411.05599	null
2024-11-08	Smart navigation through a rotating barrier: Deep reinforcement learning with application to size-based separation of active microagents	Mohammad Hossein Masoudi et.al.	2411.05587	null
2024-11-08	Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs	Hubert Szolc et.al.	2411.05586	link
2024-11-08	Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs	Ryoto Ando et.al.	2411.05574	null
2024-11-07	Few-Shot Task Learning through Inverse Generative Modeling	Aviv Netanyahu et.al.	2411.04987	null
2024-11-07	Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games	Usman Anwar et.al.	2411.04976	link
2024-11-07	StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration	Panwen Hu et.al.	2411.04925	null
2024-11-07	OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models	Siming Huang et.al.	2411.04905	null
2024-11-07	Achieving superconductivity in infinite-layer nickelate thin films by aluminum sputtering deposition	Dongxin Zhang et.al.	2411.04896	null
2024-11-07	GUI Agents with Foundation Models: A Comprehensive Survey	Shuai Wang et.al.	2411.04890	null
2024-11-07	Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning	Satchit Chatterji et.al.	2411.04867	link
2024-11-07	Robust Regulation of Labour Contracts	Théo Durandard et.al.	2411.04841	null
2024-11-07	Plasticity Loss in Deep Reinforcement Learning: A Survey	Timo Klein et.al.	2411.04832	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796	null
2024-11-06	Predicting and Publishing Accurate Imbalance Prices Using Monte Carlo Tree Search	Fabio Pavirani et.al.	2411.04011	null
2024-11-06	Temporal Network Creation Games: The Impact of Non-Locality and Terminals	Davide Bilò et.al.	2411.03973	null
2024-11-06	Almost Time-Optimal Loosely-Stabilizing Leader Election on Arbitrary Graphs Without Identifiers in Population Protocols	Haruki Kanaya et.al.	2411.03902	null
2024-11-06	AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making	Yizhe Huang et.al.	2411.03865	link
2024-11-06	Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC	Tyler Clark et.al.	2411.03820	link
2024-11-06	From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning	Zhirui Deng et.al.	2411.03817	null
2024-11-06	MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue	Fengxiang Wang et.al.	2411.03814	null
2024-11-06	Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data	Chengrui Qu et.al.	2411.03810	link
2024-11-06	Multi-Modal Intelligent Channel Modeling: A New Modeling Paradigm via Synesthesia of Machines	Lu Bai et.al.	2411.03711	null
2024-11-06	Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services	Amr Abo-eleneen et.al.	2411.03686	null
2024-11-05	SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents	Dawei Li et.al.	2411.03284	link
2024-11-05	Causal Responsibility Attribution for Human-AI Collaboration	Yahang Qi et.al.	2411.03275	link
2024-11-05	Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities	Ryosuke Takata et.al.	2411.03252	null
2024-11-05	Troll Farms	Philipp Denter et.al.	2411.03241	null
2024-11-05	A resolved Lyman-Alpha profile with doubly peaked emission at z~7	C. Moya-Sierralta et.al.	2411.03222	null
2024-11-05	GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis	Temitope Akinboyewa et.al.	2411.03205	link
2024-11-05	Online Data Collection for Efficient Semiparametric Inference	Shantanu Gupta et.al.	2411.03195	link
2024-11-05	Hierarchical Orchestra of Policies	Thomas P Cannon et.al.	2411.03008	null
2024-11-05	Accelerating Task Generalisation with Multi-Level Hierarchical Options	Thomas P Cannon et.al.	2411.02998	null
2024-11-05	Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation	Francisco Giral et.al.	2411.02975	null
2024-11-04	Attacking Vision-Language Computer Agents via Pop-ups	Yanzhe Zhang et.al.	2411.02391	link
2024-11-04	Two-Sided Learning in Decentralized Matching Markets	Vade Shah et.al.	2411.02377	null
2024-11-04	Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences	Ruotong Wang et.al.	2411.02353	null
2024-11-04	WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning	Zehan Qi et.al.	2411.02337	link
2024-11-04	CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments	Kung-Hsiang Huang et.al.	2411.02305	link
2024-11-04	Kinetic exchange opinion dynamics for the battleground-states in the 2024 US presidential elections	Soumyajyoti Biswas et.al.	2411.02240	null
2024-11-04	Positive Experience Reflection for Agents in Interactive Text Environments	Philip Lippmann et.al.	2411.02223	null
2024-11-04	CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education	Pranathi Rayavaram et.al.	2411.02143	null
2024-11-04	Foundations and Recent Trends in Multimodal Mobile Agents: A Survey	Biao Wu et.al.	2411.02006	link
2024-11-04	Taking AI Welfare Seriously	Robert Long et.al.	2411.00986	null
2024-10-31	Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use	Jiajun Xi et.al.	2410.24218	link
2024-10-31	DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning	Zhenyu Jiang et.al.	2410.24185	null
2024-10-31	Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning	Jiaqi Liu et.al.	2410.24152	null
2024-10-31	Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis	Jia Lin Hau et.al.	2410.24128	link
2024-10-31	Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning	Nabil Omi et.al.	2410.24096	null
2024-10-31	Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks	Yingzhe Peng et.al.	2410.24032	null
2024-10-31	AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents	Yifan Xu et.al.	2410.24024	link
2024-10-31	Optimal control problems driven by nonlinear degenerate Fokker-Planck equations	Francesca Anceschi et.al.	2410.24000	null
2024-10-31	Persuading a Credible Agent	Jiarui Gan et.al.	2410.23989	null
2024-10-31	Fair Division of Chores with Budget Constraints	Edith Elkind et.al.	2410.23979	null
2024-10-30	Proportional Fairness in Non-Centroid Clustering	Ioannis Caragiannis et.al.	2410.23273	null
2024-10-30	Evaluating Cultural and Social Awareness of LLM Web Agents	Haoyi Qiu et.al.	2410.23252	null
2024-10-30	Carrot and Stick: Eliciting Comparison Data and Beyond	Yiling Chen et.al.	2410.23243	null
2024-10-30	A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment	Matteo G. Mecattaf et.al.	2410.23242	link
2024-10-31	Aligning Audio-Visual Joint Representations with an Agentic Workflow	Shentong Mo et.al.	2410.23230	null
2024-10-30	OS-ATLAS: A Foundation Action Model for Generalist GUI Agents	Zhiyong Wu et.al.	2410.23218	link
2024-10-30	Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks	Michael Matthews et.al.	2410.23208	link
2024-10-30	VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning	Yichao Liang et.al.	2410.23156	null
2024-10-30	Fair Division with Market Values	Siddharth Barman et.al.	2410.23137	null
2024-10-30	First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024	Tengfei Zhang et.al.	2410.23077	null
2024-10-29	Environment as Policy: Learning to Race in Unseen Tracks	Hongze Wang et.al.	2410.22308	null
2024-10-29	Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning	Yihe Deng et.al.	2410.22304	null
2024-10-29	Fourier Head: Helping Large Language Models Learn Complex Probability Distributions	Nate Gillman et.al.	2410.22269	null
2024-10-29	RingSim- An Agent-based Approach for Modelling Mesoscopic Magnetic Nanowire Networks	Ian T Vidamour et.al.	2410.22204	null
2024-10-29	Democratizing Reward Design for Personal and Representative Value-Alignment	Carter Blair et.al.	2410.22203	null
2024-10-29	ADAM: An Embodied Causal Agent in Open-World Environments	Shu Yu et.al.	2410.22194	null
2024-10-29	EconoJax: A Fast & Scalable Economic Simulation in Jax	Koen Ponse et.al.	2410.22165	link
2024-10-29	Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration	Cory Hymel et.al.	2410.22129	null
2024-10-29	Inverse Design Method with Enhanced Sampling for Complex Open Crystals: Application to Novel Zeolite Self-Assembly in a Coarse-Grained Model	Chaohong Wang et.al.	2410.22111	null
2024-10-29	An LLM-based Simulation Framework for Embodied Conversational Agents in Psychological Counseling	Lixiu Wu et.al.	2410.22041	link
2024-10-28	Capacity-Aware Planning and Scheduling in Budget-Constrained Monotonic MDPs: A Meta-RL Approach	Manav Vora et.al.	2410.21249	null
2024-10-28	Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines	Zhixin Zhang et.al.	2410.21220	link
2024-10-28	Magnetic Milli-spinner for Robotic Endovascular Surgery	Shuai Wu et.al.	2410.21112	null
2024-10-28	Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment	Yi Zheng et.al.	2410.21109	null
2024-10-28	LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity Recognition	Naga Venkata Sai Raviteja Chappa et.al.	2410.21108	null
2024-10-28	Topological Identification of Agent Status in Information Contagions: Application to Financial Markets	Anubha Goel et.al.	2410.21104	link
2024-10-28	Automatic Generation of Benchmarks and Reliable LLM Judgment for Code Tasks	Eitan Farchi et.al.	2410.21071	null
2024-10-28	CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models	Meiqi Chen et.al.	2410.21067	null
2024-10-28	Getting By Goal Misgeneralization With a Little Help From a Mentor	Tu Trinh et.al.	2410.21052	null
2024-10-28	FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents	Jannis Weil et.al.	2410.21029	link
2024-10-25	FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning	Nicole Cho et.al.	2410.19727	null
2024-10-25	Evolving Neural Networks Reveal Emergent Collective Behavior from Minimal Agent Interactions	Guilherme S. Y. Giardini et.al.	2410.19718	null
2024-10-25	Adversarial Environment Design via Regret-Guided Diffusion Models	Hojun Chung et.al.	2410.19715	null
2024-10-25	Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks	Yinglun Xu et.al.	2410.19705	null
2024-10-25	AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs	Clemencia Siro et.al.	2410.19692	null
2024-10-25	The Sound of Silence in Social Networks	Jesús Aranda et.al.	2410.19685	null
2024-10-25	Optimizing Hearthstone Agents using an Evolutionary Algorithm	Pablo García-Sánchez et.al.	2410.19681	link
2024-10-25	MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services	Hongjia Wu et.al.	2410.19665	null
2024-10-25	Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving	Liu Yunhao et.al.	2410.19639	null
2024-10-25	Knowledge Graph Enhanced Language Agents for Recommendation	Taicheng Guo et.al.	2410.19627	null
2024-10-24	Learning to Look: Seeking Information for Decision Making via Policy Factorization	Shivin Dass et.al.	2410.18964	null
2024-10-24	OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning	Xiaoqiang Wang et.al.	2410.18963	null
2024-10-24	Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play	Sha Li et.al.	2410.18935	null
2024-10-24	SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment	Caelan Garrett et.al.	2410.18907	null
2024-10-24	Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks	Graziano A. Manduzio et.al.	2410.18890	null
2024-10-24	Learning Collusion in Episodic, Inventory-Constrained Markets	Paul Friedrich et.al.	2410.18871	link
2024-10-25	An LLM Agent for Automatic Geospatial Data Analysis	Yuxing Chen et.al.	2410.18792	null
2024-10-24	Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling for Autonomous Vehicles	Yucheng Shi et.al.	2410.18786	null
2024-10-24	Active Target Tracking Using Bearing-only Measurements With Gaussian Process Learning	Yingbo Fu et.al.	2410.18669	null
2024-10-24	Approximate EFX and Exact tEFX Allocations for Indivisible Chores: Improved Algorithms	Mahyar Afshinmehr et.al.	2410.18655	null
2024-10-23	Prioritized Generative Replay	Renhao Wang et.al.	2410.18082	null
2024-10-23	The Double-Edged Sword of Behavioral Responses in Strategic Classification: Theory and User Studies	Raman Ebrahimi et.al.	2410.18066	null
2024-10-23	SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation	Zihan Zhou et.al.	2410.18065	null
2024-10-23	A Comparative Assessment of Technology Acceptance and Learning Outcomes in Computer-based versus VR-based Pedagogical Agents	Aimilios Hadjiliasi et.al.	2410.18048	null
2024-10-23	GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration	Xin Li et.al.	2410.18032	link
2024-10-23	MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting	Sungil Seok et.al.	2410.18012	null
2024-10-23	Dynamic models of gentrification	Giovanni Mauro et.al.	2410.18004	null
2024-10-23	POMDP-Driven Cognitive Massive MIMO Radar: Joint Target Detection-Tracking In Unknown Disturbances	Imad Bouhou et.al.	2410.17967	null
2024-10-23	On Regularity and Normalization in Sequential Screening	Ian Ball et.al.	2410.17962	null
2024-10-23	Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models	He Cao et.al.	2410.17922	link
2024-10-22	SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning	Yizhou Chi et.al.	2410.17238	link
2024-10-22	Large Language Models Empowered Personalized Web Agents	Hongru Cai et.al.	2410.17236	null
2024-10-22	Responsibility in a Multi-Value Strategic Setting	Timothy Parker et.al.	2410.17229	null
2024-10-22	Scalable spectral representations for network multiagent control	Zhaolin Ren et.al.	2410.17221	null
2024-10-23	Non-myopic Generation of Language Model for Reasoning and Planning	Chang Ma et.al.	2410.17195	link
2024-10-22	DyPNIPP: Predicting Environment Dynamics for RL-based Robust Informative Path Planning	Srujan Deolasee et.al.	2410.17186	null
2024-10-22	Layered LA-MAPF: a decomposition of large agent MAPF instance to accelerate solving without compromising solvability	Zhuo Yao et.al.	2410.17160	link
2024-10-22	Mechanistic interplay between information spreading and opinion polarization	Kleber A. Oliveira et.al.	2410.17151	null
2024-10-22	Advancing lunar exploration through virtual reality simulations: a framework for future human missions	Giacomo Franchini et.al.	2410.17132	null
2024-10-22	Exploration and Persuasion	Aleksandrs Slivkins et.al.	2410.17086	null
2024-10-21	Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos	Gengshan Yang et.al.	2410.16259	null
2024-10-21	IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems	Yihuan Mao et.al.	2410.16237	null
2024-10-21	Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping	Ryan Li et.al.	2410.16232	null
2024-10-21	Role of obstacle softness in the diffusive behavior of active Particles	Ankit Gupta et.al.	2410.16223	null
2024-10-21	CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning	Kumar Manas et.al.	2410.16207	link
2024-10-22	LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation	Hao Gao et.al.	2410.16197	link
2024-10-21	Spiking Neural Networks as a Controller for Emergent Swarm Agents	Kevin Zhu et.al.	2410.16175	null
2024-10-21	A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns	Tianyi Men et.al.	2410.16155	null
2024-10-21	AdChain: Decentralized Header Bidding	Behkish Nassirzadeh et.al.	2410.16141	null
2024-10-21	Constrained Truthful Obnoxious Two-Facility Location with Optional Preferences	Panagiotis Kanellopoulos et.al.	2410.16131	null
2024-10-18	Teaching Models to Balance Resisting and Accepting Persuasion	Elias Stengel-Eskin et.al.	2410.14596	link
2024-10-18	Toolshed: Scale Tool-Equipped Agents with Advanced RAG-Tool Fusion and Tool Knowledge Bases	Elias Lumer et.al.	2410.14594	null
2024-10-18	Temporal Fair Division of Indivisible Items	Edith Elkind et.al.	2410.14593	null
2024-10-18	Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets	Namid R. Stillman et.al.	2410.14587	null
2024-10-18	Neural Combinatorial Clustered Bandits for Recommendation Systems	Baran Atalar et.al.	2410.14586	null
2024-10-18	Do LLMs estimate uncertainty well in instruction-following?	Juyeon Heo et.al.	2410.14582	link
2024-10-18	When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs	Hanna Kim et.al.	2410.14569	null
2024-10-18	RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions	Zhiyuan Peng et.al.	2410.14567	link
2024-10-18	Performance bounds for multi-vehicle networks with local integrators	Jonas Hansson et.al.	2410.14525	null
2024-10-18	Do LLMs “know” internally when they follow instructions?	Juyeon Heo et.al.	2410.14516	link
2024-10-17	VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding	Runsen Xu et.al.	2410.13860	link
2024-10-17	AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents	Ke Yang et.al.	2410.13825	null
2024-10-18	Harnessing Webpage UIs for Text-Rich Visual Understanding	Junpeng Liu et.al.	2410.13824	null
2024-10-17	Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems	Alireza Ghafarollahi et.al.	2410.13768	null
2024-10-17	MobA: A Two-Level Agent System for Efficient Mobile Task Automation	Zichen Zhu et.al.	2410.13757	link
2024-10-17	Interacting humans and robots can improve sensory prediction by adapting their viscoelasticity	Xiaoxiao Cheng et.al.	2410.13755	null
2024-10-17	Real Eventual Exponential Positivity of Complex-valued Laplacians: Applications to Consensus in Multi-agent Systems	Aditi Saxena et.al.	2410.13700	null
2024-10-17	ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization	Xiutian Zhao et.al.	2410.13667	link
2024-10-17	A Comparative Study on Reasoning Patterns of OpenAI’s o1 Model	Siwei Wu et.al.	2410.13639	link
2024-10-17	Phenotype structuring in collective cell migration:a tutorial of mathematical models and methods	Tommaso Lorenzi et.al.	2410.13629	null
2024-10-16	JudgeBench: A Benchmark for Evaluating LLM-based Judges	Sijun Tan et.al.	2410.12784	link
2024-10-16	Prophet Upper Bounds for Online Matching and Auctions	José Soto et.al.	2410.12756	null
2024-10-16	HEnRY: A Multi-Agent System Framework for Multi-Domain Contexts	Emmanuele Lacavalla et.al.	2410.12720	link
2024-10-16	A comparative analysis of metamodels for lumped cardiovascular models, and pipeline for sensitivity analysis, parameter estimation, and uncertainty quantification	John M. Hanna et.al.	2410.12654	null
2024-10-16	Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control	Koen de Vos et.al.	2410.12651	null
2024-10-16	Zeroth-Order Feedback Optimization in Multi-Agent Systems: Tackling Coupled Constraints	Yingpeng Duan et.al.	2410.12647	null
2024-10-16	Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach	Henrique Donâncio et.al.	2410.12598	null
2024-10-16	Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving	Sihao Wu et.al.	2410.12568	null
2024-10-16	A Communication Consistent Approach to Signal Temporal Logic Task Decomposition in Multi-Agent Systems	Gregorio Marchesini et.al.	2410.12563	null
2024-10-16	Nash equilibria in scalar discrete-time linear quadratic games	Giulio Salizzoni et.al.	2410.12544	null
2024-10-15	Molecular Quantum Control Algorithm Design by Reinforcement Learning	Anastasia Pipi et.al.	2410.11839	null
2024-10-15	G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks	Guibin Zhang et.al.	2410.11782	null
2024-10-15	BlendRL: A Framework for Merging Symbolic and Neural Policy Learning	Hikaru Shindo et.al.	2410.11689	null
2024-10-15	Optimal Mediation Mechanisms in Bilateral Trade	Zhikang Fan et.al.	2410.11683	null
2024-10-15	Safety Filtering While Training: Improving the Performance and Sample Efficiency of Reinforcement Learning Agents	Federico Pizarro Bejarano et.al.	2410.11671	link
2024-10-15	Markov-Nash equilibria in mean-field games under model uncertainty	Johannes Langner et.al.	2410.11652	null
2024-10-15	Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search	Jiamian Li et.al.	2410.11642	null
2024-10-15	Findings of the WMT 2024 Shared Task on Chat Translation	Wafaa Mohammed et.al.	2410.11624	null
2024-10-15	Temporal Hyperproperties for Population Protocols	Nicolas Waldburger et.al.	2410.11572	null
2024-10-15	Demo: Testing AI-driven MAC Learning in Autonomic Networks	Leonard Paeleke et.al.	2410.11565	null
2024-10-14	AFlow: Automating Agentic Workflow Generation	Jiayi Zhang et.al.	2410.10762	link
2024-10-14	Denial-of-Service Poisoning Attacks against Large Language Models	Kuofeng Gao et.al.	2410.10760	link
2024-10-14	DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model	Yuqi Wang et.al.	2410.10738	null
2024-10-14	Online Statistical Inference for Time-varying Sample-averaged Q-learning	Saunak Kumar Panda et.al.	2410.10737	null
2024-10-14	Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach	Rory Young et.al.	2410.10674	null
2024-10-14	Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning	William A. Stigall et.al.	2410.10660	null
2024-10-14	Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty	John Mern et.al.	2410.10610	null
2024-10-14	STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack	Naman Gupta et.al.	2410.10584	null
2024-10-14	Consensus in Multiagent Systems with lack of connection	Mohamed Bentaibi et.al.	2410.10486	null
2024-10-14	Compositional Shielding and Reinforcement Learning for Multi-Agent Systems	Asger Horn Brorholt et.al.	2410.10460	null
2024-10-11	PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents	Xiangyu Yin et.al.	2410.09034	link
2024-10-11	AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents	Maksym Andriushchenko et.al.	2410.09024	null
2024-10-11	From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating UI Operation Impacts	Zhuohao Jerry Zhang et.al.	2410.09006	null
2024-10-11	Cyclic jetting enables microbubble-mediated drug delivery	Marco Cattaneo et.al.	2410.08990	null
2024-10-11	Best-of-Both-Worlds Fairness of the Envy-Cycle-Elimination Algorithm	Jugal Garg et.al.	2410.08986	null
2024-10-11	Optimal Allocation with Peer Information	Axel Niemeyer et.al.	2410.08954	null
2024-10-11	Transferable Belief Model on Quantum Circuits	Qianli Zhou et.al.	2410.08949	null
2024-10-11	The Dynamics of Social Conventions in LLM populations: Spontaneous Emergence, Collective Biases and Tipping Points	Ariel Flint Ashery et.al.	2410.08948	null
2024-10-11	Hyperspectral fluorescence imaging using a high-speed silicon photomultiplier array	Chi Z. Huang et.al.	2410.08936	null
2024-10-11	MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL	Claas A Voelcker et.al.	2410.08896	null
2024-10-10	Agent S: An Open Agentic Framework that Uses Computers Like a Human	Saaket Agashe et.al.	2410.08164	link
2024-10-10	DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory	Yutong Wang et.al.	2410.08143	link
2024-10-10	SoundScape: A Human-AI Co-Creation System Making Your Memories Heard	Chongjun Zhong et.al.	2410.08136	null
2024-10-10	Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction	Jarrid Rector-Brooks et.al.	2410.08134	null
2024-10-10	Mars: Situated Inductive Reasoning in an Open-World Environment	Xiaojuan Tang et.al.	2410.08126	null
2024-10-10	Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System	Weize Chen et.al.	2410.08115	null
2024-10-10	Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining	Tianyi Bai et.al.	2410.08102	link
2024-10-10	Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread	David Kerkmann et.al.	2410.08050	link
2024-10-10	Strategic Classification With Externalities	Yiling Chen et.al.	2410.08032	null
2024-10-10	Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching	Xiaoshan Lin et.al.	2410.08022	null
2024-10-09	Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Manling Li et.al.	2410.07166	link
2024-10-09	Spatiotemporal Modeling and Forecasting at Scale with Dynamic Generalized Linear Models	Pranay Pherwani et.al.	2410.07161	null
2024-10-09	I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy	Gian Maria Campedelli et.al.	2410.07109	link
2024-10-09	Identifying and Addressing Delusions for Target-Directed Decision-Making	Mingde Zhao et.al.	2410.07096	link
2024-10-09	MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering	Jun Shern Chan et.al.	2410.07095	link
2024-10-10	Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology	Xiangyu Wang et.al.	2410.07087	null
2024-10-09	MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses	Zonglin Yang et.al.	2410.07076	link
2024-10-09	Retrieval-Augmented Decision Transformer: External Memory for In-context RL	Thomas Schmied et.al.	2410.07071	link
2024-10-09	Mechanism Design for Exchange Markets	Yusen Zheng et.al.	2410.07023	null
2024-10-09	Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach	Xuanming Zhang et.al.	2410.06949	link
2024-10-07	Grounding Partially-Defined Events in Multimodal Data	Kate Sanders et.al.	2410.05267	null
2024-10-07	GLEE: A Unified Framework and Benchmark for Language-based Economic Environments	Eilam Shapira et.al.	2410.05254	link
2024-10-07	Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents	Boyu Gou et.al.	2410.05243	link
2024-10-07	Scalable and Accurate Graph Reasoning with LLM-based Multi-Agents	Yuwei Hu et.al.	2410.05130	null
2024-10-08	Last Iterate Convergence in Monotone Mean Field Games	Noboru Isobe et.al.	2410.05127	null
2024-10-07	ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery	Ziru Chen et.al.	2410.05080	null
2024-10-07	Extended Functional Representation Lemma: A Tool For Privacy, Semantic Representation, Caching, and Compression Design	Amirreza Zamani et.al.	2410.05033	null
2024-10-07	Active Fine-Tuning of Generalist Policies	Marco Bagatella et.al.	2410.05026	null
2024-10-07	Contest design with a finite type-space: A unifying approach	Andrzej Baranski et.al.	2410.04970	null
2024-10-07	Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning	Chen Zhang et.al.	2410.04936	null
2024-10-04	Open-World Reinforcement Learning over Long Short-Term Imagination	Jiajian Li et.al.	2410.03618	link
2024-10-04	Never Mind The No-Ops: Faster and Less Volatile Simulation Modelling of Co-Evolutionary Species Interactions via Spatial Cyclic Games	Dave Cliff et.al.	2410.03586	link
2024-10-04	Training on more Reachable Tasks for Generalisation in Reinforcement Learning	Max Weltevrede et.al.	2410.03565	null
2024-10-04	Steering Large Language Models between Code Execution and Textual Reasoning	Yongchao Chen et.al.	2410.03524	null
2024-10-04	Tournament versus Circulant: On Simulating 7-Species Evolutionary Spatial Cyclic Games with Ablated Predator-Prey Networks as Models of Biodiversity	Dave Cliff et.al.	2410.03518	link
2024-10-04	MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation	Hongcheng Wang et.al.	2410.03488	null
2024-10-04	VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning	Han Lin et.al.	2410.03478	null
2024-10-04	MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents	Junpeng Yue et.al.	2410.03450	null
2024-10-04	Attainable Force Approximation and Full-Pose Tracking Control of an Over-Actuated Thrust-Vectoring Modular Team UAV	Yen-Cheng Chu et.al.	2410.03445	null
2024-10-04	ToolGen: Unified Tool Retrieval and Calling via Generation	Renxi Wang et.al.	2410.03439	link
2024-10-03	ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI	Ahmad Elawady et.al.	2410.02751	link
2024-10-03	Grounding Large Language Models In Embodied Environment With Imperfect World Models	Haolan Liu et.al.	2410.02742	null
2024-10-03	DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects	Zhaowei Wang et.al.	2410.02730	link
2024-10-03	Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization	Ryan C. Barron et.al.	2410.02721	null
2024-10-03	Grounded Answers for Multi-agent Decision-making Problem through Generative World Model	Zeyang Liu et.al.	2410.02664	null
2024-10-03	Undesirable Memorization in Large Language Models: A Survey	Ali Satvaty et.al.	2410.02650	null
2024-10-04	Learning 3D Perception from Others’ Predictions	Jinsu Yoo et.al.	2410.02646	null
2024-10-03	Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents	Hanrong Zhang et.al.	2410.02644	link
2024-10-03	Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning	Olivier Lepel et.al.	2410.02605	null
2024-10-03	Agents’ Room: Narrative Generation through Multi-step Collaboration	Fantine Huot et.al.	2410.02603	link
2024-10-02	Windowed MAPF with Completeness Guarantees	Rishi Veerapaneni et.al.	2410.01798	null
2024-10-02	Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning	Prasanth Sengadu Suresh et.al.	2410.01790	null
2024-10-02	Social coordination perpetuates stereotypic expectations and behaviors across generations in deep multi-agent reinforcement learning	Rebekah A. Gelpí et.al.	2410.01763	null
2024-10-02	PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillation	Mohammadamin Davoodabadi et.al.	2410.01745	null
2024-10-02	Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning	Xingrui Gu et.al.	2410.01739	null
2024-10-02	Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning	Omayma Mahjoub et.al.	2410.01706	null
2024-10-02	Stable Offline Value Function Learning with Bisimulation-based Representations	Brahma S. Pavse et.al.	2410.01643	null
2024-10-02	Moral Alignment for LLM Agents	Elizaveta Tennant et.al.	2410.01639	null
2024-10-02	Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving	Aron Distelzweig et.al.	2410.01628	null
2024-10-02	Automated Red Teaming with GOAT: the Generative Offensive Agent Tester	Maya Pavlova et.al.	2410.01606	null
2024-09-30	LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner	Xiaopan Zhang et.al.	2409.20560	null
2024-09-30	Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos	Md Mohaiminul Islam et.al.	2409.20557	null
2024-09-30	Direct Multipath-Based SLAM	Mingchao Liang et.al.	2409.20552	null
2024-09-30	COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models	Divyanshu Daiya et.al.	2409.20502	null
2024-09-30	Impartial Selection Under Combinatorial Constraints	Javier Cembrano et.al.	2409.20477	null
2024-09-30	Facility Location Games with Competitors	Cheng Peng et.al.	2409.20396	null
2024-09-30	Machine Learning-enabled Traffic Steering in O-RAN: A Case Study on Hierarchical Learning Approach	Md Arafat Habib et.al.	2409.20391	null
2024-09-30	Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models	Yizhou Huang et.al.	2409.20364	null
2024-09-30	A mean field Jacobi process for modeling sustainable tourism	Hidekazu Yoshioka et.al.	2409.20347	null
2024-09-30	MARLadona – Towards Cooperative Team Play Using Multi-Agent Reinforcement Learning	Zichong Li et.al.	2409.20326	null
2024-09-27	Mean-Field Control Barrier Functions: A Framework for Real-Time Swarm Control	Samy Wu Fung et.al.	2409.18945	null
2024-09-27	Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models	Jiaming Li et.al.	2409.18943	link
2024-09-27	AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow	Huizi Yu et.al.	2409.18924	null
2024-09-27	Best Arm Identification with Minimal Regret	Junwen Yang et.al.	2409.18909	null
2024-09-27	Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks	Richard Osuala et.al.	2409.18872	link
2024-09-27	Safe Decentralized Multi-Agent Control using Black-Box Predictors, Conformal Decision Policies, and Control Barrier Functions	Sacha Huriot et.al.	2409.18862	null
2024-09-27	ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning	Jannis Becktepe et.al.	2409.18827	link
2024-09-27	Facility Location Problem with Aleatory Agents	Gennaro Auricchio et.al.	2409.18817	null
2024-09-27	Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs	Yanyuan Qiao et.al.	2409.18794	null
2024-09-27	Forecasting Macroeconomic Dynamics using a Calibrated Data-Driven Agent-based Model	Samuel Wiese et.al.	2409.18760	null
2024-09-26	StackGen: Generating Stable Structures from Silhouettes via Diffusion	Luzhe Sun et.al.	2409.18098	null
2024-09-26	Infer Human’s Intentions Before Following Natural Language Instructions	Yanming Wan et.al.	2409.18073	link
2024-09-27	Explaining Explaining	Sergei Nirenburg et.al.	2409.18052	null
2024-09-26	Inverse Reinforcement Learning with Multiple Planning Horizons	Jiayu Yao et.al.	2409.18051	null
2024-09-26	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049	link
2024-09-26	Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving	Haochen Liu et.al.	2409.18031	link
2024-09-26	Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective	Yotam Wolf et.al.	2409.18028	null
2024-09-26	Control Industrial Automation System with Large Language Models	Yuchen Xia et.al.	2409.18009	link
2024-09-26	Distributed Invariant Unscented Kalman Filter based on Inverse Covariance Intersection with Intermittent Measurements	Zhian Ruan et.al.	2409.17997	null
2024-09-26	Nonparametric Inference Framework for Time-dependent Epidemic Models	Son Luu et.al.	2409.17968	null
2024-09-25	Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents	Junting Lu et.al.	2409.17140	null
2024-09-25	Collision-free time-optimal path parameterization for multi-robot teams	Katherine Mao et.al.	2409.17079	null
2024-09-25	AI-Driven Risk-Aware Scheduling for Active Debris Removal Missions	Antoine Poupon et.al.	2409.17012	null
2024-09-25	PitRSDNet: Predicting Intra-operative Remaining Surgery Duration in Endoscopic Pituitary Surgery	Anjana Wijekoon et.al.	2409.16998	null
2024-09-25	Tell Me What You Don’t Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing	Wenhao Liu et.al.	2409.16913	null
2024-09-25	A Roadmap for Embodied and Social Grounding in LLMs	Sara Incao et.al.	2409.16900	null
2024-09-25	Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study	Sota Kobuki et.al.	2409.16899	null
2024-09-25	Automating Traffic Model Enhancement with AI Research Agent	Xusen Guo et.al.	2409.16876	link
2024-09-25	Communication Backbone Reconfiguration with Connectivity Maintenance	Leonardo Santos et.al.	2409.16851	null
2024-09-25	Modeling the Modqueue: Towards Understanding and Improving Report Resolution on Reddit	Tanvi Bajpai et.al.	2409.16840	null
2024-09-24	Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks	Ahmed Shokry et.al.	2409.16208	null
2024-09-25	Extending Stable and Popular Matching Algorithms from Bipartite to Arbitrary Instances	Gergely Csáji et.al.	2409.16173	null
2024-09-24	EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges	Talor Abramovich et.al.	2409.16165	link
2024-09-25	Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed	Alexander Prutsch et.al.	2409.16154	link
2024-09-24	Analyzing Probabilistic Methods for Evaluating Agent Capabilities	Axel Højmark et.al.	2409.16125	null
2024-09-24	MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents	Ming Zhu et.al.	2409.16120	link
2024-09-24	A decision-theoretic model for a principal-agent collaborative learning problem	Getachew K Befekadu et.al.	2409.16068	null
2024-09-24	Bridging Environments and Language with Rendering Functions and Vision-Language Models	Theo Cachet et.al.	2409.16024	null
2024-09-24	AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model	Zhenghao Qi et.al.	2409.16019	link
2024-09-24	Automated test generation to evaluate tool-augmented LLMs as conversational AI agents	Samuel Arcadinho et.al.	2409.15934	null
2024-09-18	Residual Descent Differential Dynamic Game (RD3G) – A Fast Newton Solver for Constrained General Sum Games	Zhiyuan Zhang et.al.	2409.12152	null
2024-09-18	MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning	Justin Chih-Yao Chen et.al.	2409.12147	link
2024-09-19	The Impact of Element Ordering on LM Agent Performance	Wayne Chi et.al.	2409.12089	link
2024-09-19	Using Large Language Models to Generate Clinical Trial Tables and Figures	Yumeng Yang et.al.	2409.12046	null
2024-09-19	Representing Positional Information in Generative World Models for Object Manipulation	Stefano Ferraro et.al.	2409.12005	null
2024-09-18	Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning	Claude Formanek et.al.	2409.12001	null
2024-09-18	On the Stability of Consensus Control under Rotational Ambiguities	Zhonggang Li et.al.	2409.11979	null
2024-09-18	Anomalous behavior of Replicator dynamics for the Prisoner’s Dilemma on diluted lattices	Fernanda R. Leivas et.al.	2409.11955	null
2024-09-18	Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling	Arthur Müller et.al.	2409.11933	null
2024-09-18	Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks	Samuel Belkadi et.al.	2409.11897	link
2024-09-17	Ising model with varying spin strength on a scale-free network: scaling functions and critical amplitude ratios	M. Krasnytska et.al.	2409.11396	null
2024-09-17	Distributed Perception Aware Safe Leader Follower System via Control Barrier Methods	Richie R. Suganda et.al.	2409.11394	null
2024-09-17	LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents	Amine B. Hassouna et.al.	2409.11393	null
2024-09-17	CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark	Zachary S. Siegel et.al.	2409.11363	link
2024-09-17	A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems	Mostafa M. Shibl et.al.	2409.11358	null
2024-09-17	EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage	Zeyi Liao et.al.	2409.11295	link
2024-09-17	P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task	Weiye Xu et.al.	2409.11279	null
2024-09-17	Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments	Maria Rigaki et.al.	2409.11276	null
2024-09-18	The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives	Samee Arif et.al.	2409.11261	link
2024-09-17	To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games?	Chih-Yuan Chiu et.al.	2409.11257	null
2024-09-16	On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model	Surajit Saha et.al.	2409.10413	null
2024-09-16	Reducing Leximin Fairness to Utilitarian Optimization	Eden Hartman et.al.	2409.10395	null
2024-09-16	Decentralized and Asymmetric Multi-Agent Learning in Construction Sites	Yakov Miron et.al.	2409.10375	null
2024-09-16	Instigating Cooperation among LLM Agents Using Adaptive Information Modulation	Qiliang Chen et.al.	2409.10372	null
2024-09-16	2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?	Téo Guichoux et.al.	2409.10357	null
2024-09-16	Partial Ordering Bayesian Logistic Regression Model for Phase I Combination Trials and Computationally Efficient Approach to Operational Prior Specification	Weishi Chen et.al.	2409.10352	link
2024-09-16	Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots	Hongming Zhang et.al.	2409.10277	link
2024-09-16	Synchronization-Based Cooperative Distributed Model Predictive Control	Julius Beerwerth et.al.	2409.10215	null
2024-09-16	Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles	Mais Jamal et.al.	2409.10165	null
2024-09-16	Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions	Alejandro Sánchez Roncero et.al.	2409.10117	null