Updated on 2025.06.28
Usage instructions: here
Agent
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | Whole-Body Conditioned Egocentric Video Prediction | Yutong Bai et.al. | 2506.21552 | null |
2025-06-26 | PsyLite Technical Report | Fangjun Ding et.al. | 2506.21536 | null |
2025-06-26 | Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge | Boyu Gou et.al. | 2506.21506 | null |
2025-06-26 | From multi-allocations to allocations, with subadditive valuations | Uriel Feige et.al. | 2506.21493 | null |
2025-06-26 | Ad-Hoc Human-AI Coordination Challenge | Tin Dizdarević et.al. | 2506.21490 | null |
2025-06-26 | Reinforcement Learning for Optimal Control of Spin Magnetometers | Logan W. Cooke et.al. | 2506.21475 | null |
2025-06-26 | Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents | Tianyi Men et.al. | 2506.21252 | null |
2025-06-26 | Dynamic Risk-Aware MPPI for Mobile Robots in Crowds via Efficient Monte Carlo Approximations | Elia Trevisan et.al. | 2506.21205 | null |
2025-06-26 | Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout | Apurva Shah et.al. | 2506.21186 | null |
2025-06-26 | Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 | Jongyeon Park et.al. | 2506.21174 | null |
2025-06-25 | MMSearch-R1: Incentivizing LMMs to Search | Jinming Wu et.al. | 2506.20670 | null |
2025-06-25 | The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | Andrei Lupu et.al. | 2506.20664 | null |
2025-06-25 | Memento: Note-Taking for Your Future Self | Chao Wan et.al. | 2506.20642 | null |
2025-06-25 | Towards Community-Driven Agents for Machine Learning Engineering | Sijie Li et.al. | 2506.20640 | null |
2025-06-25 | Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm | Baixiang Huang et.al. | 2506.20606 | null |
2025-06-25 | Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges | Alexander D. Kalian et.al. | 2506.20598 | null |
2025-06-25 | An Explicit Solution for the Problem of Optimal Investment with Random Endowment | Michael Donisch et.al. | 2506.20506 | null |
2025-06-25 | Engineering Sentience | Konstantin Demin et.al. | 2506.20504 | null |
2025-06-25 | Opinion Dynamics with Highly Oscillating Opinions | Víctor A. Vargas-Pérez et.al. | 2506.20472 | null |
2025-06-25 | An Agentic System for Rare Disease Diagnosis with Traceable Reasoning | Weike Zhao et.al. | 2506.20430 | null |
2025-06-24 | JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning | Ai Han et.al. | 2506.19846 | null |
2025-06-24 | MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Yucheng Zhou et.al. | 2506.19835 | null |
2025-06-24 | Curating art exhibitions using machine learning | Eurico Covas et.al. | 2506.19813 | null |
2025-06-24 | LLM-Based Social Simulations Require a Boundary | Zengqing Wu et.al. | 2506.19806 | null |
2025-06-24 | Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Menglong Zhang et.al. | 2506.19785 | null |
2025-06-24 | SAGE: Strategy-Adaptive Generation Engine for Query Rewriting | Teng Wang et.al. | 2506.19783 | null |
2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | null |
2025-06-24 | From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking | Gyeongwon James Kim et.al. | 2506.19724 | null |
2025-06-24 | A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures | Dezhang Kong et.al. | 2506.19676 | null |
2025-06-24 | How trust networks shape students’ opinions about the proficiency of artificially intelligent assistants | Yutong Bu et.al. | 2506.19655 | null |
2025-06-23 | Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models | Kiymet Akdemir et.al. | 2506.18900 | null |
2025-06-23 | Steering Conceptual Bias via Transformer Latent-Subspace Activation | Vansh Sharma et.al. | 2506.18887 | null |
2025-06-23 | GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM | Annika Thomas et.al. | 2506.18885 | null |
2025-06-23 | Broad Validity of the First-Order Approach in Moral Hazard | Eduardo Azevedo et.al. | 2506.18873 | null |
2025-06-23 | Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning | Anthony Kobanda et.al. | 2506.18847 | null |
2025-06-23 | Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories | Islem Bouzenia et.al. | 2506.18824 | null |
2025-06-23 | Multi-Agent Online Control with Adversarial Disturbances | Anas Barakat et.al. | 2506.18814 | null |
2025-06-23 | Fair Allocation with Money: What is Your Objective? | Noga Klein Elmalem et.al. | 2506.18794 | null |
2025-06-23 | TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation | Kamil Szczepanik et.al. | 2506.18783 | null |
2025-06-23 | Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI | Daniel M. Lang et.al. | 2506.18720 | null |
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213 | null |
2025-06-20 | Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Matias Martinez et.al. | 2506.17208 | null |
2025-06-20 | Towards AI Search Paradigm | Yuchen Li et.al. | 2506.17188 | null |
2025-06-20 | Capturing Misalignment | Pierfrancesco Guarino et.al. | 2506.17176 | null |
2025-06-20 | A Note on Proper Relational Structures | Adam Bjorndahl et.al. | 2506.17142 | null |
2025-06-20 | When Can Model-Free Reinforcement Learning be Enough for Thinking? | Josiah P. Hanna et.al. | 2506.17124 | null |
2025-06-20 | A general multi-stratum model for a nanofunctionalized releasing capsule: a computational study | Elia Onofri et.al. | 2506.17078 | null |
2025-06-20 | Behavior Driven Development for 3D Games | Fernando Pastor Ricós et.al. | 2506.17057 | null |
2025-06-20 | Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment | Leizhen Wang et.al. | 2506.17029 | null |
2025-06-20 | Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence | Yining Hong et.al. | 2506.15677 | null |
2025-06-18 | Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers | Tommaso Green et.al. | 2506.15674 | link |
2025-06-18 | SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence | Yao Zhang et.al. | 2506.15672 | null |
2025-06-18 | PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection | Wenhao Li et.al. | 2506.15656 | null |
2025-06-18 | FindingDory: A Benchmark to Evaluate Memory in Embodied Agents | Karmesh Yadav et.al. | 2506.15635 | null |
2025-06-18 | The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games | Lyle Goodyear et.al. | 2506.15624 | null |
2025-06-18 | Multi-Agent, Multi-Scale Systems with the Koopman Operator | Craig Bakker et.al. | 2506.15589 | null |
2025-06-18 | Learning to flock in open space by avoiding collisions and staying together | Martino Brambati et.al. | 2506.15587 | null |
2025-06-18 | Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents | Aline Dobrovsky et.al. | 2506.15567 | null |
2025-06-18 | Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning | Roger Creus Castanyer et.al. | 2506.15544 | link |
2025-06-17 | RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills | Chunru Lin et.al. | 2506.14763 | null |
2025-06-17 | Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems | Shiyu Cheng et.al. | 2506.14749 | null |
2025-06-17 | AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes | Jiahao Qiu et.al. | 2506.14728 | null |
2025-06-17 | Linear Planar 3-SAT and Its Applications in Planning | Victorien Desbois et.al. | 2506.14713 | null |
2025-06-17 | AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions | Aishan Liu et.al. | 2506.14697 | null |
2025-06-17 | Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference | Kalliyan Velasco et.al. | 2506.14690 | null |
2025-06-17 | Unified Software Engineering agent as AI Software Engineer | Leonhard Applis et.al. | 2506.14683 | null |
2025-06-17 | StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery | Jina Kim et.al. | 2506.14670 | null |
2025-06-17 | SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning | Hexian Ni et.al. | 2506.14648 | null |
2025-06-17 | GenerationPrograms: Fine-grained Attribution with Executable Programs | David Wan et.al. | 2506.14580 | null |
2025-06-16 | MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering | Arya Fayyazi et.al. | 2506.13755 | null |
2025-06-16 | PB $^2$ : Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning | Brahim Driss et.al. | 2506.13741 | null |
2025-06-16 | The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning | Jiashun Liu et.al. | 2506.13672 | null |
2025-06-16 | We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems | Junfeng Fang et.al. | 2506.13666 | link |
2025-06-16 | Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning | Shulin Tian et.al. | 2506.13654 | null |
2025-06-16 | xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations | Kaiyuan Chen et.al. | 2506.13651 | null |
2025-06-16 | Deceptive Path Planning: A Bayesian Game Approach | Violetta Rostobaya et.al. | 2506.13650 | null |
2025-06-16 | CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation | Yuwei Du et.al. | 2506.13599 | null |
2025-06-16 | Agent Capability Negotiation and Binding Protocol (ACNBP) | Ken Huang et.al. | 2506.13590 | link |
2025-06-16 | Non-exchangeable mean-field theory for adaptive weights: propagation of chaos and graphon sampling lemma | Datong Zhou et.al. | 2506.13587 | null |
2025-06-13 | Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale | Junha Lee et.al. | 2506.12009 | null |
2025-06-13 | Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents? | Ramesh Raskar et.al. | 2506.12003 | null |
2025-06-13 | Self-Regulating Cars: Automating Traffic Control in Free Flow Road Networks | Ankit Bhardwaj et.al. | 2506.11973 | null |
2025-06-13 | Visual Pre-Training on Unlabeled Images using Reinforcement Learning | Dibya Ghosh et.al. | 2506.11967 | null |
2025-06-13 | Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning | Mohammadamin Moradi et.al. | 2506.11957 | null |
2025-06-13 | Secure API-Driven Research Automation to Accelerate Scientific Discovery | Tyler J. Skluzacek et.al. | 2506.11950 | null |
2025-06-13 | Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations | Miguel Suau et.al. | 2506.11912 | null |
2025-06-13 | Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients | Chapa Sirithunge et.al. | 2506.11906 | null |
2025-06-13 | An Explainable AI Framework for Dynamic Resource Management in Vehicular Network Slicing | Haochen Sun et.al. | 2506.11882 | null |
2025-06-13 | Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems | Zhipeng Bao et.al. | 2506.11842 | null |
2025-06-12 | AutoMind: Adaptive Knowledgeable Agent for Automated Data Science | Yixin Ou et.al. | 2506.10974 | link |
2025-06-12 | Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop | Justin Kerr et.al. | 2506.10968 | null |
2025-06-12 | SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks | Lianghong Guo et.al. | 2506.10954 | link |
2025-06-12 | Build the web for agents, not agents for the web | Xing Han Lù et.al. | 2506.10953 | null |
2025-06-12 | Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors | Chen Yueh-Han et.al. | 2506.10949 | link |
2025-06-12 | Execution Guided Line-by-Line Code Generation | Boaz Lavon et.al. | 2506.10948 | link |
2025-06-12 | Dynamic Epistemic Friction in Dialogue | Timothy Obiso et.al. | 2506.10934 | null |
2025-06-12 | Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence | Eduardo Baena et.al. | 2506.10925 | null |
2025-06-12 | Prediction and control of geometry-induced nematic order in growing multicellular systems | Lukas Hupe et.al. | 2506.10867 | null |
2025-06-12 | CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Alireza Salemi et.al. | 2506.10844 | link |
2025-06-11 | Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling | Tim Z. Xiao et.al. | 2506.09998 | null |
2025-06-11 | SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance | Wentao Ge et.al. | 2506.09968 | null |
2025-06-11 | The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability | Jiachen Hu et.al. | 2506.09940 | null |
2025-06-11 | On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing | Junlin Chen et.al. | 2506.09924 | null |
2025-06-11 | PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants | Zheng Zhao et.al. | 2506.09902 | link |
2025-06-11 | “What are my options?”: Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended) | Noel Brindise et.al. | 2506.09901 | null |
2025-06-11 | OctoNav: Towards Generalist Embodied Navigation | Chen Gao et.al. | 2506.09839 | null |
2025-06-11 | Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy | Tonghe Wang et.al. | 2506.09805 | null |
2025-06-11 | Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy | Davide Grossi et.al. | 2506.09789 | null |
2025-06-11 | Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era | Shuo Jiang et.al. | 2506.09755 | null |
2025-06-10 | ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering | Yuki Imajuku et.al. | 2506.09050 | link |
2025-06-10 | VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Li Kang et.al. | 2506.09049 | null |
2025-06-10 | Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation | Xiaowen Ma et.al. | 2506.09046 | null |
2025-06-10 | The Decoupled Risk Landscape in Performative Prediction | Javier Sanguino et.al. | 2506.09044 | null |
2025-06-10 | Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System | Yuan Guo et.al. | 2506.08972 | null |
2025-06-10 | Towards Robust Deep Reinforcement Learning against Environmental State Perturbation | Chenxu Wang et.al. | 2506.08961 | null |
2025-06-10 | What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities | Wendong Bu et.al. | 2506.08933 | null |
2025-06-10 | Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL) | Maria-Veronica Ciocanel et.al. | 2506.08916 | link |
2025-06-10 | Intention-Conditioned Flow Occupancy Models | Chongyi Zheng et.al. | 2506.08902 | link |
2025-06-10 | Pairwise similarity method for majority domination problem | N. I. Shushko et.al. | 2506.08886 | null |
2025-06-09 | GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior | Penghao Wu et.al. | 2506.08012 | null |
2025-06-09 | Dreamland: Controllable World Creation with Simulator and Generative Models | Sicheng Mo et.al. | 2506.08006 | null |
2025-06-09 | Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System | Fan Yang et.al. | 2506.07997 | null |
2025-06-09 | $τ^2$ -Bench: Evaluating Conversational Agents in a Dual-Control Environment | Victor Barres et.al. | 2506.07982 | link |
2025-06-09 | Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator | Alberto Bazán-Guillén et.al. | 2506.07980 | null |
2025-06-10 | Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction | Junhong Shen et.al. | 2506.07976 | link |
2025-06-09 | HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization | Hongzheng Chen et.al. | 2506.07972 | link |
2025-06-09 | Diffusion of Responsibility in Collective Decision Making | Pavel Naumov et.al. | 2506.07935 | null |
2025-06-09 | LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement | Dimitris Panagopoulos et.al. | 2506.07915 | null |
2025-06-09 | A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit | Andrea Tiranti et.al. | 2506.07877 | null |
2025-06-06 | PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time | Weizhi Zhang et.al. | 2506.06254 | null |
2025-06-06 | Longer Lists Yield Better Matchings | Yuri Faenza et.al. | 2506.06217 | null |
2025-06-06 | Can Theoretical Physics Research Benefit from Language Agents? | Sirui Lu et.al. | 2506.06214 | null |
2025-06-06 | A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization | Muhammed Ustaomeroglu et.al. | 2506.06179 | null |
2025-06-06 | Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach | James Ford et.al. | 2506.06175 | null |
2025-06-06 | The Lock-in Hypothesis: Stagnation by Algorithm | Tianyi Alex Qiu et.al. | 2506.06166 | null |
2025-06-06 | (AI peers) are people learning from the same standpoint: Perception of AI characters in a Collaborative Science Investigation | Eunhye Grace Ko et.al. | 2506.06165 | null |
2025-06-06 | Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks | Adiba Mahbub Proma et.al. | 2506.06153 | null |
2025-06-06 | CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting | Peter Lengyel et.al. | 2506.06128 | null |
2025-06-06 | Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library | Weixun Wang et.al. | 2506.06122 | null |
2025-06-05 | Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Niv Eckhaus et.al. | 2506.05309 | link |
2025-06-05 | ProRefine: Inference-time Prompt Refinement with Textual Feedback | Deepak Pandita et.al. | 2506.05305 | null |
2025-06-05 | Control Tax: The Price of Keeping AI in Check | Mikhail Terekhov et.al. | 2506.05296 | null |
2025-06-05 | A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$ : Robust Imitation via Learning to Search | Arnav Kumar Jain et.al. | 2506.05294 | link |
2025-06-05 | Tight analyses of first-order methods with error feedback | Daniel Berg Thomsen et.al. | 2506.05271 | link |
2025-06-06 | Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams | Mohammed Almutairi et.al. | 2506.05265 | null |
2025-06-05 | Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning | Dravyansh Sharma et.al. | 2506.05252 | null |
2025-06-05 | Towards Language-Augmented Multi-Agent Deep Reinforcement Learning | Maxime Toquebiau et.al. | 2506.05236 | null |
2025-06-05 | A Framework for Ethical Judgment of Smart City Applications | Weichen Shi et.al. | 2506.05172 | null |
2025-06-05 | An emergence-oriented approach to cyclic pursuit | Zhaozhan Yao et.al. | 2506.05157 | null |
2025-06-04 | OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Junting Chen et.al. | 2506.04217 | link |
2025-06-04 | Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs | Alex DeWeese et.al. | 2506.04215 | null |
2025-06-04 | TracLLM: A Generic Framework for Attributing Long Context LLMs | Yanting Wang et.al. | 2506.04202 | link |
2025-06-04 | MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures | Elena Zamaraeva et.al. | 2506.04195 | null |
2025-06-04 | SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models | Yuhao Wu et.al. | 2506.04180 | null |
2025-06-04 | A primal-dual price-optimization method for computing equilibrium prices in mean-field games models | Xu Wang et.al. | 2506.04169 | link |
2025-06-04 | Image Editing As Programs with Diffusion Models | Yujia Hu et.al. | 2506.04158 | null |
2025-06-05 | macOSWorld: A Multilingual Interactive Benchmark for GUI Agents | Pei Yang et.al. | 2506.04135 | link |
2025-06-04 | TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems | Shaina Raza et.al. | 2506.04133 | null |
2025-06-04 | CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues | Disha Sheshanarayana et.al. | 2506.04131 | null |
2025-06-03 | GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents | Qianhui Wu et.al. | 2506.03143 | null |
2025-06-03 | Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning | Yinjie Wang et.al. | 2506.03136 | link |
2025-06-03 | Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff | Sophie Greenwood et.al. | 2506.03102 | null |
2025-06-03 | EgoVLM: Policy Optimization for Egocentric Video Understanding | Ashwin Vinod et.al. | 2506.03097 | link |
2025-06-03 | DPO Learning with LLMs-Judge Signal for Computer Use Agents | Man Luo et.al. | 2506.03095 | null |
2025-06-03 | Provable Reinforcement Learning from Human Feedback with an Unknown Link Function | Qining Zhang et.al. | 2506.03066 | null |
2025-06-03 | MAEBE: Multi-Agent Emergent Behavior Framework | Sinem Erisken et.al. | 2506.03053 | null |
2025-06-03 | EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment | Mikolaj Walczak et.al. | 2506.03046 | null |
2025-06-03 | Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective | Jintian Shao et.al. | 2506.03038 | null |
2025-06-03 | TestAgent: An Adaptive and Intelligent Expert for Human Assessment | Junhao Yu et.al. | 2506.03032 | null |
2025-05-30 | Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents | Yaxin Luo et.al. | 2505.24878 | null |
2025-05-30 | Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks | Tajamul Ashraf et.al. | 2505.24876 | link |
2025-05-30 | VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software | Brandon Man et.al. | 2505.24838 | link |
2025-05-30 | Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation | Yucheng Zhou et.al. | 2505.24787 | link |
2025-06-02 | EXP-Bench: Can AI Conduct AI Research Experiments? | Patrick Tser Jern Kon et.al. | 2505.24785 | link |
2025-05-30 | Emergent Dynamics of Active Systems on Curved Environments | Euan D. Mackay et.al. | 2505.24730 | null |
2025-05-30 | CoRet: Improved Retriever for Code Editing | Fabio Fehr et.al. | 2505.24715 | null |
2025-05-30 | Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting | Wei Chen et.al. | 2505.24710 | link |
2025-05-30 | Towards a unified user modeling language for engineering human centered AI systems | Aaron Conrardy et.al. | 2505.24697 | null |
2025-05-30 | Multiple LLM Agents Debate for Equitable Cultural Alignment | Dayeon Ki et.al. | 2505.24671 | link |
2025-05-29 | From Chat Logs to Collective Insights: Aggregative Question Answering | Wentao Zhang et.al. | 2505.23765 | null |
2025-05-29 | ZeroGUI: Automating Online GUI Learning at Zero Human Cost | Chenyu Yang et.al. | 2505.23762 | link |
2025-05-29 | ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks | Akashah Shabbir et.al. | 2505.23752 | link |
2025-05-29 | ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | Zexi Liu et.al. | 2505.23723 | link |
2025-05-29 | COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents | Arun Verma et.al. | 2505.23720 | null |
2025-05-29 | From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems | Zeinab Nezami et.al. | 2505.23710 | null |
2025-05-29 | Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics | Ran Zhang et.al. | 2505.23695 | link |
2025-05-29 | ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork | Caroline Wang et.al. | 2505.23686 | link |
2025-05-29 | GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents | Manish Shetty et.al. | 2505.23671 | link |
2025-05-29 | Initial Luminally Deposited FGF4 Critically Influences Blastocyst Patterning | Michael A. Ramirez-Sierra et.al. | 2505.23650 | null |
2025-05-28 | 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | Wenbo Hu et.al. | 2505.22657 | null |
2025-05-28 | Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents | Michael Kirchhof et.al. | 2505.22655 | null |
2025-05-28 | WebDancer: Towards Autonomous Information Seeking Agency | Jialong Wu et.al. | 2505.22648 | link |
2025-05-29 | FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control | Younggyo Seo et.al. | 2505.22642 | null |
2025-05-28 | LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents | Rui Li et.al. | 2505.22634 | null |
2025-05-28 | HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym | Ngoc La et.al. | 2505.22597 | link |
2025-05-28 | GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git | Tobias Lindenbauer et.al. | 2505.22583 | link |
2025-05-29 | Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems | Hoang Pham et.al. | 2505.22571 | null |
2025-05-28 | Universal Visuo-Tactile Video Understanding for Embodied Interaction | Yifan Xie et.al. | 2505.22566 | null |
2025-05-28 | Training RL Agents for Multi-Objective Network Defense Tasks | Andres Molina-Markham et.al. | 2505.22531 | null |
2025-05-27 | Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making | Yihan Wang et.al. | 2505.21503 | null |
2025-05-27 | AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery | Haowei Wang et.al. | 2505.21499 | link |
2025-05-27 | Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers | Wei Pang et.al. | 2505.21497 | link |
2025-05-27 | UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents | Han Xiao et.al. | 2505.21496 | link |
2025-05-27 | Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming | Yang Yang et.al. | 2505.21486 | null |
2025-05-27 | Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | Zijun Liu et.al. | 2505.21471 | link |
2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
2025-05-27 | Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks | Francesco Cozzi et.al. | 2505.21426 | link |
2025-05-27 | GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation | Naizhu Jin et.al. | 2505.21425 | null |
2025-05-27 | Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery | Lina Zhao et.al. | 2505.21418 | null |
2025-05-27 | MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents | Ziming Wei et.al. | 2505.20148 | link |
2025-05-26 | Agentic 3D Scene Generation with Spatially Contextualized VLMs | Xinhang Liu et.al. | 2505.20129 | null |
2025-05-26 | Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers | Zhengliang Shi et.al. | 2505.20128 | link |
2025-05-26 | Agentic AI Process Observability: Discovering Behavioral Variability | Fabiana Fournier et.al. | 2505.20127 | null |
2025-05-26 | Agents Require Metacognitive and Strategic Reasoning to Succeed in the Coming Labor Markets | Simpson Zhang et.al. | 2505.20120 | null |
2025-05-27 | TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent | Dominik Meier et.al. | 2505.20118 | link |
2025-05-26 | MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning | Thang Nguyen et.al. | 2505.20096 | null |
2025-05-26 | SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale | Qi Li et.al. | 2505.20094 | null |
2025-05-26 | REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | Le Zhang et.al. | 2505.20046 | link |
2025-05-26 | Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking | Yihan Chen et.al. | 2505.20023 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135 | link |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121 | null |
2025-05-23 | Facility Location with Public Locations and Private Doubly-Peaked Costs | Richard Cole et.al. | 2505.18114 | null |
2025-05-23 | ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework | Lisheng Huang et.al. | 2505.18105 | link |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079 | null |
2025-05-23 | Linear Mixture Distributionally Robust Markov Decision Processes | Zhishuai Liu et.al. | 2505.18044 | null |
2025-05-23 | Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective | Jintian Shao et.al. | 2505.17997 | null |
2025-05-22 | SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding | Haoning Wu et.al. | 2505.17012 | link |
2025-05-22 | X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | Rui Ye et.al. | 2505.16997 | link |
2025-05-22 | MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems | Rui Ye et.al. | 2505.16988 | link |
2025-05-22 | T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning | Amartya Chakraborty et.al. | 2505.16986 | null |
2025-05-22 | Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | Adib Bazgir et.al. | 2505.16982 | null |
2025-05-22 | Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design | Zhenkun Li et.al. | 2505.16979 | null |
2025-05-22 | SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | Yaxin Du et.al. | 2505.16975 | link |
2025-05-22 | Modeling Inequality in Complex Networks of Strategic Agents using Iterative Game-Theoretic Transactions | Mayank Kejriwal et.al. | 2505.16966 | null |
2025-05-22 | Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection | Jiaying Fu et.al. | 2505.16954 | null |
2025-05-22 | A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | Shengyu Feng et.al. | 2505.16952 | null |
2025-05-22 | GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents | Yuqi Zhou et.al. | 2505.15810 | link |
2025-05-21 | The Agentic Economy | David M. Rothschild et.al. | 2505.15799 | null |
2025-05-22 | HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving | Zhiwen Chen et.al. | 2505.15793 | null |
2025-05-21 | Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning | Pedro P. Santos et.al. | 2505.15782 | null |
2025-05-21 | Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses | Xiaoxue Yang et.al. | 2505.15738 | link |
2025-05-21 | DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning | Gaurav Srivastava et.al. | 2505.15734 | null |
2025-05-21 | Quantum Dots as Functional Nanosystems for Enhanced Biomedical Applications | Pronama Biswas et.al. | 2505.15705 | null |
2025-05-21 | HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning | Xiaodong Mei et.al. | 2505.15703 | null |
2025-05-21 | Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives | Milad Kazemi et.al. | 2505.15693 | null |
2025-05-21 | From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems | Xiuchao Sui et.al. | 2505.15685 | link |
2025-05-20 | NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search | Sunhao Dai et.al. | 2505.14680 | null |
2025-05-20 | ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions | Bufang Yang et.al. | 2505.14668 | null |
2025-05-20 | AI Agents in the Electricity Market Game with Cryptocurrency Transactions: A Post-Terminator Analysis | Microsoft Copilot et.al. | 2505.14612 | null |
2025-05-20 | Agent Context Protocols Enhance Collective Inference | Devansh Bhardwaj et.al. | 2505.14569 | null |
2025-05-20 | Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study | Saahil Mahato et.al. | 2505.14544 | link |
2025-05-20 | A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version) | Gaia Belardinelli et.al. | 2505.14539 | null |
2025-05-20 | Energy-Efficient Deep Reinforcement Learning with Spiking Transformers | Mohammad Irfan Uddin et.al. | 2505.14533 | null |
2025-05-20 | BACON: A fully explainable AI model with graded logic for decision making problems | Haishi Bai et.al. | 2505.14510 | null |
2025-05-20 | Design and Evaluation of a Microservices Cloud Framework for Online Travel Platforms | Biman Barua et.al. | 2505.14508 | null |
2025-05-20 | Security of Distributed Gradient Descent Against Byzantine Agents | Sribalaji C. Anand et.al. | 2505.14473 | null |
2025-05-19 | G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | Liang Chen et.al. | 2505.13426 | link |
2025-05-20 | A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut | Gabriel Malikal et.al. | 2505.13405 | null |
2025-05-19 | Robin: A multi-agent system for automating scientific discovery | Ali Essam Ghareeb et.al. | 2505.13400 | null |
2025-05-19 | Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges | Hongru Wang et.al. | 2505.13328 | null |
2025-05-19 | Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions | Saleh Soudijani et.al. | 2505.13311 | null |
2025-05-19 | TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents | Yifu Cai et.al. | 2505.13291 | link |
2025-05-19 | Hybrid Voting-Based Task Assignment in Modular Construction Scenarios | Daniel Weiner et.al. | 2505.13278 | null |
2025-05-19 | From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery | Tianshi Zheng et.al. | 2505.13259 | link |
2025-05-19 | Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability | Jingyi Ren et.al. | 2505.13258 | link |
2025-05-19 | Composing Dextrous Grasping and In-hand Manipulation via Scoring with a Reinforcement Learning Critic | Lennart Röstel et.al. | 2505.13253 | null |
2025-05-16 | Automatic Reward Shaping from Confounded Offline Data | Mingxuan Li et.al. | 2505.11478 | null |
2025-05-16 | Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks | Wesley A Suttle et.al. | 2505.11461 | null |
2025-05-16 | Robust Equilibria in Shared Resource Allocation via Strengthening Border’s Theorem | David X. Lin et.al. | 2505.11431 | null |
2025-05-16 | Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis | Jing Liu et.al. | 2505.11401 | null |
2025-05-16 | Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation | Zihan Wang et.al. | 2505.11383 | link |
2025-05-16 | GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents | Lingxiao Diao et.al. | 2505.11368 | null |
2025-05-16 | Long-Term Average Impulse Control with Mean Field Interactions | K. L. Helmes et.al. | 2505.11345 | null |
2025-05-16 | Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics | Ardian Selmonaj et.al. | 2505.11311 | null |
2025-05-16 | Diffusion Learning with Partial Agent Participation and Local Updates | Elsa Rizk et.al. | 2505.11307 | null |
2025-05-16 | Meta-World+: An Improved, Standardized, RL Benchmark | Reginald McLean et.al. | 2505.11289 | link |
2025-05-15 | Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | Annie Wong et.al. | 2505.10543 | link |
2025-05-15 | Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation | Xinrui Wang et.al. | 2505.10522 | null |
2025-05-15 | Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning | Andrea Baisero et.al. | 2505.10484 | null |
2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null |
2025-05-15 | AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge | Ranjan Sapkota et.al. | 2505.10468 | null |
2025-05-15 | Bridging Theory and Perception in Fair Division: A Study on Comparative and Fair Share Notions | Hadi Hosseini et.al. | 2505.10433 | null |
2025-05-15 | Aggregating Information and Preferences with Bounded-Size Deviations | Qishen Han et.al. | 2505.10388 | null |
2025-05-15 | Multi-Agent Path Finding For Large Agents Is Intractable | Artem Agafonov et.al. | 2505.10387 | null |
2025-05-15 | Plasticity as the Mirror of Empowerment | David Abel et.al. | 2505.10361 | null |
2025-05-15 | Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change | Jonathan Clifford Balloch et.al. | 2505.10330 | null |
2025-05-14 | Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? | Anthony GX-Chen et.al. | 2505.09614 | null |
2025-05-14 | WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models | Abdullah Mushtaq et.al. | 2505.09595 | null |
2025-05-14 | Preserving Plasticity in Continual Learning with Adaptive Linearity Injection | Seyed Roozbeh Razavi Rohani et.al. | 2505.09486 | null |
2025-05-14 | Streaming Multi-agent Pathfinding | Mingkai Tang et.al. | 2505.09472 | link |
2025-05-14 | CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios | Raghav Garg et.al. | 2505.09436 | link |
2025-05-15 | Decentralized Nonlinear Model Predictive Control-Based Flock Navigation with Real-Time Obstacle Avoidance in Unknown Obstructed Environments | Nuthasith Gerdpratoom et.al. | 2505.09434 | null |
2025-05-14 | Using Dopants as Agents to Probe Key Electronic Properties of Organic Semiconductors | Artem Fediai et.al. | 2505.09431 | null |
2025-05-14 | Linear Search with Probabilistic Detection and Variable Speeds | Jared Coleman et.al. | 2505.09429 | link |
2025-05-15 | SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation | Achref Doula et.al. | 2505.09427 | null |
2025-05-14 | The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners | Vince Trencsenyi et.al. | 2505.09396 | null |
2025-05-14 | Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology | Yatai Ji et.al. | 2505.08765 | null |
2025-05-13 | Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots | Glaucia Melo et.al. | 2505.08648 | null |
2025-05-13 | TRAIL: Trace Reasoning and Agentic Issue Localization | Darshan Deshpande et.al. | 2505.08638 | null |
2025-05-13 | Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning | Shuai Han et.al. | 2505.08630 | null |
2025-05-13 | OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning | Zhaochen Su et.al. | 2505.08617 | link |
2025-05-13 | MC-Swarm: Minimal-Communication Multi-Agent Trajectory Planning and Deadlock Resolution for Quadrotor Swarm | Yunwoo Lee et.al. | 2505.08593 | null |
2025-05-14 | Communication-Efficient Distributed Online Nonconvex Optimization with Time-Varying Constraints | Kunpeng Zhang et.al. | 2505.08592 | null |
2025-05-13 | The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News | Yuhan Liu et.al. | 2505.08532 | null |
2025-05-13 | Strategy-Augmented Planning for Large Language Models via Opponent Exploitation | Shuai Xu et.al. | 2505.08459 | link |
2025-05-13 | Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting | Emlyn Williams et.al. | 2505.08458 | null |
2025-05-12 | Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models | Seungjae Lee et.al. | 2505.07815 | null |
2025-05-12 | A Theoretical Framework for Explaining Reinforcement Learning with Shapley Values | Daniel Beechey et.al. | 2505.07797 | link |
2025-05-12 | MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering | Rushi Qiang et.al. | 2505.07782 | link |
2025-05-12 | Multi-Agent Path Finding via Finite-Horizon Hierarchical Factorization | Jiarui Li et.al. | 2505.07779 | null |
2025-05-12 | Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving | Xinji Mai et.al. | 2505.07773 | link |
2025-05-12 | Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture | Rintaro Ando et.al. | 2505.07757 | null |
2025-05-13 | VTutor for High-Impact Tutoring at Scale: Managing Engagement and Real-Time Multi-Screen Monitoring with P2P Connections | Eason Chen et.al. | 2505.07736 | null |
2025-05-13 | Codifying Character Logic in Role-Playing | Letian Peng et.al. | 2505.07705 | link |
2025-05-12 | Belief Injection for Epistemic Control in Linguistic State Space | Sebastian Dumbrava et.al. | 2505.07693 | null |
2025-05-12 | Chronocept: Instilling a Sense of Time in Machines | Krish Goel et.al. | 2505.07637 | link |
2025-05-09 | Robust Multi-Agent Decision-Making in Finite-Population Games | Shinkyu Park et.al. | 2505.06200 | null |
2025-05-09 | Neuro-Symbolic Concepts | Jiayuan Mao et.al. | 2505.06191 | null |
2025-05-09 | The Power of Matching for Online Fractional Hedonic Games | Martin Bullinger et.al. | 2505.06163 | null |
2025-05-09 | Realistic Adversarial Attacks for Robustness Evaluation of Trajectory Prediction Models via Future State Perturbation | Julian F. Schumann et.al. | 2505.06134 | link |
2025-05-09 | ELA-ZSON: Efficient Layout-Aware Zero-Shot Object Navigation Agent with Hierarchical Planning | Jiawei Hou et.al. | 2505.06131 | null |
2025-05-09 | Oncolytic mechanisms and immunotherapeutic potential of Newcastle disease virus in cancer therapy | Umar Ahmad et.al. | 2505.06067 | null |
2025-05-09 | Offline Multi-agent Reinforcement Learning via Score Decomposition | Dan Qiao et.al. | 2505.05968 | null |
2025-05-09 | Learning Power Control Protocol for In-Factory 6G Subnetworks | Uyoata E. Uyoata et.al. | 2505.05967 | null |
2025-05-09 | Cost-Effective, Low Latency Vector Search with Azure Cosmos DB | Nitish Upreti et.al. | 2505.05885 | link |
2025-05-09 | Evolutionary ecology of words | Reiji Suzuki et.al. | 2505.05863 | null |
2025-05-08 | RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles | Pouria Behnoudfar et.al. | 2505.05452 | null |
2025-05-08 | clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations | Chalamalasetti Kranti et.al. | 2505.05445 | null |
2025-05-09 | EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation | Biao Yi et.al. | 2505.05440 | null |
2025-05-08 | Empowering Scientific Workflows with Federated Agents | J. Gregory Pauloski et.al. | 2505.05428 | link |
2025-05-08 | Robustly optimal dynamics for active matter reservoir computing | Mario U. Gaimann et.al. | 2505.05420 | null |
2025-05-08 | Weighted Envy-Freeness Revisited: Indivisible Resource and House Allocations | Yuxi Liu et.al. | 2505.05353 | null |
2025-05-08 | Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects | Agnese Chiatti et.al. | 2505.05318 | null |
2025-05-08 | HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow | You Peng et.al. | 2505.05286 | link |
2025-05-09 | Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents | Kaixin Wang et.al. | 2505.05283 | null |
2025-05-08 | Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | Andreas Kontogiannis et.al. | 2505.05262 | link |
2025-05-07 | Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions | Stéphane Aroca-Ouellette et.al. | 2505.04579 | link |
2025-05-07 | Optimal Deterministic Rendezvous in Labeled Lines | Yann Bourreau et.al. | 2505.04564 | null |
2025-05-07 | Qualitative Analysis of $ω$ -Regular Objectives on Robust MDPs | Ali Asadi et.al. | 2505.04539 | null |
2025-05-07 | Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving | Qi Liu et.al. | 2505.04528 | null |
2025-05-07 | RLMiniStyler: Light-weight RL Style Agent for Arbitrary Sequential Neural Style Generation | Jing Hu et.al. | 2505.04424 | link |
2025-05-07 | Consensus-Aware AV Behavior: Trade-offs Between Safety, Interaction, and Performance in Mixed Urban Traffic | Mohammad Elayan et.al. | 2505.04379 | link |
2025-05-07 | Extending a Quantum Reinforcement Learning Exploration Policy with Flags to Connect Four | Filipe Santos et.al. | 2505.04371 | null |
2025-05-07 | Benchmarking LLMs’ Swarm intelligence | Kai Ruan et.al. | 2505.04364 | link |
2025-05-07 | Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows | Wenhao Li et.al. | 2505.04354 | null |
2025-05-07 | Resist Platform-Controlled AI Agents and Champion User-Centric Agent Advocates | Sayash Kapoor et.al. | 2505.04345 | null |
2025-05-06 | Multi-Agent System for Comprehensive Soccer Understanding | Jiayuan Rao et.al. | 2505.03735 | null |
2025-05-06 | WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch | Zimu Lu et.al. | 2505.03733 | link |
2025-05-06 | Critical habitat size of organisms diffusing with stochastic resetting | Luiz Menon et.al. | 2505.03727 | null |
2025-05-06 | Meta-Optimization and Program Search using Language Models for Task and Motion Planning | Denis Shcherba et.al. | 2505.03725 | null |
2025-05-06 | Accelerated Decentralized Constraint-Coupled Optimization: A Dual $^2$ Approach | Jingwang Li et.al. | 2505.03719 | null |
2025-05-06 | Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid | Parv Kapoor et.al. | 2505.03694 | null |
2025-05-06 | Location-Restricted Stable Matching | Garret Castro et.al. | 2505.03680 | null |
2025-05-06 | CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting | Huawei Sun et.al. | 2505.03679 | null |
2025-05-06 | Gap the (Theory of) Mind: Sharing Beliefs About Teammates’ Goals Boosts Collaboration Perception, Not Performance | Yotam Amitai et.al. | 2505.03674 | null |
2025-05-06 | RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration | Huajie Tan et.al. | 2505.03673 | link |
2025-05-05 | Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation | Lu Ling et.al. | 2505.02836 | null |
2025-05-05 | AutoLibra: Agent Metric Induction from Open-Ended Feedback | Hao Zhu et.al. | 2505.02820 | link |
2025-05-05 | Generating HomeAssistant Automations Using an LLM-based Chatbot | Mathyas Giudici et.al. | 2505.02802 | null |
2025-05-05 | Recolorable Graph Exploration by an Oblivious Agent with Fewer Colors | Shota Takahashi et.al. | 2505.02789 | null |
2025-05-05 | Brief Announcement: Minimizing Energy Solves Relative Majority with a Cubic Number of States in Population Protocols | Tom-Lukas Breitkopf et.al. | 2505.02785 | null |
2025-05-05 | Merging plasmoids and nanojet-like ejections in a coronal current sheet | Samrat Sen et.al. | 2505.02733 | null |
2025-05-05 | Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework | Andrzej Mizera et.al. | 2505.02712 | link |
2025-05-05 | Technical Report: Evaluating Goal Drift in Language Model Agents | Rauno Arike et.al. | 2505.02709 | null |
2025-05-05 | Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play | Yemin Shi et.al. | 2505.02707 | link |
2025-05-05 | Exploring LLM-Powered Role and Action-Switching Pedagogical Agents for History Education in Virtual Reality | Zihao Zhu et.al. | 2505.02699 | null |
2025-05-02 | Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story | Vincenzo De Paola et.al. | 2505.01336 | null |
2025-05-02 | Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning | Mohammed Sumayli et.al. | 2505.01332 | null |
2025-05-02 | The Dance of the Sheared Eigenfunctions | J. Oliveira-Cony et.al. | 2505.01303 | null |
2025-05-02 | Pattern formation using an intrinsic optimal control approach | Tianhao Li et.al. | 2505.01302 | null |
2025-05-02 | Essential Workers at Risk: An Agent-Based Model (SAFE-ABM) with Bayesian Uncertainty Quantification | Elizabeth B. Amona et.al. | 2505.01243 | null |
2025-05-02 | Bilateral Cognitive Security Games in Networked Control Systems under Stealthy Injection Attacks | Anh Tung Nguyen et.al. | 2505.01232 | null |
2025-05-02 | Non-universal Impact of Cholesterol on Ionic Liquid-Membrane Interactions | J. Gupta et.al. | 2505.01230 | null |
2025-05-02 | A Space-Time Trade-off for Fast Self-Stabilizing Leader Election in Population Protocols | Henry Austin et.al. | 2505.01210 | null |
2025-05-02 | Explainable AI Based Diagnosis of Poisoning Attacks in Evolutionary Swarms | Mehrdad Asadi et.al. | 2505.01181 | null |
2025-05-02 | Simulating Tertiary Educational Decision Dynamics: An Agent-Based Model for the Netherlands | Jean-Paul Daemen et.al. | 2505.01142 | null |
2025-05-01 | Towards Autonomous Micromobility through Scalable Urban Simulation | Wayne Wu et.al. | 2505.00690 | null |
2025-05-01 | Visual Test-time Scaling for GUI Agent Grounding | Tiange Luo et.al. | 2505.00684 | link |
2025-05-01 | Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions | Yiming Du et.al. | 2505.00675 | link |
2025-05-01 | A Finite-State Controller Based Offline Solver for Deterministic POMDPs | Alex Schutz et.al. | 2505.00596 | link |
2025-05-01 | ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models | Jiarong Wei et.al. | 2505.00586 | null |
2025-05-01 | A continuum thermodynamic model of the influence of non-ionic surfactant on mass transfer from gas bubbles | Dieter Bothe et.al. | 2505.00581 | null |
2025-05-01 | Directly Forecasting Belief for Reinforcement Learning with Delays | Qingyuan Wu et.al. | 2505.00546 | link |
2025-05-01 | Emergence of Roles in Robotic Teams with Model Sharing and Limited Communication | Ian O’Flynn et.al. | 2505.00540 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-01 | Variational OOD State Correction for Offline Reinforcement Learning | Ke Jiang et.al. | 2505.00503 | null |
2025-04-30 | A Survey of Interactive Generative Video | Jiwen Yu et.al. | 2504.21853 | null |
2025-04-30 | TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments | Sichang Tu et.al. | 2504.21851 | null |
2025-04-30 | Characterizing AI Agents for Alignment and Governance | Atoosa Kasirzadeh et.al. | 2504.21848 | null |
2025-04-30 | SWE-smith: Scaling Data for Software Engineering Agents | John Yang et.al. | 2504.21798 | null |
2025-04-30 | WebThinker: Empowering Large Reasoning Models with Deep Research Capability | Xiaoxi Li et.al. | 2504.21776 | link |
2025-04-30 | Is Intermediate Fusion All You Need for UAV-based Collaborative Perception? | Jiuwu Hao et.al. | 2504.21774 | link |
2025-04-30 | LLM-based Interactive Imitation Learning for Robotic Manipulation | Jonas Werner et.al. | 2504.21769 | link |
2025-04-30 | Asymptotic Analysis of Weighted Fair Division | Pasin Manurangsi et.al. | 2504.21728 | null |
2025-04-30 | LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Marc Glocker et.al. | 2504.21716 | link |
2025-04-30 | Economic Inequality between Groups in an a priori Stratified Society | Thiago Dias et.al. | 2504.21703 | null |
2025-04-29 | Toward Efficient Exploration by Large Language Model Agents | Dilip Arumugam et.al. | 2504.20997 | null |
2025-04-29 | TesserAct: Learning 4D Embodied World Models | Haoyu Zhen et.al. | 2504.20995 | null |
2025-04-29 | XPG-RL: Reinforcement Learning with Explainable Priority Guidance for Efficiency-Boosted Mechanical Search | Yiting Zhang et.al. | 2504.20969 | null |
2025-04-29 | AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security | Zikui Cai et.al. | 2504.20965 | link |
2025-04-29 | Opinion-Driven Decision-Making for Multi-Robot Navigation through Narrow Corridors | Norah K. Alghamdi et.al. | 2504.20947 | null |
2025-04-29 | Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity | Taisuke Kobayashi et.al. | 2504.20932 | null |
2025-04-29 | Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR | Shahbaz P Qadri Syed et.al. | 2504.20927 | null |
2025-04-29 | Modeling AI-Human Collaboration as a Multi-Agent Adaptation | Prothit Sen et.al. | 2504.20903 | link |
2025-04-29 | CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models | Hasan Md Tusfiqur Alam et.al. | 2504.20898 | link |
2025-04-29 | Does Feedback Help in Bandits with Arm Erasures? | Merve Karakas et.al. | 2504.20894 | null |
2025-04-28 | Towards Automated Scoping of AI for Social Good Projects | Jacob Emmerson et.al. | 2504.20010 | null |
2025-04-28 | Simplified and Secure MCP Gateways for Enterprise AI Integration | Ivo Brett et.al. | 2504.19997 | link |
2025-04-28 | TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons | Emre Can Acikgoz et.al. | 2504.19982 | null |
2025-04-28 | On one generalization of stable allocations in a two-sided market | Alexander V. Karzanov et.al. | 2504.19978 | null |
2025-04-28 | Securing Agentic AI: A Comprehensive Threat Model and Mitigation Framework for Generative AI Agents | Vineeth Sai Narajala et.al. | 2504.19956 | null |
2025-04-28 | Securing GenAI Multi-Agent Systems Against Tool Squatting: A Zero Trust Registry-Based Approach | Vineeth Sai Narajala et.al. | 2504.19951 | null |
2025-04-28 | Automated decision-making for dynamic task assignment at scale | Riccardo Lo Bianco et.al. | 2504.19933 | link |
2025-04-28 | Can AI Agents Design and Implement Drug Discovery Pipelines? | Khachik Smbatyan et.al. | 2504.19912 | null |
2025-04-28 | LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects | Guangyi Liu et.al. | 2504.19838 | link |
2025-04-28 | PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Feng Chen et.al. | 2504.19818 | link |
2025-04-25 | Instrumentation for Better Demonstrations: A Case Study | Remko Proesmans et.al. | 2504.18481 | null |
2025-04-25 | Improved Dwell-times for Switched Nonlinear Systems using Memory Regression Extension | Muzaffar Qureshi et.al. | 2504.18457 | null |
2025-04-25 | Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior | Milad Sefidgaran et.al. | 2504.18455 | null |
2025-04-25 | On monotone completion of risk markets: Limit results for incomplete risk markets | Iman Khajepour et.al. | 2504.18436 | null |
2025-04-25 | LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection | Rajesh Yarra et.al. | 2504.18423 | null |
2025-04-25 | Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant | Lei Shen et.al. | 2504.18373 | link |
2025-04-25 | Interpretable Affordance Detection on 3D Point Clouds with Probabilistic Prototypes | Maximilian Xiling Li et.al. | 2504.18355 | null |
2025-04-25 | Revisiting Data Auditing in Large Vision-Language Models | Hongyu Zhu et.al. | 2504.18349 | null |
2025-04-25 | Optimal Control of Sensor-Induced Illusions on Robotic Agents | Lorenzo Medici et.al. | 2504.18339 | null |
2025-04-25 | Towards Adaptive Software Agents for Debugging | Yacine Majdoub et.al. | 2504.18316 | null |
2025-04-24 | Robotic Task Ambiguity Resolution via Natural Language Interaction | Eugenio Chisari et.al. | 2504.17748 | null |
2025-04-24 | Applied Sheaf Theory For Multi-agent Artificial Intelligence (Reinforcement Learning) Systems: A Prospectus | Eric Schmid et.al. | 2504.17700 | null |
2025-04-24 | ‘The Boring and the Tedious’: Invisible Labour in India’s Gig-Economy | Pratyay Suvarnapathaki et.al. | 2504.17697 | null |
2025-04-24 | Towards a HIPAA Compliant Agentic AI System in Healthcare | Subash Neupane et.al. | 2504.17669 | null |
2025-04-24 | A Constraint Opinion Model | Fabio Gadducci et.al. | 2504.17605 | null |
2025-04-24 | Mitigating xApp conflicts for efficient network slicing in 6G O-RAN: a graph convolutional-based attention network approach | Sihem Bakri et.al. | 2504.17590 | null |
2025-04-24 | A Multi-Agent, Laxity-Based Aggregation Strategy for Cost-Effective Electric Vehicle Charging and Local Transformer Overload Prevention | Kristoffer Christensen et.al. | 2504.17575 | null |
2025-04-24 | Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks | Yuelin Liu et.al. | 2504.17526 | null |
2025-04-24 | Communication-Efficient Personalized Distributed Learning with Data and Node Heterogeneity | Zhuojun Tian et.al. | 2504.17520 | null |
2025-04-24 | Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning | Mingqi Yuan et.al. | 2504.17490 | null |
2025-04-23 | OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents | Raghav Thind et.al. | 2504.16918 | null |
2025-04-23 | Building A Secure Agentic AI Application Leveraging A2A Protocol | Idan Habler et.al. | 2504.16902 | null |
2025-04-23 | Do Large Language Models know who did what to whom? | Joseph M. Denning et.al. | 2504.16884 | null |
2025-04-23 | Hybrid Reinforcement Learning and Model Predictive Control for Adaptive Control of Hydrogen-Diesel Dual-Fuel Combustion | Julian Bedei et.al. | 2504.16875 | null |
2025-04-23 | Monte Carlo Planning with Large Language Model for Text-Based Game Agents | Zijing Shi et.al. | 2504.16855 | null |
2025-04-23 | Fair division of the replacement-units without an appraiser in urban renewal processes | Noga Klein Elmalem et.al. | 2504.16852 | null |
2025-04-23 | MLOps Monitoring at Scale for Digital Platforms | Yu Jeffrey Hu et.al. | 2504.16789 | null |
2025-04-23 | A Survey of AI Agent Protocols | Yingxuan Yang et.al. | 2504.16736 | null |
2025-04-24 | DYNUS: Uncertainty-aware Trajectory Planner in Dynamic Unknown Environments | Kota Kondo et.al. | 2504.16734 | null |
2025-04-23 | IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery | Aniketh Garikaparthi et.al. | 2504.16728 | link |
2025-04-22 | MR. Video: “MapReduce” is the Principle for Long Video Understanding | Ziqi Pang et.al. | 2504.16082 | null |
2025-04-22 | LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities | Thomas Schmied et.al. | 2504.16078 | null |
2025-04-22 | Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation | Zhiyuan Hu et.al. | 2504.16073 | null |
2025-04-22 | ForesightNav: Learning Scene Imagination for Efficient Exploration | Hardik Shah et.al. | 2504.16062 | link |
2025-04-22 | Reinforcement Learning and Metaheuristics for Feynman Integral Reduction | Mao Zeng et.al. | 2504.16045 | null |
2025-04-22 | A Lagrangian Approach to Optimal Lotteries in Non-Convex Economies | Chengfeng Shen et.al. | 2504.15997 | null |
2025-04-22 | Neuroadaptive Haptics: Comparing Reinforcement Learning from Explicit Ratings and Neural Signals for Adaptive XR Systems | Lukas Gehrke et.al. | 2504.15984 | null |
2025-04-22 | Towards Test Generation from Task Description for Mobile Testing with Multi-modal Reasoning | Hieu Huynh et.al. | 2504.15917 | link |
2025-04-22 | Learning the Spoofability of Limit Order Books With Interpretable Probabilistic Neural Networks | Timothée Fabre et.al. | 2504.15908 | null |
2025-04-22 | A closer look at how large language models trust humans: patterns and biases | Valeria Lerman et.al. | 2504.15801 | null |
2025-04-21 | Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs | Chun-Hsiao Yeh et.al. | 2504.15280 | link |
2025-04-21 | Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning | Ehsan Ahmadi et.al. | 2504.15263 | null |
2025-04-21 | FlowReasoner: Reinforcing Query-Level Meta-Agents | Hongcheng Gao et.al. | 2504.15257 | link |
2025-04-21 | A Self-Improving Coding Agent | Maxime Robeyns et.al. | 2504.15228 | null |
2025-04-21 | An experimental study of the influence of anonymous information on social media users | Boleslaw K. Szymanski et.al. | 2504.15215 | null |
2025-04-21 | Fully Adaptive Stepsizes: Which System Benefit More – Centralized or Decentralized? | Diyako Ghaderyan et.al. | 2504.15196 | null |
2025-04-21 | Behavioral Universe Network (BUN): A Behavioral Information-Based Framework for Complex Systems | Wei Zhou et.al. | 2504.15146 | null |
2025-04-21 | Neural ATTF: A Scalable Solution to Lifelong Multi-Agent Path Planning | Kushal Shah et.al. | 2504.15130 | null |
2025-04-21 | Contemplative Wisdom for Superalignment | Ruben Laukkonen et.al. | 2504.15125 | null |
2025-04-21 | Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN | Lin Wang et.al. | 2504.15099 | null |
2025-04-18 | Science Hierarchography: Hierarchical Organization of Science Literature | Muhan Gao et.al. | 2504.13834 | link |
2025-04-18 | LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark | Guangyi Liu et.al. | 2504.13805 | null |
2025-04-18 | ChatNekoHacker: Real-Time Fan Engagement with Conversational Agents | Takuya Sera et.al. | 2504.13793 | null |
2025-04-21 | BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models | Zhengxian Wu et.al. | 2504.13775 | null |
2025-04-18 | $O(p \log d)$ Subgraph Isomorphism using Stigmergic Swarming Agents | H. Van Dyke Parunak et.al. | 2504.13722 | null |
2025-04-18 | Stability of flocking in the reciprocal two-species Vicsek model: Effects of relative population, motility, and noise | Aditya Kumar Dutta et.al. | 2504.13709 | null |
2025-04-18 | OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation | Yichen Wu et.al. | 2504.13707 | null |
2025-04-18 | Modelling Immunity in Agent-based Models | Gray Manicom et.al. | 2504.13706 | null |
2025-04-18 | EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model | Sijing Li et.al. | 2504.13650 | link |
2025-04-18 | Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning | Tao He et.al. | 2504.13643 | null |
2025-04-17 | Sleep-time Compute: Beyond Inference Scaling at Test-time | Kevin Lin et.al. | 2504.13171 | link |
2025-04-17 | Exploring Expert Failures Improves LLM Agent Tuning | Li-Cheng Lan et.al. | 2504.13145 | null |
2025-04-17 | Object-Driven Narrative in AR: A Scenario-Metaphor Framework with VLM Integration | Yusi Sun et.al. | 2504.13119 | null |
2025-04-17 | Retrieval-Augmented Generation with Conflicting Evidence | Han Wang et.al. | 2504.13079 | link |
2025-04-17 | InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Zheng Wang et.al. | 2504.13032 | null |
2025-04-17 | Why Ask One When You Can Ask $k$ ? Two-Stage Learning-to-Defer to a Set of Experts | Yannis Montreuil et.al. | 2504.12988 | null |
2025-04-17 | QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning? | Zhouyang Jiang et.al. | 2504.12961 | null |
2025-04-17 | Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback | Nearchos Potamitis et.al. | 2504.12951 | null |
2025-04-17 | RL-PINNs: Reinforcement Learning-Driven Adaptive Sampling for Efficient Training of PINNs | Zhenao Song et.al. | 2504.12949 | null |
2025-04-18 | Customizing Emotional Support: How Do Individuals Construct and Interact With LLM-Powered Chatbots | Xi Zheng et.al. | 2504.12943 | null |
2025-04-16 | Adapting a World Model for Trajectory Following in a 3D Game | Marko Tot et.al. | 2504.12299 | null |
2025-04-16 | Optimal flock formation induced by agent heterogeneity | Arthur N. Montanari et.al. | 2504.12297 | link |
2025-04-16 | Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning | Mahmoud Salhab et.al. | 2504.12254 | null |
2025-04-16 | Data Assimilation for Robust UQ Within Agent-Based Simulation on HPC Systems | Adam Spannaus et.al. | 2504.12228 | null |
2025-04-16 | Communication Optimization for Decentralized Learning atop Bandwidth-limited Edge Networks | Tingyang Sun et.al. | 2504.12210 | null |
2025-04-16 | ARCeR: an Agentic RAG for the Automated Definition of Cyber Ranges | Matteo Lupinacci et.al. | 2504.12143 | null |
2025-04-16 | Multilingual Contextualization of Large Language Models for Document-Level Machine Translation | Miguel Moura Ramos et.al. | 2504.12140 | null |
2025-04-16 | The Social Learning Barrier | Florian Brandl et.al. | 2504.12136 | null |
2025-04-16 | EmoACT: a Framework to Embed Emotions into Artificial Agents Based on Affect Control Theory | Francesca Corrao et.al. | 2504.12125 | null |
2025-04-16 | Towards LLM Agents for Earth Observation | Chia Hsiang Kao et.al. | 2504.12110 | null |
2025-04-15 | TextArena | Leon Guertler et.al. | 2504.11442 | link |
2025-04-15 | Embodied World Models Emerge from Navigational Task in Open-Ended Environments | Li Jin et.al. | 2504.11419 | null |
2025-04-15 | Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions | Wang Bill Zhu et.al. | 2504.11373 | link |
2025-04-15 | DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks | Yupei Liu et.al. | 2504.11358 | link |
2025-04-15 | Learning to Be A Doctor: Searching for Effective Medical Agent Architectures | Yangyang Zhuang et.al. | 2504.11301 | null |
2025-04-15 | Policy heterogeneity improves collective olfactory search in 3-D turbulence | Lorenzo Piro et.al. | 2504.11291 | null |
2025-04-15 | The Obvious Invisible Threat: LLM-Powered GUI Agents’ Vulnerability to Fine-Print Injections | Chaoran Chen et.al. | 2504.11281 | null |
2025-04-15 | Multi-Agent Reinforcement Learning for Greenhouse Gas Offset Credit Markets | Liam Welsh et.al. | 2504.11258 | null |
2025-04-16 | UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis | Xinyi Liu et.al. | 2504.11257 | null |
2025-04-15 | A Rollout-Based Algorithm and Reward Function for Efficient Resource Allocation in Business Processes | Jeroen Middelhuis et.al. | 2504.11250 | null |
2025-04-14 | The Price of Competitive Information Disclosure | Siddhartha Banerjee et.al. | 2504.10459 | null |
2025-04-15 | GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents | Xiaobo Xia et.al. | 2504.10458 | null |
2025-04-14 | RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users | Suyu Ye et.al. | 2504.10445 | link |
2025-04-14 | Position Uncertainty in a Prisoner’s Dilemma Game : An Experiment | Chowdhury Mohammad Sakib Anwar et.al. | 2504.10441 | null |
2025-04-14 | Silent Self-Stabilizing Ranking: Time Optimal and Space Efficient | Petra Berenbrink et.al. | 2504.10417 | null |
2025-04-14 | Ctrl-Z: Controlling AI Agents via Resampling | Aryan Bhatt et.al. | 2504.10374 | null |
2025-04-14 | Proteinoid spikes: from protocognitive to universal approximating agents | Saksham Sharma et.al. | 2504.10362 | null |
2025-04-14 | Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving | Xiaoshan Zhou et.al. | 2504.10296 | null |
2025-04-14 | Characterizing LLM-driven Social Network: The Chirper.ai Case | Yiming Zhu et.al. | 2504.10286 | null |
2025-04-14 | RealHarm: A Collection of Real-World Language Model Application Failures | Pierre Le Jeune et.al. | 2504.10277 | link |
2025-04-11 | DocAgent: A Multi-Agent System for Automated Code Documentation Generation | Dayu Yang et.al. | 2504.08725 | link |
2025-04-11 | SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents | Muhammad Shihab Rashid et.al. | 2504.08703 | link |
2025-04-11 | SeaView: Software Engineering Agent Visual Interface for Enhanced Workflow | Timothy Bula et.al. | 2504.08696 | null |
2025-04-11 | TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning | Hang Ni et.al. | 2504.08694 | null |
2025-04-11 | Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing | Jiho Kim et.al. | 2504.08687 | null |
2025-04-11 | Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents | Alessio Buscemi et.al. | 2504.08640 | null |
2025-04-11 | Optimal selection of the most informative nodes for a noisy DeGroot model with stubborn agents | Roberta Raineri et.al. | 2504.08622 | null |
2025-04-11 | MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation | Tao Zhang et.al. | 2504.08621 | link |
2025-04-11 | Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage Constraints | Mohamed S. Talamali et.al. | 2504.08585 | null |
2025-04-11 | FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents | Xin Tan et.al. | 2504.08581 | null |
2025-04-10 | Fast Adaptation with Behavioral Foundation Models | Harshit Sikchi et.al. | 2504.07896 | null |
2025-04-10 | Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge | Riccardo Cantini et.al. | 2504.07887 | link |
2025-04-11 | An LLM-Driven Multi-Agent Debate System for Mendelian Diseases | Xinyang Zhou et.al. | 2504.07881 | null |
2025-04-10 | Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis | Fei-Hsuan Yu et.al. | 2504.07872 | null |
2025-04-10 | In itinere infections covertly undermine localized epidemic control in metapopulations | Francesca Dilisante et.al. | 2504.07849 | null |
2025-04-10 | Anytime Single-Step MAPF Planning with Anytime PIBT | Nayesha Gandotra et.al. | 2504.07841 | null |
2025-04-10 | Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems | Simon Lermen et.al. | 2504.07831 | null |
2025-04-10 | MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations | Genglin Liu et.al. | 2504.07830 | link |
2025-04-10 | Active Matter Flocking via Predictive Alignment | Julian Giraldo-Barreto et.al. | 2504.07778 | null |
2025-04-10 | Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents | Manh Hung Nguyen et.al. | 2504.07655 | null |
2025-04-09 | SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills | Boyuan Zheng et.al. | 2504.07079 | null |
2025-04-09 | A Unified Agentic Framework for Evaluating Conditional Image Generation | Jifang Wang et.al. | 2504.07046 | link |
2025-04-09 | Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration | Kostas Hatalis et.al. | 2504.06943 | null |
2025-04-09 | AI-Driven Consensus: Modeling Multi-Agent Networks with Long-Range Interactions through path-Laplacian Matrices | Yusef Ahsini et.al. | 2504.06894 | link |
2025-04-09 | More connection, less community: network formation and local public goods provision | Alastair Langtry et.al. | 2504.06872 | null |
2025-04-09 | Persona Dynamics: Unveiling the Impact of Personality Traits on Agents in Text-Based Games | Seungwon Lim et.al. | 2504.06868 | link |
2025-04-09 | IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments | Can Zhang et.al. | 2504.06827 | null |
2025-04-09 | Inducing Programmatic Skills for Agentic Tasks | Zora Zhiruo Wang et.al. | 2504.06821 | link |
2025-04-09 | FamilyTool: A Multi-hop Personalized Tool Use Benchmark | Yuxin Wang et.al. | 2504.06766 | link |
2025-04-09 | Adaptive Human-Robot Collaborative Missions using Hybrid Task Planning | Gricel Vázquez et.al. | 2504.06746 | null |
2025-04-08 | FEABench: Evaluating Language Models on Multiphysics Reasoning Ability | Nayantara Mudur et.al. | 2504.06260 | link |
2025-04-08 | The Work Capacity of Channels with Memory: Maximum Extractable Work in Percept-Action Loops | Lukas J. Fiderer et.al. | 2504.06209 | null |
2025-04-08 | TxGemma: Efficient and Agentic LLMs for Therapeutics | Eric Wang et.al. | 2504.06196 | null |
2025-04-08 | SkillFlow: Efficient Skill and Code Transfer Through Communication in Adapting AI Agents | Pagkratios Tagkopoulos et.al. | 2504.06188 | null |
2025-04-08 | Linear Regulator-Based Synchronization of Positive Multi-Agent Systems | Alba Gurpegui et.al. | 2504.06169 | null |
2025-04-08 | V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models | Xiangxi Zheng et.al. | 2504.06148 | link |
2025-04-08 | Deploying Chatbots in Customer Service: Adoption Hurdles and Simple Remedies | Evgeny Kagan et.al. | 2504.06145 | null |
2025-04-08 | A Multimedia Analytics Model for the Foundation Model Era | Marcel Worring et.al. | 2504.06138 | null |
2025-04-08 | Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning | Tooraj Helmi et.al. | 2504.06135 | null |
2025-04-08 | Accelerating Vehicle Routing via AI-Initialized Genetic Algorithms | Ido Greenberg et.al. | 2504.06126 | null |
2025-04-07 | CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models | Kavana Venkatesh et.al. | 2504.05306 | null |
2025-04-07 | How to evaluate control measures for LLM agents? A trajectory from today to superintelligence | Tomek Korbak et.al. | 2504.05259 | null |
2025-04-07 | Rationalizing dynamic choices | Henrique de Oliveira et.al. | 2504.05251 | null |
2025-04-07 | Reducing the Communication of Distributed Model Predictive Control: Autoencoders and Formation Control | Torben Schiz et.al. | 2504.05223 | null |
2025-04-07 | DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation | Xinglin Lyu et.al. | 2504.05122 | link |
2025-04-07 | AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments | Saeid Ario Vaghefi et.al. | 2504.05104 | null |
2025-04-07 | AI-Driven Tactical Communications and Networking for Defense: A Survey and Emerging Trends | Victor Monzon Baeza et.al. | 2504.05071 | null |
2025-04-07 | Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning | Sugyeong Eo et.al. | 2504.05047 | null |
2025-04-08 | Attention-Augmented Inverse Reinforcement Learning with Graph Convolutions for Multi-Agent Task Allocation | Huilin Yin et.al. | 2504.05045 | null |
2025-04-07 | Mixture-of-Personas Language Models for Population Simulation | Ngoc Bui et.al. | 2504.05019 | null |
2025-04-04 | Bonsai: Interpretable Tree-Adaptive Grounded Reasoning | Kate Sanders et.al. | 2504.03640 | null |
2025-04-04 | Epicast 2.0: A large-scale, demographically detailed, agent-based model for simulating respiratory pathogen spread in the United States | Prescott C. Alexander et.al. | 2504.03604 | null |
2025-04-04 | APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay | Akshara Prabhakar et.al. | 2504.03601 | null |
2025-04-04 | A Lower Bound on Conservative Elementary Object Systems Coverability | Francesco Di Cosmo et.al. | 2504.03591 | null |
2025-04-04 | SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement | Runnan Fang et.al. | 2504.03561 | link |
2025-04-04 | Agentic Knowledgeable Self-awareness | Shuofei Qiao et.al. | 2504.03553 | link |
2025-04-04 | The Limits of “Fairness” of the Variational Generalized Nash Equilibrium | Sophie Hall et.al. | 2504.03540 | null |
2025-04-04 | RANa: Retrieval-Augmented Navigation | Gianluca Monaci et.al. | 2504.03524 | null |
2025-04-04 | Target Prediction Under Deceptive Switching Strategies via Outlier-Robust Filtering of Partially Observed Incomplete Trajectories | Yiming Meng et.al. | 2504.03502 | null |
2025-04-04 | A stochastic volatility approximation for a tick-by-tick price model with mean-field interaction | Paolo Dai Pra et.al. | 2504.03445 | null |
2025-04-03 | Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets | Chuning Zhu et.al. | 2504.02792 | null |
2025-04-03 | Sequential Binary Hypothesis Testing with Competing Agents under Information Asymmetry | Aneesh Raghavan et.al. | 2504.02743 | null |
2025-04-03 | Responsible Development of Offensive AI | Ryan Marinelli et.al. | 2504.02701 | link |
2025-04-03 | The Tension between Trust and Oversight in Long-term Relationships | Peter Achim et.al. | 2504.02696 | null |
2025-04-03 | Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL | Achilles Kiwanuka Machumilane et.al. | 2504.02688 | null |
2025-04-03 | A Set-Theoretic Robust Control Approach for Linear Quadratic Games with Unknown Counterparts | Francesco Bianchin et.al. | 2504.02679 | null |
2025-04-03 | Affordable AI Assistants with Knowledge Graph of Thoughts | Maciej Besta et.al. | 2504.02670 | null |
2025-04-03 | SymDQN: Symbolic Knowledge and Reasoning in Neural Network-based Reinforcement Learning | Ivo Amador et.al. | 2504.02654 | null |
2025-04-04 | Controlled Social Learning: Altruism vs. Bias | Raghu Arghal et.al. | 2504.02648 | null |
2025-04-03 | Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions | PeiJie Yu et.al. | 2504.02623 | link |
2025-04-02 | Graphon games and an idealized limit of large network games | Motoki Otsuka et.al. | 2504.01944 | null |
2025-04-02 | Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection | Souradip Chakraborty et.al. | 2504.01931 | null |
2025-04-02 | Gen-C: Populating Virtual Worlds with Generative Crowds | Andreas Panayiotou et.al. | 2504.01924 | null |
2025-04-02 | Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning | Yinggan Xu et.al. | 2504.01911 | null |
2025-04-02 | Interpreting Emergent Planning in Model-Free Reinforcement Learning | Thomas Bush et.al. | 2504.01871 | null |
2025-04-02 | PaperBench: Evaluating AI’s Ability to Replicate AI Research | Giulio Starace et.al. | 2504.01848 | link |
2025-04-02 | A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning | Yuyang Qiu et.al. | 2504.01839 | null |
2025-04-02 | Budget-Feasible Contracts | Michal Feldman et.al. | 2504.01773 | null |
2025-04-03 | Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning | Ke Jiang et.al. | 2504.01719 | null |
2025-04-02 | Reasoning LLMs for User-Aware Multimodal Conversational Agents | Hamed Rahimi et.al. | 2504.01700 | null |
2025-03-31 | RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy | Zhonghan Zhao et.al. | 2503.24388 | null |
2025-03-31 | Coordinating Distributed Energy Resources with Nodal Pricing in Distribution Networks: a Game-Theoretic Approach | Eli Brock et.al. | 2503.24342 | null |
2025-03-31 | Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning | Yubo Zhang et.al. | 2503.24296 | null |
2025-03-31 | Value of Information-based Deceptive Path Planning Under Adversarial Interventions | Wesley A. Suttle et.al. | 2503.24284 | null |
2025-03-31 | MaintainCoder: Maintainable Code Generation Under Dynamic Requirements | Zhengren Wang et.al. | 2503.24260 | link |
2025-03-31 | PAARS: Persona Aligned Agentic Retail Shoppers | Saab Mansour et.al. | 2503.24228 | null |
2025-03-31 | Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany | Abdul Sittar et.al. | 2503.24199 | null |
2025-03-31 | Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms | Shuoming Zhang et.al. | 2503.24191 | null |
2025-03-31 | Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up | Ziming Cheng et.al. | 2503.24180 | null |
2025-03-31 | Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy | Harry Robertshaw et.al. | 2503.24140 | null |
2025-03-28 | Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions | Mohammad Almansoori et.al. | 2503.22678 | null |
2025-03-28 | ActionStudio: A Lightweight Framework for Data and Training of Action Models | Jianguo Zhang et.al. | 2503.22673 | link |
2025-03-28 | On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations | Rajdeep Singh Hundal et.al. | 2503.22575 | null |
2025-03-28 | SafeCast: Risk-Responsive Motion Forecasting for Autonomous Vehicles | Haicheng Liao et.al. | 2503.22541 | null |
2025-03-28 | Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement | Wenqiang Luo et.al. | 2503.22512 | null |
2025-03-28 | Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments | Luke Rowe et.al. | 2503.22496 | null |
2025-03-28 | WorkTeam: Constructing Workflows from Natural Language with Multi-Agents | Hanchao Liu et.al. | 2503.22473 | null |
2025-03-28 | Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey | Shengyue Guan et.al. | 2503.22458 | null |
2025-03-28 | Scaling Laws of Scientific Discovery with AI and Robot Scientists | Pengsong Zhang et.al. | 2503.22444 | null |
2025-03-28 | CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching | Zhonghao Jiang et.al. | 2503.22424 | link |
2025-03-27 | MemInsight: Autonomous Memory Augmentation for LLM Agents | Rana Salama et.al. | 2503.21760 | null |
2025-03-27 | GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics | Arsham Gholamzadeh Khoee et.al. | 2503.21735 | null |
2025-03-27 | Collab: Controlled Decoding using Mixture of Agents for LLM Alignment | Souradip Chakraborty et.al. | 2503.21720 | null |
2025-03-27 | A tale of two goals: leveraging sequentiality in multi-goal scenarios | Olivier Serris et.al. | 2503.21677 | null |
2025-03-27 | Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI | Danaja Rutar et.al. | 2503.21668 | null |
2025-03-27 | UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning | Zhengxi Lu et.al. | 2503.21620 | link |
2025-03-27 | A Measure Based Generalizable Approach to Understandability | Vikas Kushwaha et.al. | 2503.21615 | null |
2025-03-27 | A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond | Xiaoye Qu et.al. | 2503.21614 | link |
2025-03-27 | A Deep Reinforcement Learning-based Approach for Adaptive Handover Protocols | Johannes Voigt et.al. | 2503.21601 | null |
2025-03-27 | debug-gym: A Text-Based Environment for Interactive Debugging | Xingdi Yuan et.al. | 2503.21557 | null |
2025-03-26 | Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields | Shijie Zhou et.al. | 2503.20776 | null |
2025-03-26 | Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why? | Ilia Shilov et.al. | 2503.20772 | null |
2025-03-27 | Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs | Yuxuan Lu et.al. | 2503.20749 | null |
2025-03-26 | Prospect for measuring work statistics in quantum coherent systems | Cheolhee Han et.al. | 2503.20729 | null |
2025-03-26 | Convergence Theory of Flexible ALADIN for Distributed Optimization | Xu Du et.al. | 2503.20716 | null |
2025-03-26 | Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control | Eloy Anguiano Batanero et.al. | 2503.20688 | null |
2025-03-27 | Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound | Yuhao Huang et.al. | 2503.20685 | null |
2025-03-26 | TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews | Huimin Xu et.al. | 2503.20666 | null |
2025-03-26 | Agent-Based Analysis of the Impact of Near Real-Time Data and Smart Balancing on the Frequency Stability of Power Systems | Johannes Lips et.al. | 2503.20665 | null |
2025-03-26 | State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning | Zongyuan Zhang et.al. | 2503.20613 | null |
2025-03-25 | Energetic advantages for quantum agents in online execution of complex strategies | Jayne Thompson et.al. | 2503.19896 | null |
2025-03-25 | A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design | Jie Tian et.al. | 2503.19889 | null |
2025-03-25 | Collaborative Satisfaction of Long-Term Spatial Constraints in Multi-Agent Systems: A Distributed Optimization Approach (extended version) | Farhad Mehdifar et.al. | 2503.19879 | null |
2025-03-25 | Towards Online Multi-Modal Social Interaction Understanding | Xinpeng Li et.al. | 2503.19851 | link |
2025-03-25 | FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs | Carlos Plou et.al. | 2503.19850 | null |
2025-03-25 | Thinking agents for zero-shot generalization to qualitatively novel tasks | Thomas Miconi et.al. | 2503.19815 | null |
2025-03-25 | Simulating Tracking Data to Advance Sports Analytics Research | David Radke et.al. | 2503.19809 | link |
2025-03-25 | Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation | Lewis Newsham et.al. | 2503.19752 | null |
2025-03-25 | Writing as a testbed for open ended agents | Sian Gooding et.al. | 2503.19711 | null |
2025-03-25 | Optimal Path Planning and Cost Minimization for a Drone Delivery System Via Model Predictive Control | Muhammad Al-Zafar Khan et.al. | 2503.19699 | null |
2025-03-24 | AdaWorld: Learning Adaptable World Models with Latent Actions | Shenyuan Gao et.al. | 2503.18938 | link |
2025-03-24 | AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration | Zhexuan Wang et.al. | 2503.18891 | link |
2025-03-24 | Dynamics of Insect Paraintelligence: How a Mindless Colony of Ants Meaningfully Moves a Beetle | Eldar Knar et.al. | 2503.18858 | null |
2025-03-24 | Self-Organizing Graph Reasoning Evolves into a Critical State for Continuous Discovery Through Structural-Semantic Dynamics | Markus J. Buehler et.al. | 2503.18852 | null |
2025-03-24 | EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments | Sara Fish et.al. | 2503.18825 | null |
2025-03-24 | Faster Heat Transfer Clarifies the Unexpected Twist in the Simultaneous Freezing of Hot versus Cold Water | James D. Brownridge et.al. | 2503.18820 | null |
2025-03-24 | Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm | Chak Lam Shek et.al. | 2503.18816 | null |
2025-03-24 | Defeating Prompt Injections by Design | Edoardo Debenedetti et.al. | 2503.18813 | null |
2025-03-24 | Simulation-Driven Balancing of Competitive Game Levels with Reinforcement Learning | Florian Rupp et.al. | 2503.18748 | link |
2025-03-24 | Unsupervised Acquisition of Discrete Grammatical Categories | David Ph. Shakouri et.al. | 2503.18702 | null |
2025-03-21 | HCAST: Human-Calibrated Autonomy Software Tasks | David Rein et.al. | 2503.17354 | link |
2025-03-21 | CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities | Yuxuan Zhu et.al. | 2503.17332 | link |
2025-03-21 | LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language | Kun Chu et.al. | 2503.17309 | link |
2025-03-21 | Exploring the Temporal Dynamics of Facial Mimicry in Emotion Processing Using Action Units | Meisam Jamshidi Seikavandi et.al. | 2503.17306 | null |
2025-03-21 | Coarsening in the Persistent Voter Model: analytical results | R. G. de Almeida et.al. | 2503.17295 | null |
2025-03-21 | Curriculum RL meets Monte Carlo Planning: Optimization of a Real World Container Management Problem | Abhijeet Pendyala et.al. | 2503.17194 | link |
2025-03-21 | Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection | Duanrui Yu et.al. | 2503.17175 | null |
2025-03-21 | Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning | Chan Kim et.al. | 2503.17125 | null |
2025-03-21 | Deterministic AI Agent Personality Expression through Standard Psychological Diagnostics | J. M. Diederik Kruijssen et.al. | 2503.17085 | null |
2025-03-21 | Replay4NCL: An Efficient Memory Replay-based Methodology for Neuromorphic Continual Learning in Embedded AI Systems | Mishal Fatima Minhas et.al. | 2503.17061 | null |
2025-03-20 | Survey on Evaluation of LLM-based Agents | Asaf Yehudai et.al. | 2503.16416 | null |
2025-03-20 | Computing Lindahl Equilibrium for Public Goods with and without Funding Caps | Christian Kroer et.al. | 2503.16414 | null |
2025-03-20 | RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints | Yiran Qin et.al. | 2503.16408 | null |
2025-03-20 | Do Visual Imaginations Improve Vision-and-Language Navigation Agents? | Akhil Perincherry et.al. | 2503.16394 | null |
2025-03-20 | JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse | Muyao Li et.al. | 2503.16365 | null |
2025-03-20 | Issue2Test: Generating Reproducing Test Cases from Issue Reports | Noor Nashid et.al. | 2503.16320 | null |
2025-03-20 | Characterizing the Convergence of Game Dynamics via Potentialness | Martin Bichler et.al. | 2503.16285 | link |
2025-03-20 | Binary-Report Peer Prediction for Real-Valued Signal Spaces | Rafael Frongillo et.al. | 2503.16280 | null |
2025-03-20 | AI Agents in Cryptoland: Practical Attacks and No Silver Bullet | Atharv Singh Patlan et.al. | 2503.16248 | null |
2025-03-20 | Dispersion is (Almost) Optimal under (A)synchrony | Ajay D. Kshemkalyani et.al. | 2503.16216 | null |
2025-03-19 | More Information is Not Always Better: Connections between Zero-Sum Local Nash Equilibria in Feedback and Open-Loop Information Patterns | Kushagra Gupta et.al. | 2503.15486 | null |
2025-03-19 | SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Yifei Zhou et.al. | 2503.15478 | link |
2025-03-19 | Energy-efficient Merging of Connected and Automated Vehicles using Control Barrier Functions | Shreshta Rajakumar Deshpande et.al. | 2503.15379 | null |
2025-03-19 | Lyapunov-Based Graph Neural Networks for Adaptive Control of Multi-Agent Systems | Brandon C. Fallin et.al. | 2503.15360 | null |
2025-03-19 | MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration | David Wan et.al. | 2503.15272 | null |
2025-03-19 | Exploring Large Language Models for Word Games:Who is the Spy? | Chentian Wei et.al. | 2503.15235 | link |
2025-03-19 | A Personalized Data-Driven Generative Model of Human Motion | Angelo Di Porzio et.al. | 2503.15225 | null |
2025-03-19 | When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection | Tittaya Mairittha et.al. | 2503.15204 | null |
2025-03-19 | Learning Topology Actions for Power Grid Control: A Graph-Based Soft-Label Imitation Learning Approach | Mohamed Hassouna et.al. | 2503.15190 | null |
2025-03-19 | Role-Selection Game in Block Production under Proposer-Builder Separation | Yanzhen Li et.al. | 2503.15184 | null |
2025-03-18 | Gricean Norms as a Basis for Effective Collaboration | Fardin Saad et.al. | 2503.14484 | link |
2025-03-18 | Don’t lie to your friends: Learning what you know from collaborative self-play | Jacob Eisenstein et.al. | 2503.14481 | null |
2025-03-18 | EnvBench: A Benchmark for Automated Environment Setup | Aleksandra Eliseeva et.al. | 2503.14443 | link |
2025-03-18 | PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play | Wei Fang et.al. | 2503.14432 | null |
2025-03-18 | Decentralized RISE-based Control for Exponential Heterogeneous Multi-Agent Target Tracking of Second-Order Nonlinear Systems | Cristian F. Nino et.al. | 2503.14418 | null |
2025-03-18 | Large Language Models for Virtual Human Gesture Selection | Parisa Ghanad Torshizi et.al. | 2503.14408 | null |
2025-03-18 | Unified Analysis of Decentralized Gradient Descent: a Contraction Mapping Framework | Erik G. Larsson et.al. | 2503.14353 | null |
2025-03-18 | MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration | Yisen Xu et.al. | 2503.14340 | null |
2025-03-18 | DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal | Vaibhav Aggarwal et.al. | 2503.14269 | link |
2025-03-18 | Conversational Agents as Catalysts for Critical Thinking: Challenging Social Influence in Group Decision-making | Soohwan Lee et.al. | 2503.14263 | null |
2025-03-17 | VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning | Ye Liu et.al. | 2503.13444 | link |
2025-03-17 | A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Weiqiang Jin et.al. | 2503.13415 | null |
2025-03-17 | Reward Adaptation Via Q-Manipulation | Kevin Vora et.al. | 2503.13414 | null |
2025-03-17 | Toward Generative 6G Simulation: An Experimental Multi-Agent LLM and ns-3 Integration | Farhad Rezazadeh et.al. | 2503.13402 | null |
2025-03-17 | MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research | James Burgess et.al. | 2503.13399 | link |
2025-03-17 | Mixtures of ensembles: System separation and identification via optimal transport | Filip Elvander et.al. | 2503.13362 | null |
2025-03-17 | Optimal intrinsic formation using exogenous systems | Yueyue Xu et.al. | 2503.13359 | null |
2025-03-17 | Agents Play Thousands of 3D Video Games | Zhongwen Xu et.al. | 2503.13356 | null |
2025-03-17 | Goal2Story: A Multi-Agent Fleet based on Privately Enabled sLLMs for Impacting Mapping on Requirements Elicitation | Xinkai Zou et.al. | 2503.13279 | null |
2025-03-17 | Knowledge-Aware Iterative Retrieval for Multi-Agent Systems | Seyoung Song et.al. | 2503.13275 | null |
2025-03-14 | Scaling the Automated Discovery of Quantum Circuits via Reinforcement Learning with Gadgets | Jan Olle et.al. | 2503.11638 | null |
2025-03-14 | Essentials of the kinetic theory of multi-agent systems | Nadia Loy et.al. | 2503.11554 | null |
2025-03-14 | Multi-robot coordination for connectivity recovery after unpredictable environment changes | Yaroslav Marchukov et.al. | 2503.11520 | null |
2025-03-14 | Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks | Diego Gosmar et.al. | 2503.11517 | link |
2025-03-14 | Multi-agent coordination for on-demand data gathering with periodic information upload | Yaroslav Marchukov et.al. | 2503.11504 | null |
2025-03-14 | Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control | Yifeng Zhang et.al. | 2503.11488 | null |
2025-03-14 | Research Vision: Multi-Agent Path Planning for Cops And Robbers Via Reactive Synthesis | William Fishell et.al. | 2503.11475 | null |
2025-03-14 | Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning | Jose-Luis Holgado-Alvarez et.al. | 2503.11467 | null |
2025-03-14 | Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves | Aryaman Reddi et.al. | 2503.11452 | link |
2025-03-14 | Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery | Balaji Rama et.al. | 2503.11444 | link |
2025-03-13 | UniGoal: Towards Universal Zero-shot Goal-oriented Navigation | Hang Yin et.al. | 2503.10630 | null |
2025-03-13 | Uncertainty in Action: Confidence Elicitation in Embodied Agents | Tianjiao Yu et.al. | 2503.10628 | null |
2025-03-13 | CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing | Advait Gupta et.al. | 2503.10613 | link |
2025-03-13 | GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Rui Hu et.al. | 2503.10596 | link |
2025-03-13 | The Lagrangian Method for Solving Constrained Markov Games | Soham Das et.al. | 2503.10561 | null |
2025-03-13 | A large multi-agent system with noise both in position and control | Giuseppe D’Onofrio et.al. | 2503.10543 | null |
2025-03-13 | Fair allocations with subadditive and XOS valuations | Uriel Feige et.al. | 2503.10513 | null |
2025-03-13 | SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models | Sahar Admoni et.al. | 2503.10509 | null |
2025-03-13 | SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process | Tom Maus et.al. | 2503.10466 | null |
2025-03-13 | Compliant Control of Quadruped Robots for Assistive Load Carrying | Nimesh Khandelwal et.al. | 2503.10401 | null |
2025-03-12 | Auspex: Building Threat Modeling Tradecraft into an Artificial Intelligence-based Copilot | Andrew Crossman et.al. | 2503.09586 | null |
2025-03-12 | Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks | Lutfi Eren Erdogan et.al. | 2503.09572 | null |
2025-03-12 | The turnpike control in stochastic multi-agent dynamics: a discrete-time approach with exponential integrators | Fabio Cassini et.al. | 2503.09549 | null |
2025-03-13 | Large Language Models for Multi-Facility Location Mechanism Design | Nguyen Thach et.al. | 2503.09533 | null |
2025-03-12 | PairVDN - Pair-wise Decomposed Value Functions | Zak Buzzard et.al. | 2503.09521 | link |
2025-03-12 | RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment | Md Morshed Alam et.al. | 2503.09513 | null |
2025-03-12 | TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues | Hannah VanderHoeven et.al. | 2503.09511 | null |
2025-03-12 | ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning | Ziyu Wan et.al. | 2503.09501 | link |
2025-03-12 | SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery | Jiayuan Huang et.al. | 2503.09474 | null |
2025-03-12 | Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation | Máté Tóth et.al. | 2503.09464 | null |
2025-03-11 | CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Changxing Liu et.al. | 2503.08683 | link |
2025-03-11 | AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence | Zekun Li et.al. | 2503.08669 | null |
2025-03-11 | EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments | Dongping Li et.al. | 2503.08604 | link |
2025-03-11 | GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training | Tong Wei et.al. | 2503.08525 | null |
2025-03-11 | ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews | Xian Gao et.al. | 2503.08506 | null |
2025-03-11 | Existence of Optimal Contracts for Principal-Agent Problem with Drift Control and Quadratic Effort Cost | Xinfu Chen et.al. | 2503.08503 | null |
2025-03-11 | Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RAN | F. Giarrè et.al. | 2503.08493 | null |
2025-03-11 | Hybrid Deep Reinforcement Learning for Radio Tracer Localisation in Robotic-assisted Radioguided Surgery | Hanyi Zhang et.al. | 2503.08492 | null |
2025-03-11 | Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding | Tim Steinke et.al. | 2503.08474 | null |
2025-03-12 | An Autonomous RL Agent Methodology for Dynamic Web UI Testing in a BDD Framework | Ali Hassaan Mughal et.al. | 2503.08464 | null |
2025-03-10 | MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning | Xiangru Tang et.al. | 2503.07459 | link |
2025-03-10 | LLMs syntactically adapt their language use to their conversational partner | Florian Kandra et.al. | 2503.07457 | null |
2025-03-10 | Towards Safe Robot Foundation Models | Maximilian Tölle et.al. | 2503.07404 | null |
2025-03-10 | Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning | Kha Vo et.al. | 2503.07397 | null |
2025-03-10 | AttentionSwarm: Reinforcement Learning with Attention Control Barier Function for Crazyflie Drones in Dynamic Environments | Grik Tadevosyan et.al. | 2503.07376 | null |
2025-03-10 | Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future | Yannick Oswald et.al. | 2503.07364 | null |
2025-03-10 | Temporal Triplane Transformers as Occupancy World Models | Haoran Xu et.al. | 2503.07338 | null |
2025-03-10 | Dynamic Path Navigation for Motion Agents with LLM Reasoning | Yubo Zhao et.al. | 2503.07323 | null |
2025-03-10 | Experimental Exploration: Investigating Cooperative Interaction Behavior Between Humans and Large Language Model Agents | Guanxuan Jiang et.al. | 2503.07320 | null |
2025-03-10 | Automated Movie Generation via Multi-Agent CoT Planning | Weijia Wu et.al. | 2503.07314 | link |
2025-03-07 | On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations | Vittorio Bilò et.al. | 2503.05695 | null |
2025-03-07 | A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Yu Zhang et.al. | 2503.05659 | link |
2025-03-07 | Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Justin Chih-Yao Chen et.al. | 2503.05641 | null |
2025-03-07 | InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model | Feeza Khan Khanzada et.al. | 2503.05573 | null |
2025-03-07 | Tractable Representations for Convergent Approximation of Distributional HJB Equations | Julie Alhosh et.al. | 2503.05563 | null |
2025-03-07 | ALMAGAL I. The ALMA evolutionary study of high-mass protocluster formation in the Galaxy. Presentation of the survey and early results | S. Molinari et.al. | 2503.05555 | null |
2025-03-07 | Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning | Raphael Trumpp et.al. | 2503.05546 | null |
2025-03-07 | The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence | Noah Mamie et.al. | 2503.05473 | null |
2025-03-07 | Game Theory in Formula 1: Multi-agent Physical and Strategical Interactions | Giona Fienia et.al. | 2503.05421 | null |
2025-03-07 | First-passage-time statistics of active Brownian particles: A perturbative approach | Yanis Baouche et.al. | 2503.05401 | null |
2025-03-06 | The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making | Stephen Pilli et.al. | 2503.04692 | null |
2025-03-06 | Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases | Pengcheng Qiu et.al. | 2503.04691 | null |
2025-03-06 | Multi-Agent Inverse Q-Learning from Demonstrations | Nathaniel Haynam et.al. | 2503.04679 | null |
2025-03-06 | Data-Driven Distributed Optimization via Aggregative Tracking and Deep-Learning | Riccardo Brumali et.al. | 2503.04668 | null |
2025-03-06 | Assessing the performance of compartmental and renewal models for learning $R_{t}$ using spatially heterogeneous epidemic simulations on real geographies | Matthew Ghosh et.al. | 2503.04648 | null |
2025-03-06 | SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing | Xiangchao Yan et.al. | 2503.04629 | link |
2025-03-06 | The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy | Xinyi Hou et.al. | 2503.04596 | null |
2025-03-06 | Advancing Solutions for the Three-Body Problem Through Physics-Informed Neural Networks | Manuel Santos Pereira et.al. | 2503.04585 | null |
2025-03-06 | ToolFuzz – Automated Agent Tool Testing | Ivan Milev et.al. | 2503.04479 | null |
2025-03-06 | From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design | Felix Ocker et.al. | 2503.04417 | null |
2025-03-05 | The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems | Richard Ren et.al. | 2503.03750 | null |
2025-03-05 | CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning | Yuqi Zhou et.al. | 2503.03743 | link |
2025-03-05 | A Practical Memory Injection Attack against LLM Agents | Shen Dong et.al. | 2503.03704 | null |
2025-03-05 | MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems | Rui Ye et.al. | 2503.03686 | null |
2025-03-05 | Optimally Installing Strict Equilibria | Jeremy McMahan et.al. | 2503.03676 | null |
2025-03-05 | Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models | Bar Karov et.al. | 2503.03669 | link |
2025-03-05 | A Generative Approach to High Fidelity 3D Reconstruction from Text Data | Venkat Kumar R et.al. | 2503.03664 | null |
2025-03-05 | Motion Planning and Control with Unknown Nonlinear Dynamics through Predicted Reachability | Zhiquan Zhang et.al. | 2503.03633 | null |
2025-03-05 | TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation | Haowei Sun et.al. | 2503.03629 | link |
2025-03-05 | Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories | Alperen Yildiz et.al. | 2503.03586 | null |
2025-03-04 | MuBlE: MuJoCo and Blender simulation Environment and Benchmark for Task Planning in Robot Manipulation | Michal Nazarczuk et.al. | 2503.02834 | link |
2025-03-04 | Meta-Learning to Explore via Memory Density Feedback | Kevin L. McKee et.al. | 2503.02831 | null |
2025-03-04 | Do Not Trust Licenses You See – Dataset Compliance Requires Massive-Scale AI-Powered Lifecycle Tracing | Jaekyeom Kim et.al. | 2503.02784 | null |
2025-03-04 | Quantitative Resilience Modeling for Autonomous Cyber Defense | Xavier Cadet et.al. | 2503.02780 | null |
2025-03-04 | From Metaphor to Mechanism: How LLMs Decode Traditional Chinese Medicine Symbolic Language for Modern Clinical Relevance | Jiacheng Tang et.al. | 2503.02760 | null |
2025-03-04 | Consumption-portfolio choice with preferences for liquid assets | Guohui Guan et.al. | 2503.02697 | null |
2025-03-04 | Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems | Jakob Weber et.al. | 2503.02693 | link |
2025-03-04 | FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting | Congluo Xu et.al. | 2503.02692 | null |
2025-03-04 | MPO: Boosting LLM Agents with Meta Plan Optimization | Weimin Xiong et.al. | 2503.02682 | link |
2025-03-04 | Unique existence of solution and Hyers-Ulam stability for a new fractional differential quasi-variational inequality with Mittag-Leffler kernel and its applications | Zeng-bao Wu et.al. | 2503.02669 | null |
2025-02-28 | Hybrid Team Tetris: A New Platform For Hybrid Multi-Agent, Multi-Human Teaming | Kaleb Mcdowell et.al. | 2502.21300 | null |
2025-02-28 | Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind | Dingyi Zhang et.al. | 2502.21297 | null |
2025-02-28 | ReaLJam: Real-Time Human-AI Music Jamming with Reinforcement Learning-Tuned Transformers | Alexander Scarlatos et.al. | 2502.21267 | null |
2025-02-28 | Towards Developing Ethical Reasoners: Integrating Probabilistic Reasoning and Decision-Making for Complex AI Systems | Nijesh Upreti et.al. | 2502.21250 | null |
2025-02-28 | A Method of Selective Attention for Reservoir Based Agents | Kevin McKee et.al. | 2502.21229 | null |
2025-02-28 | ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments | Pedro Gimenes et.al. | 2502.21208 | null |
2025-03-03 | Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Baiting Luo et.al. | 2502.21186 | link |
2025-02-28 | Reducing Reward Dependence in RL Through Adaptive Confidence Discounting | Muhammed Yusuf Satici et.al. | 2502.21181 | null |
2025-02-28 | Autonomous Curriculum Design via Relative Entropy Based Task Modifications | Muhammed Yusuf Satici et.al. | 2502.21166 | null |
2025-02-28 | Cryptis: Cryptographic Reasoning in Separation Logic | Arthur Azevedo de Amorim et.al. | 2502.21156 | null |
2025-02-27 | Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation | Siddhant Haldar et.al. | 2502.20391 | link |
2025-02-27 | Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis | Jeffrey Yang Fan Chiang et.al. | 2502.20383 | null |
2025-02-27 | Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers | Shalev Lifshitz et.al. | 2502.20379 | null |
2025-02-27 | Multi-Agent Path Planning in Complex Environments using Gaussian Belief Propagation with Global Path Finding | Jens Høigaard Jensen et.al. | 2502.20369 | link |
2025-02-27 | Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization | Ryan C. Barron et.al. | 2502.20364 | link |
2025-02-27 | Trajectory-to-Action Pipeline (TAP): Automated Scenario Description Extraction for Autonomous Vehicle Behavior Comparison | Aron Harder et.al. | 2502.20353 | null |
2025-02-27 | Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning | Thomas Budiarjo et.al. | 2502.20348 | null |
2025-02-27 | Safety Representations for Safer Policy Learning | Kaustubh Mani et.al. | 2502.20341 | null |
2025-02-27 | Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application | Thomas Hickling et.al. | 2502.20326 | null |
2025-02-27 | M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging | Jinghao Feng et.al. | 2502.20301 | null |
2025-02-26 | Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation | Shiven Sinha et.al. | 2502.19414 | link |
2025-02-26 | TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding | Max Ku et.al. | 2502.19400 | null |
2025-02-26 | Hybrid Robot Learning for Automatic Robot Motion Planning in Manufacturing | Siddharth Singh et.al. | 2502.19340 | null |
2025-02-26 | Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems | Hao Peng et.al. | 2502.19328 | link |
2025-02-26 | CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query | Zhe Wang et.al. | 2502.19313 | null |
2025-02-26 | WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | William Solow et.al. | 2502.19308 | link |
2025-02-26 | Agent-centric Information Access | Evangelos Kanoulas et.al. | 2502.19298 | null |
2025-02-26 | CritiQ: Mining Data Quality Criteria from Human Preferences | Honglin Guo et.al. | 2502.19279 | null |
2025-02-26 | EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region | Nadya Abdel Madjid et.al. | 2502.19260 | link |
2025-02-26 | ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding | Qihang Peng et.al. | 2502.19247 | null |
2025-02-25 | FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response | Mollie Shichman et.al. | 2502.18452 | null |
2025-02-25 | MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning | Chanwoo Park et.al. | 2502.18439 | null |
2025-02-25 | ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies | Pedro Sequeira et.al. | 2502.18438 | null |
2025-02-25 | CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing | Yafei Ou et.al. | 2502.18437 | null |
2025-02-25 | AgentRM: Enhancing Agent Generalization with Reward Modeling | Yu Xia et.al. | 2502.18407 | null |
2025-02-25 | Responsible AI Agents | Deven R. Desai et.al. | 2502.18359 | null |
2025-02-25 | WebGames: Challenging General-Purpose Web-Browsing AI Agents | George Thomas et.al. | 2502.18356 | link |
2025-02-25 | RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction | Jianhao Yan et.al. | 2502.18308 | null |
2025-02-25 | Smart and Efficient IoT-Based Irrigation System Design: Utilizing a Hybrid Agent-Based and System Dynamics Approach | Taha Ahmadi Pargo et.al. | 2502.18298 | null |
2025-02-25 | A Competitive Posted-Price Mechanism for Online Budget-Feasible Auctions | Andreas Charalampopoulos et.al. | 2502.18265 | null |
2025-02-24 | Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making | Luca Lalor et.al. | 2502.17417 | null |
2025-02-24 | Distributed Coordination for Heterogeneous Non-Terrestrial Networks | Jikang Deng et.al. | 2502.17366 | null |
2025-02-24 | Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents | Prafulla Kumar Choubey et.al. | 2502.17321 | null |
2025-02-24 | Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach | Jichen Li et.al. | 2502.17307 | null |
2025-02-24 | IGDA: Interactive Graph Discovery through Large Language Model Agents | Alex Havrilla et.al. | 2502.17189 | null |
2025-02-24 | Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being | Bin Yin et.al. | 2502.17172 | null |
2025-02-24 | A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning | Hamidreza Mazandarani et.al. | 2502.17167 | null |
2025-02-24 | Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case | Hamidreza Mazandarani et.al. | 2502.17120 | null |
2025-02-24 | Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration | Junyang Wang et.al. | 2502.17110 | null |
2025-02-24 | Generative Models in Decision Making: A Survey | Yinchuan Li et.al. | 2502.17100 | null |
2025-02-21 | AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind | Zhining Zhang et.al. | 2502.15676 | link |
2025-02-21 | Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities | Natasha Astudillo et.al. | 2502.15663 | null |
2025-02-21 | Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network | Vincent Hsiao et.al. | 2502.15662 | null |
2025-02-21 | Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? | Yoshua Bengio et.al. | 2502.15657 | null |
2025-02-21 | A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications | Jefferson Silveira et.al. | 2502.15649 | null |
2025-02-21 | WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents | Xinhang Liu et.al. | 2502.15601 | null |
2025-02-21 | SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instrucion Following Evaluation for Social Agents | Wenyuan Zhang et.al. | 2502.15538 | link |
2025-02-21 | Contract DesignUnderApproximate Best Responses | Francesco Bacchiocchi et.al. | 2502.15523 | null |
2025-02-21 | SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning | Xuyang Li et.al. | 2502.15512 | null |
2025-02-21 | Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing | Masaya Kobayashi et.al. | 2502.15506 | null |
2025-02-20 | GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks | Jianwen Luo et.al. | 2502.14848 | link |
2025-02-20 | Red-Teaming LLM Multi-Agent Systems via Communication Attacks | Pengfei He et.al. | 2502.14847 | null |
2025-02-20 | Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation | Yue Yang et.al. | 2502.14846 | null |
2025-02-20 | Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models | Vlad Sobal et.al. | 2502.14819 | null |
2025-02-20 | Optimizing Model Selection for Compound AI Systems | Lingjiao Chen et.al. | 2502.14815 | link |
2025-02-20 | Byzantine Game Theory: Sun Tzus Boxes | Andrei Constantinescu et.al. | 2502.14812 | null |
2025-02-20 | Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission | Gregg Rabideau et.al. | 2502.14803 | null |
2025-02-20 | A Multi-Agent Perspective on Modern Information Retrieval | Haya Nachimovsky et.al. | 2502.14796 | null |
2025-02-20 | Making Universal Policies Universal | Niklas Höpner et.al. | 2502.14777 | link |
2025-02-20 | Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis | Priyanka Kargupta et.al. | 2502.14767 | link |
2025-02-19 | Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Michael Luo et.al. | 2502.13965 | null |
2025-02-19 | LIDDIA: Language-based Intelligent Drug Discovery Agent | Reza Averly et.al. | 2502.13959 | null |
2025-02-19 | RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision | Guangzhi Xiong et.al. | 2502.13957 | null |
2025-02-19 | Qwen2.5-VL Technical Report | Shuai Bai et.al. | 2502.13923 | null |
2025-02-19 | Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health | Xingbo Wang et.al. | 2502.13920 | link |
2025-02-19 | DataSciBench: An LLM Agent Benchmark for Data Science | Dan Zhang et.al. | 2502.13897 | link |
2025-02-19 | NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants | Yiran Qin et.al. | 2502.13894 | null |
2025-02-19 | Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents | Jiahao Liu et.al. | 2502.13843 | link |
2025-02-19 | ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities | Chanjin Zheng et.al. | 2502.13832 | link |
2025-02-19 | Learning to explore when mistakes are not allowed | Charly Pecqueux-Guézénec et.al. | 2502.13801 | null |
2025-02-18 | AIDE: AI-Driven Exploration in the Space of Code | Zhengyao Jiang et.al. | 2502.13138 | link |
2025-02-18 | Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions | Taedong Yun et.al. | 2502.13135 | null |
2025-02-18 | Magma: A Foundation Model for Multimodal AI Agents | Jianwei Yang et.al. | 2502.13130 | link |
2025-02-18 | Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning | Jingyang Lin et.al. | 2502.13127 | null |
2025-02-18 | Approximately Efficient Bilateral Trade with Samples | Yuan Deng et.al. | 2502.13122 | null |
2025-02-18 | Text2World: Benchmarking Large Language Models for Symbolic World Model Generation | Mengkang Hu et.al. | 2502.13092 | null |
2025-02-18 | Interactive Agents to Overcome Ambiguity in Software Engineering | Sanidhya Vijayvargiya et.al. | 2502.13069 | link |
2025-02-18 | Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection | Jingbiao Mei et.al. | 2502.13061 | link |
2025-02-18 | AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks | Yurun Chen et.al. | 2502.13053 | null |
2025-02-18 | Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks | Markus J. Buehler et.al. | 2502.13025 | link |
2025-02-17 | HARBOR: Exploring Persona Dynamics in Multi-Agent Competition | Kenan Jiang et.al. | 2502.12149 | null |
2025-02-17 | Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Zhenfang Chen et.al. | 2502.12130 | null |
2025-02-17 | A-MEM: Agentic Memory for LLM Agents | Wujiang Xu et.al. | 2502.12110 | link |
2025-02-17 | Relational Norms for Human-AI Cooperation | Brian D. Earp et.al. | 2502.12102 | null |
2025-02-17 | A Study on Leveraging Search and Self-Feedback for Agent Reasoning | Karthikeyan K et.al. | 2502.12094 | null |
2025-02-17 | Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation | Zhongyi Qiu et.al. | 2502.12073 | null |
2025-02-17 | A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice | Carole Adam et.al. | 2502.12058 | null |
2025-02-17 | Multi-agent coordination via communication partitions | Wei-Chen Lee et.al. | 2502.12042 | null |
2025-02-17 | Machine Learning Should Maximize Welfare, Not (Only) Accuracy | Nir Rosenfeld et.al. | 2502.11981 | null |
2025-02-17 | FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control | Yutong Ye et.al. | 2502.11937 | null |
2025-02-14 | Representation and Interpretation in Artificial and Natural Computing | Luis A. Pineda et.al. | 2502.10383 | null |
2025-02-14 | Agentic Verification for Ambiguous Query Disambiguation | Youngwon Lee et.al. | 2502.10352 | null |
2025-02-14 | Process Reward Models for LLM Agents: Practical Framework and Directions | Sanjiban Choudhury et.al. | 2502.10325 | link |
2025-02-14 | Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations | Abdelrhman Shaheen et.al. | 2502.10303 | null |
2025-02-14 | Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers | Aivin V. Solatorio et.al. | 2502.10263 | link |
2025-02-14 | Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Laurin Luttmann et.al. | 2502.10233 | link |
2025-02-14 | A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation | Redha Taguelmimt et.al. | 2502.10226 | null |
2025-02-14 | Do Large Language Models Reason Causally Like Us? Even Better? | Hanna M. Dettki et.al. | 2502.10215 | null |
2025-02-14 | Dynamic Reinforcement Learning for Actors | Katsunari Shibata et.al. | 2502.10200 | null |
2025-02-14 | Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design | Jingjie Ni et.al. | 2502.10187 | null |
2025-02-13 | Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs | Siyan Zhao et.al. | 2502.09597 | link |
2025-02-13 | KIMAs: A Configurable Knowledge Integrated Multi-Agent System | Zitao Li et.al. | 2502.09596 | null |
2025-02-13 | Rolling Ahead Diffusion for Traffic Scene Simulation | Yunpeng Liu et.al. | 2502.09587 | null |
2025-02-13 | Learning to Coordinate with Experts | Mohamad H. Danesh et.al. | 2502.09583 | link |
2025-02-13 | Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks | Qian Wan et.al. | 2502.09577 | null |
2025-02-13 | MDCrow: Automating Molecular Dynamics Workflows with Large Language Models | Quintina Campbell et.al. | 2502.09565 | link |
2025-02-13 | EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents | Rui Yang et.al. | 2502.09560 | null |
2025-02-13 | Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages | Shreyan Biswas et.al. | 2502.09532 | null |
2025-02-13 | Exact Leader Estimation: A New Approach for Distributed Differentiation | Rodrigo Aldana-Lopez et.al. | 2502.09529 | null |
2025-02-13 | Forward-backward Contention Resolution Schemes for Fair Rationing | Will Ma et.al. | 2502.09521 | null |
2025-02-12 | Poly-Autoregressive Prediction for Modeling Interactions | Neerja Thakkar et.al. | 2502.08646 | null |
2025-02-12 | Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs | Mantas Mazeika et.al. | 2502.08640 | null |
2025-02-12 | SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent | Keyeun Lee et.al. | 2502.08599 | link |
2025-02-12 | Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners | David Easley et.al. | 2502.08597 | null |
2025-02-12 | Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks | Ang Li et.al. | 2502.08586 | null |
2025-02-12 | Statistically validated projection of bipartite signed networks | Anna Gallo et.al. | 2502.08567 | null |
2025-02-12 | Human-Centric Foundation Models: Perception, Generation and Agentic Modeling | Shixiang Tang et.al. | 2502.08556 | link |
2025-02-12 | Extreme vulnerability to intruder attacks destabilizes network dynamics | Amirhossein Nazerian et.al. | 2502.08552 | null |
2025-02-12 | Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation | Mahnaz Koupaee et.al. | 2502.08514 | link |
2025-02-12 | Resilient Quantized Consensus in Multi-Hop Relay Networks | Liwei Yuan et.al. | 2502.08455 | null |
2025-02-11 | MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces | Loris Gaven et.al. | 2502.07709 | link |
2025-02-11 | Human Decision-making is Susceptible to AI-driven Manipulation | Sahand Sabour et.al. | 2502.07663 | link |
2025-02-11 | Robust-Sorting and Applications to Ulam-Median | Ragesh Jaiswal et.al. | 2502.07653 | null |
2025-02-11 | Distributed Value Decomposition Networks with Networked Agents | Guilherme S. Varela et.al. | 2502.07635 | null |
2025-02-11 | Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy | Kristijan Atanasov et.al. | 2502.07593 | null |
2025-02-11 | DMWM: Dual-Mind World Model with Long-Term Imagination | Lingyi Wang et.al. | 2502.07591 | null |
2025-02-11 | Pure $ε$ -equilibrium in random games | Bary S. R. Pradelski et.al. | 2502.07585 | null |
2025-02-11 | Genetic evolution of a multi-generational population in the context of interstellar space travels – Part II: Phenotypic effects of gene expression | Frédéric Marin et.al. | 2502.07559 | null |
2025-02-11 | Unsupervised Translation of Emergent Communication | Ido Levy et.al. | 2502.07552 | null |
2025-02-11 | A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond | Zicheng Hu et.al. | 2502.07514 | null |
2025-02-10 | Visual Agentic AI for Spatial Reasoning with a Dynamic API | Damiano Marsili et.al. | 2502.06787 | null |
2025-02-10 | Towards Internet-Scale Training For Agents | Brandon Trabucco et.al. | 2502.06776 | null |
2025-02-10 | Distributed Constraint-Coupled Optimization: Harnessing ADMM-consensus for robustness | Mohamed Abdelmouamin Messilem et.al. | 2502.06763 | null |
2025-02-10 | Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty | Valia Efthymiou et.al. | 2502.06749 | null |
2025-02-10 | Institutional Preferences in the Laboratory | Qiankun Zhong et.al. | 2502.06748 | null |
2025-02-10 | Wandering around: A bioinspired approach to visual attention through object motion sensitivity | Giulia D Angelo et.al. | 2502.06747 | link |
2025-02-10 | AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection | Roohan Ahmed Khan et.al. | 2502.06725 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-10 | Quantile Multi-Armed Bandits with 1-bit Feedback | Ivan Lau et.al. | 2502.06678 | null |
2025-02-10 | Unbiased Evaluation of Large Language Models from a Causal Perspective | Meilin Chen et.al. | 2502.06655 | null |
2025-02-07 | Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Yunhang Shen et.al. | 2502.05177 | link |
2025-02-07 | MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison | Kaijie Zhu et.al. | 2502.05174 | link |
2025-02-07 | From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance | Jiamin Xu et.al. | 2502.05145 | link |
2025-02-07 | Maximin Share Guarantees for Few Agents with Subadditive Valuations | George Christodoulou et.al. | 2502.05141 | null |
2025-02-07 | Joint TITE-CRM for Dual Agent Dose Finding Studies | Helen Barnett et.al. | 2502.05072 | null |
2025-02-07 | Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation | Wenqi Bai et.al. | 2502.05069 | null |
2025-02-07 | nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow | Geliang Ouyang et.al. | 2502.05036 | link |
2025-02-07 | Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency | Qixin Zhang et.al. | 2502.05028 | null |
2025-02-07 | Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning | Tristan K. Schuler et.al. | 2502.05014 | null |
2025-02-07 | The Rising Threat to Emerging AI-Powered Search Engines | Zeren Luo et.al. | 2502.04951 | null |
2025-02-06 | ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization | Yinjie Wang et.al. | 2502.04306 | link |
2025-02-06 | Mutual Multilinearity of Nonequilibrium Network Currents | Sara Dal Cengio et.al. | 2502.04298 | null |
2025-02-06 | DECAF: Learning to be Fair in Multi-agent Resource Allocation | Ashwin Kumar et.al. | 2502.04281 | null |
2025-02-06 | Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study | Michael Walters et.al. | 2502.04249 | null |
2025-02-06 | Multi-agent Architecture Search via Agentic Supernet | Guibin Zhang et.al. | 2502.04180 | link |
2025-02-06 | Dense Fixed-Wing Swarming using Receding-Horizon NMPC | Varun Madabushi et.al. | 2502.04174 | null |
2025-02-06 | Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Wesley A. Suttle et.al. | 2502.04141 | null |
2025-02-06 | Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation | Jiahao Lu et.al. | 2502.04139 | null |
2025-02-06 | VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output | Eason Chen et.al. | 2502.04103 | null |
2025-02-06 | Strategic Learning with Local Explanations as Feedback | Kiet Q. H. Vo et.al. | 2502.04058 | null |
2025-02-05 | A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Yiye Chen et.al. | 2502.03450 | null |
2025-02-05 | Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators | Yuan Xinjie et.al. | 2502.03424 | null |
2025-02-05 | Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach | Abdullahi Isa Ahmed et.al. | 2502.03377 | null |
2025-02-05 | Learning from Active Human Involvement through Proxy Value Propagation | Zhenghao Peng et.al. | 2502.03369 | null |
2025-02-05 | PalimpChat: Declarative and Interactive AI analytics | Chunwei Liu et.al. | 2502.03368 | null |
2025-02-05 | Inverse Mixed Strategy Games with Generative Trajectory Models | Max Muchen Sun et.al. | 2502.03356 | null |
2025-02-05 | Implicit Communication in Human-Robot Collaborative Transport | Elvin Yang et.al. | 2502.03346 | link |
2025-02-05 | Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes | Haotian Wu et.al. | 2502.03335 | null |
2025-02-05 | SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs | Ben Liu et.al. | 2502.03283 | null |
2025-02-05 | Modeling and Optimization of Insulin Injection for Type-1 Diabetes Mellitus Management | Rinrada Jadsadaphongphaibool et.al. | 2502.03269 | null |
2025-02-04 | QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search | Zongyu Lin et.al. | 2502.02584 | link |
2025-02-04 | Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents | Shayan Kiyani et.al. | 2502.02561 | null |
2025-02-04 | AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis | Divya Bharti et.al. | 2502.02555 | link |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | Adaptive Self-improvement LLM Agentic System for ML Library Development | Genghan Zhang et.al. | 2502.02534 | link |
2025-02-04 | Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies | Han Zhou et.al. | 2502.02533 | null |
2025-02-04 | Why human-AI relationships need socioaffective alignment | Hannah Rose Kirk et.al. | 2502.02528 | null |
2025-02-04 | The Cost Perspective of Liquid Democracy: Feasibility and Control | Shiri Alouf-Heffetz et.al. | 2502.02380 | null |
2025-02-04 | Mirai: A Wearable Proactive AI “Inner-Voice” for Contextual Nudging | Cathy Mengying Fang et.al. | 2502.02370 | null |
2025-02-04 | MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2502.02311 | null |
2025-01-31 | Vintix: Action Model via In-Context Reinforcement Learning | Andrey Polubarov et.al. | 2501.19400 | link |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398 | link |
2025-01-31 | Learning Contracts in Hierarchical Multi-Agent Systems | Antoine Scheid et.al. | 2501.19388 | null |
2025-01-31 | The Physics and Metaphysics of Social Powers: Bridging Cognitive Processing and Social Dynamics, a New Perspective on Power through Active Inference | Mahault Albarracin et.al. | 2501.19368 | null |
2025-01-31 | PixelWorld: Towards Perceiving Everything as Pixels | Zhiheng Lyu et.al. | 2501.19339 | null |
2025-01-31 | MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems | Anirudh Chari et.al. | 2501.19318 | null |
2025-01-31 | Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning | Balint Gyevnar et.al. | 2501.19256 | null |
2025-02-03 | SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments | Hüseyin Aydın et.al. | 2501.19245 | link |
2025-01-31 | Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics | Xingyu Wang et.al. | 2501.19239 | null |
2025-01-31 | A parallelizable variant of HCA* | Sreenivasan Ganti et.al. | 2501.19218 | null |
2025-01-30 | Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method | Peter Baile Chen et.al. | 2501.18539 | null |
2025-01-30 | Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems | Parth Ganeriwala et.al. | 2501.18506 | null |
2025-01-30 | Graph Exploration with Edge Weight Estimates | Matthias Gehnen et.al. | 2501.18496 | null |
2025-01-30 | Conversation Games and a Strategic View of the Turing Test | Kaveh Aryan et.al. | 2501.18455 | null |
2025-01-30 | Stable Marriage: Loyalty vs. Competition | Amit Ronen et.al. | 2501.18442 | null |
2025-01-30 | Gravity-Bench-v1: A Benchmark on Gravitational Physics Discovery for Agents | Nolan Koblischke et.al. | 2501.18411 | null |
2025-01-30 | Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach | Tianpeng Pan et.al. | 2501.18320 | null |
2025-01-30 | Model-Free RL Agents Demonstrate System 1-Like Intentionality | Hal Ashton et.al. | 2501.18299 | null |
2025-01-30 | CueTip: An Interactive and Explainable Physics-aware Pool Assistant | Sean Memery et.al. | 2501.18291 | null |
2025-01-30 | Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents | ShuiDe Wen et.al. | 2501.18190 | null |
2025-01-29 | From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning | Junseok Park et.al. | 2501.17842 | null |
2025-01-29 | A note on the Cucker-Smale model with time delay and communication failures | Elisa Continelli et.al. | 2501.17743 | null |
2025-01-29 | RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts | Eujeong Choi et.al. | 2501.17715 | link |
2025-01-29 | Inferring Implicit Goals Across Differing Task Models | Silvia Tulli et.al. | 2501.17704 | null |
2025-01-29 | CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization | Derui Wang et.al. | 2501.17667 | link |
2025-01-29 | Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps | Scott Fredriksson et.al. | 2501.17661 | null |
2025-01-29 | Coalitional control: a bottom-up approach | Filiberto Fele et.al. | 2501.17614 | null |
2025-01-29 | Coalitional model predictive control of an irrigation canal | Filiberto Fele et.al. | 2501.17561 | null |
2025-01-29 | Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant | Gaole He et.al. | 2501.17546 | link |
2025-01-29 | Sequential Learning of the Pareto Front for Multi-objective Bandits | Elise Crépon et.al. | 2501.17513 | link |
2025-01-28 | Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning | Rémy Hosseinkhan Boucher et.al. | 2501.17115 | null |
2025-01-28 | CRSet: Non-Interactive Verifiable Credential Revocation with Metadata Privacy for Issuers and Everyone Else | Felix Hoops et.al. | 2501.17089 | null |
2025-01-28 | Learning Mean Field Control on Sparse Graphs | Christian Fabian et.al. | 2501.17079 | null |
2025-01-28 | Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning | Anna Soligo et.al. | 2501.17077 | null |
2025-01-28 | Context is Key in Agent Security | Lillian Tsai et.al. | 2501.17070 | null |
2025-01-28 | Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework | Longzhong Lin et.al. | 2501.17015 | null |
2025-01-28 | Towards Open-Source and Modular Space Systems with ATMOS | Pedro Roque et.al. | 2501.16973 | link |
2025-01-28 | Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning | Xi Chen et.al. | 2501.16966 | null |
2025-01-28 | ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations | Xinyi Ni et.al. | 2501.16945 | null |
2025-01-28 | Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies | Suzie Grondin et.al. | 2501.16935 | null |
2025-01-27 | LUCY: Linguistic Understanding and Control Yielding Early Stage of Her | Heting Gao et.al. | 2501.16327 | link |
2025-01-27 | Privacy-aware Nash Equilibrium Synthesis with Partially Ordered LTL $_f$ Objectives | Caleb Probine et.al. | 2501.16307 | null |
2025-01-27 | Multi-Agent Geospatial Copilots for Remote Sensing Workflows | Chaehong Lee et.al. | 2501.16254 | null |
2025-01-27 | Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma | Richard Willis et.al. | 2501.16173 | link |
2025-01-27 | AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants | Pascal J. Sager et.al. | 2501.16150 | null |
2025-01-27 | Quantifying the Self-Interest Level of Markov Social Dilemmas | Richard Willis et.al. | 2501.16138 | null |
2025-01-27 | Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection | Eslam Eldeeb et.al. | 2501.16098 | null |
2025-01-27 | Galaxy Era: Agent-based Simulation of Execution Tickets | Pascal Stichler et.al. | 2501.16090 | link |
2025-01-27 | Value-oriented forecast reconciliation for renewables in electricity markets | Honglin Wen et.al. | 2501.16086 | null |
2025-01-27 | Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki | Vanja Falck et.al. | 2501.16080 | null |
2025-01-24 | An Attentive Graph Agent for Topology-Adaptive Cyber Defence | Ilya Orson Sandoval et.al. | 2501.14700 | link |
2025-01-24 | The Division of Surplus and the Burden of Proof | Deniz Kattwinkel et.al. | 2501.14686 | null |
2025-01-24 | MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications | Yixing Jiang et.al. | 2501.14654 | link |
2025-01-24 | Whisper D-SGD: Correlated Noise Across Agents for Differentially Private Decentralized Learning | Angelo Rodio et.al. | 2501.14644 | link |
2025-01-24 | Fair Division Beyond Monotone Valuations | Siddharth Barman et.al. | 2501.14609 | null |
2025-01-24 | Hybrid Quantum-Classical Multi-Agent Pathfinding | Thore Gerlach et.al. | 2501.14568 | null |
2025-01-24 | Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation | Wenzhang Liu et.al. | 2501.14543 | link |
2025-01-24 | Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning | Yuhan Hu et.al. | 2501.14488 | null |
2025-01-24 | Avoiding Overfitting in Variable-Order Markov Models: a Cross-Validation Approach | Valeria Secchini et.al. | 2501.14476 | null |
2025-01-24 | The Pseudo-Dimension of Contracts | Paul Duetting et.al. | 2501.14474 | null |
2025-01-23 | GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration | Yue Fan et.al. | 2501.13896 | null |
2025-01-23 | Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning | Matyáš Lorenc et.al. | 2501.13883 | link |
2025-01-23 | Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems | Ethan Wilson et.al. | 2501.13878 | null |
2025-01-23 | EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents | Yuhui Yun et.al. | 2501.13746 | null |
2025-01-23 | Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System | Haikuo Du et.al. | 2501.13727 | link |
2025-01-23 | A Non-Parametric Approach to Heterogeneity Analysis | Avner Seror et.al. | 2501.13721 | null |
2025-01-23 | Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel–Young Loss Perspective and Gap-Dependent Regret Analysis | Shinsaku Sakaue et.al. | 2501.13648 | null |
2025-01-23 | WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control | Claire Bizon Monroc et.al. | 2501.13592 | link |
2025-01-23 | Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation | Nasir Khan et.al. | 2501.13552 | null |
2025-01-23 | Towards a Theory of AI Personhood | Francis Rhys Ward et.al. | 2501.13533 | null |
2025-01-22 | Boosting MCTS with Free Energy Minimization | Mawaba Pascal Dao et.al. | 2501.13083 | null |
2025-01-22 | Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment | Melissa Kazemi Rad et.al. | 2501.13080 | null |
2025-01-22 | Evolution and The Knightian Blindspot of Machine Learning | Joel Lehman et.al. | 2501.13075 | null |
2025-01-22 | Optimizing Return Distributions with Distributional Dynamic Programming | Bernardo Ávila Pires et.al. | 2501.13028 | null |
2025-01-22 | The regret lower bound for communicating Markov Decision Processes | Victor Boone et.al. | 2501.13013 | null |
2025-01-22 | MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking | Sebastian Farquhar et.al. | 2501.13011 | null |
2025-01-22 | Constructive characterisations of the must-preorder for asynchrony | Giovanni Bernardi et.al. | 2501.13002 | link |
2025-01-22 | An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management | Eslam Eldeeb et.al. | 2501.12991 | null |
2025-01-22 | Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization | Hossein Nejatbakhsh Esfahani et.al. | 2501.12989 | null |
2025-01-22 | Quantification of Ultrafast Nonlinear Photothermal and Photoacoustic Effects in Molecular Thin Films via Time-Domain Brillouin Scattering | Valentin Cherruault et.al. | 2501.12912 | null |
2025-01-21 | Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists | Thomas F. Eisenmann et.al. | 2501.12374 | link |
2025-01-21 | UI-TARS: Pioneering Automated GUI Interaction with Native Agents | Yujia Qin et.al. | 2501.12326 | link |
2025-01-21 | Transitions to synchronization in adaptive multilayer networks with higher-order interactions | Richita Ghosh et.al. | 2501.12301 | null |
2025-01-21 | mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework | Bingyi Liu et.al. | 2501.12263 | null |
2025-01-21 | Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control | Mark Gonzales et.al. | 2501.12234 | null |
2025-01-21 | Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access | Antonio López Martínez et.al. | 2501.12229 | null |
2025-01-21 | Convergence of time-delayed opinion dynamics with complex interaction types | Lingling Yao et.al. | 2501.12219 | null |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-21 | Experience-replay Innovative Dynamics | Tuo Zhang et.al. | 2501.12199 | null |
2025-01-21 | Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window | A. Bautista et.al. | 2501.12198 | null |
2025-01-17 | Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems | Weibo Gao et.al. | 2501.10332 | link |
2025-01-17 | Towards Human-Guided, Data-Centric LLM Co-Pilots | Evgeny Saveliev et.al. | 2501.10321 | null |
2025-01-17 | Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling | Suvodip Dey et.al. | 2501.10316 | link |
2025-01-17 | Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture | Suvidha Mhatre et.al. | 2501.10292 | null |
2025-01-17 | Evidence for the gravity-driven and magnetically-regularized gas flows feeding the massive protostellar cluster in Cep A | Panigrahy Sandhyarani et.al. | 2501.10280 | null |
2025-01-17 | Grey-Box Fuzzing in Constrained Ultra-Large Systems: Lessons for SE Community | Jiazhao Yu et.al. | 2501.10269 | null |
2025-01-17 | Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments | Niklas Dahlquist et.al. | 2501.10262 | null |
2025-01-17 | Logarithmic Regret for Nonlinear Control | James Wang et.al. | 2501.10261 | null |
2025-01-17 | Secure Semantic Communication With Homomorphic Encryption | Rui Meng et.al. | 2501.10182 | null |
2025-01-17 | PaSa: An LLM Agent for Comprehensive Academic Paper Search | Yichen He et.al. | 2501.10120 | link |
2025-01-16 | CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education | Tianyu Wang et.al. | 2501.09709 | link |
2025-01-16 | The Goofus & Gallant Story Corpus for Practical Value Alignment | Md Sultan Al Nahian et.al. | 2501.09707 | null |
2025-01-16 | Authenticated Delegation and Authorized AI Agents | Tobin South et.al. | 2501.09674 | null |
2025-01-16 | NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes | Nathaniel S. Keplinger et.al. | 2501.09646 | link |
2025-01-16 | Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework | Yushen Lin et.al. | 2501.09631 | null |
2025-01-16 | A Multi-agent System for Hybrid Optimization | Eric S. Fraga et.al. | 2501.09563 | null |
2025-01-16 | Solving the unsolvable: Translating case law in Hong Kong | King-kui Sin et.al. | 2501.09444 | null |
2025-01-16 | ADAGE: A generic two-layer framework for adaptive agent based modelling | Benjamin Patrick Evans et.al. | 2501.09429 | null |
2025-01-16 | AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling | Ancheng Xu et.al. | 2501.09426 | null |
2025-01-16 | Agent-Based Simulation of a Perpetual Futures Market | Ramshreyas Rao et.al. | 2501.09404 | null |
2025-01-15 | Personality Modeling for Persuasion of Misinformation using AI Agent | Qianmin Lou et.al. | 2501.08985 | null |
2025-01-15 | Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action | Fouad Bousetouane et.al. | 2501.08944 | null |
2025-01-15 | A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management | Surya Murthy et.al. | 2501.08941 | null |
2025-01-15 | Disentangling Exploration of Large Language Models by Optimal Exploitation | Tim Grams et.al. | 2501.08925 | null |
2025-01-15 | Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning | Qinyu Ma et.al. | 2501.08897 | link |
2025-01-15 | Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts | Antonio Castellanos et.al. | 2501.08869 | null |
2025-01-15 | The geometry of moral decision making | Roland M. Friedrich et.al. | 2501.08865 | null |
2025-01-15 | On the Dominance of Truth-Telling in Gradual Mechanisms | Wenqian Wang et.al. | 2501.08802 | null |
2025-01-15 | Networked Agents in the Dark: Team Value Learning under Partial Observability | Guilherme S. Varela et.al. | 2501.08778 | null |
2025-01-15 | Leveraging LLM Agents for Translating Network Configurations | Yunze Wei et.al. | 2501.08760 | null |
2025-01-14 | ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations | Ziyuan Huang et.al. | 2501.08324 | null |
2025-01-14 | Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation | Aleix Nicolás Olivé et.al. | 2501.08280 | null |
2025-01-14 | Addressing the sustainable AI trilemma: a case study on LLM agents and RAG | Hui Wu et.al. | 2501.08262 | link |
2025-01-14 | Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps | Kannan Parthasarathy et.al. | 2501.08243 | null |
2025-01-14 | Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning | Enrique Adrian Villarrubia-Martin et.al. | 2501.08234 | null |
2025-01-14 | ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems | Mohita Chowdhury et.al. | 2501.08208 | null |
2025-01-14 | An Elementary Microscopic Model of Sympatric Speciation | Franco Bagnoli et.al. | 2501.08130 | null |
2025-01-14 | Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving | Guizhe Jin et.al. | 2501.08096 | null |
2025-01-14 | AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation | Feng Zhang et.al. | 2501.08088 | null |
2025-01-14 | CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Guoliang He et.al. | 2501.08071 | link |
2025-01-13 | WebWalker: Benchmarking LLMs in Web Traversal | Jialong Wu et.al. | 2501.07572 | link |
2025-01-13 | SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds | Grik Tadevosyan et.al. | 2501.07566 | null |
2025-01-13 | SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Varun Biyyala et.al. | 2501.07554 | link |
2025-01-13 | Evaluating Agent-based Program Repair at Google | Pat Rondon et.al. | 2501.07531 | null |
2025-01-13 | Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning | Haonan Xu et.al. | 2501.07508 | null |
2025-01-13 | How low-cost AI universal approximators reshape market efficiency | Paolo Barucca et.al. | 2501.07489 | null |
2025-01-13 | SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM) | Xiang Cheng et.al. | 2501.07459 | link |
2025-01-13 | Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI | Rolf Pfister et.al. | 2501.07458 | null |
2025-01-13 | Online inductive learning from answer sets for efficient reinforcement learning exploration | Celeste Veronese et.al. | 2501.07445 | null |
2025-01-13 | Attention when you need | Lokesh Boominathan et.al. | 2501.07440 | null |
2025-01-10 | PEACE: Empowering Geologic Map Holistic Understanding with MLLMs | Yangyu Huang et.al. | 2501.06184 | null |
2025-01-10 | A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem | Allen George Philip et.al. | 2501.06130 | null |
2025-01-10 | Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation | Guojun Xiong et.al. | 2501.06103 | null |
2025-01-10 | Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks | Kevin Fu et.al. | 2501.06058 | link |
2025-01-10 | Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems | Nathaniel Hamilton et.al. | 2501.06016 | null |
2025-01-10 | Enhanced Acoustic Beamforming with Sub-Aperture Angular Multiply and Sum – in vivo and in Human Demonstration | Matthieu Toulemonde et.al. | 2501.05837 | null |
2025-01-10 | CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech | Madhurananda Pahar et.al. | 2501.05755 | null |
2025-01-10 | Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions | Sonia Raychaudhuri et.al. | 2501.05750 | null |
2025-01-10 | How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond | Chen Huang et.al. | 2501.05714 | null |
2025-01-10 | Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains | Vighnesh Subramaniam et.al. | 2501.05707 | null |
2025-01-09 | Search-o1: Agentic Search-Enhanced Large Reasoning Models | Xiaoxi Li et.al. | 2501.05366 | link |
2025-01-09 | Control of Overpopulated Tails in Kinetic Epidemic Models | Mattia Zanella et.al. | 2501.05365 | null |
2025-01-09 | A Path Variant of the Explorer Director Game on Graphs | Abigail Raz et.al. | 2501.05364 | null |
2025-01-09 | On Corrigibility and Alignment in Multi Agent Games | Edmund Dable-Heath et.al. | 2501.05360 | null |
2025-01-09 | A learning agent-based approach to the characterization of open quantum systems | Lorenzo Fioroni et.al. | 2501.05350 | null |
2025-01-09 | The Bakers and Millers Game with Restricted Locations | Simon Krogmann et.al. | 2501.05334 | null |
2025-01-09 | Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning | Dmytro Kuzmenko et.al. | 2501.05329 | null |
2025-01-09 | Contrast-Free Myocardial Scar Segmentation in Cine MRI using Motion and Texture Fusion | Guang Yang et.al. | 2501.05241 | null |
2025-01-09 | CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness | Shoucheng Song et.al. | 2501.05207 | null |
2025-01-09 | Emergence of human-like polarization among large language model agents | Jinghua Piao et.al. | 2501.05171 | null |
2025-01-08 | RadGPT: Constructing 3D Image-Text Tumor Datasets | Pedro R. A. S. Bassi et.al. | 2501.04678 | link |
2025-01-08 | InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection | Yuhang Liu et.al. | 2501.04575 | link |
2025-01-08 | The importance of being discrete – An agent-based model for active nematics and more | Mathieu Dedenon et.al. | 2501.04559 | null |
2025-01-08 | Approximately EFX and PO Allocations for Bivalued Chores | Zehan Lin et.al. | 2501.04550 | null |
2025-01-08 | Cyber-Physical Steganography in Robotic Motion Control | Ching-Chun Chang et.al. | 2501.04541 | null |
2025-01-08 | Safe Reinforcement Learning with Minimal Supervision | Alexander Quessy et.al. | 2501.04481 | null |
2025-01-08 | Hybrid Artificial Intelligence Strategies for Drone Navigation | Rubén San-Segundo et.al. | 2501.04472 | null |
2025-01-08 | A Digital Shadow for Modeling, Studying and Preventing Urban Crime | Juan Palma-Borda et.al. | 2501.04435 | null |
2025-01-08 | User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation | Krisztian Balog et.al. | 2501.04410 | null |
2025-01-08 | Agent Laboratory: Using LLM Agents as Research Assistants | Samuel Schmidgall et.al. | 2501.04227 | null |
2025-01-07 | Kinetic theory of decentralized learning for smart active matter | Gerhard Jung et.al. | 2501.03948 | null |
2025-01-07 | Implicit Coordination using Active Epistemic Inference | Lauren Bramblett et.al. | 2501.03907 | null |
2025-01-07 | Truthful mechanisms for linear bandit games with private contexts | Yiting Hu et.al. | 2501.03865 | null |
2025-01-07 | Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants | Philip Weber et.al. | 2501.03862 | null |
2025-01-07 | Run-and-tumble chemotaxis using reinforcement learning | Ramesh Pramanik et.al. | 2501.03687 | null |
2025-01-07 | The Textbook of Tomorrow: Rethinking Course Material Interfacing in the Era of GPT | Audrey Olson et.al. | 2501.03618 | null |
2025-01-07 | Distributed Observer for Descriptor Linear System: The Luenberger Observer Method | Shuai Liu et.al. | 2501.03564 | null |
2025-01-07 | Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective | Tianyang Duan et.al. | 2501.03562 | null |
2025-01-07 | FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis | Xiaojiao Xiao et.al. | 2501.03526 | link |
2025-01-07 | A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages | Jinming Gao et.al. | 2501.03496 | null |
2025-01-06 | Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Yuhui Zhang et.al. | 2501.03225 | link |
2025-01-06 | Turn-based Multi-Agent Reinforcement Learning Model Checking | Dennis Gross et.al. | 2501.03187 | null |
2025-01-06 | Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning | Muyun Li et.al. | 2501.03162 | null |
2025-01-06 | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | Alhassan Mumuni et.al. | 2501.03151 | null |
2025-01-06 | Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty | Andreas Athanasopoulos et.al. | 2501.03018 | link |
2025-01-06 | Approximating N-Player Nash Equilibrium through Gradient Descent | Dongge Wang et.al. | 2501.03001 | null |
2025-01-06 | CALM: Curiosity-Driven Auditing for Large Language Models | Xiang Zheng et.al. | 2501.02997 | link |
2025-01-06 | CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems | Chuanbo Hua et.al. | 2501.02977 | link |
2025-01-06 | Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective | Chuxiong Sun et.al. | 2501.02888 | null |
2025-01-06 | A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation | Toomas Tahves et.al. | 2501.02858 | null |
2025-01-03 | QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture | Shvetank Prakash et.al. | 2501.01892 | null |
2025-01-03 | Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification | Xiangxiang Dai et.al. | 2501.01849 | link |
2025-01-03 | MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning | Pu Yang et.al. | 2501.01834 | null |
2025-01-03 | SDPO: Segment-Level Direct Preference Optimization for Social Agents | Aobo Kong et.al. | 2501.01821 | link |
2025-01-03 | Distributed Framework Construction for Affine Formation Control | Huiming Li et.al. | 2501.01817 | null |
2025-01-03 | Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery | Baoru Huang et.al. | 2501.01752 | null |
2025-01-03 | Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning | Gavin B. Rens et.al. | 2501.01727 | null |
2025-01-03 | AgentRefine: Enhancing Agent Generalization through Refinement Tuning | Dayuan Fu et.al. | 2501.01702 | null |
2025-01-03 | The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective | Alexander Lam et.al. | 2501.01660 | null |
2025-01-03 | PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents | Jingoo Lee et.al. | 2501.01594 | null |
2025-01-02 | Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective | Julian Barreiro-Gomez et.al. | 2501.01389 | null |
2025-01-02 | PIMAEX: Multi-Agent Exploration through Peer Incentivization | Michael Kölle et.al. | 2501.01266 | null |
2025-01-02 | Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants | Lixiong Qin et.al. | 2501.01243 | null |
2025-01-02 | From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma | Tianqi Song et.al. | 2501.01220 | null |
2025-01-02 | D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma | Ludovico Musenich et.al. | 2501.01211 | null |
2025-01-02 | Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects | Abdullah Mushtaq et.al. | 2501.01205 | null |
2025-01-02 | 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer | Jiajun Deng et.al. | 2501.01163 | null |
2025-01-02 | A3: Android Agent Arena for Mobile GUI Agents | Yuxiang Chai et.al. | 2501.01149 | null |
2025-01-02 | Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method | Ruichen Zhang et.al. | 2501.01141 | null |
2025-01-02 | Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning | Min Whoo Lee et.al. | 2501.01140 | null |
2024-12-30 | Distributed Mixture-of-Agents for Edge Inference with Large Language Models | Purbesh Mitra et.al. | 2412.21200 | link |
2024-12-30 | Aviary: training language agents on challenging scientific tasks | Siddharth Narayanan et.al. | 2412.21154 | link |
2024-12-30 | Training Software Engineering Agents and Verifiers with SWE-Gym | Jiayi Pan et.al. | 2412.21139 | link |
2024-12-30 | Positional information trade-offs in boundary-driven reaction-diffusion systems | Jonas Berx et.al. | 2412.21113 | null |
2024-12-30 | Exploring and Controlling Diversity in LLM-Agent Conversation | KuanChao Chu et.al. | 2412.21102 | null |
2024-12-30 | Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024 | Reza Azadeh et.al. | 2412.21088 | null |
2024-12-30 | Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding | Wenhao Zhuang et.al. | 2412.21069 | null |
2024-12-30 | Plancraft: an evaluation dataset for planning with LLM agents | Gautier Dagan et.al. | 2412.21033 | link |
2024-12-30 | UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Fangwei Zhong et.al. | 2412.20977 | null |
2024-12-31 | SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity | Pengfei Jing et.al. | 2412.20787 | null |
2024-12-27 | Bottom-up robust modeling for the foraging behavior of Physarum polycephalum | Damiano Reginato et.al. | 2412.19790 | null |
2024-12-27 | Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration | Le Chen et.al. | 2412.19770 | link |
2024-12-27 | Can Large Language Models Adapt to Other Agents In-Context? | Matthew Riemer et.al. | 2412.19726 | null |
2024-12-27 | OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis | Qiushi Sun et.al. | 2412.19723 | null |
2024-12-27 | The Value of Recall in Extensive-Form Games | Ratip Emin Berker et.al. | 2412.19659 | null |
2024-12-27 | Xmodel-2 Technical Report | Wang Qun et.al. | 2412.19638 | link |
2024-12-27 | Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives | Guy Avni et.al. | 2412.19609 | null |
2024-12-27 | Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following | Yuxiao Yang et.al. | 2412.19562 | null |
2024-12-27 | Quantiles under ambiguity and risk sharing | Peng Liu et.al. | 2412.19546 | null |
2024-12-27 | TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data | Xiang Huang et.al. | 2412.19544 | link |
2024-12-24 | Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems | Fernando Jia et.al. | 2412.18601 | link |
2024-12-24 | Automated Code Review In Practice | Umut Cihan et.al. | 2412.18531 | null |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | Calibrating the Subjective | Mark Whitmeyer et.al. | 2412.18486 | null |
2024-12-24 | Multi-Agent Norm Perception and Induction in Distributed Healthcare | Chao Li et.al. | 2412.18454 | null |
2024-12-24 | 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Tatiana Zemskova et.al. | 2412.18450 | link |
2024-12-24 | GeAR: Graph-enhanced Agent for Retrieval-augmented Generation | Zhili Shen et.al. | 2412.18431 | null |
2024-12-24 | Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent | Farhad Nooralahzadeh et.al. | 2412.18428 | link |
2024-12-24 | GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent | Kangjia Zhao et.al. | 2412.18426 | null |
2024-12-24 | Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles | Zihan Wang et.al. | 2412.18416 | null |
2024-12-23 | Observation Interference in Partially Observable Assistance Games | Scott Emmons et.al. | 2412.17797 | null |
2024-12-23 | ResearchTown: Simulator of Human Research Community | Haofei Yu et.al. | 2412.17767 | link |
2024-12-23 | Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning | Christian A. Schroth et.al. | 2412.17740 | null |
2024-12-23 | Robin Hood Reachability Bidding Games | Shaull Almagor et.al. | 2412.17718 | null |
2024-12-23 | SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC | Yue Deng et.al. | 2412.17707 | link |
2024-12-23 | Large Language Model Safety: A Holistic Survey | Dan Shi et.al. | 2412.17686 | link |
2024-12-23 | Shape and Performance of Fastest Paths over Networks with Interacting Selfish Agents | Marco Cogoni et.al. | 2412.17665 | null |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | Fluid-Derived Lattices for Unbiased Modeling of Bacterial Colony Growth | Bryan Verhoef et.al. | 2412.17604 | null |
2024-12-23 | PC Agent: While You Sleep, AI Works – A Cognitive Journey into Digital World | Yanheng He et.al. | 2412.17589 | link |
2024-12-20 | Offline Reinforcement Learning for LLM Multi-Step Reasoning | Huaijie Wang et.al. | 2412.16145 | link |
2024-12-20 | Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information | Dirk Bergemann et.al. | 2412.16132 | null |
2024-12-20 | Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Hasan Md Tusfiqur Alam et.al. | 2412.16086 | link |
2024-12-20 | Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning | Jingbo Chen et.al. | 2412.15975 | null |
2024-12-20 | The multilayer garbage disposal game | Hsin-Lun Li et.al. | 2412.15942 | null |
2024-12-20 | Speedup Techniques for Switchable Temporal Plan Graph Optimization | He Jiang et.al. | 2412.15908 | null |
2024-12-20 | Exploring the Effects of AI Nonverbal Emotional Cues on Human Decision Certainty in Moral Dilemmas | Chenyi Zhang et.al. | 2412.15834 | null |
2024-12-20 | WebLLM: A High-Performance In-Browser LLM Inference Engine | Charlie F. Ruan et.al. | 2412.15803 | link |
2024-12-20 | FTISS Adaptive Bearing-Only Formation Tracking Control with Unknown Disturbance Rejection | Hong Liang Cheah et.al. | 2412.15757 | null |
2024-12-20 | Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion | Martin Bichler et.al. | 2412.15707 | null |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206 | link |
2024-12-19 | Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration | Junjia Liu et.al. | 2412.15166 | link |
2024-12-19 | Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents | Jessica Woodgate et.al. | 2412.15163 | null |
2024-12-19 | Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents | Serafina Kamp et.al. | 2412.15162 | null |
2024-12-19 | Probabilistic Strategy Logic with Degrees of Observability | Chunyan Mu et.al. | 2412.15135 | null |
2024-12-19 | From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model | Jerome Garnier-Brun et.al. | 2412.14996 | null |
2024-12-19 | Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination | Leonardo Barcellona et.al. | 2412.14957 | null |
2024-12-19 | Long Time Behavior and Stabilization for Displacement Monotone Mean Field Games | Marco Cirant et.al. | 2412.14903 | null |
2024-12-19 | Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning | Anthony Kobanda et.al. | 2412.14865 | null |
2024-12-19 | Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning | Mohammadreza nakhaei et.al. | 2412.14834 | link |
2024-12-18 | TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks | Frank F. Xu et.al. | 2412.14161 | link |
2024-12-18 | Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report | Markus Dablander et.al. | 2412.14085 | null |
2024-12-18 | A Computationally Grounded Framework for Cognitive Attitudes (extended version) | Tiago de Lima et.al. | 2412.14073 | null |
2024-12-18 | Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning | Adi Shuchami et.al. | 2412.14039 | link |
2024-12-18 | Decentralized Convergence to Equilibrium Prices in Trading Networks | Edwin Lock et.al. | 2412.13972 | null |
2024-12-18 | Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves | Martin Kurečka et.al. | 2412.13962 | null |
2024-12-18 | Harvesting energy from turbulent winds with Reinforcement Learning | Lorenzo Basile et.al. | 2412.13961 | null |
2024-12-18 | Towards privacy-preserving cooperative control via encrypted distributed optimization | Philipp Binfet et.al. | 2412.13953 | null |
2024-12-18 | Strategyproof Matching of Roommates and Rooms | Hadi Hosseini et.al. | 2412.13887 | null |
2024-12-18 | Who Saves us From Risk? Altruists Promote Cooperation in a Public Investment Game | Shen Zhang et.al. | 2412.13816 | null |
2024-12-17 | Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents | Yifei Zhou et.al. | 2412.13194 | null |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Sheng Yin et.al. | 2412.13178 | link |
2024-12-17 | Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs – A Graph Sequential Embedding Method | Jiate Li et.al. | 2412.13134 | link |
2024-12-17 | Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements | Rafael Dewes et.al. | 2412.13114 | null |
2024-12-17 | Active Reinforcement Learning Strategies for Offline Policy Improvement | Ambedkar Dukkipati et.al. | 2412.13106 | null |
2024-12-17 | AI PERSONA: Towards Life-long Personalization of LLMs | Tiannan Wang et.al. | 2412.13103 | null |
2024-12-17 | Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks | Kevin McKee et.al. | 2412.13093 | null |
2024-12-17 | Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks | Kun Huang et.al. | 2412.13054 | null |
2024-12-18 | NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation | Karan Wanchoo et.al. | 2412.13026 | null |
2024-12-16 | Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Marius Belly et.al. | 2412.12063 | link |
2024-12-16 | Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers | Farnaz Nouraei et.al. | 2412.12061 | null |
2024-12-16 | Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps | Linfeng Zhao et.al. | 2412.12024 | null |
2024-12-16 | Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm | Rajat Khanda et.al. | 2412.12006 | null |
2024-12-16 | CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception | Senkang Hu et.al. | 2412.12000 | null |
2024-12-16 | AlphaZero Neural Scaling and Zipf’s Law: a Tale of Board Games and Power Laws | Oren Neumann et.al. | 2412.11979 | link |
2024-12-16 | Learning Human-Aware Robot Policies for Adaptive Assistance | Jason Qin et.al. | 2412.11913 | null |
2024-12-16 | Reentrant phase behavior in binary topological flocks with nonreciprocal alignment | Tian Tang et.al. | 2412.11871 | null |
2024-12-16 | The Black Ninjas and the Sniper: On Robustness of Population Protocols | Benno Lossin et.al. | 2412.11783 | null |
2024-12-16 | Prediction of social dilemmas in networked populations via graph neural networks | Huaiyu Tan et.al. | 2412.11775 | null |
2024-12-13 | Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining | Zhiqi Ge et.al. | 2412.10342 | null |
2024-12-13 | Reciprocity in Interbank Markets | Lutz Honvehlmann et.al. | 2412.10329 | null |
2024-12-13 | MeshA*: Efficient Path Planing With Motion Primitives | Marat Agranovskiy et.al. | 2412.10320 | null |
2024-12-13 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | Cultural Evolution of Cooperation among LLM Agents | Aron Vallinder et.al. | 2412.10270 | null |
2024-12-13 | ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL | Yang Qin et.al. | 2412.10138 | link |
2024-12-13 | You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects | Islem Bouzenia et.al. | 2412.10133 | link |
2024-12-13 | Reward Machine Inference for Robotic Manipulation | Mattijs Baert et.al. | 2412.10096 | null |
2024-12-13 | Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints | Dolev Mutzari et.al. | 2412.10083 | null |
2024-12-13 | Large Action Models: From Inception to Implementation | Lu Wang et.al. | 2412.10047 | link |
2024-12-12 | GenEx: Generating an Explorable World | Taiming Lu et.al. | 2412.09624 | null |
2024-12-12 | AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials | Yiheng Xu et.al. | 2412.09605 | null |
2024-12-12 | DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction | Yu Feng et.al. | 2412.09572 | null |
2024-12-12 | Can Modern LLMs Act as Agent Cores in Radiology~Environments? | Qiaoyu Zheng et.al. | 2412.09529 | link |
2024-12-12 | Agent-based Video Trimming | Lingfeng Yang et.al. | 2412.09513 | null |
2024-12-12 | Solving Multiagent Path Finding on Highly Centralized Networks | Foivos Fioravantes et.al. | 2412.09433 | null |
2024-12-12 | From Intention To Implementation: Automating Biomedical Research via LLMs | Yi Luo et.al. | 2412.09429 | null |
2024-12-12 | Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer | Adam Labiosa et.al. | 2412.09417 | null |
2024-12-12 | Uncommon Belief in Rationality | Qi Shi et.al. | 2412.09407 | null |
2024-12-12 | Falcon-UI: Understanding GUI Before Following User Instructions | Huawen Shen et.al. | 2412.09362 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643 | link |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-11 | RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation | Mingfei Han et.al. | 2412.08591 | null |
2024-12-11 | Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead | Yanqi Su et.al. | 2412.08581 | null |
2024-12-11 | GenPlan: Generative sequence models as adaptive planners | Akash Karthikeyan et.al. | 2412.08565 | link |
2024-12-11 | An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios | Leandro Parada et.al. | 2412.08562 | null |
2024-12-11 | Exact Algorithms for Multiagent Path Finding with Communication Constraints on Tree-Like Structures | Foivos Fioravantes et.al. | 2412.08556 | null |
2024-12-11 | Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending against Poisoning Attacks | Ao Liu et.al. | 2412.08555 | null |
2024-12-11 | MaestroMotif: Skill Design from Artificial Intelligence Feedback | Martin Klissarov et.al. | 2412.08542 | null |
2024-12-11 | Spatial segregation across travelling fronts in individual-based and continuum models for the growth of heterogeneous cell populations | José A. Carrillo et.al. | 2412.08535 | null |
2024-12-10 | Balancing Mobility Behaviors to avoid Global epidemics from Local Outbreaks | Pablo Valgañón et.al. | 2412.07656 | null |
2024-12-10 | Searching for Structure: Investigating Emergent Communication with Large Language Models | Tom Kouwenhoven et.al. | 2412.07646 | null |
2024-12-10 | Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization | Zongkai Liu et.al. | 2412.07639 | link |
2024-12-10 | Swarm Behavior Cloning | Jonas Nüßlein et.al. | 2412.07617 | null |
2024-12-10 | Modeling Speculative Trading Patterns in Token Markets: An Agent-Based Analysis with TokenLab | Mengjue Wang et.al. | 2412.07512 | null |
2024-12-10 | ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning | Hongshu Guo et.al. | 2412.07507 | null |
2024-12-10 | SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World | Jiaqi Zhang et.al. | 2412.07472 | link |
2024-12-10 | Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems | Sen Kong et.al. | 2412.07471 | null |
2024-12-10 | Dynamic Ensemble Reasoning for LLM Experts | Jinwu Hu et.al. | 2412.07448 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-09 | Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty | Meera Hahn et.al. | 2412.06771 | link |
2024-12-09 | AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark | Lan Li et.al. | 2412.06724 | link |
2024-12-09 | Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies | Dilian Gurov et.al. | 2412.06706 | null |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | Self-Interested Agents in Collaborative Learning: An Incentivized Adaptive Data-Centric Framework | Nithia Vijayan et.al. | 2412.06597 | null |
2024-12-09 | Argentine ants regulate traffic flow with stopped individuals | Ulrich Dobramysl et.al. | 2412.06587 | null |
2024-12-09 | Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation | Egor Cherepanov et.al. | 2412.06531 | null |
2024-12-09 | EFX Allocations on Some Multi-graph Classes | Umang Bhaskar et.al. | 2412.06513 | null |
2024-12-09 | The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap | Yedi Zhang et.al. | 2412.06512 | null |
2024-12-09 | Reasoning about Strategic Abilities in Stochastic Multi-agent Systems | Yedi Zhang et.al. | 2412.06509 | null |
2024-12-06 | TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft | Qian Long et.al. | 2412.05255 | link |
2024-12-06 | AI’s assigned gender affects human-AI cooperation | Sepideh Bazazi et.al. | 2412.05214 | null |
2024-12-06 | SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Jinlin Wu et.al. | 2412.05187 | link |
2024-12-06 | Sense and Sensitivity: Evaluating the simulation of social dynamics via Large Language Models | Da Ju et.al. | 2412.05093 | null |
2024-12-06 | Synchronization and desynchronization in ensembles of mobile agents | E. M. Varvarin et.al. | 2412.05040 | null |
2024-12-06 | Frontier Models are Capable of In-context Scheming | Alexander Meinke et.al. | 2412.04984 | null |
2024-12-06 | Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task | Raphael C. Engelhardt et.al. | 2412.04974 | null |
2024-12-06 | Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games | Ryota Nonomura et.al. | 2412.04937 | link |
2024-12-06 | Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase | Zak Hussain et.al. | 2412.04936 | link |
2024-12-06 | PERCY: A Multimodal Dataset and Conversational System for Personalized and Emotionally Aware Human-Robot Interaction | Mohammed Althubyani et.al. | 2412.04908 | null |
2024-12-05 | Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction | Yiheng Xu et.al. | 2412.04454 | null |
2024-12-05 | GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration | Kaiyi Huang et.al. | 2412.04440 | null |
2024-12-05 | Sub-diffraction Imaging of Carrier Dynamics in Halide Perovskite Semiconductors: Effects of Passivation, Morphology, and Ion Motion | Madeleine D. Breshears et.al. | 2412.04423 | null |
2024-12-05 | Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation | Xuying Li et.al. | 2412.04415 | null |
2024-12-05 | EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding | Yuqi Wu et.al. | 2412.04380 | link |
2024-12-05 | Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach | Haoran Su et.al. | 2412.04369 | null |
2024-12-05 | Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Edoardo Cetin et.al. | 2412.04368 | null |
2024-12-05 | Machine Theory of Mind for Autonomous Cyber-Defence | Luke Swaby et.al. | 2412.04367 | null |
2024-12-05 | Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles | Ke Sun et.al. | 2412.04341 | null |
2024-12-05 | Action Mapping for Reinforcement Learning in Continuous Environments with Constraints | Mirco Theile et.al. | 2412.04327 | null |
2024-12-04 | Navigation World Models | Amir Bar et.al. | 2412.03572 | null |
2024-12-04 | From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Xinyi Mou et.al. | 2412.03563 | link |
2024-12-04 | Categorize and randomize: a model of sequential stochastic choice | Ester Sudano et.al. | 2412.03554 | null |
2024-12-04 | SPICE: Smart Projection Interface for Cooking Enhancement | Vera Prohaska et.al. | 2412.03551 | link |
2024-12-04 | Risk-aware Classification via Uncertainty Quantification | Murat Sensoy et.al. | 2412.03391 | null |
2024-12-04 | WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis | Chengwei Hu et.al. | 2412.03359 | null |
2024-12-04 | AI-Driven Day-to-Day Route Choice | Leizhen Wang et.al. | 2412.03338 | link |
2024-12-04 | Mean-field Concentration of Opinion Dynamics in Random Graphs | Javiera Gutiérrez-Ramírez et.al. | 2412.03207 | null |
2024-12-04 | AffordDP: Generalizable Diffusion Policy with Transferable Affordance | Shijie Wu et.al. | 2412.03142 | null |
2024-12-04 | ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning | Zhe Xie et.al. | 2412.03104 | link |
2024-12-03 | Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation | Gabriele Giudici et.al. | 2412.02644 | null |
2024-12-03 | Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework | Ziheng Liu et.al. | 2412.02581 | null |
2024-12-03 | Generating Critical Scenarios for Testing Automated Driving Systems | Trung-Hieu Nguyen et.al. | 2412.02574 | link |
2024-12-03 | TAB-Fields: A Maximum Entropy Framework for Mission-Aware Adversarial Planning | Gokul Puthumanaillam et.al. | 2412.02570 | link |
2024-12-03 | Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization | Nicolás García Trillos et.al. | 2412.02535 | link |
2024-12-03 | General Resetting Theory for Group Avoidance | Juhee Lee et.al. | 2412.02524 | null |
2024-12-03 | Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations | Conghao Wong et.al. | 2412.02447 | null |
2024-12-03 | A Multi-Agent Framework for Extensible Structured Text Generation in PLCs | Donghao Yang et.al. | 2412.02410 | null |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-03 | Bio-inspired visual relative localization for large swarms of UAVs | Martin Křížek et.al. | 2412.02393 | null |
2024-11-29 | EF1 Allocations for Identical Trilean and Separable Single-Peaked Valuations | Umang Bhaskar et.al. | 2411.19881 | null |
2024-11-29 | Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models | Claudio Agnorelli et.al. | 2411.19840 | null |
2024-11-29 | Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation | Robin D. Pesl et.al. | 2411.19804 | null |
2024-11-29 | CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives | Armin Saghafian et.al. | 2411.19787 | link |
2024-11-29 | The 2024 Motile Active Matter Roadmap | Gerhard Gompper et.al. | 2411.19783 | null |
2024-11-29 | HVAC-DPT: A Decision Pretrained Transformer for HVAC Control | Anaïs Berkes et.al. | 2411.19746 | null |
2024-11-29 | Relative Representations of Latent Spaces enable Efficient Semantic Channel Equalization | Tomás Hüttebräucker et.al. | 2411.19719 | null |
2024-11-29 | RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents | Shi Zifeng et.al. | 2411.19639 | null |
2024-11-29 | Build An Influential Bot In Social Media Simulations With Large Language Models | Bailu Jin et.al. | 2411.19635 | null |
2024-11-29 | Solving Rubik’s Cube Without Tricky Sampling | Yicheng Lin et.al. | 2411.19583 | null |
2024-11-27 | Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective | Zhi Zhang et.al. | 2411.18615 | null |
2024-11-27 | Robust Offline Reinforcement Learning with Linearly Structured $f$ -Divergence Regularization | Cheng Tang et.al. | 2411.18612 | null |
2024-11-27 | AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans | Dillon Loh et.al. | 2411.18539 | link |
2024-11-27 | Biswas-Chatterjee-Sen kinetic exchange opinion model for two connected groups | Krzysztof Suchecki et.al. | 2411.18527 | null |
2024-11-27 | NeuroAI for AI Safety | Patrick Mineault et.al. | 2411.18526 | null |
2024-11-27 | Collective decision making by embodied neural agents | Nicolas Coucke et.al. | 2411.18498 | link |
2024-11-27 | Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator | Frederic Kirstein et.al. | 2411.18444 | null |
2024-11-28 | A Multi-Agent Dual Dialogue System to Support Mental Health Care Providers | Onno P. Kampman et.al. | 2411.18429 | null |
2024-11-27 | Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration | Esmaeel Mohammadi et.al. | 2411.18305 | null |
2024-11-27 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-26 | SketchAgent: Language-Driven Sequential Sketch Generation | Yael Vinker et.al. | 2411.17673 | null |
2024-11-26 | MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Harsh Singh et.al. | 2411.17636 | null |
2024-11-26 | Making History Readable | Bipasha Banerjee et.al. | 2411.17600 | null |
2024-11-26 | Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals | William A. Ingram et.al. | 2411.17598 | null |
2024-11-26 | Decision making in stochastic extensive form II: Stochastic extensive forms and games | E. Emanuel Rapsch et.al. | 2411.17587 | null |
2024-11-26 | Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence | Ross O’Driscoll et.al. | 2411.17585 | null |
2024-11-26 | Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach | Yaosheng Deng et.al. | 2411.17552 | null |
2024-11-26 | ShowUI: One Vision-Language-Action Model for GUI Visual Agent | Kevin Qinghong Lin et.al. | 2411.17465 | link |
2024-11-26 | Object-centric proto-symbolic behavioural reasoning from pixels | Ruben van Bergen et.al. | 2411.17438 | link |
2024-11-26 | Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning | Mahdi Salahshour et.al. | 2411.17353 | null |
2024-11-25 | Winning opinion: Following Your Friends’ Advice or That of Their Friends? | Francisco J. Muñoz et.al. | 2411.16671 | null |
2024-11-25 | Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination | Viswa Narayanan Sankaranarayanan et.al. | 2411.16608 | null |
2024-11-25 | Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete? | Connor Douglas et.al. | 2411.16574 | null |
2024-11-25 | Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation | Muhammad Burhan Hafez et.al. | 2411.16532 | link |
2024-11-25 | Reinforcement Learning for Bidding Strategy Optimization in Day-Ahead Energy Market | Luca Di Persio et.al. | 2411.16519 | null |
2024-11-25 | Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding | Hongzhi Zang et.al. | 2411.16506 | link |
2024-11-25 | Distributed Online Optimization with Stochastic Agent Availability | Juliette Achddou et.al. | 2411.16477 | null |
2024-11-25 | Generating social networks with static and dynamic utility-maximization approaches | Aldric Labarthe et.al. | 2411.16464 | link |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation | Linqing Zhong et.al. | 2411.16425 | null |
2024-11-22 | RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts | Hjalmar Wijk et.al. | 2411.15114 | link |
2024-11-22 | XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models | Yixin Dong et.al. | 2411.15100 | null |
2024-11-22 | On Multi-Agent Inverse Reinforcement Learning | Till Freihaut et.al. | 2411.15046 | null |
2024-11-22 | Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium | Zeyang Li et.al. | 2411.15036 | null |
2024-11-22 | On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations | Guojun Xiong et.al. | 2411.15014 | null |
2024-11-22 | ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data | Junhong Shen et.al. | 2411.15004 | link |
2024-11-22 | Free Energy Projective Simulation (FEPS): Active inference with interpretability | Joséphine Pazem et.al. | 2411.14991 | null |
2024-11-22 | BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence | Xuewu Lin et.al. | 2411.14869 | link |
2024-11-22 | Universal and Context-Independent Triggers for Precise Control of LLM Outputs | Jiashuo Liang et.al. | 2411.14738 | null |
2024-11-22 | Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents | Hanwen Shi et.al. | 2411.14637 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432 | link |
2024-11-21 | Multi-Agent Environments for Vehicle Routing Problems | Ricardo Gama et.al. | 2411.14411 | link |
2024-11-21 | Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs | Ofer Dagan et.al. | 2411.14404 | null |
2024-11-21 | SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching | Arjun P S et.al. | 2411.14322 | link |
2024-11-21 | Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks | Kubra Duran et.al. | 2411.14281 | null |
2024-11-21 | Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation | Pedro Enrique Iturria-Rivera et.al. | 2411.14264 | null |
2024-11-21 | Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems | Junhua Liu et.al. | 2411.14214 | null |
2024-11-21 | SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization | Shuchen Zhu et.al. | 2411.14166 | null |
2024-11-21 | Multi-terminal Strong Coordination subject to Secrecy Constraints | Viswanathan Ramachandran et.al. | 2411.14123 | null |
2024-11-21 | Umbrella Reinforcement Learning – computationally efficient tool for hard non-linear problems | Egor E. Nuzhin et.al. | 2411.14117 | link |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543 | null |
2024-11-20 | Metacognition for Unknown Situations and Environments (MUSE) | Rodolfo Valiente et.al. | 2411.13537 | null |
2024-11-20 | AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations | Gaurav Verma et.al. | 2411.13451 | null |
2024-11-20 | Robust Monocular Visual Odometry using Curriculum Learning | Assaf Lahiany et.al. | 2411.13438 | null |
2024-11-20 | A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Alireza Rashidi Laleh et.al. | 2411.13410 | null |
2024-11-20 | Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership | Lars Fluri et.al. | 2411.13381 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | Revealed Information | Laura Doval et.al. | 2411.13293 | null |
2024-11-20 | Transforming the Hybrid Cloud for Emerging AI Workloads | Deming Chen et.al. | 2411.13239 | null |
2024-11-20 | Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications | Tiago Roux Oliveira et.al. | 2411.13234 | null |
2024-11-19 | Reinforcement Learning, Collusion, and the Folk Theorem | Galit Askenazi-Golan et.al. | 2411.12725 | null |
2024-11-19 | UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments | Chunru Lin et.al. | 2411.12711 | null |
2024-11-19 | Weighted Envy Freeness With Limited Subsidies | Noga Klein Elmalem et.al. | 2411.12696 | null |
2024-11-19 | Quasi-stability notions in two-sided matching models | Nadia Guiñazú et.al. | 2411.12533 | null |
2024-11-19 | Coevolution of relationship-driven cooperation under recommendation protocol on multiplex networks | Hongyu Yue et.al. | 2411.12436 | null |
2024-11-19 | Instrumentation of Software Systems with OpenTelemetry for Software Visualization | Malte Hansen et.al. | 2411.12380 | null |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | SNN-Based Online Learning of Concepts and Action Laws in an Open World | Christel Grimaud et.al. | 2411.12308 | null |
2024-11-19 | Emergence of Implicit World Models from Mortal Agents | Kazuya Horibe et.al. | 2411.12304 | null |
2024-11-19 | Could Humans Outshine AI in Visual Data Analysis? | Ratanond Koonchanok et.al. | 2411.12299 | null |
2024-11-18 | Generative World Explorer | Taiming Lu et.al. | 2411.11844 | null |
2024-11-18 | Reinterpreting Delay and Procrastination | Conrad Kosowsky et.al. | 2411.11828 | null |
2024-11-18 | Competing Bandits in Decentralized Large Contextual Matching Markets | Satush Parikh et.al. | 2411.11794 | null |
2024-11-18 | LLM-IE: A Python Package for Generative Information Extraction with Large Language Models | Enshuo Hsu et.al. | 2411.11779 | null |
2024-11-18 | Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework | Yannick Metz et.al. | 2411.11761 | null |
2024-11-18 | The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning | Longju Bai et.al. | 2411.11758 | link |
2024-11-18 | Distributed Asynchronous Time-Varying Quadratic Programming with Asynchronous Objective Sampling | Gabriel Behrendt et.al. | 2411.11732 | null |
2024-11-18 | Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment | Allison Huang et.al. | 2411.11731 | link |
2024-11-18 | TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World | Xianlong Wang et.al. | 2411.11683 | null |
2024-11-18 | Artificial Scientific Discovery | Antonio Norelli et.al. | 2411.11672 | null |
2024-11-15 | Fair Division via the Cake-Cutting Share | Yannan Bai et.al. | 2411.10434 | null |
2024-11-15 | Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash | Parsa Hejabi et.al. | 2411.10422 | link |
2024-11-15 | The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use | Siyuan Hu et.al. | 2411.10323 | link |
2024-11-15 | Static network structure cannot stabilize cooperation among Large Language Model agents | Jin Han et.al. | 2411.10294 | null |
2024-11-15 | Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Hossein Hassani et.al. | 2411.10268 | null |
2024-11-15 | Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Jingru Yang et.al. | 2411.10252 | null |
2024-11-15 | An Empirical Study on LLM-based Agents for Automated Bug Fixing | Xiangxin Meng et.al. | 2411.10213 | null |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks | Marco Matarese et.al. | 2411.10176 | null |
2024-11-15 | The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning | Moritz Schneider et.al. | 2411.10175 | null |
2024-11-14 | Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games | Georgios Pantazis et.al. | 2411.09636 | null |
2024-11-14 | Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents | Yuyou Gan et.al. | 2411.09523 | null |
2024-11-14 | Randomized Truthful Auctions with Learning Agents | Gagan Aggarwal et.al. | 2411.09517 | null |
2024-11-14 | Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity | Sneha Ramshanker et.al. | 2411.09493 | null |
2024-11-14 | Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches | Carlos J. Costa et.al. | 2411.09313 | null |
2024-11-14 | Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning | Dunwei Tu et.al. | 2411.09250 | null |
2024-11-14 | Risk-aware MPPI for Stochastic Hybrid Systems | Hardik Parwana et.al. | 2411.09198 | link |
2024-11-14 | Enhancing reinforcement learning for population setpoint tracking in co-cultures | Sebastián Espinel-Ríos et.al. | 2411.09177 | null |
2024-11-14 | Artificial Theory of Mind and Self-Guided Social Organisation | Michael S. Harré et.al. | 2411.09169 | null |
2024-11-14 | Theory of Mind Enhances Collective Intelligence | Michael S. Harré et.al. | 2411.09168 | null |
2024-11-13 | The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games | Dan Calderone et.al. | 2411.08809 | null |
2024-11-13 | FinRobot: AI Agent for Equity Research and Valuation with Large Language Models | Tianyu Zhou et.al. | 2411.08804 | link |
2024-11-13 | Evaluating World Models with LLM for Decision Making | Chang Yang et.al. | 2411.08794 | null |
2024-11-13 | Towards Fair and Efficient Public Transportation: A Bus Stop Model | Martin Bullinger et.al. | 2411.08784 | link |
2024-11-13 | Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces | Arabinda Ghosh et.al. | 2411.08754 | null |
2024-11-13 | Statistical Operating Characteristics of Current Early Phase Dose Finding Designs with Toxicity and Efficacy in Oncology | Hao Sun et.al. | 2411.08698 | null |
2024-11-13 | Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models | Jan Albrecht et.al. | 2411.08692 | null |
2024-11-13 | Robot See, Robot Do: Imitation Reward for Noisy Financial Environments | Sven Goluža et.al. | 2411.08637 | null |
2024-11-13 | On the Application of Model Predictive Control to a Weighted Coverage Path Planning Problem | Kilian Schweppe et.al. | 2411.08634 | null |
2024-11-13 | NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation | Youzhi Liu et.al. | 2411.08579 | null |
2024-11-12 | LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models | Anoop Cherian et.al. | 2411.08027 | null |
2024-11-12 | Incentive Design with Spillovers | Krishna Dasaratha et.al. | 2411.08026 | null |
2024-11-12 | From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents | Chuyi Kong et.al. | 2411.07965 | null |
2024-11-12 | Learning Memory Mechanisms for Decision Making through Demonstrations | William Yue et.al. | 2411.07954 | link |
2024-11-12 | RedCode: Risky Code Execution and Generation Benchmark for Code Agents | Chengquan Guo et.al. | 2411.07781 | link |
2024-11-12 | Efficiency of energy-consuming random walkers: Variability in energy helps | Mohsen Ghasemi Nezhadhaghighi et.al. | 2411.07771 | null |
2024-11-12 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | Fangyu Lei et.al. | 2411.07763 | null |
2024-11-12 | Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning | Stefan Pranger et.al. | 2411.07700 | null |
2024-11-12 | World Models: The Safety Perspective | Zifan Zeng et.al. | 2411.07690 | null |
2024-11-12 | Safe Exploitative Play with Untrusted Type Beliefs | Tongxin Li et.al. | 2411.07679 | null |
2024-11-11 | Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving | Botao Yu et.al. | 2411.07228 | null |
2024-11-11 | Grounding Video Models to Actions through Goal Conditioned Exploration | Yunhao Luo et.al. | 2411.07223 | null |
2024-11-11 | ‘Explaining RL Decisions with Trajectories’: A Reproducibility Study | Karim Abdel Sadek et.al. | 2411.07200 | link |
2024-11-11 | Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation | Yao Ma et.al. | 2411.07185 | null |
2024-11-11 | RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration | Young-Min Cho et.al. | 2411.07161 | null |
2024-11-11 | Azurin-Based Peptide p28 Arrests the p53-HDM2 Interactions: A Novel Anti-Cancer Pathway | Albin Joy et.al. | 2411.07124 | null |
2024-11-11 | Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing | Chuye Hong et.al. | 2411.07104 | null |
2024-11-11 | Bounded Rationality Equilibrium Learning in Mean Field Games | Yannick Eich et.al. | 2411.07099 | link |
2024-11-11 | A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs | Myeongsoo Kim et.al. | 2411.07098 | null |
2024-11-11 | Differentially-Private Collaborative Online Personalized Mean Estimation | Yauhen Yakimenka et.al. | 2411.07094 | null |
2024-11-08 | Topology-aware Reinforcement Feature Space Reconstruction for Graph Data | Wangyang Ying et.al. | 2411.05742 | null |
2024-11-08 | A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics | Puze Liu et.al. | 2411.05718 | null |
2024-11-08 | Settling the Complexity of Popularity in Additively Separable and Fractional Hedonic Games | Martin Bullinger et.al. | 2411.05713 | null |
2024-11-08 | Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning | Indranil Sur et.al. | 2411.05683 | null |
2024-11-08 | The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent | Leon O. H. Kroczek et.al. | 2411.05653 | null |
2024-11-08 | LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution | Yuheng Zhao et.al. | 2411.05651 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Smart navigation through a rotating barrier: Deep reinforcement learning with application to size-based separation of active microagents | Mohammad Hossein Masoudi et.al. | 2411.05587 | null |
2024-11-08 | Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs | Hubert Szolc et.al. | 2411.05586 | link |
2024-11-08 | Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs | Ryoto Ando et.al. | 2411.05574 | null |
2024-11-07 | Few-Shot Task Learning through Inverse Generative Modeling | Aviv Netanyahu et.al. | 2411.04987 | null |
2024-11-07 | Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games | Usman Anwar et.al. | 2411.04976 | link |
2024-11-07 | StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration | Panwen Hu et.al. | 2411.04925 | null |
2024-11-07 | OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models | Siming Huang et.al. | 2411.04905 | null |
2024-11-07 | Achieving superconductivity in infinite-layer nickelate thin films by aluminum sputtering deposition | Dongxin Zhang et.al. | 2411.04896 | null |
2024-11-07 | GUI Agents with Foundation Models: A Comprehensive Survey | Shuai Wang et.al. | 2411.04890 | null |
2024-11-07 | Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning | Satchit Chatterji et.al. | 2411.04867 | link |
2024-11-07 | Robust Regulation of Labour Contracts | Théo Durandard et.al. | 2411.04841 | null |
2024-11-07 | Plasticity Loss in Deep Reinforcement Learning: A Survey | Timo Klein et.al. | 2411.04832 | null |
2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
2024-11-06 | Predicting and Publishing Accurate Imbalance Prices Using Monte Carlo Tree Search | Fabio Pavirani et.al. | 2411.04011 | null |
2024-11-06 | Temporal Network Creation Games: The Impact of Non-Locality and Terminals | Davide Bilò et.al. | 2411.03973 | null |
2024-11-06 | Almost Time-Optimal Loosely-Stabilizing Leader Election on Arbitrary Graphs Without Identifiers in Population Protocols | Haruki Kanaya et.al. | 2411.03902 | null |
2024-11-06 | AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Yizhe Huang et.al. | 2411.03865 | link |
2024-11-06 | Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC | Tyler Clark et.al. | 2411.03820 | link |
2024-11-06 | From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning | Zhirui Deng et.al. | 2411.03817 | null |
2024-11-06 | MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue | Fengxiang Wang et.al. | 2411.03814 | null |
2024-11-06 | Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data | Chengrui Qu et.al. | 2411.03810 | link |
2024-11-06 | Multi-Modal Intelligent Channel Modeling: A New Modeling Paradigm via Synesthesia of Machines | Lu Bai et.al. | 2411.03711 | null |
2024-11-06 | Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services | Amr Abo-eleneen et.al. | 2411.03686 | null |
2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link |
2024-11-05 | Causal Responsibility Attribution for Human-AI Collaboration | Yahang Qi et.al. | 2411.03275 | link |
2024-11-05 | Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Ryosuke Takata et.al. | 2411.03252 | null |
2024-11-05 | Troll Farms | Philipp Denter et.al. | 2411.03241 | null |
2024-11-05 | A resolved Lyman-Alpha profile with doubly peaked emission at z~7 | C. Moya-Sierralta et.al. | 2411.03222 | null |
2024-11-05 | GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis | Temitope Akinboyewa et.al. | 2411.03205 | link |
2024-11-05 | Online Data Collection for Efficient Semiparametric Inference | Shantanu Gupta et.al. | 2411.03195 | link |
2024-11-05 | Hierarchical Orchestra of Policies | Thomas P Cannon et.al. | 2411.03008 | null |
2024-11-05 | Accelerating Task Generalisation with Multi-Level Hierarchical Options | Thomas P Cannon et.al. | 2411.02998 | null |
2024-11-05 | Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation | Francisco Giral et.al. | 2411.02975 | null |
2024-11-04 | Attacking Vision-Language Computer Agents via Pop-ups | Yanzhe Zhang et.al. | 2411.02391 | link |
2024-11-04 | Two-Sided Learning in Decentralized Matching Markets | Vade Shah et.al. | 2411.02377 | null |
2024-11-04 | Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences | Ruotong Wang et.al. | 2411.02353 | null |
2024-11-04 | WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Zehan Qi et.al. | 2411.02337 | link |
2024-11-04 | CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments | Kung-Hsiang Huang et.al. | 2411.02305 | link |
2024-11-04 | Kinetic exchange opinion dynamics for the battleground-states in the 2024 US presidential elections | Soumyajyoti Biswas et.al. | 2411.02240 | null |
2024-11-04 | Positive Experience Reflection for Agents in Interactive Text Environments | Philip Lippmann et.al. | 2411.02223 | null |
2024-11-04 | CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education | Pranathi Rayavaram et.al. | 2411.02143 | null |
2024-11-04 | Foundations and Recent Trends in Multimodal Mobile Agents: A Survey | Biao Wu et.al. | 2411.02006 | link |
2024-11-04 | Taking AI Welfare Seriously | Robert Long et.al. | 2411.00986 | null |
2024-10-31 | Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi et.al. | 2410.24218 | link |
2024-10-31 | DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning | Zhenyu Jiang et.al. | 2410.24185 | null |
2024-10-31 | Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning | Jiaqi Liu et.al. | 2410.24152 | null |
2024-10-31 | Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis | Jia Lin Hau et.al. | 2410.24128 | link |
2024-10-31 | Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning | Nabil Omi et.al. | 2410.24096 | null |
2024-10-31 | Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks | Yingzhe Peng et.al. | 2410.24032 | null |
2024-10-31 | AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents | Yifan Xu et.al. | 2410.24024 | link |
2024-10-31 | Optimal control problems driven by nonlinear degenerate Fokker-Planck equations | Francesca Anceschi et.al. | 2410.24000 | null |
2024-10-31 | Persuading a Credible Agent | Jiarui Gan et.al. | 2410.23989 | null |
2024-10-31 | Fair Division of Chores with Budget Constraints | Edith Elkind et.al. | 2410.23979 | null |
2024-10-30 | Proportional Fairness in Non-Centroid Clustering | Ioannis Caragiannis et.al. | 2410.23273 | null |
2024-10-30 | Evaluating Cultural and Social Awareness of LLM Web Agents | Haoyi Qiu et.al. | 2410.23252 | null |
2024-10-30 | Carrot and Stick: Eliciting Comparison Data and Beyond | Yiling Chen et.al. | 2410.23243 | null |
2024-10-30 | A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment | Matteo G. Mecattaf et.al. | 2410.23242 | link |
2024-10-31 | Aligning Audio-Visual Joint Representations with an Agentic Workflow | Shentong Mo et.al. | 2410.23230 | null |
2024-10-30 | OS-ATLAS: A Foundation Action Model for Generalist GUI Agents | Zhiyong Wu et.al. | 2410.23218 | link |
2024-10-30 | Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks | Michael Matthews et.al. | 2410.23208 | link |
2024-10-30 | VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning | Yichao Liang et.al. | 2410.23156 | null |
2024-10-30 | Fair Division with Market Values | Siddharth Barman et.al. | 2410.23137 | null |
2024-10-30 | First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 | Tengfei Zhang et.al. | 2410.23077 | null |
2024-10-29 | Environment as Policy: Learning to Race in Unseen Tracks | Hongze Wang et.al. | 2410.22308 | null |
2024-10-29 | Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning | Yihe Deng et.al. | 2410.22304 | null |
2024-10-29 | Fourier Head: Helping Large Language Models Learn Complex Probability Distributions | Nate Gillman et.al. | 2410.22269 | null |
2024-10-29 | RingSim- An Agent-based Approach for Modelling Mesoscopic Magnetic Nanowire Networks | Ian T Vidamour et.al. | 2410.22204 | null |
2024-10-29 | Democratizing Reward Design for Personal and Representative Value-Alignment | Carter Blair et.al. | 2410.22203 | null |
2024-10-29 | ADAM: An Embodied Causal Agent in Open-World Environments | Shu Yu et.al. | 2410.22194 | null |
2024-10-29 | EconoJax: A Fast & Scalable Economic Simulation in Jax | Koen Ponse et.al. | 2410.22165 | link |
2024-10-29 | Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration | Cory Hymel et.al. | 2410.22129 | null |
2024-10-29 | Inverse Design Method with Enhanced Sampling for Complex Open Crystals: Application to Novel Zeolite Self-Assembly in a Coarse-Grained Model | Chaohong Wang et.al. | 2410.22111 | null |
2024-10-29 | An LLM-based Simulation Framework for Embodied Conversational Agents in Psychological Counseling | Lixiu Wu et.al. | 2410.22041 | link |
2024-10-28 | Capacity-Aware Planning and Scheduling in Budget-Constrained Monotonic MDPs: A Meta-RL Approach | Manav Vora et.al. | 2410.21249 | null |
2024-10-28 | Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines | Zhixin Zhang et.al. | 2410.21220 | link |
2024-10-28 | Magnetic Milli-spinner for Robotic Endovascular Surgery | Shuai Wu et.al. | 2410.21112 | null |
2024-10-28 | Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment | Yi Zheng et.al. | 2410.21109 | null |
2024-10-28 | LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity Recognition | Naga Venkata Sai Raviteja Chappa et.al. | 2410.21108 | null |
2024-10-28 | Topological Identification of Agent Status in Information Contagions: Application to Financial Markets | Anubha Goel et.al. | 2410.21104 | link |
2024-10-28 | Automatic Generation of Benchmarks and Reliable LLM Judgment for Code Tasks | Eitan Farchi et.al. | 2410.21071 | null |
2024-10-28 | CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models | Meiqi Chen et.al. | 2410.21067 | null |
2024-10-28 | Getting By Goal Misgeneralization With a Little Help From a Mentor | Tu Trinh et.al. | 2410.21052 | null |
2024-10-28 | FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents | Jannis Weil et.al. | 2410.21029 | link |
2024-10-25 | FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning | Nicole Cho et.al. | 2410.19727 | null |
2024-10-25 | Evolving Neural Networks Reveal Emergent Collective Behavior from Minimal Agent Interactions | Guilherme S. Y. Giardini et.al. | 2410.19718 | null |
2024-10-25 | Adversarial Environment Design via Regret-Guided Diffusion Models | Hojun Chung et.al. | 2410.19715 | null |
2024-10-25 | Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks | Yinglun Xu et.al. | 2410.19705 | null |
2024-10-25 | AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs | Clemencia Siro et.al. | 2410.19692 | null |
2024-10-25 | The Sound of Silence in Social Networks | Jesús Aranda et.al. | 2410.19685 | null |
2024-10-25 | Optimizing Hearthstone Agents using an Evolutionary Algorithm | Pablo García-Sánchez et.al. | 2410.19681 | link |
2024-10-25 | MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services | Hongjia Wu et.al. | 2410.19665 | null |
2024-10-25 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | Knowledge Graph Enhanced Language Agents for Recommendation | Taicheng Guo et.al. | 2410.19627 | null |
2024-10-24 | Learning to Look: Seeking Information for Decision Making via Policy Factorization | Shivin Dass et.al. | 2410.18964 | null |
2024-10-24 | OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning | Xiaoqiang Wang et.al. | 2410.18963 | null |
2024-10-24 | Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play | Sha Li et.al. | 2410.18935 | null |
2024-10-24 | SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment | Caelan Garrett et.al. | 2410.18907 | null |
2024-10-24 | Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks | Graziano A. Manduzio et.al. | 2410.18890 | null |
2024-10-24 | Learning Collusion in Episodic, Inventory-Constrained Markets | Paul Friedrich et.al. | 2410.18871 | link |
2024-10-25 | An LLM Agent for Automatic Geospatial Data Analysis | Yuxing Chen et.al. | 2410.18792 | null |
2024-10-24 | Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling for Autonomous Vehicles | Yucheng Shi et.al. | 2410.18786 | null |
2024-10-24 | Active Target Tracking Using Bearing-only Measurements With Gaussian Process Learning | Yingbo Fu et.al. | 2410.18669 | null |
2024-10-24 | Approximate EFX and Exact tEFX Allocations for Indivisible Chores: Improved Algorithms | Mahyar Afshinmehr et.al. | 2410.18655 | null |
2024-10-23 | Prioritized Generative Replay | Renhao Wang et.al. | 2410.18082 | null |
2024-10-23 | The Double-Edged Sword of Behavioral Responses in Strategic Classification: Theory and User Studies | Raman Ebrahimi et.al. | 2410.18066 | null |
2024-10-23 | SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation | Zihan Zhou et.al. | 2410.18065 | null |
2024-10-23 | A Comparative Assessment of Technology Acceptance and Learning Outcomes in Computer-based versus VR-based Pedagogical Agents | Aimilios Hadjiliasi et.al. | 2410.18048 | null |
2024-10-23 | GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Xin Li et.al. | 2410.18032 | link |
2024-10-23 | MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting | Sungil Seok et.al. | 2410.18012 | null |
2024-10-23 | Dynamic models of gentrification | Giovanni Mauro et.al. | 2410.18004 | null |
2024-10-23 | POMDP-Driven Cognitive Massive MIMO Radar: Joint Target Detection-Tracking In Unknown Disturbances | Imad Bouhou et.al. | 2410.17967 | null |
2024-10-23 | On Regularity and Normalization in Sequential Screening | Ian Ball et.al. | 2410.17962 | null |
2024-10-23 | Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models | He Cao et.al. | 2410.17922 | link |
2024-10-22 | SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning | Yizhou Chi et.al. | 2410.17238 | link |
2024-10-22 | Large Language Models Empowered Personalized Web Agents | Hongru Cai et.al. | 2410.17236 | null |
2024-10-22 | Responsibility in a Multi-Value Strategic Setting | Timothy Parker et.al. | 2410.17229 | null |
2024-10-22 | Scalable spectral representations for network multiagent control | Zhaolin Ren et.al. | 2410.17221 | null |
2024-10-23 | Non-myopic Generation of Language Model for Reasoning and Planning | Chang Ma et.al. | 2410.17195 | link |
2024-10-22 | DyPNIPP: Predicting Environment Dynamics for RL-based Robust Informative Path Planning | Srujan Deolasee et.al. | 2410.17186 | null |
2024-10-22 | Layered LA-MAPF: a decomposition of large agent MAPF instance to accelerate solving without compromising solvability | Zhuo Yao et.al. | 2410.17160 | link |
2024-10-22 | Mechanistic interplay between information spreading and opinion polarization | Kleber A. Oliveira et.al. | 2410.17151 | null |
2024-10-22 | Advancing lunar exploration through virtual reality simulations: a framework for future human missions | Giacomo Franchini et.al. | 2410.17132 | null |
2024-10-22 | Exploration and Persuasion | Aleksandrs Slivkins et.al. | 2410.17086 | null |
2024-10-21 | Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos | Gengshan Yang et.al. | 2410.16259 | null |
2024-10-21 | IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems | Yihuan Mao et.al. | 2410.16237 | null |
2024-10-21 | Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping | Ryan Li et.al. | 2410.16232 | null |
2024-10-21 | Role of obstacle softness in the diffusive behavior of active Particles | Ankit Gupta et.al. | 2410.16223 | null |
2024-10-21 | CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning | Kumar Manas et.al. | 2410.16207 | link |
2024-10-22 | LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation | Hao Gao et.al. | 2410.16197 | link |
2024-10-21 | Spiking Neural Networks as a Controller for Emergent Swarm Agents | Kevin Zhu et.al. | 2410.16175 | null |
2024-10-21 | A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns | Tianyi Men et.al. | 2410.16155 | null |
2024-10-21 | AdChain: Decentralized Header Bidding | Behkish Nassirzadeh et.al. | 2410.16141 | null |
2024-10-21 | Constrained Truthful Obnoxious Two-Facility Location with Optional Preferences | Panagiotis Kanellopoulos et.al. | 2410.16131 | null |
2024-10-18 | Teaching Models to Balance Resisting and Accepting Persuasion | Elias Stengel-Eskin et.al. | 2410.14596 | link |
2024-10-18 | Toolshed: Scale Tool-Equipped Agents with Advanced RAG-Tool Fusion and Tool Knowledge Bases | Elias Lumer et.al. | 2410.14594 | null |
2024-10-18 | Temporal Fair Division of Indivisible Items | Edith Elkind et.al. | 2410.14593 | null |
2024-10-18 | Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets | Namid R. Stillman et.al. | 2410.14587 | null |
2024-10-18 | Neural Combinatorial Clustered Bandits for Recommendation Systems | Baran Atalar et.al. | 2410.14586 | null |
2024-10-18 | Do LLMs estimate uncertainty well in instruction-following? | Juyeon Heo et.al. | 2410.14582 | link |
2024-10-18 | When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs | Hanna Kim et.al. | 2410.14569 | null |
2024-10-18 | RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions | Zhiyuan Peng et.al. | 2410.14567 | link |
2024-10-18 | Performance bounds for multi-vehicle networks with local integrators | Jonas Hansson et.al. | 2410.14525 | null |
2024-10-18 | Do LLMs “know” internally when they follow instructions? | Juyeon Heo et.al. | 2410.14516 | link |
2024-10-17 | VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Runsen Xu et.al. | 2410.13860 | link |
2024-10-17 | AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents | Ke Yang et.al. | 2410.13825 | null |
2024-10-18 | Harnessing Webpage UIs for Text-Rich Visual Understanding | Junpeng Liu et.al. | 2410.13824 | null |
2024-10-17 | Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems | Alireza Ghafarollahi et.al. | 2410.13768 | null |
2024-10-17 | MobA: A Two-Level Agent System for Efficient Mobile Task Automation | Zichen Zhu et.al. | 2410.13757 | link |
2024-10-17 | Interacting humans and robots can improve sensory prediction by adapting their viscoelasticity | Xiaoxiao Cheng et.al. | 2410.13755 | null |
2024-10-17 | Real Eventual Exponential Positivity of Complex-valued Laplacians: Applications to Consensus in Multi-agent Systems | Aditi Saxena et.al. | 2410.13700 | null |
2024-10-17 | ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization | Xiutian Zhao et.al. | 2410.13667 | link |
2024-10-17 | A Comparative Study on Reasoning Patterns of OpenAI’s o1 Model | Siwei Wu et.al. | 2410.13639 | link |
2024-10-17 | Phenotype structuring in collective cell migration:a tutorial of mathematical models and methods | Tommaso Lorenzi et.al. | 2410.13629 | null |
2024-10-16 | JudgeBench: A Benchmark for Evaluating LLM-based Judges | Sijun Tan et.al. | 2410.12784 | link |
2024-10-16 | Prophet Upper Bounds for Online Matching and Auctions | José Soto et.al. | 2410.12756 | null |
2024-10-16 | HEnRY: A Multi-Agent System Framework for Multi-Domain Contexts | Emmanuele Lacavalla et.al. | 2410.12720 | link |
2024-10-16 | A comparative analysis of metamodels for lumped cardiovascular models, and pipeline for sensitivity analysis, parameter estimation, and uncertainty quantification | John M. Hanna et.al. | 2410.12654 | null |
2024-10-16 | Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control | Koen de Vos et.al. | 2410.12651 | null |
2024-10-16 | Zeroth-Order Feedback Optimization in Multi-Agent Systems: Tackling Coupled Constraints | Yingpeng Duan et.al. | 2410.12647 | null |
2024-10-16 | Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach | Henrique Donâncio et.al. | 2410.12598 | null |
2024-10-16 | Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving | Sihao Wu et.al. | 2410.12568 | null |
2024-10-16 | A Communication Consistent Approach to Signal Temporal Logic Task Decomposition in Multi-Agent Systems | Gregorio Marchesini et.al. | 2410.12563 | null |
2024-10-16 | Nash equilibria in scalar discrete-time linear quadratic games | Giulio Salizzoni et.al. | 2410.12544 | null |
2024-10-15 | Molecular Quantum Control Algorithm Design by Reinforcement Learning | Anastasia Pipi et.al. | 2410.11839 | null |
2024-10-15 | G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks | Guibin Zhang et.al. | 2410.11782 | null |
2024-10-15 | BlendRL: A Framework for Merging Symbolic and Neural Policy Learning | Hikaru Shindo et.al. | 2410.11689 | null |
2024-10-15 | Optimal Mediation Mechanisms in Bilateral Trade | Zhikang Fan et.al. | 2410.11683 | null |
2024-10-15 | Safety Filtering While Training: Improving the Performance and Sample Efficiency of Reinforcement Learning Agents | Federico Pizarro Bejarano et.al. | 2410.11671 | link |
2024-10-15 | Markov-Nash equilibria in mean-field games under model uncertainty | Johannes Langner et.al. | 2410.11652 | null |
2024-10-15 | Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search | Jiamian Li et.al. | 2410.11642 | null |
2024-10-15 | Findings of the WMT 2024 Shared Task on Chat Translation | Wafaa Mohammed et.al. | 2410.11624 | null |
2024-10-15 | Temporal Hyperproperties for Population Protocols | Nicolas Waldburger et.al. | 2410.11572 | null |
2024-10-15 | Demo: Testing AI-driven MAC Learning in Autonomic Networks | Leonard Paeleke et.al. | 2410.11565 | null |
2024-10-14 | AFlow: Automating Agentic Workflow Generation | Jiayi Zhang et.al. | 2410.10762 | link |
2024-10-14 | Denial-of-Service Poisoning Attacks against Large Language Models | Kuofeng Gao et.al. | 2410.10760 | link |
2024-10-14 | DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model | Yuqi Wang et.al. | 2410.10738 | null |
2024-10-14 | Online Statistical Inference for Time-varying Sample-averaged Q-learning | Saunak Kumar Panda et.al. | 2410.10737 | null |
2024-10-14 | Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach | Rory Young et.al. | 2410.10674 | null |
2024-10-14 | Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning | William A. Stigall et.al. | 2410.10660 | null |
2024-10-14 | Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty | John Mern et.al. | 2410.10610 | null |
2024-10-14 | STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack | Naman Gupta et.al. | 2410.10584 | null |
2024-10-14 | Consensus in Multiagent Systems with lack of connection | Mohamed Bentaibi et.al. | 2410.10486 | null |
2024-10-14 | Compositional Shielding and Reinforcement Learning for Multi-Agent Systems | Asger Horn Brorholt et.al. | 2410.10460 | null |
2024-10-11 | PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents | Xiangyu Yin et.al. | 2410.09034 | link |
2024-10-11 | AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents | Maksym Andriushchenko et.al. | 2410.09024 | null |
2024-10-11 | From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating UI Operation Impacts | Zhuohao Jerry Zhang et.al. | 2410.09006 | null |
2024-10-11 | Cyclic jetting enables microbubble-mediated drug delivery | Marco Cattaneo et.al. | 2410.08990 | null |
2024-10-11 | Best-of-Both-Worlds Fairness of the Envy-Cycle-Elimination Algorithm | Jugal Garg et.al. | 2410.08986 | null |
2024-10-11 | Optimal Allocation with Peer Information | Axel Niemeyer et.al. | 2410.08954 | null |
2024-10-11 | Transferable Belief Model on Quantum Circuits | Qianli Zhou et.al. | 2410.08949 | null |
2024-10-11 | The Dynamics of Social Conventions in LLM populations: Spontaneous Emergence, Collective Biases and Tipping Points | Ariel Flint Ashery et.al. | 2410.08948 | null |
2024-10-11 | Hyperspectral fluorescence imaging using a high-speed silicon photomultiplier array | Chi Z. Huang et.al. | 2410.08936 | null |
2024-10-11 | MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL | Claas A Voelcker et.al. | 2410.08896 | null |
2024-10-10 | Agent S: An Open Agentic Framework that Uses Computers Like a Human | Saaket Agashe et.al. | 2410.08164 | link |
2024-10-10 | DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory | Yutong Wang et.al. | 2410.08143 | link |
2024-10-10 | SoundScape: A Human-AI Co-Creation System Making Your Memories Heard | Chongjun Zhong et.al. | 2410.08136 | null |
2024-10-10 | Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction | Jarrid Rector-Brooks et.al. | 2410.08134 | null |
2024-10-10 | Mars: Situated Inductive Reasoning in an Open-World Environment | Xiaojuan Tang et.al. | 2410.08126 | null |
2024-10-10 | Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System | Weize Chen et.al. | 2410.08115 | null |
2024-10-10 | Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining | Tianyi Bai et.al. | 2410.08102 | link |
2024-10-10 | Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread | David Kerkmann et.al. | 2410.08050 | link |
2024-10-10 | Strategic Classification With Externalities | Yiling Chen et.al. | 2410.08032 | null |
2024-10-10 | Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching | Xiaoshan Lin et.al. | 2410.08022 | null |
2024-10-09 | Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Manling Li et.al. | 2410.07166 | link |
2024-10-09 | Spatiotemporal Modeling and Forecasting at Scale with Dynamic Generalized Linear Models | Pranay Pherwani et.al. | 2410.07161 | null |
2024-10-09 | I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy | Gian Maria Campedelli et.al. | 2410.07109 | link |
2024-10-09 | Identifying and Addressing Delusions for Target-Directed Decision-Making | Mingde Zhao et.al. | 2410.07096 | link |
2024-10-09 | MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering | Jun Shern Chan et.al. | 2410.07095 | link |
2024-10-10 | Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology | Xiangyu Wang et.al. | 2410.07087 | null |
2024-10-09 | MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses | Zonglin Yang et.al. | 2410.07076 | link |
2024-10-09 | Retrieval-Augmented Decision Transformer: External Memory for In-context RL | Thomas Schmied et.al. | 2410.07071 | link |
2024-10-09 | Mechanism Design for Exchange Markets | Yusen Zheng et.al. | 2410.07023 | null |
2024-10-09 | Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach | Xuanming Zhang et.al. | 2410.06949 | link |
2024-10-07 | Grounding Partially-Defined Events in Multimodal Data | Kate Sanders et.al. | 2410.05267 | null |
2024-10-07 | GLEE: A Unified Framework and Benchmark for Language-based Economic Environments | Eilam Shapira et.al. | 2410.05254 | link |
2024-10-07 | Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents | Boyu Gou et.al. | 2410.05243 | link |
2024-10-07 | Scalable and Accurate Graph Reasoning with LLM-based Multi-Agents | Yuwei Hu et.al. | 2410.05130 | null |
2024-10-08 | Last Iterate Convergence in Monotone Mean Field Games | Noboru Isobe et.al. | 2410.05127 | null |
2024-10-07 | ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery | Ziru Chen et.al. | 2410.05080 | null |
2024-10-07 | Extended Functional Representation Lemma: A Tool For Privacy, Semantic Representation, Caching, and Compression Design | Amirreza Zamani et.al. | 2410.05033 | null |
2024-10-07 | Active Fine-Tuning of Generalist Policies | Marco Bagatella et.al. | 2410.05026 | null |
2024-10-07 | Contest design with a finite type-space: A unifying approach | Andrzej Baranski et.al. | 2410.04970 | null |
2024-10-07 | Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning | Chen Zhang et.al. | 2410.04936 | null |
2024-10-04 | Open-World Reinforcement Learning over Long Short-Term Imagination | Jiajian Li et.al. | 2410.03618 | link |
2024-10-04 | Never Mind The No-Ops: Faster and Less Volatile Simulation Modelling of Co-Evolutionary Species Interactions via Spatial Cyclic Games | Dave Cliff et.al. | 2410.03586 | link |
2024-10-04 | Training on more Reachable Tasks for Generalisation in Reinforcement Learning | Max Weltevrede et.al. | 2410.03565 | null |
2024-10-04 | Steering Large Language Models between Code Execution and Textual Reasoning | Yongchao Chen et.al. | 2410.03524 | null |
2024-10-04 | Tournament versus Circulant: On Simulating 7-Species Evolutionary Spatial Cyclic Games with Ablated Predator-Prey Networks as Models of Biodiversity | Dave Cliff et.al. | 2410.03518 | link |
2024-10-04 | MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation | Hongcheng Wang et.al. | 2410.03488 | null |
2024-10-04 | VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning | Han Lin et.al. | 2410.03478 | null |
2024-10-04 | MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents | Junpeng Yue et.al. | 2410.03450 | null |
2024-10-04 | Attainable Force Approximation and Full-Pose Tracking Control of an Over-Actuated Thrust-Vectoring Modular Team UAV | Yen-Cheng Chu et.al. | 2410.03445 | null |
2024-10-04 | ToolGen: Unified Tool Retrieval and Calling via Generation | Renxi Wang et.al. | 2410.03439 | link |
2024-10-03 | ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI | Ahmad Elawady et.al. | 2410.02751 | link |
2024-10-03 | Grounding Large Language Models In Embodied Environment With Imperfect World Models | Haolan Liu et.al. | 2410.02742 | null |
2024-10-03 | DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects | Zhaowei Wang et.al. | 2410.02730 | link |
2024-10-03 | Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization | Ryan C. Barron et.al. | 2410.02721 | null |
2024-10-03 | Grounded Answers for Multi-agent Decision-making Problem through Generative World Model | Zeyang Liu et.al. | 2410.02664 | null |
2024-10-03 | Undesirable Memorization in Large Language Models: A Survey | Ali Satvaty et.al. | 2410.02650 | null |
2024-10-04 | Learning 3D Perception from Others’ Predictions | Jinsu Yoo et.al. | 2410.02646 | null |
2024-10-03 | Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents | Hanrong Zhang et.al. | 2410.02644 | link |
2024-10-03 | Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning | Olivier Lepel et.al. | 2410.02605 | null |
2024-10-03 | Agents’ Room: Narrative Generation through Multi-step Collaboration | Fantine Huot et.al. | 2410.02603 | link |
2024-10-02 | Windowed MAPF with Completeness Guarantees | Rishi Veerapaneni et.al. | 2410.01798 | null |
2024-10-02 | Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning | Prasanth Sengadu Suresh et.al. | 2410.01790 | null |
2024-10-02 | Social coordination perpetuates stereotypic expectations and behaviors across generations in deep multi-agent reinforcement learning | Rebekah A. Gelpí et.al. | 2410.01763 | null |
2024-10-02 | PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillation | Mohammadamin Davoodabadi et.al. | 2410.01745 | null |
2024-10-02 | Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning | Xingrui Gu et.al. | 2410.01739 | null |
2024-10-02 | Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning | Omayma Mahjoub et.al. | 2410.01706 | null |
2024-10-02 | Stable Offline Value Function Learning with Bisimulation-based Representations | Brahma S. Pavse et.al. | 2410.01643 | null |
2024-10-02 | Moral Alignment for LLM Agents | Elizaveta Tennant et.al. | 2410.01639 | null |
2024-10-02 | Entropy-Based Uncertainty Modeling for Trajectory Prediction in Autonomous Driving | Aron Distelzweig et.al. | 2410.01628 | null |
2024-10-02 | Automated Red Teaming with GOAT: the Generative Offensive Agent Tester | Maya Pavlova et.al. | 2410.01606 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos | Md Mohaiminul Islam et.al. | 2409.20557 | null |
2024-09-30 | Direct Multipath-Based SLAM | Mingchao Liang et.al. | 2409.20552 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | Impartial Selection Under Combinatorial Constraints | Javier Cembrano et.al. | 2409.20477 | null |
2024-09-30 | Facility Location Games with Competitors | Cheng Peng et.al. | 2409.20396 | null |
2024-09-30 | Machine Learning-enabled Traffic Steering in O-RAN: A Case Study on Hierarchical Learning Approach | Md Arafat Habib et.al. | 2409.20391 | null |
2024-09-30 | Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models | Yizhou Huang et.al. | 2409.20364 | null |
2024-09-30 | A mean field Jacobi process for modeling sustainable tourism | Hidekazu Yoshioka et.al. | 2409.20347 | null |
2024-09-30 | MARLadona – Towards Cooperative Team Play Using Multi-Agent Reinforcement Learning | Zichong Li et.al. | 2409.20326 | null |
2024-09-27 | Mean-Field Control Barrier Functions: A Framework for Real-Time Swarm Control | Samy Wu Fung et.al. | 2409.18945 | null |
2024-09-27 | Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models | Jiaming Li et.al. | 2409.18943 | link |
2024-09-27 | AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Huizi Yu et.al. | 2409.18924 | null |
2024-09-27 | Best Arm Identification with Minimal Regret | Junwen Yang et.al. | 2409.18909 | null |
2024-09-27 | Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks | Richard Osuala et.al. | 2409.18872 | link |
2024-09-27 | Safe Decentralized Multi-Agent Control using Black-Box Predictors, Conformal Decision Policies, and Control Barrier Functions | Sacha Huriot et.al. | 2409.18862 | null |
2024-09-27 | ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning | Jannis Becktepe et.al. | 2409.18827 | link |
2024-09-27 | Facility Location Problem with Aleatory Agents | Gennaro Auricchio et.al. | 2409.18817 | null |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-09-27 | Forecasting Macroeconomic Dynamics using a Calibrated Data-Driven Agent-based Model | Samuel Wiese et.al. | 2409.18760 | null |
2024-09-26 | StackGen: Generating Stable Structures from Silhouettes via Diffusion | Luzhe Sun et.al. | 2409.18098 | null |
2024-09-26 | Infer Human’s Intentions Before Following Natural Language Instructions | Yanming Wan et.al. | 2409.18073 | link |
2024-09-27 | Explaining Explaining | Sergei Nirenburg et.al. | 2409.18052 | null |
2024-09-26 | Inverse Reinforcement Learning with Multiple Planning Horizons | Jiayu Yao et.al. | 2409.18051 | null |
2024-09-26 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | link |
2024-09-26 | Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Haochen Liu et.al. | 2409.18031 | link |
2024-09-26 | Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective | Yotam Wolf et.al. | 2409.18028 | null |
2024-09-26 | Control Industrial Automation System with Large Language Models | Yuchen Xia et.al. | 2409.18009 | link |
2024-09-26 | Distributed Invariant Unscented Kalman Filter based on Inverse Covariance Intersection with Intermittent Measurements | Zhian Ruan et.al. | 2409.17997 | null |
2024-09-26 | Nonparametric Inference Framework for Time-dependent Epidemic Models | Son Luu et.al. | 2409.17968 | null |
2024-09-25 | Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents | Junting Lu et.al. | 2409.17140 | null |
2024-09-25 | Collision-free time-optimal path parameterization for multi-robot teams | Katherine Mao et.al. | 2409.17079 | null |
2024-09-25 | AI-Driven Risk-Aware Scheduling for Active Debris Removal Missions | Antoine Poupon et.al. | 2409.17012 | null |
2024-09-25 | PitRSDNet: Predicting Intra-operative Remaining Surgery Duration in Endoscopic Pituitary Surgery | Anjana Wijekoon et.al. | 2409.16998 | null |
2024-09-25 | Tell Me What You Don’t Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing | Wenhao Liu et.al. | 2409.16913 | null |
2024-09-25 | A Roadmap for Embodied and Social Grounding in LLMs | Sara Incao et.al. | 2409.16900 | null |
2024-09-25 | Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study | Sota Kobuki et.al. | 2409.16899 | null |
2024-09-25 | Automating Traffic Model Enhancement with AI Research Agent | Xusen Guo et.al. | 2409.16876 | link |
2024-09-25 | Communication Backbone Reconfiguration with Connectivity Maintenance | Leonardo Santos et.al. | 2409.16851 | null |
2024-09-25 | Modeling the Modqueue: Towards Understanding and Improving Report Resolution on Reddit | Tanvi Bajpai et.al. | 2409.16840 | null |
2024-09-24 | Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks | Ahmed Shokry et.al. | 2409.16208 | null |
2024-09-25 | Extending Stable and Popular Matching Algorithms from Bipartite to Arbitrary Instances | Gergely Csáji et.al. | 2409.16173 | null |
2024-09-24 | EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges | Talor Abramovich et.al. | 2409.16165 | link |
2024-09-25 | Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Alexander Prutsch et.al. | 2409.16154 | link |
2024-09-24 | Analyzing Probabilistic Methods for Evaluating Agent Capabilities | Axel Højmark et.al. | 2409.16125 | null |
2024-09-24 | MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents | Ming Zhu et.al. | 2409.16120 | link |
2024-09-24 | A decision-theoretic model for a principal-agent collaborative learning problem | Getachew K Befekadu et.al. | 2409.16068 | null |
2024-09-24 | Bridging Environments and Language with Rendering Functions and Vision-Language Models | Theo Cachet et.al. | 2409.16024 | null |
2024-09-24 | AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model | Zhenghao Qi et.al. | 2409.16019 | link |
2024-09-24 | Automated test generation to evaluate tool-augmented LLMs as conversational AI agents | Samuel Arcadinho et.al. | 2409.15934 | null |
2024-09-18 | Residual Descent Differential Dynamic Game (RD3G) – A Fast Newton Solver for Constrained General Sum Games | Zhiyuan Zhang et.al. | 2409.12152 | null |
2024-09-18 | MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning | Justin Chih-Yao Chen et.al. | 2409.12147 | link |
2024-09-19 | The Impact of Element Ordering on LM Agent Performance | Wayne Chi et.al. | 2409.12089 | link |
2024-09-19 | Using Large Language Models to Generate Clinical Trial Tables and Figures | Yumeng Yang et.al. | 2409.12046 | null |
2024-09-19 | Representing Positional Information in Generative World Models for Object Manipulation | Stefano Ferraro et.al. | 2409.12005 | null |
2024-09-18 | Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning | Claude Formanek et.al. | 2409.12001 | null |
2024-09-18 | On the Stability of Consensus Control under Rotational Ambiguities | Zhonggang Li et.al. | 2409.11979 | null |
2024-09-18 | Anomalous behavior of Replicator dynamics for the Prisoner’s Dilemma on diluted lattices | Fernanda R. Leivas et.al. | 2409.11955 | null |
2024-09-18 | Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling | Arthur Müller et.al. | 2409.11933 | null |
2024-09-18 | Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks | Samuel Belkadi et.al. | 2409.11897 | link |
2024-09-17 | Ising model with varying spin strength on a scale-free network: scaling functions and critical amplitude ratios | M. Krasnytska et.al. | 2409.11396 | null |
2024-09-17 | Distributed Perception Aware Safe Leader Follower System via Control Barrier Methods | Richie R. Suganda et.al. | 2409.11394 | null |
2024-09-17 | LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents | Amine B. Hassouna et.al. | 2409.11393 | null |
2024-09-17 | CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark | Zachary S. Siegel et.al. | 2409.11363 | link |
2024-09-17 | A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems | Mostafa M. Shibl et.al. | 2409.11358 | null |
2024-09-17 | EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage | Zeyi Liao et.al. | 2409.11295 | link |
2024-09-17 | P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task | Weiye Xu et.al. | 2409.11279 | null |
2024-09-17 | Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments | Maria Rigaki et.al. | 2409.11276 | null |
2024-09-18 | The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives | Samee Arif et.al. | 2409.11261 | link |
2024-09-17 | To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games? | Chih-Yuan Chiu et.al. | 2409.11257 | null |
2024-09-16 | On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model | Surajit Saha et.al. | 2409.10413 | null |
2024-09-16 | Reducing Leximin Fairness to Utilitarian Optimization | Eden Hartman et.al. | 2409.10395 | null |
2024-09-16 | Decentralized and Asymmetric Multi-Agent Learning in Construction Sites | Yakov Miron et.al. | 2409.10375 | null |
2024-09-16 | Instigating Cooperation among LLM Agents Using Adaptive Information Modulation | Qiliang Chen et.al. | 2409.10372 | null |
2024-09-16 | 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? | Téo Guichoux et.al. | 2409.10357 | null |
2024-09-16 | Partial Ordering Bayesian Logistic Regression Model for Phase I Combination Trials and Computationally Efficient Approach to Operational Prior Specification | Weishi Chen et.al. | 2409.10352 | link |
2024-09-16 | Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots | Hongming Zhang et.al. | 2409.10277 | link |
2024-09-16 | Synchronization-Based Cooperative Distributed Model Predictive Control | Julius Beerwerth et.al. | 2409.10215 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions | Alejandro Sánchez Roncero et.al. | 2409.10117 | null |