Updated biweekly.
AI agents can think, act, and complete tasks by themselves.
But can they really replace our jobs?
🔥: Recommended papers
📖: Survey papers
⚖️: Benchmark papers
- Agent Capabilities
- AI Agents Architecture
- AI Agents Applications
- GenAI Agents Presentations
- 📖 "Agentic Reasoning for Large Language Models" [paper]
- 📖 "Toward Efficient Agents: Memory, Tool learning, and Planning" [paper]
- "JENIUS AGENT: Towards Experience-Driven Accuracy Optimization in Real-World Scenarios" [paper]
- "EvoRoute: Experience-Driven Self-Routing LLM Agent Systems" [paper]
- "MEMRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory" [paper]
- "PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution" [paper]
- "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" [paper]
- "WISE-Flow: Workflow-Induced Structured Experience for Self-Evolving Conversational Service Agents" [paper]
- "To Retrieve or To Think? An Agentic Approach for Context Evolution" [paper]
- "Controlled Self-Evolution for Algorithmic Code Optimization" [paper]
- "Learn Like Humans: Use Meta-cognitive Reflection for Efficient Self-Improvement" [paper]
- 📖 "From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms" [paper]
- "Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification" [paper]
- "Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents" [paper]
- "SimpleMem: Efficient Lifelong Memory for LLM Agents" [paper]
- "MEMRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory" [paper]
- "Memory Matters More: Event-Centric Memory as a Logic Map for Agent Searching and Reasoning" [paper]
- "Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction" [paper]
- "Inside Out: Evolving User-Centric Core Memory Trees for Long-Term Personalized Dialogue Systems" [paper]
- "MineNPC-Task: Task Suite for Memory-Aware Minecraft Agents" [paper]
- "PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution" [paper]
- "The AI Hippocampus: How Far are We From Human Memory?" [paper]
- "MemoBrain: Executive Memory as an Agentic Brain for Reasoning" [paper]
- "AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation" [paper]
- "Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management" [paper]
- "Structured Episodic Event Memory" [paper]
- "Active Context Compression: Autonomous Memory Management in LLM Agents"[paper]
- "Progressive Ideation using an Agentic AI Framework for Human-AI Co-Creation" [paper]
- "OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment" [paper]
- "Sci-Reasoning: A Dataset Decoding AI Innovation Patterns" [paper]
- "SuS: Strategy-aware Surprise for Intrinsic Exploration" [paper]
- "Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments" [paper]
- "LLM Review: Enhancing Creative Writing via Blind Peer Review Feedback" [paper]
- "Agentic AI and Machine Learning for Accelerated Materials Discovery and Applications" [paper]
- "Who Owns Creativity and Who Does the Work? Trade-offs in LLM-Supported Research Ideation" [paper]
- "Improved Bug Localization with AI Agents Leveraging Hypothesis and Dynamic Cognition" [paper]
- "Rethinking the AI Scientist: Interactive Multi-Agent Workflows for Scientific Discovery" [paper]
- "Learning to Discover at Test Time" [paper]
- "Strategic Self-Improvement for Competitive Agents in AI Labour Markets" [paper]
- "Guided Self-Evolving LLMs with Minimal Human Supervision" [paper]
- "Evolving Excellence: Automated Optimization of LLM-based Agents" [paper]
- "Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution" [paper]
- "Beyond Training: Enabling Self-Evolution of Agents with MOBIMEM" [paper]
- "SCOPE: Prompt Evolution for Enhancing Agent Effectiveness" [paper]
- "Reinforcement Learning for Self-Improving Agent with Skill Library" [paper]
- "MemEvolve: Meta-Evolution of Agent Memory Systems" [paper]
- 📖 "Memory in the Age of AI Agents: A Survey Forms, Functions and Dynamics" [paper]
- 📖 "Adaptation of Agentic AI" [paper]
- 📖 "Deep Research: A Systematic Survey" [paper]
- 🔥 "Measuring Agents in Production" [paper]
- 🔥 "Towards a Science of Scaling Agent Systems" [paper]
- ⚖️ "Evaluating Large Language Models in Scientific Discovery" [paper]
- 🔥 "How Far Are We from Genuinely Useful Deep Research Agents?" [paper]
- "Can Agentic AI Match the Performance of Human Data Scientists?" [paper]
04/25 ~ 12/25 [link]
