Topics

AI Agents

LLM-driven systems that plan, act, use tools, and carry skills across tasks.

An autonomous agent workflow on a dark screen

AI agents wrap a language model in a loop of planning, tool use, memory, and action, turning a one-shot responder into a system that can pursue a goal over many steps. The research that matters is less about any single model and more about how agents reason, call tools, recover from errors, and carry reusable skills between tasks.

This topic tracks the shift from clever prompting to durable infrastructure: ReAct interleaved reasoning with actions, Toolformer taught models to call APIs, and skill-packaging systems like COLLEAGUE.SKILL turn expertise into portable, correctable artifacts. The hard open questions are reliability, evaluation, safety bounds, and how to author and maintain skills at scale.

Foundational papers

Recent papers