Small agents with big memory
Tiny footprint, infinite memory
The Problem
Smolagents from Hugging Face are beautifully minimal. Small, fast, and surprisingly capable. They prove you don't need a massive framework to build effective agents.
But small doesn't mean you want amnesia.
Your smolagent handles tasks efficiently, then forgets everything. The next session, it's back to "Hello, I'm an AI assistant." All that efficiency, none of the memory.
Minimal framework doesn't have to mean minimal context.
How Stompy Helps
Stompy gives your smolagents persistent memory without adding bloat.
Your lightweight agents gain: - Cross-session context without heavy dependencies - Knowledge that persists across runs - Semantic search for finding relevant past interactions - All without sacrificing the "smol" philosophy
Stay light. Remember everything.
Integration Walkthrough
Add Stompy to your smolagent
Smolagents support custom tools. Add Stompy memory tools directly.
from smolagents import CodeAgent, ToolCallingAgentfrom smolagents.mcp import MCPClientimport os# Connect to Stompy via SSEstompy = MCPClient(url="https://mcp.stompy.ai/sse",headers={"Authorization": f"Bearer {os.environ['STOMPY_TOKEN']}"})agent = ToolCallingAgent(tools=[*stompy.get_tools()],model="Qwen/Qwen2.5-72B-Instruct")# Agent now has persistent memory
What You Get
- Stays lightweight—memory is external, not embedded
- Automatic session handovers between runs
- Semantic search finds relevant context
- Delta evaluation keeps storage minimal
- Perfect for edge deployment with cloud memory
Ready to give Smolagents a memory?
Join the waitlist and be the first to know when Stompy is ready. Your Smolagents projects will never forget again.