A modular, multi-provider AI agent with persistent memory, vector search, MCP support, and a powerful tool system, all in a sleek terminal interface.
A simulated Wisemonkey session — the welcome screen you see on first launch, followed by example prompts showcasing natural-language control over tools, memory, providers, and MCP servers.
Connect to OpenAI, Anthropic, Ollama, LM Studio, or any OpenAI-compatible API. Switch providers on the fly.
Session-scoped chat history, user profiles, and notes that survive restarts. Your agent remembers everything.
Embed PDFs and documents into a ChromaDB vector store. Retrieve relevant context on demand.
Connect to Model Context Protocol servers for filesystem access, GitHub, databases, and more.
Register tools at runtime. MCP server tools are auto-discovered and injected into the agent's capabilities.
Real-time token streaming with configurable reasoning effort. Watch your agent think step by step.
On first run, Wisemonkey creates ~/.config/wisemonkey/config.yaml with sensible defaults. Works with any OpenAI-compatible endpoint out of the box.
LM Studio, Ollama, OpenWebUI, or your own server. Just set base_url.
Set reasoning_effort to none, low, medium, or high. Toggle visibility with reasoning_visible.
Separate embedding model config. Use any OpenAI-compatible embedding endpoint for vector search.
Enable vi_mode for vim-style editing. Press v in normal mode to open your external editor.