AgentScope RAG + Memory Architecture — Building Knowledge-Based Agents
Build knowledge-based agents with KnowledgeBase, vector stores (Qdrant/Milvus), and ReMe long-term memory.

AgentScope RAG + Memory Architecture — Building Knowledge-Based Agents
An agent that can reason and use tools is powerful. An agent that can also search your documents and remember past interactions is transformative.
In this post, we'll add RAG (Retrieval-Augmented Generation) and long-term memory to AgentScope agents — turning them into knowledge workers that improve over time.
Series: Part 1: Getting Started | Part 2: Multi-Agent | Part 3: MCP Integration | Part 4 (this post) | Part 5: Realtime Voice | Part 6: Production
1. RAG Overview
Related Posts

Inside Google COSMO — The New Architecture of On-Device AI Agents
Deep-dive into COSMO, Google's next-gen AI assistant accidentally leaked before I/O 2026. Full breakdown of the 3-mode architecture: Gemini Nano + PI server + Hybrid routing.

Self-Evolving AI Agents — The New Paradigm of 2026
GenericAgent, Evolver, Open Agents — comparing 3 self-evolving agent frameworks that learn, adapt, and grow without human coding.

LLM Inference Optimization Part 4 — Production Serving
Production deployment with vLLM and TGI. Continuous Batching, Speculative Decoding, memory budget design, and throughput benchmarks.