Data Pool
The Data Pool is the central interface for managing indexed data in SmythOS. It allows you to create Data Spaces, upload files or URLs, and optionally connect external vector storage using Custom Storage. This enables agents to use retrieval-augmented generation (RAG) and answer queries using relevant context from your documents.
How the Data Pool supports RAG
RAG stands for retrieval-augmented generation. When an agent is connected to a Data Space, it can search through indexed content and use the most relevant snippets to generate informed answers. This makes your agents more accurate, grounded, and useful for knowledge-based tasks.
Key features of the Data Pool
Feature | Description | Learn More |
---|---|---|
Data Spaces | Containers where you upload and manage files or URLs for indexing | Create Data Spaces |
Custom Storage | Connect Pinecone or another vector store to store embeddings externally | Set Up Custom Storage |
Indexing Status | See whether each data source has been successfully indexed and is available for agent retrieval | Included in the Data Spaces guide |
Data source preview | Review uploaded files or linked content from the interface | Managed inside each Data Space |
Using the Data Pool
To access the Data Pool:
- Open the SmythOS Studio sidebar
- Click Data Pool
- View your list of existing Data Spaces
- Check the indexing status for each row
- Click any row to open the full Data Space view

The Data Pool shows your content containers, index status, and linked storage at a glance.
Storage options
The Data Pool supports both internal and external vector storage for embeddings.
Storage Type | Hosted By | Setup Required | Best For | Docs |
---|---|---|---|---|
Internal Storage | SmythOS | None | Quick setups, default RAG usage | Data Spaces |
External Storage (Pinecone) | You (Pinecone) | API key and index info | Full control, enterprise use, scalable vectors | Custom Storage |