Data Pool

The Data Pool is the central interface for managing indexed data in SmythOS. It allows you to create Data Spaces, upload files or URLs, and optionally connect external vector storage using Custom Storage. This enables agents to use retrieval-augmented generation (RAG) and answer queries using relevant context from your documents.

What the Data Pool enables

Agents in SmythOS can use your files, pages, and structured content for precise responses. The Data Pool powers this by indexing data into searchable embeddings.

How the Data Pool supports RAG

RAG stands for retrieval-augmented generation. When an agent is connected to a Data Space, it can search through indexed content and use the most relevant snippets to generate informed answers. This makes your agents more accurate, grounded, and useful for knowledge-based tasks.

RAG gives your agent real knowledge

Without data, your agent only knows what you prompt it with. With RAG, it retrieves meaningful information from your own sources before generating output.

Key features of the Data Pool

Feature	Description	Learn More
Data Spaces	Containers where you upload and manage files or URLs for indexing	Create Data Spaces
Custom Storage	Connect Pinecone or another vector store to store embeddings externally	Set Up Custom Storage
Indexing Status	See whether each data source has been successfully indexed and is available for agent retrieval	Included in the Data Spaces guide
Data source preview	Review uploaded files or linked content from the interface	Managed inside each Data Space

Use one Data Pool, many agents

You can reuse the same Data Spaces across multiple agents. This makes your content modular and efficient to maintain.

Using the Data Pool

To access the Data Pool:

Open the SmythOS Studio sidebar
Click Data Pool
View your list of existing Data Spaces
Check the indexing status for each row
Click any row to open the full Data Space view

The Data Pool shows your content containers, index status, and linked storage at a glance.

Storage options

The Data Pool supports both internal and external vector storage for embeddings.

Storage Type	Hosted By	Setup Required	Best For	Docs
Internal Storage	SmythOS	None	Quick setups, default RAG usage	Data Spaces
External Storage (Pinecone)	You (Pinecone)	API key and index info	Full control, enterprise use, scalable vectors	Custom Storage

Pinecone index requirement

Make sure your Pinecone index uses a vector dimension of 1536 or indexing will not work.

What's Next

Was this page helpful?

How the Data Pool supports RAG​

Key features of the Data Pool​

Using the Data Pool​

Storage options​

What's Next​

How the Data Pool supports RAG

Key features of the Data Pool

Using the Data Pool

Storage options

What's Next