Configuration
When creating an AutoRAG instance, you can customize how your RAG pipeline ingests, processes, and responds to data using a set of configuration options. Some settings can be updated after the instance is created, while others are fixed at creation time.
The table below lists all available configuration options:
| Configuration | Editable after creation | Description | 
|---|---|---|
| Data source | no | The source where your knowledge base is stored | 
| Chunk size | yes | Number of tokens per chunk | 
| Chunk overlap | yes | Number of overlapping tokens between chunks | 
| Embedding model | no | Model used to generate vector embeddings | 
| Query rewrite | yes | Enable or disable query rewriting before retrieval | 
| Query rewrite model | yes | Model used for query rewriting | 
| Query rewrite system prompt | yes | Custom system prompt to guide query rewriting behavior | 
| Match threshold | yes | Minimum similarity score required for a vector match | 
| Maximum number of results | yes | Maximum number of vector matches returned ( top_k) | 
| Generation model | yes | Model used to generate the final response | 
| Generation system prompt | yes | Custom system prompt to guide response generation | 
| Similarity caching | yes | Enable or disable caching of responses for similar (not just exact) prompts | 
| Similarity caching threshold | yes | Controls how similar a new prompt must be to a previous one to reuse its cached response | 
| AI Gateway | yes | AI Gateway for monitoring and controlling model usage | 
| AutoRAG name | no | Name of your AutoRAG instance | 
| Service API token | yes | API token granted to AutoRAG to give it permission to configure resources on your account. | 
Was this helpful?
- Resources
- API
- New to Cloudflare?
- Products
- Sponsorships
- Open Source
- Support
- Help Center
- System Status
- Compliance
- GDPR
- Company
- cloudflare.com
- Our team
- Careers
- © 2025 Cloudflare, Inc.
- Privacy Policy
- Terms of Use
- Report Security Issues
- Trademark