Skip to content

RAG nodes

Retrieval-Augmented Generation: embedders, vector stores, document loaders, retrievers.

5 nodes.


document_chunker — Document Chunker

Splits documents into smaller chunks for embeddings.

Config fields

FieldTypeRequiredValues
chunkSizenumberno
chunkOverlapnumberno
separatorstringno
strategystringnoseparator | character | recursive | token

Example config

json
{
  "chunkSize": 500,
  "chunkOverlap": 50,
  "separator": "\\n\\n",
  "strategy": "separator"
}

embeddings_azure_openai — Embeddings Azure OpenAI

Generates embedding vectors using Azure OpenAI embedding deployments.

Config fields

FieldTypeRequiredValues
endpointstringyes
deploymentstringyes
apiVersionstringno
secretRefobjectyes
inputKeystringno
outputKeystringno

Example config

json
{
  "endpoint": "https://my-azure-openai.openai.azure.com",
  "deployment": "text-embedding-3-large",
  "apiVersion": "2024-10-21",
  "inputKey": "user_prompt",
  "outputKey": "embedding"
}

extract_citations — Extract Citations

Phase 9.4 — scans an LLM answer for [N] markers, resolves each to the upstream retrieved document, and emits structured citations the chat UI renders as clickable footnotes. Wire after llm_call when the upstream chain produced documents.

Config fields

FieldTypeRequiredValues
answerPathstringno
documentsPathstringno

Example config

json
{
  "answerPath": "answer",
  "documentsPath": "documents"
}

rag_retrieve — RAG Retrieve

Retrieves context chunks from provided documents or vector store.

Config fields

FieldTypeRequiredValues
queryTemplatestringno
topKnumberno
documentsarray<string>no
embedderIdstringno
vectorStoreIdstringnoin-memory-vector-store | knowledge-base | pinecone-vector-store | pgvector-store | azure-ai-search-vector-store | qdrant-vector-store | chroma-vector-store | weaviate-vector-store | redis-vector-store
knowledgeBaseIdstringno
searchModestringnovector | bm25 | hybrid
bm25Weightnumberno
vectorWeightnumberno
rrfKnumberno
candidatesPerRankernumberno
vectorStoreConfigobjectno
embeddingSecretRefobjectno

Example config

json
{
  "queryTemplate": "{{user_prompt}}",
  "topK": 3,
  "embedderId": "token-embedder",
  "vectorStoreId": "in-memory-vector-store"
}

rerank — Rerank

Reorders candidate documents by relevance via a cross-encoder reranker (Cohere / Jina / Voyage). Plug after rag_retrieve for the precision boost.

Config fields

FieldTypeRequiredValues
queryTemplatestringno
topNnumberno
providerIdstringnocohere | jina | voyage
modelstringno
secretRefobjectno

Example config

json
{
  "queryTemplate": "{{user_prompt}}",
  "topN": 3,
  "providerId": "cohere",
  "model": "rerank-english-v3.0"
}