RAG Frameworks, Gateway Layer, NuClide Stack

What it is

Retrieval-Augmented Generation is how an LLM gets access to documents it wasn’t trained on: your company wiki, last week’s invoices, a PDF of your medical history. A RAG pipeline chains a document loader, an embedder, a vector store, a retriever, and the LLM call. Frameworks that package this into one runtime: Dify (the most polished, Chinese-origin), Flowise (visual builder on top of LangChain), Haystack (Deepset’s enterprise stack), Quivr, Verba. The pipeline is what turns a model into a product.

What goes wrong

Most RAG deployments are research artefacts that grew into prototypes that grew into production. Dify ships with admin@admin.com / password as the seed account; a fresh Flowise install exposes the canvas and every workflow’s embedded API keys; Haystack’s REST API is unauthenticated by default and its /query endpoint will dutifully retrieve and return any document the embedder has indexed. The corpus exposed this way ranges from public PDFs all the way to attorney-client communications, internal sales decks, and patient records.

How we test

We probe each framework’s signature endpoints: Dify’s /console/api/setup for the seed-account state, Flowise’s /api/v1/chatflows for the workflow catalogue, Haystack’s /search for the indexed corpus reach. When the retriever is reachable, we issue a single low-volume query (e.g. “summary”) to confirm the corpus contains real content, capture the document titles and sources from the response, and stop. Title metadata is enough to attribute the operator and characterise the data class without reading the documents themselves.

RAG Frameworks

What it is

What goes wrong

How we test

Dify Population Survey — 939 Config-Disclosure, 9 Open Auth Findings

RAG Framework Servers Population Survey — Cat-07 (2026-05-31)

RAG Stragglers: LightRAG, RAGFlow, DocsGPT, Ragapp Population Survey

RAG Framework Servers: Population-Scale Survey (2026-05-15)

MinIO + Dify on Public Cloud: Auth Posture Survey

Embedding Services: Cross-Cloud Survey (2026-05)

RAG Framework Servers: Cross-Cloud Survey (2026-05)

23.239.19.219: Exposed LlamaIndex Chat with Broken Backend, Multi-Tenant SNI Co-Tenancy

University of Dhaka: Coding Cluster, 3 Cloud Proxies, Embedding Pipeline

China Telecom Tianjin: 46-Node Multi-Tenant Ollama Cluster

Agricultural University of Athens: 142GB Qwen3-235B MoE, Dual-Embedding RAG

Institut Teknologi Bandung (ITB): 22 Models, Custom Indonesian Education AI

Government AI Infrastructure Exposures

Indonesia Government Cluster: 5-Node Survey, 2 Account Takeovers

DINAS KOMINFO PROV. JAWA TENGAH: Account Takeover, RAG Pipeline

"No. 18 Institute of Jingdong HQ": 26-Node Cluster, China Unicom

Kyungpook National University: 3-Node Cluster, Multimodal AI

California Institute of Technology (Caltech): GPT-OSS 120B, RAG Pipeline

Chinese Primary School: Cloud Proxy Subscriptions + Credential Leak

University of Newcastle, Australia: DeepSeek Cloud Proxy + RAG Pipeline

Brno University of Technology: Abliterated Gemma + Bulgarian GPT + RAG Pipeline

Technical University of Crete + NTUA: Unauthenticated Ollama, MiniMax Cloud, 235.7B Model

University of Crete Medical Center: Dual-Embedding RAG Pipeline

Fu Jen Catholic University: Medical Public Health GPU Server, 75GB + 60GB Local Models

Rochester Institute of Technology: 4-Node Cluster, DGX with 18 Cloud Subscriptions, Student Machine with Abliterated Models

SUNY Buffalo: Unauthenticated Ollama + Cloud Proxy Quota Hijack Confirmed

Es Frojasg1 Dev Haystack 2026 05 17

Klinikken.ai: Unauthenticated Vector Database API (Auth Bypass via Embedding Proxy)

Au Newcastle Followup

Us Ny Suny Buffalo State

Au Newcastle

Cz Brno Vutbr

Gr U Crete Medical

Us Ny Rit

→ 200 OK, response: "Buffalo", eval_count: 2

LLM Gateways

Rerankers