Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

§ THE STACK / DATA LAYER

Search Engines

Elasticsearch, Solr, Meilisearch, Typesense, Vespa. full-text + vector search

Vector stores, registries, memory, datasets: what the model knows and remembers.

What it is

Search engines power both the classic full-text retrieval tier (Elasticsearch, Apache Solr, Vespa) and the modern vector-similarity tier that LLM apps lean on for retrieval-augmented generation. The line between them has blurred since 2022: every mainstream engine (Elastic, OpenSearch, Solr 9, Vespa, Meilisearch, Typesense) now ships dense-vector indices alongside their inverted-index core. Many production RAG pipelines store their LangChain or LlamaIndex document chunks here rather than in a dedicated vector DB.

What goes wrong

The official Docker images ship with auth off by default. The operator must opt into security: set xpack.security.enabled=true for Elasticsearch, configure Solr’s security.json to enable the basic-auth plugin, or set the Meilisearch master key via environment variable. Across population-scale surveys, ~54% of reachable Elasticsearch instances skip the step entirely. Solr’s older Docker tags (solr:7.x) compound the problem with multiple unauthenticated remote-code-execution CVEs: CVE-2019-17558 (Velocity Template SSTI), CVE-2019-0193 (DataImportHandler), CVE-2019-12409 (JMX-RMI). The data layer itself discloses operator app schema via index and core names long before any document is read.

How we test

We probe each engine’s identity endpoint (/ for Elasticsearch’s version JSON, /solr/admin/info/system for Solr, /health for Meili and Typesense, /state/v1 for Vespa), confirm version, and then call the documented listing endpoint (/_cat/indices, /solr/admin/cores, /indexes, /collections). Index and core names are the finding: operators name things like rag-document-chunks, spring-ai-document-index, entity_vectors, kb_documents_v1. Disclosure of the operator’s app architecture happens before any document fetch. We never run free-text queries against the index; the names alone justify the severity claim.

Receipts

Research

Every survey, case study, and disclosure we've published that touches this layer of the stack. Counts on the cells above tally these directly.

Cross-cloud surveys

9
Survey May 17, 2026

Meow / Indexrm Elasticsearch extortion. Three actors. (2026-05-17)

We sampled 150 of the 3,604 fully-wiped Elasticsearch hosts from this morning's re-probe. We read the readme index on each one. Three different actors are running the campaign in parallel.

Read →
Survey May 17, 2026

Meow / Indexrm campaign: per-actor census across 4,776 ES hosts

We re-ran the full 4,776-host Elasticsearch population through aimap v1.9.10. The new release reads one document from the attacker-planted marker index and parses it for actor identifiers. The morning…

Read →
Survey May 16, 2026

Elasticsearch AI-Stack Population Survey (2026-05-16)

Population survey of Elasticsearch clusters with focus on AI-stack adjacency. RAG vector stores, langchain/llama-index indices, embedding caches, prompt history. Elasticsearch has been a major exposur…

Read →
Survey May 16, 2026

Vector-DB Stragglers Population Survey (2026-05-16)

Closes the four platform-class stragglers left after the 2026-05 Qdrant / ChromaDB / Milvus / Weaviate sweep: Apache Solr, Meilisearch, Typesense, Vespa, plus pgvector body-marker recheck. Each candid…

Read →
Survey May 9, 2026

BI/Dashboard Platforms: Auth Posture Survey

Four BI and analytics dashboard platforms surveyed via Shodan + asyncio probe: Metabase (1,789 IPs), Grafana (2,000 IPs), Apache Superset (1,176 IPs), Redash (1,079 IPs). Total 6,044 IPs → 4,449 confi…

Read →
Survey May 9, 2026

Neo4j, Elasticsearch, Supabase, Redis Stack: AI Infrastructure Exposure Survey

Four additional infrastructure layers surveyed as part of the 2026-05-09 vector DB series. Combined Shodan pull → asyncio probe across 2,064 IPs (971 Neo4j + 636 Elasticsearch v8 + 314 Supabase + 143…

Read →
Survey May 9, 2026

New Vector Storage Survey: QuestDB / Meilisearch / PocketBase / NATS JetStream

Seven previously-unsurveyed AI-adjacent storage and messaging platforms probed via Shodan. 293 QuestDB consoles open with unauthenticated SQL execution, 488 Meilisearch instances health-confirmed (100…

Read →
Survey May 9, 2026

SurrealDB, Typesense, and LanceDB: Exposure Survey

Three additional vector-capable databases surveyed as part of the 2026-05-09 vector DB series. Combined Shodan pull → asyncio probe across 995 IPs (431 SurrealDB + 354 Typesense + 210 LanceDB).

Read →
Survey May 3, 2026

Elasticsearch / OpenSearch on Public Cloud: Auth Posture Survey

Sweep of 1.83M IPs across 28 cloud-provider /16 ranges (DigitalOcean, Hetzner, Vultr) on port 9200 → 313 masscan hits → 42 confirmed unauthenticated Elasticsearch/OpenSearch instances (38 ES, 4 OpenSe…

Read →

Coordinated disclosures

15
May 17, 2026

Cn Gaohe Itgaohe 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Cn Gxota Guangxi Travel Dev 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Cn Hooper Erp 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Cn Timedb 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Cn Torchv Mengjia Zlmediakit 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Cn Woyaodiancan Restaurant Ai 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Cn Xiaoice Demo Virtualhuman 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

De Aitalkx Dms Rag 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

De Travelm Articles 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Eg Equant Tech Waffarha Lms 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

It Isideweb Deskpro 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
CRITICAL drafted May 17, 2026

Np Mohp Hmis Ocl 2026 05 17

Government of Nepal. Ministry of Health and Population (HMIS / Open Concept Lab)

Read →
May 17, 2026

Ru Westcall Aicloud Backend 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
May 17, 2026

Sa Tahakum Llm 2026 05 17

Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

Read →
CRITICAL drafted May 16, 2026

Solr 7.6.0 unauth fleet: Aggregate cloud-provider disclosure

Apache Solr 7.6.0 self-hosted population

Read →