Object Storage, Data Layer, NuClide Stack

What it is

Models and datasets are big (gigabytes to terabytes per artefact), and the universal storage substrate for them is S3-compatible object storage. MinIO is the self-hosted on-prem option (also bundled with most RAG distributions like Dify); AWS S3, Google Cloud Storage, and Cloudflare R2 are the cloud variants; Garage and SeaweedFS are the smaller open alternatives. Every model registry, every fine-tuning job, every RAG document loader writes through one of these.

What goes wrong

MinIO ships with the credentials minioadmin / minioadmin and a public console on port 9001. Most operators change the password but leave the console reachable; many leave the API on port 9000 with a public bucket policy that reveals the bucket inventory. The buckets are typically named after the project (model-weights, training-data-2026, customer-uploads), and the keys inside them describe the artefact lifecycle. S3 buckets exhibit the same pattern at a different scale: misconfigured bucket policies, public ACLs from old aws s3 sync --acl public-read mistakes, and the now-classic “bucket name is the company name plus production” enumeration vulnerability.

How we test

We list buckets through the unauthenticated MinIO admin API where reachable, and check S3 buckets via probabilistic name enumeration (no brute-force, just the patterns that fall out of the operator’s known naming conventions). We confirm exposure with a single HEAD against a bucket-listing URL; we do not download objects. Bucket names plus their key-prefix structure are the disclosure evidence.

Receipts

Research

Every survey, case study, and disclosure we've published that touches this layer of the stack. Counts on the cells above tally these directly.

Field cases

2

Case May 26, 2026

CPAC Strapi CMS — Production API Surface Enumeration

Second node in the CPAC chain. The primary finding is in cpacredis-redisinsight-chain-b-178.128.84.65-2026-05-26.md. The Redis credential prefix cpacredis pivoted to cpac.co.th, which resolved to a St…

Read →

Case May 22, 2026

117.50.80.181 — TCI Kindergarten ASR / Speech-Assessment Platform

117.50.80.181:8001 runs the "TCI ASR Service" v3.0.0, a Chinese kindergarten classroom speech-assessment platform. The processing tier has no authentication. An unauthenticated internet caller can sub…

Read →

Coordinated disclosures

1

May 9, 2026

GraphRAG Process Safety API: Full Multi-Stack Auth-Off Exposure (Scaleway FR)

A French operator running an industrial process safety knowledge management RAG application has deployed five separate AI/ML services on a single Scaleway dedicated server with no…

Read →

Data Layer

Object Storage

What it is

What goes wrong

How we test

Research

Field cases

CPAC Strapi CMS — Production API Surface Enumeration

117.50.80.181 — TCI Kindergarten ASR / Speech-Assessment Platform

Coordinated disclosures

GraphRAG Process Safety API: Full Multi-Stack Auth-Off Exposure (Scaleway FR)

Other categories in this layer

Vector Databases

Search Engines

OLAP / Analytics Backends

MLOps Tracking

Agent Memory

Data Labeling

Compute Orchestration

GPU Compute & Telemetry

Container Orchestration

Medical / Edge AI

Backup & Snapshots

Fine-tuning Runtimes

Document Parsers

Model Hubs & Registries