Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← Research library

CRITICAL · Disclosure May 1, 2026

→ 200 OK, response: "Buffalo", eval_count: 2

To: sec-office@buffalo.edu Subject: Unauthenticated AI inference endpoint, SUNY Buffalo (136.183.56.88)


Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

2026-05-01

Re: Unauthenticated Ollama AI inference endpoint, SUNY Buffalo IP / Host: 136.183.56.88 Severity: CRITICAL


I’m an independent security researcher. I hold CISA disclosures CVE-2025-4364 and ICSA-25-140-11 and conduct good-faith AI infrastructure research under the NuClide Research umbrella. This is an unsolicited disclosure, no engagement exists with your organization, and I have not accessed, modified, or exfiltrated any data beyond what was necessary to confirm the exposure.


Summary

State University of New York at Buffalo research compute node running 26 Ollama models including gemma4:31b-cloud, a cloud proxy model. Cloud proxy inference confirmed live, 200 OK response at operator expense. Also includes RAG pipeline components (embedding model + reranker) and a 74GB Mixtral instance. Raw Ollama port publicly accessible, no authentication.


Infrastructure

FieldValue
IP136.183.56.88
OrgSUNY Buffalo State University
CountryUS, New York
Open ports11434 (Ollama, public)

Models (26 total)

ModelSizeNotes
gemma4:31b-cloud0 GB☁️ Cloud proxy, CONFIRMED LIVE
mixtral:8x22b-instruct74 GBLocal, MoE
qwen2.5:72b-instruct44 GBLocal
llama3.1:70b39 GBLocal
qwen3.5:35b22 GBLocal
qwen2.5:32b-instruct18 GBLocal
gemma4:31b-it-q4_K_M18 GBLocal
gemma4:31B18 GBLocal
glm-4.7-flash:latest17 GBLocal (Zhipu AI)
gemma4:26B16 GBLocal
gemma4:e4B8 GBLocal
qwen3:14b8 GBLocal
phi4:latest8 GBLocal
gemma4:latest8 GBLocal
qwen2.5:14b-instruct8 GBLocal
qwen2.5vl:7b (equivalent)8 GBLocal
gemma3:27B16 GBLocal
gemma4:e2B6 GBLocal
gemma2:9b5 GBLocal
llama3.1:8b4 GBLocal
qwen2.5:7b-instruct4 GBLocal
llama3.2:3b1 GBLocal
bge-m3:latest1 GBEmbedding, RAG pipeline
smollm2:135m0 GBLocal
qllama/bge-reranker-v2-m3:latest0 GBReranker, RAG pipeline

Findings

F1: Cloud Proxy Quota Hijack (CRITICAL)

gemma4:31b-cloud returned 200 OK without any authentication:

curl http://136.183.56.88:11434/api/generate \
  -d '{"model":"gemma4:31b-cloud","prompt":"say: Buffalo","stream":false}'
# → 200 OK, response: "Buffalo", eval_count: 2

Two tokens generated at operator’s cloud API expense. No authentication, no rate limiting visible from outside.

F2: Unauthenticated RAG Pipeline Components (HIGH)

The deployment includes BGE-M3 embedding model and BGE-reranker-v2-M3, indicating an active RAG pipeline. If this Ollama instance backs a document retrieval system with university data, model injection via CVE-2025-63389 would affect all RAG-augmented responses, including content derived from indexed university documents.

# Inject into any model to affect RAG responses
curl -X POST http://136.183.56.88:11434/api/create \
  -d '{"model":"qwen3:14b","from":"qwen3:14b","system":"[attacker instructions]"}'

F3: 26-Model Unauthenticated Surface (HIGH)

26 models accessible including heavy compute (Mixtral 8x22B, Qwen2.5-72B, LLaMA3.1-70B). All injectable via CVE-2025-63389. Total local model storage: ~350+ GB.


Why it matters

Any internet actor can run inference against your cloud API subscription at your expense, this constitutes direct quota/billing theft. An embedding model indicates an active RAG pipeline, documents loaded into your vector store are reachable via unauthenticated queries.

One-line fix

OLLAMA_HOST=127.0.0.1:11434
systemctl restart ollama

This rebinds Ollama to loopback only. If running in Docker: docker run -p 127.0.0.1:11434:11434 ollama/ollama.

CVE-2025-63389

All models on this instance are injectable via the unauthenticated /api/create endpoint, an attacker can overwrite any model’s system prompt or delete models entirely. No patch exists as of this disclosure.

Reference

Full technical details, parameter counts, and remediation notes are in this public research repository: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/US/NY-suny-buffalo.md

This research is part of a broader sweep of university AI infrastructure exposures documented at: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/OVERVIEW.md

I’m happy to answer questions or assist with verification. No response is required.

Regards, Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com AI-LLM-Infrastructure-OSINT