→ 200 OK, response: "Buffalo", eval_count: 2
To: sec-office@buffalo.edu Subject: Unauthenticated AI inference endpoint, SUNY Buffalo (136.183.56.88)
Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com
2026-05-01
Re: Unauthenticated Ollama AI inference endpoint, SUNY Buffalo IP / Host: 136.183.56.88 Severity: CRITICAL
I’m an independent security researcher. I hold CISA disclosures CVE-2025-4364 and ICSA-25-140-11 and conduct good-faith AI infrastructure research under the NuClide Research umbrella. This is an unsolicited disclosure, no engagement exists with your organization, and I have not accessed, modified, or exfiltrated any data beyond what was necessary to confirm the exposure.
Summary
State University of New York at Buffalo research compute node running 26 Ollama models including gemma4:31b-cloud, a cloud proxy model. Cloud proxy inference confirmed live, 200 OK response at operator expense. Also includes RAG pipeline components (embedding model + reranker) and a 74GB Mixtral instance. Raw Ollama port publicly accessible, no authentication.
Infrastructure
| Field | Value |
|---|---|
| IP | 136.183.56.88 |
| Org | SUNY Buffalo State University |
| Country | US, New York |
| Open ports | 11434 (Ollama, public) |
Models (26 total)
| Model | Size | Notes |
|---|---|---|
| gemma4:31b-cloud | 0 GB | ☁️ Cloud proxy, CONFIRMED LIVE |
| mixtral:8x22b-instruct | 74 GB | Local, MoE |
| qwen2.5:72b-instruct | 44 GB | Local |
| llama3.1:70b | 39 GB | Local |
| qwen3.5:35b | 22 GB | Local |
| qwen2.5:32b-instruct | 18 GB | Local |
| gemma4:31b-it-q4_K_M | 18 GB | Local |
| gemma4:31B | 18 GB | Local |
| glm-4.7-flash:latest | 17 GB | Local (Zhipu AI) |
| gemma4:26B | 16 GB | Local |
| gemma4:e4B | 8 GB | Local |
| qwen3:14b | 8 GB | Local |
| phi4:latest | 8 GB | Local |
| gemma4:latest | 8 GB | Local |
| qwen2.5:14b-instruct | 8 GB | Local |
| qwen2.5vl:7b (equivalent) | 8 GB | Local |
| gemma3:27B | 16 GB | Local |
| gemma4:e2B | 6 GB | Local |
| gemma2:9b | 5 GB | Local |
| llama3.1:8b | 4 GB | Local |
| qwen2.5:7b-instruct | 4 GB | Local |
| llama3.2:3b | 1 GB | Local |
| bge-m3:latest | 1 GB | Embedding, RAG pipeline |
| smollm2:135m | 0 GB | Local |
| qllama/bge-reranker-v2-m3:latest | 0 GB | Reranker, RAG pipeline |
Findings
F1: Cloud Proxy Quota Hijack (CRITICAL)
gemma4:31b-cloud returned 200 OK without any authentication:
curl http://136.183.56.88:11434/api/generate \
-d '{"model":"gemma4:31b-cloud","prompt":"say: Buffalo","stream":false}'
# → 200 OK, response: "Buffalo", eval_count: 2
Two tokens generated at operator’s cloud API expense. No authentication, no rate limiting visible from outside.
F2: Unauthenticated RAG Pipeline Components (HIGH)
The deployment includes BGE-M3 embedding model and BGE-reranker-v2-M3, indicating an active RAG pipeline. If this Ollama instance backs a document retrieval system with university data, model injection via CVE-2025-63389 would affect all RAG-augmented responses, including content derived from indexed university documents.
# Inject into any model to affect RAG responses
curl -X POST http://136.183.56.88:11434/api/create \
-d '{"model":"qwen3:14b","from":"qwen3:14b","system":"[attacker instructions]"}'
F3: 26-Model Unauthenticated Surface (HIGH)
26 models accessible including heavy compute (Mixtral 8x22B, Qwen2.5-72B, LLaMA3.1-70B). All injectable via CVE-2025-63389. Total local model storage: ~350+ GB.
Why it matters
Any internet actor can run inference against your cloud API subscription at your expense, this constitutes direct quota/billing theft. An embedding model indicates an active RAG pipeline, documents loaded into your vector store are reachable via unauthenticated queries.
One-line fix
OLLAMA_HOST=127.0.0.1:11434
systemctl restart ollama
This rebinds Ollama to loopback only. If running in Docker: docker run -p 127.0.0.1:11434:11434 ollama/ollama.
CVE-2025-63389
All models on this instance are injectable via the unauthenticated /api/create endpoint, an attacker can overwrite any model’s system prompt or delete models entirely. No patch exists as of this disclosure.
Reference
Full technical details, parameter counts, and remediation notes are in this public research repository: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/US/NY-suny-buffalo.md
This research is part of a broader sweep of university AI infrastructure exposures documented at: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/OVERVIEW.md
I’m happy to answer questions or assist with verification. No response is required.
Regards, Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com AI-LLM-Infrastructure-OSINT