Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← Research library

CRITICAL · Disclosure May 1, 2026

Us Ny Rit

To: security@rit.edu Subject: Unauthenticated AI inference endpoint, Rochester Institute of Technology (129.21.25.95 (+3 nodes))


Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

2026-05-01

Re: Unauthenticated Ollama AI inference endpoint, Rochester Institute of Technology IP / Host: 129.21.25.95 (+3 nodes) Severity: CRITICAL


I’m an independent security researcher. I hold CISA disclosures CVE-2025-4364 and ICSA-25-140-11 and conduct good-faith AI infrastructure research under the NuClide Research umbrella. This is an unsolicited disclosure, no engagement exists with your organization, and I have not accessed, modified, or exfiltrated any data beyond what was necessary to confirm the exposure.


Summary

Rochester Institute of Technology (RIT) has four externally-accessible Ollama nodes on campus, including an NVIDIA DGX research server with 18 cloud proxy subscriptions (same subscription portfolio as POSTECH/Shiv Nadar/Hanoi) and a student machine with two abliterated QwQ-32B models alongside an embedding model for a RAG pipeline. All nodes injectable via CVE-2025-63389.


Infrastructure

HostnameIPModelsCloudNotes
disco-dgx-spark.wireless.rit.edu129.21.25.952518NVIDIA DGX GPU server
ragdepc.student.rit.edu129.21.149.9740Student machine, abliterated models + RAG
8N610008D0.ad.rit.edu129.21.220.95101 (deepseek)AD-joined workstation
cl5.wireless.rit.edu129.21.146.150190Wireless client, llama2-uncensored

Node: disco-dgx-spark (DGX GPU Server: 18 cloud subscriptions)

Same 18-model cloud portfolio as POSTECH, Shiv Nadar, and Hanoi University:
DeepSeek (v4-pro, v4-flash, v3.2), MiniMax (m2, m2.1, m2.5, m2.7), Kimi (k2.5, k2.6, k2-thinking), GLM (4.6, 4.7, 5, 5.1), Qwen (3.5, coder-next), Gemini (flash-preview), Nemotron.

Local models: qwen3.6:35b (22GB), qwen3.5:27b (16GB), qwen3-coder-next:latest (48GB), llama3.2:3b, smollm2:135m.

All 25 models injectable via CVE-2025-63389. 18 cloud subscriptions accessible through unauthenticated port.


Node: ragdepc.student.rit.edu (STUDENT Machine: Abliterated + RAG)

ModelSizeNotes
qwq:latest14 GBQwQ-32B reasoning model
qwq-uncensored:latest14 GBAbliterated QwQ, safety removed
huihui_ai/qwq-abliterated:32b-Q3_K_M14 GBAbliterated QwQ, safety removed
nomic-embed-text:latest0 GBEmbedding, RAG pipeline
  • Hostname prefix ragdepc suggests “RAG deep PC”, student building a RAG system with abliterated reasoning models
  • Two variants of abliterated QwQ-32B publicly accessible on student hardware
  • Model injection via CVE-2025-63389 affects any RAG pipeline using these models

Node: cl5.wireless.rit.edu (Wireless Client: uncensored model)

19 models including:

  • llama2-uncensored:7b, uncensored Llama 2
  • gpt-oss:latest (12GB), glm-4.7-flash (17GB), lfm2.5-thinking, ShreyanGondaliya/s5:latest
  • Multiple qwen3.5 variants, deepseek-coder

Findings

F1: DGX GPU Server: 18 Cloud Subscriptions Exposed (CRITICAL)

Research-grade NVIDIA DGX server running Ollama with 18 cloud proxy subscriptions, publicly accessible without authentication.

F2: Student Machine: Dual Abliterated Reasoning Models + RAG (HIGH)

Student’s machine running two variants of QwQ-32B with safety fine-tuning removed, alongside an embedding model confirming a RAG pipeline. Both abliterated models and the RAG pipeline are publicly accessible without credentials.

F3: Third Node: llama2-uncensored on Wireless Client (MEDIUM)

Wireless-connected laptop running llama2-uncensored:7b exposed on port 11434.


Pattern Note

The DGX server’s 18-model cloud proxy portfolio (DeepSeek + MiniMax + Kimi + GLM + Qwen + Gemini + Nemotron) appears identically at POSTECH (Korea), Shiv Nadar (India), Hanoi University (Vietnam), and now RIT. This suggests a shared Ollama Connect subscription, demonstration account, or institutional license distributed across multiple universities.


Why it matters

The credential leak (username + SSH public key) exposes your service account to enumeration and credential-stuffing against other services. An embedding model indicates an active RAG pipeline, documents loaded into your vector store are reachable via unauthenticated queries.

One-line fix

OLLAMA_HOST=127.0.0.1:11434
systemctl restart ollama

This rebinds Ollama to loopback only. If running in Docker: docker run -p 127.0.0.1:11434:11434 ollama/ollama.

CVE-2025-63389

All models on this instance are injectable via the unauthenticated /api/create endpoint, an attacker can overwrite any model’s system prompt or delete models entirely. No patch exists as of this disclosure.

Reference

Full technical details, parameter counts, and remediation notes are in this public research repository: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/US/NY-rit.md

This research is part of a broader sweep of university AI infrastructure exposures documented at: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/OVERVIEW.md

I’m happy to answer questions or assist with verification. No response is required.

Regards, Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com AI-LLM-Infrastructure-OSINT