Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← Research library

HIGH · Disclosure May 1, 2026

Kr Inha

To: security@inha.ac.kr Subject: Unauthenticated AI inference endpoint, INHA University (165.246.39.51)


Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com

2026-05-01

Re: Unauthenticated Ollama AI inference endpoint, INHA University IP / Host: 165.246.39.51 Severity: HIGH


I’m an independent security researcher. I hold CISA disclosures CVE-2025-4364 and ICSA-25-140-11 and conduct good-faith AI infrastructure research under the NuClide Research umbrella. This is an unsolicited disclosure, no engagement exists with your organization, and I have not accessed, modified, or exfiltrated any data beyond what was necessary to confirm the exposure.


Summary

INHA University (인하대학교) in Incheon has an Ollama instance with 7 models totalling ~133GB including a local gpt-oss:20b (20.9B params) and two NVIDIA Nemotron-Cascade 30B models.


Infrastructure

FieldValue
IP165.246.39.51
OrganizationINHA UNIVERSITY
CountrySouth Korea
Open ports11434 (Ollama, public)

Model Inventory

ModelSizeNotes
gpt-oss:20b12.1GBLocal inference, 20.9B params, gpt-oss family
hf.co/unsloth/gpt-oss-20b-GGUF:Q8_012.1GBSame weights, direct HF GGUF pull
nemotron-cascade-2:30b24.3GBNVIDIA Nemotron Cascade 2 30B
gemma4:26b-a4b-it-q8_028.1GBGemma 4 Q8
nemotron-3-nano:30b24.3GBNVIDIA Nemotron-3 Nano 30B
qwen3.5:27b22.5GB,
deepseek-r1:14b9.0GB,

Total local storage: ~132GB


Findings

F1: Local gpt-oss:20b and Dual Nemotron Stack (HIGH)

gpt-oss:20b is running locally (12.1GB, 20.9B params). The model family gpt-oss is the OpenAI open-source weights release. Both the standard Ollama-tagged version and the direct HuggingFace GGUF pull are present, suggesting the operator downloaded via hf.co/unsloth/gpt-oss-20b-GGUF:Q8_0 first, then aliased it.

The dual nemotron-cascade-2:30b and nemotron-3-nano:30b stack (both 24.3GB) suggests NVIDIA model evaluation or research use.

F2: CVE-2025-63389 Injectable (HIGH)

All models injectable via unauthenticated /api/create. The Nemotron and gpt-oss models have no system prompts, post-injection inference is unobstructed.


Why it matters

An embedding model indicates an active RAG pipeline, documents loaded into your vector store are reachable via unauthenticated queries.

One-line fix

OLLAMA_HOST=127.0.0.1:11434
systemctl restart ollama

This rebinds Ollama to loopback only. If running in Docker: docker run -p 127.0.0.1:11434:11434 ollama/ollama.

CVE-2025-63389

All models on this instance are injectable via the unauthenticated /api/create endpoint, an attacker can overwrite any model’s system prompt or delete models entirely. No patch exists as of this disclosure.

Reference

Full technical details, parameter counts, and remediation notes are in this public research repository: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/KR/inha.md

This research is part of a broader sweep of university AI infrastructure exposures documented at: AI-LLM-Infrastructure-OSINT/blob/main/case-studies/universities/OVERVIEW.md

I’m happy to answer questions or assist with verification. No response is required.

Regards, Nicholas Michael Kloster / NuClide Research nicholas@nuclide-research.com AI-LLM-Infrastructure-OSINT