Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← All engagement records

Case study May 2, 2026

California Institute of Technology (Caltech): GPT-OSS 120B, RAG Pipeline

Sector
Universities
Country
ca

NuClide Research · 2026-05-02


Summary

A Caltech node (yertle.caltech.edu, 131.215.141.46) exposes Ollama with 6 models including gpt-oss:120b (OpenAI’s 120B open model, 65.4GB) and a RAG pipeline stack (two embedding models). The hostname yertle references the Dr. Seuss turtle, a common playful server naming convention at research universities.


Infrastructure

FieldValue
IP131.215.141.46
Hostnameyertle.caltech.edu
OrgCalifornia Institute of Technology
CountryUnited States
Ollama version0.12.10
Open port11434 (public)

Models

ModelSizeNotes
gpt-oss:120b~65 GBOpenAI open-weight, 120B params
rjmalagon/gte-qwen2-1.5b-instruct-embed-f16:latest~3 GBEmbedding model (RAG)
mxbai-embed-large:latest~670 MBEmbedding model (RAG)
syntax:latest,Custom model
java:latest,Custom model
smollm2:135m270 MBSmolLM

System prompt on smollm2: You are a helpful AI assistant named SmolLM, trained by Hugging Face

The custom syntax:latest and java:latest models suggest research tooling, possibly code syntax assistance or Java-specific fine-tunes.


Findings

F1: 120B Model Accessible for Free Inference (HIGH)

gpt-oss:120b, OpenAI’s 120B open-weight model, accessible without authentication. Frontier-class inference at Caltech’s compute cost.

F2: Active RAG Pipeline (MEDIUM)

Two embedding models (gte-qwen2-1.5b and mxbai-embed-large) co-deployed with a custom syntax:latest model suggests an active RAG pipeline under development, likely for code understanding research.

F3: Model Injection (CRITICAL)

CVE-2025-63389 applies. Custom research models (syntax, java) injectable via /api/create.


Remediation

OLLAMA_HOST=127.0.0.1:11434
systemctl restart ollama

Disclosure

  • Discovered: 2026-05-02
  • Status: Pending outreach to Caltech IMSS security