Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← All research

Survey Jun 6, 2026

LobeChat Population Survey — 10/12 Fully Open (83.3%, small population)

NuClide Research · 2026-06-06


Executive Summary

LobeChat (github.com/lobehub/lobe-chat) is an open-source ChatGPT-alternative chat interface from Lobehub, a China-origin OSS community. Direct functional parallel to LibreChat. 641 Shodan-indexed; 636 downloaded; only 12 of 636 (1.9%) responded to live HTTP probing. Of the 12 reachable: 10 are in fully-open mode (enabledAccessCode: false AND enabledOAuthSSO: false), 2 are ACCESS_CODE_GATED, 1 of the open instances also has OAuth SSO available alongside no-access-code.

DCWF KSAT coverage

Auto-derived from DCWF AI work-role rule files (ksat-tag).

  • 672 (AI Test & Evaluation Specialist): K7003, K7004, S7070, S7075, T5858, T5919
  • 733 (AI Risk & Ethics Specialist): K7040, K7051, S7067, T5854, T5868
  • overlap (Common AI KSATs (all 5 roles)): K1158, K22, K6311, K6935, K7003, K7048

The 1.9% reachability rate is the lowest observed in any 2026-06-06 survey. Most indexed LobeChat hosts are either stale (Shodan cache aging out), behind reverse proxies that intercept the SSR HTML, or deployed on extremely transient infrastructure. The small reachable subset is correspondingly less statistically robust than the same-day LibreChat / Langfuse / RAGFlow surveys.

Among the reachable subset, the 83.3% AUTH_OFF rate matches the auth-permissive-default cohort norm. The LibreChat counter-example (v0.8.x correcting to 10.3%) does not appear to extend to LobeChat — though the small N means this is best read as suggestive rather than confirmed.


Methodology

StageActionTool
Stage 0Shodan harvest http.title:"LobeChat"shodan CLI (636 records)
Stage 0cHTTP liveness via root GETherald
Stage 1bAuth state via SSR HTML body markersherald lobechat platform config
Stage 3vSource verification: enabledAccessCode and enabledOAuthSSO are LobeChat client config fields embedded in Next.js SSR streams (self.__next_f.push)manual review of LobeChat source

Probe semantics calibration: LobeChat’s server-side rendered HTML embeds the client config as JSON-escaped strings inside Next.js streams. The exact markers used:

  • \"enabledAccessCode\":false → no access-code prompt: anyone can use the chat
  • \"enabledAccessCode\":true → access-code prompt on first visit: gated
  • \"enabledOAuthSSO\":true → OAuth SSO available

Auth modes (from LobeChat source):

  • Both false → fully open, no auth at all
  • enabledAccessCode: true → simple shared-password gate
  • enabledOAuthSSO: true → full identity provider flow (Auth0, Casdoor, Logto, etc.)

Population Results

MetricCountRate
Shodan-indexed641
Downloaded636
Reachable121.9% of indexed
AUTH_OFF (fully open, no access code)1083.3% of reachable
ACCESS_CODE_GATED216.7% of reachable
OAUTH_SSO_ENABLED1(one of the AUTH_OFF hosts also has OAuth)

AUTH_OFF Hosts (full list — small N)

IP:PortCountryProvider
129.146.5.44:8036United StatesOracle Cloud
111.231.14.199:32101ChinaTencent Cloud
101.37.145.198:30002ChinaAlibaba Cloud
139.129.111.70:3020ChinaAlibaba Cloud
140.245.51.103:8036SingaporeOracle Cloud
156.229.163.187:8036United StatesAkile LTD
152.70.238.77:8036KoreaOracle Cloud
192.3.131.11:80United StatesRackNerd LLC
47.239.236.83:3000Hong KongAlibaba Cloud
8.154.30.249:18006ChinaAlibaba Cloud

Notable pattern: 4 of 10 AUTH_OFF hosts are on Oracle Cloud port 8036 across 3 regions (USA, Singapore, Korea). This consistent port + provider combination suggests a Lobehub deployment template — possibly the one-click deploy guide that defaults to no access code for ease-of-demo.


Geographic split

CountryAUTH_OFF
China + Hong Kong + Singapore + Korea6 (60%)
United States3 (30%)
Singapore1 (10%)

The reachable + AUTH_OFF population is 60% APAC-concentrated, consistent with LobeChat’s China-origin OSS community. This is the inverse of LibreChat’s geographic distribution (US-dominant, 34% US).


Comparison with LibreChat (same-day, same-category)

DimensionLibreChatLobeChat
Shodan indexed3,153641
Reachable1,565 (78.3%)12 (1.9%)
Open rate (of reachable)26.3% REG_OPEN83.3% AUTH_OFF
Within-platform correctionv0.8.x = 10.3% (corrected)Not detected at this sample size
Geographic dominanceUS (34%)China/APAC (60%)
Origin communityUS-led OSS (danny-avila)China-led OSS (Lobehub)

The two platforms are direct functional substitutes for self-hosted ChatGPT-alternative deployment. They diverge sharply on auth-default trajectory:

  • LibreChat: explicitly tightened in tagged releases (v0.8.x = 10.3% — Open-WebUI-equivalent)
  • LobeChat: among reachable population, the cohort-default auth-permissive rate (83.3%) persists

Caveat for cohort comparison: the LobeChat reachable N=12 is too small to conclude the absence of correction. The hypothesis space remains: (a) LobeChat ships auth-permissive default still, (b) operators of LobeChat skew toward demo / public-facing deployment intent, (c) reverse proxies in Chinese cloud regions intercept the SSR HTML and Shodan cached the proxy not the LobeChat backend.

The cleanest test of the cohort hypothesis would require either:

  • A larger LobeChat reachable population (different dorks; Censys cross-reference)
  • A direct source-code review of LobeChat’s current default ACCESS_CODE env var behavior

LLM10 surface

LobeChat’s AUTH_OFF mode means anyone visiting the URL invokes completions against whatever LLM providers the operator has configured. The provider list is embedded in the SSR HTML as languageModel: {provider: {enabled: bool}} — enumerable per host with no auth.

LobeChat supports broad provider coverage including:

  • OpenAI, Anthropic, Azure, Google, AWS Bedrock (Western frontier)
  • DeepSeek, Moonshot, Qwen, MiniMax, Zhipu/GLM, Baichuan, Spark/Wenxin/Tongyi (Chinese frontier)
  • Ollama, custom OpenAI-compatible (self-hosted)

A single fully-open LobeChat instance with SERVER-side configured API keys for any of the above is a direct LLM10 Denial-of-Wallet target. Among the 10 AUTH_OFF hosts surveyed, provider enumeration was not exhaustively performed at this sample (provider-name regex requires JSON-escape-aware match; deferred to next round).


Toolchain Provenance

Step 0:    shodan download 'http.title:"LobeChat"' (636 records)
Step 0c:   IP extraction → ip-port.txt (636 unique)
Step 1b:   herald -platform lobechat < ip-port.txt
           - probe id fully_open: / body_contains '\"enabledAccessCode\":false'
           - probe id access_code_set: / body_contains '\"enabledAccessCode\":true'
           - probe id oauth_sso: / body_contains '\"enabledOAuthSSO\":true'
Step 3v:   Probe semantics verified against LobeChat source (Next.js SSR
           __next_f stream embedding `enabledAccessCode` and `enabledOAuthSSO`
           in client config)
Step 12b:  This document

Tool note (herald v0.1.3): the LobeChat config exposed a body_contains matching issue around JSON-escaped quote sequences. Initial probe used literal "enabledAccessCode":false and matched zero hosts. The actual SSR HTML contains the JSON-escaped form \"enabledAccessCode\":false (the page is a JSON-stringified Next.js stream rendered inside HTML script tags). Updated to escaped form; sweep then succeeded. This is a generalizable lesson: Next.js / Remix / SSR-streamed framework HTML embeds config data as JSON-stringified strings with escaped quotes. body_contains probes against SPAs need to account for the escape level.


Research-Program Contribution

LobeChat fills the Chinese-origin OSS chat-UI cohort slot in the 2026-06-06 survey corpus. Same category as LibreChat (US-origin OSS); same functional class; different origin community.

The 83.3% AUTH_OFF rate (small N caveat) is directionally consistent with the uncorrected-cohort norm of Insight #76. Combined with LibreChat’s clear within-platform correction in v0.8.x, the dataset begins to support a sub-hypothesis: Western OSS chat-UI maintainers have begun correcting; Chinese OSS chat-UI maintainers may not yet have.

This is testable against further Chinese-origin OSS AI/LLM platforms surveyed under similar conditions. Bisheng (DataElem), FastGPT (Sealos), and Open Interpreter all candidates for the next-round comparison.

The herald body_contains escape-level limitation surfaced during this survey is documented as a known issue and warrants a v0.2 enhancement: optional regex or fuzzy-match mode for SPA-embedded JSON.