Most recent
navigate open esc close Corpus index built 2026-06-07 23:58 UTC

← All research

Survey Jun 6, 2026

Bisheng Population Survey — Negative Result (Auth-Required Default)

NuClide Research · 2026-06-06


Executive Summary

Bisheng (github.com/dataelement/bisheng) is an open-source LLM application development platform from DataElem (Beijing), focused on enterprise-oriented document AI, RAG, agent orchestration, and workflow building. Direct functional parallel to RAGFlow (also Shanghai-based) and Flowise.

DCWF KSAT coverage

Auto-derived from DCWF AI work-role rule files (ksat-tag).

  • 672 (AI Test & Evaluation Specialist): K7003, K7004, S7068, S7070, S7075, T5858, T5904, T5919
  • 733 (AI Risk & Ethics Specialist): K7040, T5854, T5868
  • overlap (Common AI KSATs (all 5 roles)): K1158, K1159, K22, K6311, K6935, K7003

Negative result. The DataElem Bisheng platform ships auth-required by default across all data endpoints. The 30 Shodan-indexed hits on the "BISHENG" dork largely matched unrelated services (Chinese banks named after the historical figure Bi Sheng — inventor of movable type printing; Synology NAS user-named “BISheng”; Yongyou/UFIDA ERP). The actual DataElem Bisheng population is small (~4 confirmed reachable). All confirmed instances return HTTP 401 ("Missing cookie access_token_cookie") on the canonical endpoints.

This is the counter-example needed for the cohort-comparison sub-hypothesis that emerged from the LobeChat survey same-day. The CN-origin OSS cohort is not uniformly auth-permissive: Bisheng (DataElem, Beijing) ships auth-required where LobeChat (Lobehub, Hangzhou) ships fully-open. The sub-hypothesis must be refined: the cohort default is platform-maintainer-specific, not jurisdiction-wide.


Methodology

StageActionDetail
Stage 0Shodan harvest "BISHENG"30 results — high false-positive rate
Stage 1bPer-host probe /api/v1/user/info, /api/v1/config, /login, /api/v1/versionManual Python probe (no herald config yet)
Stage 3vConfirmed Bisheng identity via /api/v1/version returning DataElem-style JSON wrapper {"status_code":200,"status_message":"SUCCESS","data":{"version":"X.Y.Z"}}Per-host
Stage FPFiltered non-Bisheng matches (banks, ERP, NAS)Manual

Population Results

MetricCount
Shodan "BISHENG" matches30
Web-UI candidate ports (80/443/3001/5000/5001/8089)25
Confirmed DataElem Bisheng4
Auth-required (all data endpoints return 401)4 / 4 (100%)
Auth-permissive0 / 4 (0%)
Other matches (banks, NAS, ERP — FPs)~26

Confirmed DataElem Bisheng instances

IP:PortVersionCountryProviderAuth state
14.22.86.97:3001v2.0.0CNAliyun401 — auth required (data endpoints), version disclosed
120.25.222.22:3001v2.4.0-beta1CNAliyun401 — auth required, version disclosed
120.26.245.160:3001v2.4.0-beta1-fixCNAliyun401 — auth required, version disclosed
122.247.1.59:3001(500 on all endpoints)CNTencent Cloudservice degraded / unhealthy

Sample false-positive matches removed

IP:PortTitleWhy flagged
112.5.139.115:8089协同管理软件.A8N V9.0SP1Yongyou A8N (Chinese ERP) — unrelated
114.32.121.143:5001BISheng - Synology NASUser-named Synology NAS — unrelated
117.149.2.27:443浙江南浔农村商业银行股份有限公司Zhejiang Nanxun Rural Commercial Bank — unrelated
183.134.216.216:443(same bank)Same — unrelated

Cross-platform FP class: “BISHENG” / “Bi Sheng” / “畢昇” refers to Bi Sheng, the 11th-century Chinese inventor of movable-type printing. Multiple Chinese commercial entities (especially in banking, ERP, and traditional cultural sectors) use the name. Future Bisheng surveys should disambiguate via /api/v1/version JSON response signature, not title alone.


Information Disclosure Surface

While auth posture is sound, /api/v1/version is publicly accessible on all confirmed Bisheng instances:

GET /api/v1/version
{"status_code":200,"status_message":"SUCCESS","data":{"version":"2.0.0"}}

Version disclosure on a Chinese enterprise AI platform with no public vulnerability database tracking would only become security-relevant once a Bisheng-specific CVE class is established. Current state: low-severity informational. Not flagged as a finding-worth-disclosing.


Why This Negative Result Matters

The LobeChat survey (same day) found 10/12 reachable (83.3%) fully-open. The emerging sub-hypothesis was “CN-origin OSS chat-UI / AI infrastructure trends auth-permissive.” Bisheng refutes the jurisdiction-wide form of that hypothesis:

PlatformOriginDefault auth postureOpen rate (reachable)
LobeChatCN — Lobehub (Hangzhou)Open (no access code)83.3% AUTH_OFF
RAGFlowCN — InfiniFlow (Shanghai)Registration-open87.2% REGISTER_OPEN
BishengCN — DataElem (Beijing)Auth-required0% open
DifyCN — Dify.AI (Shanghai)Auth-required0.9% open

The CN cohort splits 2-2: LobeChat + RAGFlow ship auth-permissive defaults; Bisheng + Dify ship auth-required. The maintainer’s deployment culture (DX-first one-click demo vs enterprise-customer-first secure-by-default) is the determining variable, not the jurisdiction.

Refined sub-hypothesis: the cohort default is platform-maintainer-specific, reflecting whether the upstream maintainer optimizes for:

  • “Clone, docker compose up, immediately demo” — auth-permissive default (Langfuse, RAGFlow, Phoenix, Flowise, LobeChat)
  • “Self-host for our enterprise customers” — auth-required default (Bisheng, Dify, AnythingLLM)

This refinement preserves the second clause of Insight #76 (the rate is movable) and discards the over-strong jurisdictional claim.


Toolchain Provenance

Step 0:    shodan download '"BISHENG"' (30 records)
Step 1b:   manual urllib probe — herald lobechat config doesn't apply
Step 3v:   /api/v1/version JSON signature as Bisheng-identity confirmation
Step FP:   manual filter of bank/NAS/ERP false-positives via title
Step 12b:  This document

Decision: no herald config for Bisheng at this time. The population is too small (4 confirmed) to justify the per-platform config maintenance, and the auth-required default means there is no auth-permissive finding to detect. If a Bisheng CVE class emerges that surfaces an unauth disclosure endpoint, a herald config can be added then.


Research-Program Contribution

Negative result properly counter-examples the LobeChat-suggested CN-jurisdiction sub-hypothesis. The cohort hypothesis #76 is refined from “platform-cohort dependent” to “platform-maintainer dependent” — the deployment culture of the upstream maintainer team is the determining variable, not the geographic/jurisdictional origin of the platform.

The 4-platform CN-origin matrix is now:

  • Auth-permissive: LobeChat (Lobehub Hangzhou), RAGFlow (InfiniFlow Shanghai) — 2
  • Auth-required: Bisheng (DataElem Beijing), Dify (Dify.AI Shanghai) — 2

An interesting demographic gradient appears: Beijing-based DataElem (enterprise-customer-first, China-government-adjacent enterprise AI vendor) and Shanghai-based Dify.AI (commercial SaaS) both auth-required. Hangzhou-based Lobehub (community OSS, DX-first) and Shanghai-based InfiniFlow (academic-origin RAG research) both auth-permissive. Enterprise-customer-focused vs community/research-focused maintainer culture is the splitter — not geography.