Baidu

frontier_builder · China , Beijing · founded 2000 · BIDU

Employees: 5000+ · Stage: Public

59.2

elevated

default-balanced
composite score

Chinese technology conglomerate and search engine operator developing the ERNIE model family. Publicly traded on NASDAQ and HKEX, integrated AI across search, cloud, and enterprise products.

Analyst summary

Baidu is China's largest search engine and the operator of ERNIE Bot, one of the first officially approved Chinese generative AI services. As a PRC-jurisdiction company with deep state approval obligations, it is a non-starter for most Western enterprise use cases regardless of technical capability.

Not a viable option for Western enterprise adoption; PRC jurisdiction and content alignment make it a domestic-China tool.

Rating: avoid

Compliance posture

SOC 2 Type II	No / not disclosed
ISO 27001	Yes
ISO 42001 (AI management system)	No / not disclosed
FedRAMP authorized	No / not disclosed
GDPR compliant	No / not disclosed
CCPA compliant	No / not disclosed
HIPAA compliant	No / not disclosed
NIST AI RMF aligned	No / not disclosed
CSA STAR certified	No / not disclosed

Data handling

Trains on user data	yes
Outputs feed model improvement	yes
Data retention period	not_disclosed
Can delete user data on request	No / not disclosed
Default data residency	CN
Encryption at rest	Yes (not_disclosed)
Encryption in transit	Yes
DPA available	No / not disclosed
Public subprocessor list	No / not disclosed
HIPAA BAA available	No / not disclosed

IP profile

User owns outputs	unclear
Vendor claims output rights	No / not disclosed
Input IP protection	weak
Indemnification offered	No / not disclosed
Copyright shield program	No / not disclosed
Commercial use permitted	Yes
Training data provenance	not_disclosed
Known IP lawsuits	No / not disclosed

Jurisdiction

Incorporation country	China
Incorporation jurisdiction risk	high
Subject to US jurisdiction	No / not disclosed
Subject to EU jurisdiction	No / not disclosed
Subject to China jurisdiction	Yes
Subject to Russia jurisdiction	No / not disclosed
Government data access risk	critical
Five Eyes aligned	No / not disclosed
Adequate privacy jurisdiction	No / not disclosed

Governance

Publishes model cards	No / not disclosed
Publishes transparency reports	No / not disclosed
Has AI ethics board	No / not disclosed
Safety testing disclosed	No / not disclosed
Red-teaming program	No / not disclosed
Government contracts	Yes
Terms of service	https://yiyan.baidu.com/terms
Privacy policy	https://yiyan.baidu.com/privacy

Incidents on record

No incidents on file.

OWASP LLM Top 10 cross-walk

TrustAtlas dimensions that materially address each OWASP risk. Use to translate this vendor's compliance posture and data-handling stance into the application-security vocabulary your security team already uses.

LLM01	Prompt Injection User-supplied prompts manipulate model behaviour to bypass intended controls. SecurityTransparencyDependency chain
LLM02	Sensitive Information Disclosure Models leak PII, PHI, secrets, or proprietary data through outputs. Data handlingIP exposureJurisdiction
LLM03	Supply Chain Risk propagates from upstream models, datasets, plug-ins, and vendors. Dependency chainBusiness stabilitySecurity
LLM04	Data and Model Poisoning Adversarial training data or fine-tuning input degrades model integrity. Data handlingTransparencySecurity
LLM05	Improper Output Handling Downstream systems blindly trust model output, enabling injection downstream. IP exposureTransparency
LLM06	Excessive Agency Agents granted overbroad tool, identity, or permission scopes cause harm. Dependency chainTransparencyJurisdiction
LLM07	System Prompt Leakage System prompts containing secrets or logic are extracted via crafted input. Data handlingTransparency
LLM08	Vector and Embedding Weaknesses Vector stores and RAG pipelines leak or contaminate retrieved context. Data handlingSecurity
LLM09	Misinformation Hallucinated, biased, or fabricated outputs treated as authoritative. TransparencyRegulatory complianceBusiness stability
LLM10	Unbounded Consumption Cost, denial-of-service, and resource-exhaustion attacks against LLM endpoints. SecurityBusiness stability

Full framework reference: https://trustatlas.pages.dev/framework/owasp-llm-top-10

NIST AI RMF cross-walk

How each NIST AI RMF function is supported by the dimensions TrustAtlas scores.

GOVERN	Govern Establish AI governance structure: policies, roles, accountability. Regulatory complianceJurisdictionTransparencyBusiness stability
MAP	Map Establish AI context: intended purpose, use cases, capabilities, and risks. TransparencyDependency chainData handlingIP exposure
MEASURE	Measure Quantitative + qualitative risk assessment: testing, benchmarks, monitoring. SecurityData handlingTransparency
MANAGE	Manage Treat identified risks: mitigation, controls, incident response, lifecycle. Regulatory complianceSecurityDependency chainBusiness stability

Full framework reference: https://trustatlas.pages.dev/framework/nist-ai-rmf

Cited sources

Field	Source
data_handling.data_residency_options	https://cloud.baidu.com/doc/Agreements/s/Mjwvy4eos Verified 2026-04-19 by admin
data_handling.trains_on_user_data	https://yiyan.baidu.com/agreement Verified 2026-04-19 by admin
governance.government_ties	https://www.reuters.com/technology/chinas-baidu-launches-ernie-bot-public-2023-08-31/ Verified 2026-04-19 by admin
jurisdiction_profiles.incorporation_country	https://ir.baidu.com/sec-filings Verified 2026-04-19 by admin
jurisdiction_profiles.sanctions_risk	https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0001329099 Verified 2026-04-19 by admin
jurisdiction_profiles.subject_to_china_jurisdiction	https://www.cac.gov.cn/2023-07/13/c_1690898327029107.htm Verified 2026-04-19 by admin
security_compliance.gdpr_compliant	https://yiyan.baidu.com/privacy Verified 2026-04-19 by admin
security_compliance.iso_27001	https://cloud.baidu.com/doc/Reference/s/Kk86h0h8b Verified 2026-04-19 by admin

Questions to ask before signing

Vendor-agnostic baseline. Send these to the vendor and require written answers before contract.

01. Provide your most recent SOC 2 Type II report (with bridge letter if applicable).
02. Describe your training-data provenance and customer opt-out mechanics in writing.
03. List all sub-processors and confirm notification policy for material additions.
04. Confirm BAA availability and signed-BAA process if we process PHI.
05. Describe rate-limiting, quota, and circuit-breaker controls protecting our usage.
06. Provide your model card or equivalent disclosure documenting intended use, limitations, and known failure modes.
07. Describe your prompt-injection defences and red-team posture against OWASP LLM Top 10 risks.
08. Confirm data residency options and which sub-regions our data may touch.
09. Provide incident-response SLAs, security-event notification timelines, and the most recent pen-test report summary.
10. Confirm output ownership terms and any indemnification or copyright-shield programs available.
11. Describe acquisition-risk safeguards and what happens to our data on a change of control.
12. List foundation-model dependencies and how upstream-model risk is mitigated.

Methodology + caveats

Composite scores use the default-balanced weight profile (25% data handling, 20% IP exposure, 15% jurisdiction, 15% security, 10% regulatory compliance, 8% transparency, 5% business stability, 2% dependency chain). All facts are sourced from the vendor's own public disclosures, public regulatory filings, or reputable secondary reporting — see the cited sources table above. This pack is decision-support material, not legal advice or audit evidence.