TrustAtlas Procurement Pack
← Full vendor profile   

Baidu

frontier_builder · China , Beijing · founded 2000 · BIDU

Employees: 5000+ · Stage: Public

59.2
elevated
default-balanced
composite score

Chinese technology conglomerate and search engine operator developing the ERNIE model family. Publicly traded on NASDAQ and HKEX, integrated AI across search, cloud, and enterprise products.

Analyst summary
Baidu is China's largest search engine and the operator of ERNIE Bot, one of the first officially approved Chinese generative AI services. As a PRC-jurisdiction company with deep state approval obligations, it is a non-starter for most Western enterprise use cases regardless of technical capability.
Not a viable option for Western enterprise adoption; PRC jurisdiction and content alignment make it a domestic-China tool.
Rating: avoid

Compliance posture

SOC 2 Type IINo / not disclosed
ISO 27001Yes
ISO 42001 (AI management system)No / not disclosed
FedRAMP authorizedNo / not disclosed
GDPR compliantNo / not disclosed
CCPA compliantNo / not disclosed
HIPAA compliantNo / not disclosed
NIST AI RMF alignedNo / not disclosed
CSA STAR certifiedNo / not disclosed

Data handling

Trains on user datayes
Outputs feed model improvementyes
Data retention periodnot_disclosed
Can delete user data on requestNo / not disclosed
Default data residencyCN
Encryption at restYes (not_disclosed)
Encryption in transitYes
DPA availableNo / not disclosed
Public subprocessor listNo / not disclosed
HIPAA BAA availableNo / not disclosed

IP profile

User owns outputsunclear
Vendor claims output rightsNo / not disclosed
Input IP protectionweak
Indemnification offeredNo / not disclosed
Copyright shield programNo / not disclosed
Commercial use permittedYes
Training data provenancenot_disclosed
Known IP lawsuitsNo / not disclosed

Jurisdiction

Incorporation countryChina
Incorporation jurisdiction riskhigh
Subject to US jurisdictionNo / not disclosed
Subject to EU jurisdictionNo / not disclosed
Subject to China jurisdictionYes
Subject to Russia jurisdictionNo / not disclosed
Government data access riskcritical
Five Eyes alignedNo / not disclosed
Adequate privacy jurisdictionNo / not disclosed

Governance

Publishes model cardsNo / not disclosed
Publishes transparency reportsNo / not disclosed
Has AI ethics boardNo / not disclosed
Safety testing disclosedNo / not disclosed
Red-teaming programNo / not disclosed
Government contractsYes
Terms of servicehttps://yiyan.baidu.com/terms
Privacy policyhttps://yiyan.baidu.com/privacy

Incidents on record

No incidents on file.

OWASP LLM Top 10 cross-walk

TrustAtlas dimensions that materially address each OWASP risk. Use to translate this vendor's compliance posture and data-handling stance into the application-security vocabulary your security team already uses.

LLM01
Prompt Injection
User-supplied prompts manipulate model behaviour to bypass intended controls.
SecurityTransparencyDependency chain
LLM02
Sensitive Information Disclosure
Models leak PII, PHI, secrets, or proprietary data through outputs.
Data handlingIP exposureJurisdiction
LLM03
Supply Chain
Risk propagates from upstream models, datasets, plug-ins, and vendors.
Dependency chainBusiness stabilitySecurity
LLM04
Data and Model Poisoning
Adversarial training data or fine-tuning input degrades model integrity.
Data handlingTransparencySecurity
LLM05
Improper Output Handling
Downstream systems blindly trust model output, enabling injection downstream.
IP exposureTransparency
LLM06
Excessive Agency
Agents granted overbroad tool, identity, or permission scopes cause harm.
Dependency chainTransparencyJurisdiction
LLM07
System Prompt Leakage
System prompts containing secrets or logic are extracted via crafted input.
Data handlingTransparency
LLM08
Vector and Embedding Weaknesses
Vector stores and RAG pipelines leak or contaminate retrieved context.
Data handlingSecurity
LLM09
Misinformation
Hallucinated, biased, or fabricated outputs treated as authoritative.
TransparencyRegulatory complianceBusiness stability
LLM10
Unbounded Consumption
Cost, denial-of-service, and resource-exhaustion attacks against LLM endpoints.
SecurityBusiness stability

Full framework reference: https://trustatlas.pages.dev/framework/owasp-llm-top-10

NIST AI RMF cross-walk

How each NIST AI RMF function is supported by the dimensions TrustAtlas scores.

GOVERN
Govern
Establish AI governance structure: policies, roles, accountability.
Regulatory complianceJurisdictionTransparencyBusiness stability
MAP
Map
Establish AI context: intended purpose, use cases, capabilities, and risks.
TransparencyDependency chainData handlingIP exposure
MEASURE
Measure
Quantitative + qualitative risk assessment: testing, benchmarks, monitoring.
SecurityData handlingTransparency
MANAGE
Manage
Treat identified risks: mitigation, controls, incident response, lifecycle.
Regulatory complianceSecurityDependency chainBusiness stability

Full framework reference: https://trustatlas.pages.dev/framework/nist-ai-rmf

Cited sources

FieldSource
data_handling.data_residency_options https://cloud.baidu.com/doc/Agreements/s/Mjwvy4eos
Verified 2026-04-19 by admin
data_handling.trains_on_user_data https://yiyan.baidu.com/agreement
Verified 2026-04-19 by admin
governance.government_ties https://www.reuters.com/technology/chinas-baidu-launches-ernie-bot-public-2023-08-31/
Verified 2026-04-19 by admin
jurisdiction_profiles.incorporation_country https://ir.baidu.com/sec-filings
Verified 2026-04-19 by admin
jurisdiction_profiles.sanctions_risk https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK=0001329099
Verified 2026-04-19 by admin
jurisdiction_profiles.subject_to_china_jurisdiction https://www.cac.gov.cn/2023-07/13/c_1690898327029107.htm
Verified 2026-04-19 by admin
security_compliance.gdpr_compliant https://yiyan.baidu.com/privacy
Verified 2026-04-19 by admin
security_compliance.iso_27001 https://cloud.baidu.com/doc/Reference/s/Kk86h0h8b
Verified 2026-04-19 by admin

Questions to ask before signing

Vendor-agnostic baseline. Send these to the vendor and require written answers before contract.

  1. 01. Provide your most recent SOC 2 Type II report (with bridge letter if applicable).
  2. 02. Describe your training-data provenance and customer opt-out mechanics in writing.
  3. 03. List all sub-processors and confirm notification policy for material additions.
  4. 04. Confirm BAA availability and signed-BAA process if we process PHI.
  5. 05. Describe rate-limiting, quota, and circuit-breaker controls protecting our usage.
  6. 06. Provide your model card or equivalent disclosure documenting intended use, limitations, and known failure modes.
  7. 07. Describe your prompt-injection defences and red-team posture against OWASP LLM Top 10 risks.
  8. 08. Confirm data residency options and which sub-regions our data may touch.
  9. 09. Provide incident-response SLAs, security-event notification timelines, and the most recent pen-test report summary.
  10. 10. Confirm output ownership terms and any indemnification or copyright-shield programs available.
  11. 11. Describe acquisition-risk safeguards and what happens to our data on a change of control.
  12. 12. List foundation-model dependencies and how upstream-model risk is mitigated.

Methodology + caveats

Composite scores use the default-balanced weight profile (25% data handling, 20% IP exposure, 15% jurisdiction, 15% security, 10% regulatory compliance, 8% transparency, 5% business stability, 2% dependency chain). All facts are sourced from the vendor's own public disclosures, public regulatory filings, or reputable secondary reporting — see the cited sources table above. This pack is decision-support material, not legal advice or audit evidence.