built throughORANGEBOX·see what it ships·$1 →
A matte-black machined globe with a single bio-cyan equatorial line — where data lives matters.

AtomEons / Learn / trust / data-residency

Data residency by AI provider

Where your prompts and outputs actually live, by vendor. As of June 2026, best-effort.

Data residency is the boring half of AI procurement. It is also the half that gets a deal killed in the last review meeting. A model that costs ten cents per million tokens does not matter if your DPO cannot answer where the prompt ran, where the response was written to disk, who else may have logged it, and for how long. This page is an honest, vendor-by-vendor reference. Not a sales pitch. Not a horoscope. We list what each provider publishes on its own trust portal, where the gaps are, and what we could not verify without a contract in hand. A few words on what residency actually means, because vendors are sloppy with it. Residency usually has three layers: where inference runs (the GPU that generates the response), where data is stored at rest (logs, abuse-monitoring buffers, training corpora), and where endpoint side-services run (tokenizers, image transcoders, code interpreters, retrieval indexes). Most "EU-resident" claims you see refer to layer two. Layer one — the actual inference — often still routes through US infrastructure even when the dashboard says EU. Read the fine print. The Anthropic docs we cite below state this explicitly: as of June 2026 the workspace geo for first-party API is only `"us"`, and the new `inference_geo` parameter on Opus 4.6 and Sonnet 4.6 supports only `"us"` and `"global"` — there is no first-party `"eu"` value. For EU-only Claude, you go through Bedrock or Vertex inference profiles. We also separate the residency question from the data-retention question. Residency asks where. Retention asks for how long. The two interact: zero-data-retention (ZDR) terms can change residency from "rest in country X" to "never written to disk anywhere." Most major US providers will sign ZDR on enterprise tiers. Most do not enable it by default. Always check the order form, not the marketing page. Throughout, dates and certification scopes are stated as of June 2026 based on each provider's own published trust portal. Some details change month to month. Treat this as a starting point and verify against the cited source before signing.

What the three layers of residency actually mean

Before reading the per-provider tables, internalize these distinctions. Every vendor slides between them.

  • Inference geo — the data center where the GPU actually generated your response. This is the layer most exposed to compelled access by the host country's government, because the cleartext token stream is in memory there.
  • Storage geo — where logs, abuse-monitoring buffers, and any persisted artifacts live at rest. Usually 30 days by default on commercial APIs, often reducible to 0 days under a ZDR addendum.
  • Endpoint-services geo — where supporting infrastructure runs: tokenizers, content-safety classifiers, image pipelines, retrieval indexes, code interpreters. Easy to forget. A 'EU-resident' deployment that calls a US classifier is, in practice, sending your prompt to the US.
  • Training reuse — separate from residency, but always asked in the same breath. Default on enterprise API tiers from Anthropic, OpenAI, Google, AWS, Mistral, and Cohere: customer prompts are NOT used to train base models. Verify on the order form.
  • BAA scope — a Business Associate Agreement is a contract under HIPAA. The fact that a vendor 'has' a BAA does not mean every product is covered. Bedrock got added to AWS's HIPAA-eligible list on 10 February 2026; some Bedrock features remain out of scope. Read the schedule.

Provider index — at a glance

Quick scan of where each provider stood as of June 2026, drawn from their published trust portals and docs. Treat 'yes' as 'available under enterprise contract' unless noted otherwise.

ProviderAnthropic (direct API)
Inference regionsUS (workspace geo `us` only); `inference_geo` global or US on Opus/Sonnet 4.6+
BAAYes (Claude for Enterprise & API)
DPAYes
SOC 2 Type IIYes
ISO 27001Yes (27001:2022)
FedRAMPNo (not listed June 2026)
ZDR optionYes (ZDR addendum)
ProviderAnthropic via AWS Bedrock
Inference regionsUS, EU (Frankfurt/Paris/Ireland), APAC profiles
BAAYes (via AWS BAA, Bedrock HIPAA-eligible since Feb 10 2026)
DPAYes (AWS DPA)
SOC 2 Type IIYes (AWS)
ISO 27001Yes (AWS)
FedRAMPYes, Bedrock in GovCloud US-West
ZDR optionStateless by default
ProviderAnthropic via Google Vertex
Inference regionsUS, EU (10 regions), APAC
BAAYes (via Google BAA)
DPAYes (Google DPA)
SOC 2 Type IIYes (Google)
ISO 27001Yes (Google)
FedRAMPYes, FedRAMP High since July 2025
ZDR optionYes (per Google DPA addendum)
ProviderOpenAI (direct API & ChatGPT Enterprise)
Inference regionsUS, EU, UK, CA, JP, KR, SG, IN, AU, UAE — storage at rest only; inference defaults to US
BAAYes (enterprise tiers)
DPAYes
SOC 2 Type IIYes
ISO 27001Yes
FedRAMPModerate (some products)
ZDR optionYes (qualifying orgs)
ProviderOpenAI via Azure
Inference regions28+ Azure regions; Global/Data Zone/Regional deployment modes
BAAYes (Microsoft BAA)
DPAYes (MS DPA)
SOC 2 Type IIYes (MS)
ISO 27001Yes (MS)
FedRAMPYes (MS High)
ZDR optionYes (modified abuse monitoring + ZDR amendment)
ProviderGoogle Vertex AI / Gemini
Inference regionsMulti-region; europe-west1/4 etc.; regional residency GA April 2025
BAAYes
DPAYes
SOC 2 Type IIYes (SOC 1/2/3)
ISO 27001Yes (27001/27017/27018/27701/42001)
FedRAMPYes, FedRAMP High (July 2025)
ZDR optionYes (contractual addendum)
ProviderAWS Bedrock
Inference regionsus-east-1, us-west-2, eu-west-1, eu-central-1, ap-southeast-1, GovCloud US-West, others
BAAYes (Bedrock added Feb 10 2026)
DPAYes
SOC 2 Type IIYes (since Aug 15 2023)
ISO 27001Yes
FedRAMPYes (High in GovCloud)
ZDR optionStateless API; no prompts stored by default
ProviderMistral (la Plateforme)
Inference regionsEU default (Paris-region), explicit US endpoint optional
BAANot published as of June 2026
DPAYes (EU-native DPA)
SOC 2 Type IIYes (Type II)
ISO 27001Yes (27001 & 27701)
FedRAMPNo
ZDR optionAvailable on enterprise tier
ProviderCohere
Inference regionsUS-Central (GCP) as primary; multi-jurisdictional DPA; limited true multi-region
BAAYes (HIPAA, standard BAA)
DPAYes (with SCCs)
SOC 2 Type IIYes (Type II, annual)
ISO 27001Trust portal lists; verify scope
FedRAMPNo
ZDR optionConfigurable ephemeral mode
ProviderTogether AI
Inference regionsUS primarily; EU (Sweden) live since Sept 2025; APAC dedicated
BAAYes (on enterprise contracts)
DPAYes
SOC 2 Type IIYes (Type II)
ISO 27001Verify on trust portal
FedRAMPNo
ZDR optionZDR on enterprise
ProviderFireworks AI
Inference regionsUS, EU (Frankfurt/Iceland), APAC (Tokyo); airgapped EKS option
BAAYes
DPAYes
SOC 2 Type IIYes (Type II)
ISO 27001Verify on trust portal
FedRAMPNo
ZDR optionZDR by default on enterprise
ProviderReplicate
Inference regionsUS-focused; community-tier developer platform
BAANot published as enterprise-grade BAA
DPAYes (DPA)
SOC 2 Type IIVerify on trust portal
ISO 27001Verify on trust portal
FedRAMPNo
ZDR optionNot the default product posture
ProviderGroq
Inference regionsUS, Canada, EU (Helsinki, Finland), Saudi Arabia (Dammam, sovereign), more 2026
BAAVerify on trust.groq.com
DPAYes
SOC 2 Type IIYes
ISO 27001Verify on trust portal
FedRAMPNo
ZDR optionVerify on trust portal
ProviderCerebras
Inference regionsUS (CA, TX, MN — Condor Galaxy), UAE/MENA via Core42, Buffalo NY via TeraWulf
BAAVerify directly
DPAYes (enterprise)
SOC 2 Type IIVerify directly
ISO 27001Verify directly
FedRAMPNo
ZDR optionVerify directly

Anthropic — what is actually shipping

Anthropic's compliance certifications, as published on its privacy center on the cited URL, are SOC 2 Type I and Type II, ISO 27001:2022, and ISO/IEC 42001:2023. HIPAA-ready configuration with a BAA is available on Claude for Enterprise and on the API. The Anthropic trust portal at trust.anthropic.com is the source of truth for the SOC 3 summary; the SOC 2 detailed report sits behind NDA. FedRAMP authorization is not listed on Anthropic's privacy center as of June 2026 — if you need FedRAMP for Claude, route through AWS Bedrock in GovCloud US-West or through Vertex AI, both of which carry FedRAMP High. On residency itself, Anthropic's docs page is clear and clinical. The `inference_geo` API parameter (Opus 4.6, Sonnet 4.6, and later) takes only `"us"` or `"global"`. The workspace geo controlling storage at rest is currently only `"us"`. US-only inference is priced at 1.1x the standard rate. There is no first-party EU workspace today. EU-resident Claude is reached via AWS Bedrock EU inference profiles (Frankfurt, Paris, Ireland) or via Vertex's ten EU regions and the eu multi-region endpoint. Standard API log retention was reduced from 30 days to 7 days as of 14 September 2025. ZDR addenda are available for enterprise customers and zero out the 7-day window.

OpenAI — global storage, US inference

OpenAI publishes one of the broadest data-residency footprints in the market: as of mid-2026, storage-at-rest in Europe, the United Kingdom, the United States, Canada, Japan, South Korea, Singapore, India, Australia, and the United Arab Emirates. The Asia program (Japan, India, Singapore, South Korea) was announced 8 May 2025. This is meaningful — and it has a sharp caveat that buyers regularly miss. OpenAI's own announcement states: residency applies to data at rest, not to inference, whose default location continues to be the United States. In other words, your conversations may live on disk in Tokyo, but the GPU that generated the response was in Texas or Iowa. For full per-region inference, route through Azure OpenAI. Azure OpenAI gives you Global, Data Zone (EU or US), and Regional deployment modes across 28+ Azure regions, governed by the Microsoft Online Services DPA. ZDR on Azure requires approval for modified abuse monitoring and a Zero Data Retention amendment; without that, Azure holds prompt/response data in a 30-day abuse-monitoring buffer. Microsoft carries SOC 2, ISO 27001, FedRAMP High, HIPAA BAA, and the EU Cloud Code of Conduct. SOC 2 alone does not guarantee geographic boundaries — it evaluates control design and effectiveness — so the SOC 2 report is not a residency report. Read both.

Google Vertex AI and AWS Bedrock — the partner-operated lanes

If you want Claude or a major open model with the strictest residency posture available today, you almost certainly want it through one of the two big clouds. Both inherit hyperscaler compliance.

Google Vertex AI / Gemini

Cite: docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/data-residency

FedRAMP High since July 2025. SOC 1/2/3, ISO 9001, ISO/IEC 27001/27017/27018/27701/42001. Regional data residency GA since April 2025 for Gemini API, Vertex AI agents, and secured Gemini Apps. EU residency through europe-west1, europe-west4, and others; the `eu` multi-region endpoint guarantees EU-only processing under Google's DPA. Zero-retention terms available via DPA addendum for eligible customers.

AWS Bedrock

Cite: aws.amazon.com/bedrock/security-compliance

Stateless API by default — prompts and responses are not stored on AWS's side. SOC 1/2/3 since 15 August 2023. HIPAA-eligible since 10 February 2026 — a meaningful upgrade for healthcare workloads. FedRAMP High in GovCloud US-West. ISO and CSA STAR Level 2 in scope. EU residency through eu-west-1 (Ireland), eu-central-1 (Frankfurt), and Paris.

Microsoft Azure OpenAI

Cite: azure.microsoft.com/en-us/blog (Data Zones announcement)

Closest thing to a true global per-region inference plane for GPT-class models. 28+ regions, Global/Data Zone/Regional deployment selectors, ZDR amendment available. Inherits Azure's SOC 2, ISO 27001, FedRAMP High, HIPAA BAA.

Mistral — the EU-native default

Mistral is the only major provider on this list where EU residency is the default rather than an opt-in. Mistral AI is headquartered in Paris and operates under French corporate law; la Plateforme stores data in the EU unless the customer explicitly uses the US endpoint. The Paris-region data center came online mid-2026 with reportedly more than 13,800 NVIDIA GPUs; a second site near Borlänge, Sweden with EcoDataCenter is under construction for 2027. Mistral's help center confirms it complies with SOC 2 Type II and ISO 27001/27701 frameworks. As of June 2026, Mistral does not publish a FedRAMP authorization and does not publish a standard BAA for US healthcare workloads on its public help center — verify directly with sales if HIPAA scope matters. The reason Mistral keeps showing up in EU public-sector AI procurement is the combination of EU jurisdiction, EU-default residency, and signed framework agreements with the governments of France and Germany running through 2030.

Cohere, Together, Fireworks, Replicate — the inference-platform layer

These four sit one rung below the hyperscalers in posture. All four are usable in production. Compliance scope varies sharply.

  • Cohere — primary infrastructure on GCP US-Central. Annual SOC 2 Type II. HIPAA compliant with a standard BAA available. Multi-jurisdictional DPA incorporating the EU Standard Contractual Clauses (approved 4 June 2021). The honest caveat: Cohere does not currently offer true multi-region data residency outside the US — a meaningful limit if you need EU-only inference and not just EU-resident logging.
  • Together AI — US-primary with EU residency live in Sweden since September 2025. Enterprise SOC 2 Type II, HIPAA on enterprise contracts, dedicated data-residency controls and ZDR on the enterprise tier. Voice platform layer adds its own enterprise compliance bundle.
  • Fireworks AI — multi-region global fleet across US, Frankfurt, Iceland, Tokyo. Fully HIPAA-eligible, SOC 2 Type II, GDPR-compliant. Zero data retention by default on the enterprise product. Airgapped EKS deployment is on offer for the most regulated customers.
  • Replicate — developer-and-community surface. Strong model catalog, easy experimentation. As of June 2026 not the right choice if you need formal enterprise BAA and FedRAMP — those are simply not the posture they sell. Use Replicate for prototyping and route the production workload to a hyperscaler or to Together/Fireworks.

Groq and Cerebras — the dedicated-silicon lane

Groq and Cerebras both run their own inference data centers on their own purpose-built silicon (Groq's LPU and Cerebras's WSE wafer-scale chip). Both have notable Middle East infrastructure footprints, which matters for compliance teams thinking about cross-border exposure. Groq, as of late 2025, operates data centers across the United States, Canada, Europe (a Helsinki, Finland site launched late 2025), and the Middle East. Groq secured a USD 1.5 billion investment from Saudi Arabia and is expanding a Dammam, Saudi Arabia data center as part of HUMAIN's sovereign infrastructure — all activity in that region is contractually aligned to Saudi data-sovereignty rules. Public 2026 plans: more than a dozen additional sites. The trust portal at trust.groq.com is the canonical source for current certifications; verify SOC 2 and ISO scope there directly before relying on any specific certification claim. Cerebras delivers inference largely through partnerships. The Condor Galaxy network with Core42 (formerly G42 Holding US LLC) provides about 20 exaFLOPs of AI compute across California, Texas, and Minnesota. Core42 also commissioned Maximus-01 at TeraWulf's Lake Mariner campus in Buffalo, New York in November 2025 (powered by ~9,000 AMD Instinct MI300X GPUs — not strictly Cerebras silicon, but the same enterprise partner). The G42/UAE relationship is a structural fact of Cerebras's business; if your compliance posture excludes UAE-connected supply chains, that is the conversation to have on day one of vendor diligence. Cerebras's specific SOC 2 and BAA posture for the inference cloud (as opposed to the hardware business) should be verified directly — the public S-1 / DRS filings cited below address corporate structure and customer concentration more than residency.

Where buyers reliably get burned

Three failure modes we keep seeing in 2026 procurement reviews. Each is avoidable on day one of vendor diligence. First, mistaking storage residency for inference residency. OpenAI's Asia residency announcement says it plainly: 'targeted for data that is stored or is at rest and not data that is being used for inference by a model, whose default location continues to be the US.' If your DPO wrote a policy that says EU data must be processed in the EU, you are not compliant on OpenAI direct API even with Asia residency turned on. Use Azure OpenAI's Regional or Data Zone deployments for that. Second, assuming SOC 2 proves geography. SOC 2 proves control design and operating effectiveness over the trust services criteria. It does not prove your data stayed in any particular country. A SOC 2 report is necessary but not sufficient for residency compliance. Pair it with a regional inference commitment in the order form. Third, assuming a vendor BAA covers every product the vendor sells. Bedrock got added to AWS's HIPAA-eligible services list on 10 February 2026. The fact that AWS has had a BAA for a decade did not retroactively cover Bedrock before that date. Read the HIPAA-eligible-services schedule, not the marketing page.

How AtomEons handles this internally

We treat residency the same way we treat any other gate in our pipeline: it is checked deterministically before code ships, not asserted in a marketing line. Every provider we integrate with is logged in our internal trust register with five fields — published inference regions, storage default retention, ZDR availability, BAA scope, and the URL of the canonical trust portal page we read it from. When a provider updates their trust portal, the register entry gets a date stamp and a diff. We do not use a provider in a regulated workload without that record, and we do not claim a residency posture publicly that we cannot back with a citation. This page is the public-facing version of that register. If you spot something we got wrong, write to the address on the contact page. We will fix it and credit the catch.

Sources

  1. [01]

    Anthropic Privacy Center lists SOC 2 Type I & II, ISO 27001:2022, ISO/IEC 42001:2023, and HIPAA-ready configuration with BAA available.

    https://privacy.claude.com/en/articles/10015870-what-certifications-has-anthropic-obtained
  2. [02]

    Anthropic's data-residency docs confirm `inference_geo` accepts only `us` or `global`; workspace geo is currently US-only; Opus 4.6 and Sonnet 4.6 support the parameter at 1.1x pricing.

    https://platform.claude.com/docs/en/manage-claude/data-residency
  3. [03]

    Anthropic Trust Portal is the source for SOC 3 summary and certification verification.

    https://trust.anthropic.com/
  4. [04]

    OpenAI announced data residency in Japan, India, Singapore, and South Korea on 8 May 2025; default inference location remains the US.

    https://openai.com/index/introducing-data-residency-in-asia/
  5. [05]

    OpenAI data residency is available in Europe, UK, US, Canada, Japan, South Korea, Singapore, India, Australia, and the UAE for storage at rest.

    https://openai.com/index/expanding-data-residency-access-to-business-customers-worldwide/
  6. [06]

    OpenAI publishes its enterprise privacy posture including DPA, SOC 2, and ZDR availability for qualifying organizations.

    https://openai.com/business-data/
  7. [07]

    Azure OpenAI introduced Data Zones for EU and US in addition to Global and Regional deployment modes.

    https://azure.microsoft.com/en-us/blog/enterprise-trust-in-azure-openai-service-strengthened-with-data-zones/
  8. [08]

    Vertex AI provides regional data residency via region-pinned endpoints including europe-west1, europe-west4, and the eu multi-region endpoint.

    https://docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/data-residency
  9. [09]

    Vertex AI Search and Generative AI on Vertex AI achieved FedRAMP High authorization in July 2025.

    https://cloud.google.com/blog/topics/public-sector/vertex-ai-search-and-generative-ai-with-gemini-achieve-fedramp-high
  10. [10]

    Gemini Enterprise documents SOC 1/2/3, ISO 27001/27017/27018/27701/42001 compliance scope.

    https://docs.cloud.google.com/gemini/enterprise/docs/compliance-security-controls
  11. [11]

    Amazon Bedrock is in scope for ISO, SOC, CSA STAR Level 2, GDPR, HIPAA eligibility, and FedRAMP High in GovCloud US-West.

    https://aws.amazon.com/bedrock/security-compliance/
  12. [12]

    AWS Alps blog confirms Bedrock's regional residency and Switzerland-specific data-protection guidance.

    https://aws.amazon.com/blogs/alps/security_bedrock/
  13. [13]

    Amazon Bedrock is available in AWS GovCloud (US) with FedRAMP High authorization.

    https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/govcloud-bedrock.html
  14. [14]

    Mistral AI states it complies with SOC 2 Type II and ISO 27001/27701 frameworks.

    https://help.mistral.ai/en/articles/347638-do-you-have-soc-2-or-iso-27001-certification
  15. [15]

    Mistral AI defaults to EU data hosting unless customers explicitly use the US API endpoint.

    https://help.mistral.ai/en/articles/347629-where-do-you-store-my-data-or-my-organization-s-data
  16. [16]

    Cohere Trust Center publishes its annual SOC 2 Type II posture and HIPAA-ready BAA availability.

    https://trustcenter.cohere.com/
  17. [17]

    Cohere documents its multi-jurisdictional DPA incorporating EU Standard Contractual Clauses and ephemeral-data configuration option.

    https://cohere.com/enterprise-data-commitments
  18. [18]

    Fireworks AI publishes HIPAA, SOC 2 Type II, GDPR compliance, ZDR by default, and multi-region deployment including airgapped EKS.

    https://fireworks.ai/enterprise
  19. [19]

    Groq Trust Center publishes Groq's current security and compliance posture.

    https://trust.groq.com/
  20. [20]

    Groq operates data centers across the US, Canada, Europe, and the Middle East with further 2026 expansion announced.

    https://groq.com/newsroom/groq-solidifies-status-as-emerging-hyperscaler-with-new-global-deployment
  21. [21]

    Groq is building inferencing infrastructure with Aramco Digital in Saudi Arabia under local data-sovereignty rules.

    https://groq.com/newsroom/aramco-digital-and-groq-announce-progress-in-building-the-worlds-largest-inferencing-data-center-in-saudi-arabia-following-leap-mou-signing
  22. [22]

    Groq secured a USD 1.5 billion Saudi Arabia investment to expand its Dammam data-center footprint.

    https://www.datacenterdynamics.com/en/news/groq-secures-15bn-from-saudi-arabia-to-expand-ai-inference-infrastructure-in-the-region/
  23. [23]

    Cerebras Systems S-1 documents its Framework Agreement with Core42 (formerly G42 Holding US LLC) for AI supercomputer deployment.

    https://www.sec.gov/Archives/edgar/data/2021728/000162828026025762/exhibit1011-sx1.htm
  24. [24]

    Core42 commissioned Maximus-01 at TeraWulf's Lake Mariner data center in Buffalo, NY in November 2025 with ~9,000 AMD Instinct MI300X GPUs.

    https://www.middleeastainews.com/p/core42-us-footprint-expands-adding
LAB · ATOMEONS · MARCO ISLAND FLÆONS RESEARCH · 12 PAPERS · CC-BY 4.0ORANGEBOX v1.0.0-beta · TURBO-OPTIMIZE CLAUDE · SHIPPED 2026-05-30B00KMAKR v3.2.0 · AI PUBLISHING COCKPIT · MAC + WINDOWSFREE LAUNCH WEEK · ENDS JUNE 6 · §4A NO-SAAS LOCKFOUNDER'S VIEW · NEXT BROADCAST IN ...CITE THE WORK · FORWARD THE LINK · NO ALGORITHMLAB · ATOMEONS · MARCO ISLAND FLÆONS RESEARCH · 12 PAPERS · CC-BY 4.0ORANGEBOX v1.0.0-beta · TURBO-OPTIMIZE CLAUDE · SHIPPED 2026-05-30B00KMAKR v3.2.0 · AI PUBLISHING COCKPIT · MAC + WINDOWSFREE LAUNCH WEEK · ENDS JUNE 6 · §4A NO-SAAS LOCKFOUNDER'S VIEW · NEXT BROADCAST IN ...CITE THE WORK · FORWARD THE LINK · NO ALGORITHM