When does the EU AI Act apply to my organization?

The EU AI Act applies to any provider or deployer of AI systems that places systems on the EU market or whose systems affect people in the EU — regardless of where the organization is based. GPAI model obligations applied from August 2025. High-risk AI system obligations, including training data documentation and logging, apply from August 2026.

← Learning CenterRegulation Primer

AI Regulation: A Practical Guide for AI Teams

Q: What is the EU AI Act?

The EU AI Act is a binding regulation that classifies AI systems by risk level and imposes specific obligations on providers and deployers of high-risk AI systems — including training data documentation (Article 10), audit logging (Article 12), and technical record-keeping (Article 19). It entered into force in August 2024 and most high-risk obligations apply from August 2026.

Q: What does AI regulation require for training data?

Under the EU AI Act Article 10, providers of high-risk AI systems must document the origin of every training dataset, the data collection methodology, any pre-processing steps, and the provenance chain. Synthetic datasets must include documentation proving they were generated using controlled, auditable processes — which is where dataset certification becomes a compliance mechanism.

Q: How does synthetic data help with AI regulation compliance?

Synthetic data addresses two compliance challenges: it eliminates real personal data from training pipelines (reducing GDPR and privacy exposure), and when certified, it provides a tamper-evident audit record proving the dataset's origin, generation algorithm, and integrity — satisfying the training data documentation requirements of Article 10 of the EU AI Act.

AI regulation is no longer theoretical. The EU AI Act is in force. NIST published its AI Risk Management Framework. The UK, US, Canada, and China are each building their own regimes. This guide explains what regulation actually demands — in technical terms, for the people building and shipping AI systems.

Last updated: 2026 · Covers EU AI Act, NIST AI RMF, and UK AI regulation.

Why regulation now Global landscape EU AI Act Technical obligations How certification helps FAQ

Why AI Regulation Now

AI systems are making consequential decisions — credit decisions, hiring screens, medical triage, bail risk scores — at scale, with no external accountability infrastructure. That gap is what regulation is closing.

The driving logic across every AI regulation regime is the same: if an AI system affects people, someone must be accountable for it, and accountability requires evidence. Evidence means documentation of training data, audit logs of system behavior, and a verifiable record that the system was built and tested as claimed.

The core compliance challenge

Most AI teams have no tamper-evident record of what their training data actually was. They may know where it came from, but they cannot prove it — certainly not in the format a regulator or auditor expects. This is the gap that dataset certification closes.

The Global AI Regulation Landscape

Four major regulatory frameworks are shaping AI governance for organizations operating globally.

Framework	Jurisdiction	Binding?	Key focus
EU AI Act	European Union	Yes — regulation	Risk-based classification, training data, logging, conformity assessment
NIST AI RMF	United States (federal)	Voluntary (federal agencies: mandatory for some)	Govern, Map, Measure, Manage — process-oriented
UK AI Framework	United Kingdom	Voluntary (sector-specific enforcement)	Principles-based via sector regulators (FCA, ICO, CQC)
China AI Regulation	China	Yes — multiple decrees	Recommendation systems, deep synthesis, AIGC labeling

For organizations operating in the EU, the EU AI Act is the highest-priority obligation. Its extraterritorial reach means any provider whose AI system affects EU residents must comply, regardless of where the organization is incorporated.

The EU AI Act — Core Structure

The EU AI Act entered into force in August 2024. It creates a tiered risk model and assigns specific compliance obligations to each tier.

Unacceptable risk

Prohibited

Social scoring by governments
Real-time biometric surveillance in public
Emotion recognition in workplace/education
Predictive policing based on personal traits

High risk

Full compliance

Biometric ID systems
Critical infrastructure AI
Education and vocational training
Employment and worker management
Essential services (credit, insurance)
Law enforcement
Migration and border control

Limited risk

Transparency obligations

Chatbots (must disclose AI)
Deepfakes (must disclose synthetic origin)
Emotion recognition systems

Minimal risk

No specific obligations

AI-enabled video games
Spam filters
AI-powered recommendation systems (basic)

Risk classification deep dive →Full EU AI Act explainer →

EU AI Act Enforcement Timeline

Aug 2024

EU AI Act enters into force

Publication in Official Journal. 24-month transition begins for most provisions.

Feb 2025

Prohibited AI practices banned

Unacceptable risk practices (social scoring, real-time biometric surveillance) prohibited.

Aug 2025

GPAI model obligations apply

General-Purpose AI model providers must comply: transparency, capability evaluation, incident reporting.

Aug 2026

High-risk AI system obligations apply

Article 10 (training data), Article 12 (logging), Article 13 (transparency), Article 19 (record-keeping) — full compliance required.

Aug 2027

Legacy systems covered

Existing high-risk AI systems that were placed on the market before August 2026 must comply.

Technical Obligations — What You Must Actually Build

For high-risk AI systems, the EU AI Act is not a checkbox exercise. It requires specific technical infrastructure. Here are the four articles that directly affect AI engineering teams.

Article 10

Training Data — Documentation and Governance

Document the origin and provenance of every training dataset
Record the data collection methodology and any pre-processing
Demonstrate that training data is relevant and representative
Identify and address known biases
For synthetic data: document the generation algorithm, parameters, and validation

How CertifiedData addresses this

Certified synthetic datasets include a cryptographic record of the generation algorithm, timestamp, and schema — satisfying the provenance documentation requirement.

Article 10 technical reference →

Article 12

Logging — Automatic Record of AI System Operation

High-risk AI systems must automatically log events during operation
Logs must be tamper-resistant
Logs must capture sufficient detail for post-hoc audit
Retention period: at minimum for the period where the AI system is in use

How CertifiedData addresses this

The Decision Ledger component provides hash-chained, tamper-evident decision logs that satisfy Article 12 for AI systems using CertifiedData infrastructure.

Article 12 technical reference →

Article 13

Transparency — Instructions for Deployers

Providers must supply instructions for use to deployers
Instructions must describe performance characteristics and limitations
Training data characteristics must be disclosed to deployers
Known limitations and foreseeable misuse cases must be documented

How CertifiedData addresses this

Dataset certificates provide a machine-readable, standardized disclosure of training data characteristics that can be included in system instructions.

Article 13 technical reference →

Article 19

Record-keeping — Technical Documentation Retention

Providers must retain technical documentation for 10 years post-market
Documentation must cover the full development lifecycle
Training dataset records must be included
Documentation must be available to authorities on request

How CertifiedData addresses this

Certified artifacts are stored in the CertifiedData artifact registry with long-term retention. Each certificate includes a tamper-evident hash anchored to a transparency log.

Article 19 technical reference →

NIST AI Risk Management Framework

The NIST AI RMF structures AI risk management around four core functions. It is voluntary but is becoming a de facto standard for US organizations and a complement to the EU AI Act for global teams.

Govern

Establish policies, accountability structures, and organizational processes for AI risk. Define who is responsible for AI system decisions.

Map

Identify and categorize AI risks in context. Document the AI system's purpose, stakeholders, and potential impacts before deployment.

Measure

Analyze and quantify AI risks. Evaluate training data quality, model performance, bias metrics, and system reliability through testing.

Manage

Prioritize and address risks. Implement controls, document responses, and maintain audit trails of risk management decisions.

Dataset certification addresses the Measure and Manage functions by providing an objective, tamper-evident record of training data quality and provenance.

How Dataset Certification Satisfies Regulatory Requirements

Regulators don't accept assertions — they require evidence. A dataset certificate is a cryptographic artifact that constitutes evidence. Here's how it maps to specific obligations.

Regulation obligation	What regulators check	Certificate provides
Article 10 — data provenance	Origin and collection methodology of training data	Dataset hash, generation algorithm, timestamp, issuer signature
Article 10 — data quality	Representativeness and absence of known errors	Integrity score, schema documentation, constraint validation
Article 12 — tamper resistance	Logs cannot be altered post-facto	Ed25519 signature verifiable against public key registry
Article 19 — retention	Technical records available for 10 years	Immutable artifact registry with retention policy
NIST RMF — Measure	Documented evidence of data quality assessment	Cryptographically anchored integrity report

How certification works →Verify a certificate

Frequently Asked Questions

What is the EU AI Act?▾

The EU AI Act is a binding EU regulation that classifies AI systems by risk level and imposes specific compliance obligations on providers and deployers of high-risk systems. It entered into force in August 2024. Most high-risk obligations — including training data documentation, logging, and record-keeping — apply from August 2026.

Does the EU AI Act apply to organizations outside the EU?▾

Yes. The EU AI Act has extraterritorial reach: any provider whose AI system is placed on the EU market or whose outputs are used in the EU must comply, regardless of where the organization is incorporated. This is similar to GDPR's territorial scope.

What does AI regulation require for training data?▾

Under EU AI Act Article 10, providers of high-risk AI systems must document the origin, collection methodology, and provenance of every training dataset. Synthetic datasets must include documentation proving they were generated using controlled, auditable processes. Dataset certification provides this documentation in machine-verifiable form.

What is a high-risk AI system?▾

High-risk AI systems are defined in Annex III of the EU AI Act and include AI used in biometric identification, critical infrastructure, education, employment, essential services, law enforcement, migration, and administration of justice. If your system is used in any of these domains, the full set of compliance obligations applies.

How does synthetic data help with AI regulation compliance?▾

Synthetic data removes real personal data from training pipelines, reducing GDPR and privacy exposure. When certified, it also provides a tamper-evident record proving the dataset's origin, algorithm, and integrity — directly satisfying the training data documentation requirements of Article 10 of the EU AI Act.

What is the NIST AI Risk Management Framework?▾

The NIST AI RMF is a voluntary US framework for managing AI risk across four functions: Govern, Map, Measure, and Manage. It is widely used by US federal agencies and large enterprises. Unlike the EU AI Act it carries no direct penalty for non-compliance, but federal agencies increasingly require NIST RMF alignment for AI procurement.

EU AI Act Explained

Key articles, obligations, and enforcement timeline in depth.

AI Risk Classification

The four-tier risk model and where your system lands.

Dataset Certification

How cryptographic certification works and what it proves.

EU AI Act Compliance Guide

Step-by-step compliance checklist for high-risk AI providers.

Article 12 — Logging Requirements

What Article 12 requires and how to implement compliant logging.

Article 19 — Record-keeping

Technical documentation retention requirements and implementation.