Data Card
A data card is a structured document describing a dataset used to train or evaluate an AI model: its source, composition, collection process, intended use, and limitations.
What is Data Card?
Data cards extend the model card idea to datasets. They typically describe collection methodology, demographic composition, labelling process, known biases, licensing, and recommended evaluation slices. Data cards are essential for downstream users to assess fitness for purpose. The EU AI Act explicitly requires data governance documentation for high-risk AI.
How does Data Card apply to enterprise AI?
Enterprises that fine-tune on internal data should produce data cards for those datasets, both for compliance and for future maintainers who will need to know what is in them.
Related terms
Model Card
AI Risk Management
Data Residency
External references
Need help applying Data Card to your enterprise? Submit a short brief and we reply within one business day.