Data drift
ML Concept
Overview
Use casedetecting changes in data distribution over time in machine learning systems
Technical
Integrates with
Knowledge graph stats
Claims87
Avg confidence91%
Avg freshness100%
Last updatedUpdated 5 days ago
Trust distribution
100% unverified
Governance
Not assessed
Data drift
concept
Changes in input data distribution between training and production environments affecting model performance.
Compare with...is type of
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| machine learning concept | ○Unverified | High | Fresh | 1 |
| machine learning monitoring concept | ○Unverified | High | Fresh | 1 |
subcategory of
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| ML monitoring | ○Unverified | High | Fresh | 1 |
| MLOps | ○Unverified | High | Fresh | 1 |
field of study
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| machine learning | ○Unverified | High | Fresh | 1 |
category
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| machine learning monitoring concept | ○Unverified | High | Fresh | 1 |
causes problem
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model performance degradation | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| detecting changes in data distribution over time in machine learning systems | ○Unverified | High | Fresh | 1 |
| monitoring changes in data distribution over time in machine learning systems | ○Unverified | High | Fresh | 1 |
| monitoring changes in input data distribution over time | ○Unverified | High | Fresh | 1 |
| detecting changes in input data distribution over time in machine learning systems | ○Unverified | High | Fresh | 1 |
| detecting changes in input data distribution that may degrade machine learning model performance | ○Unverified | High | Fresh | 1 |
| monitoring changes in input data distribution compared to training data | ○Unverified | High | Fresh | 1 |
| detecting changes in statistical properties of input data over time in machine learning systems | ○Unverified | High | Fresh | 1 |
addressed by tool
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| AWS SageMaker Model Monitor | ○Unverified | High | Fresh | 1 |
| Evidently AI | ○Unverified | High | Fresh | 1 |
| Azure Machine Learning | ○Unverified | High | Fresh | 1 |
| Google Cloud AI Platform | ○Unverified | Moderate | Fresh | 1 |
impacts
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model performance | ○Unverified | High | Fresh | 1 |
causes problem for
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| machine learning model performance | ○Unverified | High | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model monitoring systems | ○Unverified | High | Fresh | 1 |
| MLOps platforms | ○Unverified | Moderate | Fresh | 1 |
| feature stores | ○Unverified | Moderate | Fresh | 1 |
occurs in domain
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| machine learning operations | ○Unverified | High | Fresh | 1 |
| production machine learning systems | ○Unverified | High | Fresh | 1 |
commonly affects
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| production machine learning systems | ○Unverified | High | Fresh | 1 |
requires
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| baseline data distribution | ○Unverified | High | Fresh | 1 |
| reference dataset | ○Unverified | High | Fresh | 1 |
| baseline reference data | ○Unverified | High | Fresh | 1 |
| continuous data collection | ○Unverified | High | Fresh | 1 |
| statistical monitoring techniques | ○Unverified | High | Fresh | 1 |
common in
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| production machine learning systems | ○Unverified | High | Fresh | 1 |
detection method
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| distribution comparison | ○Unverified | High | Fresh | 1 |
| statistical hypothesis testing | ○Unverified | High | Fresh | 1 |
| Kolmogorov-Smirnov test | ○Unverified | Moderate | Fresh | 1 |
part of
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| MLOps practices | ○Unverified | High | Fresh | 1 |
| MLOps workflow | ○Unverified | High | Fresh | 1 |
| MLOps pipeline | ○Unverified | High | Fresh | 1 |
based on
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical hypothesis testing | ○Unverified | High | Fresh | 1 |
mitigation strategy
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model retraining | ○Unverified | High | Fresh | 1 |
| online learning | ○Unverified | High | Fresh | 1 |
related concept
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| concept drift | ○Unverified | High | Fresh | 1 |
commonly measured using
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical distance metrics | ○Unverified | High | Fresh | 1 |
| Kolmogorov-Smirnov test | ○Unverified | High | Fresh | 1 |
| Population Stability Index | ○Unverified | Moderate | Fresh | 1 |
related to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model monitoring | ○Unverified | High | Fresh | 1 |
| concept drift | ○Unverified | High | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| supervised learning models | ○Unverified | High | Fresh | 1 |
| unsupervised learning models | ○Unverified | High | Fresh | 1 |
monitoring approach
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| continuous data monitoring | ○Unverified | High | Fresh | 1 |
detects
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical changes in feature distributions | ○Unverified | High | Fresh | 1 |
monitored by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| AWS SageMaker Model Monitor | ○Unverified | High | Fresh | 1 |
| TensorFlow Data Validation | ○Unverified | Moderate | Fresh | 1 |
| Evidently AI | ○Unverified | Moderate | Fresh | 1 |
| Whylabs | ○Unverified | Moderate | Fresh | 1 |
part of discipline
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| MLOps | ○Unverified | High | Fresh | 1 |
requires technique
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| baseline data distribution establishment | ○Unverified | High | Fresh | 1 |
measured using
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical distance metrics | ○Unverified | High | Fresh | 1 |
| KL divergence | ○Unverified | Moderate | Fresh | 1 |
| Kolmogorov-Smirnov test | ○Unverified | Moderate | Fresh | 1 |
| population stability index | ○Unverified | Moderate | Fresh | 1 |
| Wasserstein distance | ○Unverified | Moderate | Fresh | 1 |
detected by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical tests | ○Unverified | High | Fresh | 1 |
| Kolmogorov-Smirnov test | ○Unverified | Moderate | Fresh | 1 |
| Population Stability Index | ○Unverified | Moderate | Fresh | 1 |
also known as
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| dataset shift | ○Unverified | High | Fresh | 1 |
| covariate shift | ○Unverified | Moderate | Fresh | 1 |
causes
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model performance degradation | ○Unverified | High | Fresh | 1 |
supports protocol
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Kolmogorov-Smirnov test | ○Unverified | Moderate | Fresh | 1 |
| Population Stability Index | ○Unverified | Moderate | Fresh | 1 |
can trigger
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model retraining | ○Unverified | Moderate | Fresh | 1 |
types include
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| covariate shift | ○Unverified | Moderate | Fresh | 1 |
| prior probability shift | ○Unverified | Moderate | Fresh | 1 |
triggers
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model retraining | ○Unverified | Moderate | Fresh | 1 |
| model retraining workflows | ○Unverified | Moderate | Fresh | 1 |
can be detected using
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical hypothesis testing | ○Unverified | Moderate | Fresh | 1 |
monitored by platform
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Evidently AI | ○Unverified | Moderate | Fresh | 1 |
detected using
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| statistical tests | ○Unverified | Moderate | Fresh | 1 |
addressed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model retraining | ○Unverified | Moderate | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| concept drift | ○Unverified | Moderate | Fresh | 1 |
can be measured with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Kolmogorov-Smirnov test | ○Unverified | Moderate | Fresh | 1 |