LogoLogo
HomeBlogGitHub
v0.1.57
v0.1.57
  • What is Evidently?
  • Installation
  • Get Started Tutorial
  • Reports
    • Data Drift
    • Data Quality
    • Numerical Target Drift
    • Categorical Target Drift
    • Regression Performance
    • Classification Performance
    • Probabilistic Classification Performance
  • Tests
  • Examples
  • Integrations
    • Evidently and Grafana
    • Evidently and Airflow
    • Evidently and MLflow
  • Features
    • Dashboards
      • Input data
      • Column mapping
      • Generate dashboards
      • CLI
      • Colab and other environments
    • Profiling
      • Input data
      • Column mapping
      • Generate profiles
      • CLI
    • Monitoring
  • User Guide
    • Customization
      • Select Widgets
      • Custom Widgets and Tabs
      • Options for Data / Target drift
      • Options for Quality Metrics
      • Options for Statistical Tests
      • Options for Color Schema
    • Recipes
  • SUPPORT
    • Contact
    • F.A.Q.
    • Telemetry
    • Changelog
  • GitHub Page
  • Website
Powered by GitBook
On this page
  • 1. Generate model performance reports as a batch job
  • 2. Generate visual performance reports on defined conditions
  1. Integrations

Evidently and Airflow

Run model evaluation or data drift analysis as a part of Airflow DAG.

PreviousEvidently and GrafanaNextEvidently and MLflow

Last updated 2 years ago

Apache Airflow is an open-source workflow management tool.

You can use this integration to generate JSON profiles or HTML reports and as a step in the Airflow DAG.

An integration example is available as a Docker container:

Follow the readme to install and modify the example.

It contains two specific DAGs that match common batch monitoring needs.

1. Generate model performance reports as a batch job

You can generate an Evidently report (e.g. a data drift report) every time the new data arrives. You can then store it in your file system.

Here is a DAG example:

2. Generate visual performance reports on defined conditions

You might not always need to generate visual reports every time.

For example, you can run checks on the model performance and only generate the reports if a certain condition is satisfied. For example, if you detect drift or performance drop. Otherwise, you can simply log the results.

Here is a DAG example:

It works as the following:

  • Run a data drift check by generating an Evidently JSON profile

  • If the drift is not detected, log the JSON output

  • If the drift is detected, generate and store a visual HTML report

evidently/examples/integrations/airflow_drift_detection at main · evidentlyai/evidentlyGitHub
evidently/evidently_drift_dashboard.py at main · evidentlyai/evidentlyGitHub
evidently/evidently_conditional_drift_dashboard_generation.py at main · evidentlyai/evidentlyGitHub
Logo
Logo
Logo