Question 1

Airflow vs Prefect vs Dagster — when do I use each?

Accepted Answer

Airflow: Python-native, 10k+ companies, best ecosystem, steepest DAG learning curve. Prefect: modern API, better error handling, less operational overhead, smaller community. Dagster: asset-centric (not task-centric), best for data assets/lineage, younger product. For enterprise data teams: Airflow. For rapid prototyping: Prefect. For assets-first thinking: Dagster.

Question 2

Self-hosted vs managed Airflow (MWAA/Astronomer/Cloud Composer) — which should I pick?

Accepted Answer

Self-hosted: full control, low cost for small teams (just Postgres + web server on EC2), but you own patching, scaling, backups. Managed: $1-3k/month base, frees you from ops, built-in monitoring, auto-scaling. For <10 DAGs or learning: self-hosted on a tiny EC2. For >50 DAGs or mission-critical: managed platform.

Question 3

What's the difference between Celery, Kubernetes, and LocalExecutor?

Accepted Answer

LocalExecutor: single-machine parallelism, good for dev/testing, max ~8 workers. Celery: distributed task queue, scales to 100s of workers, requires Redis/RabbitMQ, complex ops. Kubernetes: modern, scales elastically, requires K8s cluster, best for cloud-native shops. Start LocalExecutor, graduate to Celery for on-prem, or K8s for cloud.

Question 4

How do I handle dynamic DAGs (DAGs that vary per tenant or config)?

Accepted Answer

DAGFactory: generate DAGs from YAML configs at parse time. TaskFlow API decorators: cleaner than legacy operators. DAG factories iterate over configs, yielding DAG objects. Common pattern: loop over 100 customer configs, yield 100 DAGs at runtime. Avoid hardcoding: DAGs should read from a database, S3, or config file.

Question 5

Why is my Airflow scheduler slow / DAGs parsing slowly?

Accepted Answer

Typical culprits: (1) too many imports in DAG file (move to operators/hooks), (2) Postgres metadata DB is overloaded (add indexes, tune max_connections), (3) too many DAGs (>5k) — split into separate Airflow instances. Solution: profile DAG parse time with `airflow dags list --verbose`, offload heavy logic to operators (not DAG definition).

Question 6

How do I monitor Airflow in production?

Accepted Answer

Built-in: Airflow web UI (task/DAG status, logs, gantt). CloudWatch (if MWAA): logs → CloudWatch. Add: Datadog/New Relic for APM, PagerDuty for alerting. Set alarms on: DAG failure rate, task duration p99, scheduler lag, Postgres connection pool. Log all DAG runs + task logs to S3.

Question 7

What's the difference between XComs, variables, and connections?

Accepted Answer

Variables: static config (e.g., API keys) — set in UI or code, shared across DAGs. Connections: credentials (username + password + host), used by hooks/operators. XComs: task-to-task messaging (e.g., task A computes a path, task B reads it) — ephemeral, stored in metadata DB. For secrets: use Secrets Backend (AWS Secrets Manager, Vault) instead of plain variables.

Region	Junior	Mid	Senior
USA	$105k	$145k	$185k
UK	£62k	£88k	£125k
EU	€68k	€95k	€135k
CANADA	C$115k	C$160k	C$210k

Unknown

What is Unknown

📋 Before you start

💰 Salary by region

🎓 Certifications

🎯 Careers using Unknown

⚖ Compare with

❓ FAQ

Not sure this skill is for you?

Find your ideal career path