this is really cool! That said, my impression is that Airflow is a really dated ...

Hasz · 2025-03-31T23:06:02 1743462362

Dated doesn’t mean bad (usually the opposite in my experience!) What issues do you have with Airflow?

gre · 2025-04-01T00:59:14 1743469154

Here's my problems with MWAA (amazon hosted airflow.) I have about 100 dags which maxes out the scheduler thread. Airflow parses all the files every minute so it's always parsing around 94% cpu. I could run a second scheduler thread if I coordinate with my SRE team and get the terraform deployed...it's really tedious.

Related possibly, my dags get kill -9 for no apparent reason. The RAM usage is not that high, maybe 2gb out of 8gb system RAM in use. No reason is given in the logs.

I am trying to switch to dagster, not because it's awesome, but because it hasn't crashed randomly on me.

alittletooraph2 · 2025-04-01T13:54:50 1743515690

This feels like an MWAA issue but I understand how that often gets conflated with it being an Airflow issue.

gre · 2025-04-01T15:00:55 1743519655

You're right, it doesn't happen when developing locally, only in MWAA. This was the answer given by the Airflow team as well and I figured they would punt before I asked.

I realize Amazon is taking an open source project and making a ton of money on it (the instance prices are ridiculous for what you get) and the incentives are misaligned for the Airflow team to help AWS make it better unless AWS paid them to help fix it.

It's crap all around, and Airflow gets a bad rap from AWS's terrible MWAA product based on it.

mblast311 · 2025-04-01T03:04:09 1743476649

MWAA is hot garbage. I had similar issues and switched to running it on EKS instead.

mdaniel · 2025-04-01T03:20:54 1743477654

> What issues do you have with Airflow?

Their operational perspective is catastrophic; how does one view the logs for a dag through the UI[1]? Why can't it store the python in the database they have attached to their deployment, versus making me jump through 80,000 hoops to put the files in the right magic directory on disk of every worker[2]?

1: no, not <https://airflow.apache.org/docs/apache-airflow/stable/ui.htm...> I mean the log, you know, like in the old days of $(tail -f /var/log/the.thing). I'm open to the answer hiding somewhere in this gobbledygook <https://airflow.apache.org/docs/apache-airflow/stable/admini...> but who is the target audience for having such a fancy UI and omitting log viewing from it, doubly so if there's some alleged http just for viewing logs

2: https://airflow.apache.org/docs/apache-airflow/stable/core-c... and double-plus-good anytime python software mentions PYTHONPATH -- that's how you know you're in for a hot good time https://airflow.apache.org/docs/apache-airflow/stable/admini...

6LLvveMx2koXfwn · 2025-04-01T03:50:21 1743479421

We deploy on K8s in OpenStack from a scheduled GitHub Actions pipeline which aggregates DAGs into a new container build based on hashes of hashes. This works well with almost no intervention.

WRT your 1, above any DAG output to stdout/err is available via the logs tab from the graph view of the individual tasks. Almost all our DAGs leverage on the PythonOperator though, not sure if that standardises this for us and your experience is muddied by more complexity than we currently have?

WRT 2. we generate an uber requirements.txt running pyreqs from the pipeline and install everything in the container automatically. Again no issues currently - although we do need to manually add the installation of test libraries to the pipeline job as for some reason auto-discovery is flakier for unit-tests frameworks.

jedberg · 2025-04-01T02:59:05 1743476345

I'd be curious if this scratches your itch:

https://www.dbos.dev/blog/durable-execution-crashproof-ai-ag...

hbarka · 2025-04-01T08:27:55 1743496075

Pleasantly surprised to see the name Mike Stonebraker in the About Us.

jedberg · 2025-04-01T16:04:10 1743523450

About to jump into an eng meeting with him right now!

bashfulpup · 2025-04-01T00:14:45 1743466485

This space is honestly a mess. I did an in depth survey around 1.5 yrs ago and my eventual conclusion was just to build with airflow.

You either get simplicity with the caveate that your systems need to perfectly align.

Or you get complexity but will work with basically anything (airflow).

febed · 2025-04-01T00:57:55 1743469075

Would be interested to know what drawbacks you found with Dagster or Prefect.

jtbaker · 2025-04-01T01:47:52 1743472072

Prefect is amazing. Built out an ETL pipeline system with it at last job and would love to get it incorporated in the current one, but unfortunately have a lot of legacy stuff in Airflow. Being able to debug stuff locally was amazing and super clean integration with K8S.

bashfulpup · 2025-04-02T21:10:51 1743628251

Other guy said it right. These work and are fine but you lose the legacy stuff. If you know your limits and where the eventual system will end up it's great and probably better.

If you are building a expandable long term system and you want all the goodies baked in choose airflow.

Pretty much the same as any architecture choice. Ugly/hard often means control and features, pretty/easy means less of both.

On the surface the differences are not very noticable other than the learning curve of getting started.

nikolayasdf123 · 2025-04-01T02:28:18 1743474498

+1 to this. other solutions over-promise, under-deliver, poor developer relations and communication, "open-source, but pay us" style open-source, and is indeed a mess