I am reading up about DataFlow Runner but can’t seem to grok does it completely replace EmrEtlRunner or is it complimentary. If it replaces it what is the workflow to then use storage loader to get the data into Postgres or Redshift? Do you then need the DataFlow iglu schema plus the old config.yml?
Hi @BenFradet, for enriching and loading events to Redshift, is Dataflow Runner now the recommended approach? Or is using EmrEtlRunner + Storage Loader still the way to do it?
EmrEtlRunner still is the way to go. But we deprecated StorageLoader in latest R90 release. @BenFradet’s upcoming R91 release will include new generate command which should alleviate transition, but I believe EmrEtlRunner will remain default approach even after that for some more time.