<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" xmlns:podcast="https://podcastindex.org/namespace/1.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" version="2.0">
<channel>
  <atom:link href="https://feeds.cohostpodcasting.com/yiLVF7xu" rel="self" title="MP3 Audio" type="application/atom+xml"/>
  <atom:link href="https://pubsubhubbub.appspot.com/" rel="hub" xmlns="http://www.w3.org/2005/Atom" />
  <generator>https://cohostpodcasting.com</generator>
  <title><![CDATA[The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI]]></title>
  <description><![CDATA[Welcome to The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI— the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward.

Join us each week, as we explore the current state, future and potential of Airflow with leading thinkers in the community, and discover how best to leverage this workflow management system to meet the ever-evolving needs of data engineering and AI ecosystems. 

Podcast Webpage: https://www.astronomer.io/podcast/]]></description>
  <itunes:summary><![CDATA[Welcome to The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI— the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward.

Join us each week, as we explore the current state, future and potential of Airflow with leading thinkers in the community, and discover how best to leverage this workflow management system to meet the ever-evolving needs of data engineering and AI ecosystems. 

Podcast Webpage: https://www.astronomer.io/podcast/]]></itunes:summary>
  <language>en</language>
  <copyright><![CDATA[All rights reserved]]></copyright>
<podcast:guid>50fd8116-be73-4852-be37-94d8bc0082fa</podcast:guid>
  <pubDate>Wed, 25 Sep 2024 15:23:59 -0400</pubDate>
  <lastBuildDate>Thu, 14 May 2026 09:45:57 -0400</lastBuildDate>
  <image>
    <link>https://airflow.apache.org/</link>
    <title><![CDATA[The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI]]></title>
    <url>https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/cover-art/original_435eaa7ba08d2ad36e55e0342c4f15bb.jpg</url>
  </image>
  <link>https://airflow.apache.org/</link>
  <itunes:type>episodic</itunes:type>
  <itunes:author><![CDATA[Astronomer]]></itunes:author>
  <itunes:explicit>false</itunes:explicit>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/cover-art/original_435eaa7ba08d2ad36e55e0342c4f15bb.jpg"/>
  <itunes:new-feed-url>https://feeds.cohostpodcasting.com/yiLVF7xu</itunes:new-feed-url>
  
  <itunes:owner>
    <itunes:name><![CDATA[The Data Flowcast]]></itunes:name>
    <itunes:email>support@contentallies.com</itunes:email>
  </itunes:owner>
  <itunes:category text="Technology"/>
<item>
  <guid isPermaLink="false"><![CDATA[0e1bf992-36b8-4402-89f6-7c017753cf4d]]></guid>
  <title><![CDATA[Getting Into Data Engineering with Shrividya Hegde, Data and AI Engineer]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we take a step back from implementation-specific topics to explore what it actually takes to build a career in data engineering — and how AI is reshaping that path.</span></p><p><a href="https://www.linkedin.com/in/shrividya-hegde-shri-91562365/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya Hegde</a><span style="background-color: transparent; color: rgb(22, 14, 61);">,  a data and AI engineer and an Airflow champion in </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer’s </a><span style="background-color: transparent; color: rgb(22, 14, 61);">Champions program, joins us to discuss getting into data engineering, contributing to open source and why good data engineering should make AI output trustworthy rather than confidently wrong.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Build fundamentals before chasing trending tools — understanding what a tool does, why it exists and what problem it solves has to come first.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:19</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Data engineering fundamentals mean SQL query performance under joins and aggregations, how data moves between pipelines, DAG failure recovery and idempotency — not just writing queries.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">08:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The most common mistake newer data engineers make is skipping fundamentals to chase trends — it is a sequencing problem, not a talent problem.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:15</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AI creates more opportunity for data engineers because AI output quality is directly determined by the quality of the data pipeline feeding it — confidently wrong output is harder to catch than obviously wrong output.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:06</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow's supporting operators make AI outputs production-ready — orchestration is what converts experimental AI into something reliable.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">17:14</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AI-generated DAGs help newer engineers understand underlying concepts rather than just producing working code.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">23:12</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The Airflow open source community is more welcoming than most people expect for a project of its size — raising issues and reviewing PRs are viable entry points for first contributions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shrividya-hegde-shri-91562365/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya Hegde</a></p><p>https://www.linkedin.com/in/shrividya-hegde-shri-91562365/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://womenindata.mn.co/landing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Women in Data</a> | Website</p><p>https://womenindata.mn.co/landing</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Slack</a><span style="background-color: transparent; color: rgb(17, 85, 204);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://medium.com/@shrihegde" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya's Medium writing</a></p><p>https://medium.com/@shrihegde</p><p><br></p><p><a href="https://substack.com/@shrividyahegde" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya’ Substack writing</a></p><p>https://substack.com/@shrividyahegde</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/1a4eb490-df5e-4ed8-b4eb-74df86c3b4ec/289e64a85e.jpg" />
  <pubDate>Thu, 14 May 2026 05:45:33 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26475115" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/1a4eb490-df5e-4ed8-b4eb-74df86c3b4ec/episode.mp3" />
  <itunes:title><![CDATA[Getting Into Data Engineering with Shrividya Hegde, Data and AI Engineer]]></itunes:title>
  <itunes:duration>27:34</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we take a step back from implementation-specific topics to explore what it actually takes to build a career in data engineering — and how AI is reshaping that path.</span></p><p><a href="https://www.linkedin.com/in/shrividya-hegde-shri-91562365/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya Hegde</a><span style="background-color: transparent; color: rgb(22, 14, 61);">,  a data and AI engineer and an Airflow champion in </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer’s </a><span style="background-color: transparent; color: rgb(22, 14, 61);">Champions program, joins us to discuss getting into data engineering, contributing to open source and why good data engineering should make AI output trustworthy rather than confidently wrong.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Build fundamentals before chasing trending tools — understanding what a tool does, why it exists and what problem it solves has to come first.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:19</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Data engineering fundamentals mean SQL query performance under joins and aggregations, how data moves between pipelines, DAG failure recovery and idempotency — not just writing queries.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">08:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The most common mistake newer data engineers make is skipping fundamentals to chase trends — it is a sequencing problem, not a talent problem.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:15</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AI creates more opportunity for data engineers because AI output quality is directly determined by the quality of the data pipeline feeding it — confidently wrong output is harder to catch than obviously wrong output.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:06</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow's supporting operators make AI outputs production-ready — orchestration is what converts experimental AI into something reliable.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">17:14</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AI-generated DAGs help newer engineers understand underlying concepts rather than just producing working code.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">23:12</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The Airflow open source community is more welcoming than most people expect for a project of its size — raising issues and reviewing PRs are viable entry points for first contributions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shrividya-hegde-shri-91562365/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya Hegde</a></p><p>https://www.linkedin.com/in/shrividya-hegde-shri-91562365/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://womenindata.mn.co/landing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Women in Data</a> | Website</p><p>https://womenindata.mn.co/landing</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Slack</a><span style="background-color: transparent; color: rgb(17, 85, 204);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://medium.com/@shrihegde" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya's Medium writing</a></p><p>https://medium.com/@shrihegde</p><p><br></p><p><a href="https://substack.com/@shrividyahegde" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya’ Substack writing</a></p><p>https://substack.com/@shrividyahegde</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we take a step back from implementation-specific topics to explore what it actually takes to build a career in data engineering — and how AI is reshaping that path.</span></p><p><a href="https://www.linkedin.com/in/shrividya-hegde-shri-91562365/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya Hegde</a><span style="background-color: transparent; color: rgb(22, 14, 61);">,  a data and AI engineer and an Airflow champion in </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer’s </a><span style="background-color: transparent; color: rgb(22, 14, 61);">Champions program, joins us to discuss getting into data engineering, contributing to open source and why good data engineering should make AI output trustworthy rather than confidently wrong.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Build fundamentals before chasing trending tools — understanding what a tool does, why it exists and what problem it solves has to come first.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:19</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Data engineering fundamentals mean SQL query performance under joins and aggregations, how data moves between pipelines, DAG failure recovery and idempotency — not just writing queries.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">08:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The most common mistake newer data engineers make is skipping fundamentals to chase trends — it is a sequencing problem, not a talent problem.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:15</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AI creates more opportunity for data engineers because AI output quality is directly determined by the quality of the data pipeline feeding it — confidently wrong output is harder to catch than obviously wrong output.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:06</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow's supporting operators make AI outputs production-ready — orchestration is what converts experimental AI into something reliable.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">17:14</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AI-generated DAGs help newer engineers understand underlying concepts rather than just producing working code.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">23:12</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The Airflow open source community is more welcoming than most people expect for a project of its size — raising issues and reviewing PRs are viable entry points for first contributions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shrividya-hegde-shri-91562365/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya Hegde</a></p><p>https://www.linkedin.com/in/shrividya-hegde-shri-91562365/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://womenindata.mn.co/landing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Women in Data</a> | Website</p><p>https://womenindata.mn.co/landing</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Slack</a><span style="background-color: transparent; color: rgb(17, 85, 204);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://medium.com/@shrihegde" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya's Medium writing</a></p><p>https://medium.com/@shrihegde</p><p><br></p><p><a href="https://substack.com/@shrividyahegde" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shrividya’ Substack writing</a></p><p>https://substack.com/@shrividyahegde</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[In this episode, we take a step back from implementation-specific topics to explore what it actually takes to build a career in data engineering — and how AI is reshaping that path.Shrividya Hegde,  a data and AI engineer and an Airflow champion in...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>79</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[20728bb8-75b0-4ded-ac50-f104e6131651]]></guid>
  <title><![CDATA[Orchestrating DBT With Cosmos and Airflow with Filip Kunčar at ShipMonk Product Development]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">We explore how a third-party logistics platform built its entire data orchestration layer on Airflow, and what that makes possible for developer teams and merchant-facing products alike.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/filipkuncar/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Filip Kunčar</a><span style="background-color: transparent;">, Platform Director at</span><a href="https://www.linkedin.com/company/shipmonk-product-development/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);"> ShipMonk Product Development</a><span style="background-color: transparent;">, discusses migrating from a closed source tool to Airflow, orchestrating dbt with both Cosmos and the BashOperator and using Airflow to power customer-facing data delivery.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">01:07</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk is a third-party logistics company guaranteeing two-day delivery across the US. The data platform team's mission is to lower cognitive load for developers working with data.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">05:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk migrated to Airflow in 2022, moving away from a closed-source UI-based tool, driven by the need for a code-first approach, open source extensibility and broad cloud provider support.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">10:02</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The team uses Cosmos for developer-facing visibility and lineage and BashOperator for internal pipelines where runtime performance matters.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">12:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Switching from Cosmos to the BashOperator for a frequently running pipeline reduced runtime from over 15 minutes to three minutes.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:14</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Because the full dbt chain runs inside Airflow, a configurable downstream DAG can deliver processed data directly to each merchant's preferred destination, with secrets management and SLA tracking already handled.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:03</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Per-team alerting is hooked to each DAG by owner and severity, so teams can react to SLA breaches immediately.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">18:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk uses Airflow in three ways for AI: authoring DAGs faster with skills, orchestrating AI workloads in Lambda and containers and using Astronomer's skills repo to simplify Airflow version upgrades.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/filipkuncar/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Filip Kunčar</a></p><p>https://www.linkedin.com/in/filipkuncar/</p><p><br></p><p><a href="https://www.linkedin.com/company/shipmonk-product-development/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ShipMonk Product Development</a></p><p>https://www.linkedin.com/company/shipmonk-product-development/</p><p><br></p><p><a href="http://www.shipmonk.com" target="_blank">ShipMonk</a> | Website</p><p>http://www.shipmonk.com</p><p><br></p><p><a href="http://www.astronomer.io/cosmos" target="_blank">Astronomer Cosmos</a></p><p>http://www.astronomer.io/cosmos</p><p><br></p><p><a href="http://www.github.com/astronomer/airflow-llm-providers-demo" target="_blank">Astronomer AI Skills Repo</a></p><p>http://www.github.com/astronomer/airflow-llm-providers-demo</p><p><br></p><p><a href="http://www.datadoghq.com" target="_blank">Datadog</a></p><p>http://www.datadoghq.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/d5770b9c-9d92-4c3e-8b3f-c5cef3f6775e/2a8fd1810f.jpg" />
  <pubDate>Thu, 07 May 2026 07:03:41 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23954578" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/d5770b9c-9d92-4c3e-8b3f-c5cef3f6775e/episode.mp3" />
  <itunes:title><![CDATA[Orchestrating DBT With Cosmos and Airflow with Filip Kunčar at ShipMonk Product Development]]></itunes:title>
  <itunes:duration>24:57</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">We explore how a third-party logistics platform built its entire data orchestration layer on Airflow, and what that makes possible for developer teams and merchant-facing products alike.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/filipkuncar/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Filip Kunčar</a><span style="background-color: transparent;">, Platform Director at</span><a href="https://www.linkedin.com/company/shipmonk-product-development/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);"> ShipMonk Product Development</a><span style="background-color: transparent;">, discusses migrating from a closed source tool to Airflow, orchestrating dbt with both Cosmos and the BashOperator and using Airflow to power customer-facing data delivery.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">01:07</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk is a third-party logistics company guaranteeing two-day delivery across the US. The data platform team's mission is to lower cognitive load for developers working with data.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">05:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk migrated to Airflow in 2022, moving away from a closed-source UI-based tool, driven by the need for a code-first approach, open source extensibility and broad cloud provider support.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">10:02</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The team uses Cosmos for developer-facing visibility and lineage and BashOperator for internal pipelines where runtime performance matters.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">12:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Switching from Cosmos to the BashOperator for a frequently running pipeline reduced runtime from over 15 minutes to three minutes.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:14</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Because the full dbt chain runs inside Airflow, a configurable downstream DAG can deliver processed data directly to each merchant's preferred destination, with secrets management and SLA tracking already handled.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:03</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Per-team alerting is hooked to each DAG by owner and severity, so teams can react to SLA breaches immediately.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">18:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk uses Airflow in three ways for AI: authoring DAGs faster with skills, orchestrating AI workloads in Lambda and containers and using Astronomer's skills repo to simplify Airflow version upgrades.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/filipkuncar/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Filip Kunčar</a></p><p>https://www.linkedin.com/in/filipkuncar/</p><p><br></p><p><a href="https://www.linkedin.com/company/shipmonk-product-development/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ShipMonk Product Development</a></p><p>https://www.linkedin.com/company/shipmonk-product-development/</p><p><br></p><p><a href="http://www.shipmonk.com" target="_blank">ShipMonk</a> | Website</p><p>http://www.shipmonk.com</p><p><br></p><p><a href="http://www.astronomer.io/cosmos" target="_blank">Astronomer Cosmos</a></p><p>http://www.astronomer.io/cosmos</p><p><br></p><p><a href="http://www.github.com/astronomer/airflow-llm-providers-demo" target="_blank">Astronomer AI Skills Repo</a></p><p>http://www.github.com/astronomer/airflow-llm-providers-demo</p><p><br></p><p><a href="http://www.datadoghq.com" target="_blank">Datadog</a></p><p>http://www.datadoghq.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">We explore how a third-party logistics platform built its entire data orchestration layer on Airflow, and what that makes possible for developer teams and merchant-facing products alike.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/filipkuncar/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Filip Kunčar</a><span style="background-color: transparent;">, Platform Director at</span><a href="https://www.linkedin.com/company/shipmonk-product-development/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);"> ShipMonk Product Development</a><span style="background-color: transparent;">, discusses migrating from a closed source tool to Airflow, orchestrating dbt with both Cosmos and the BashOperator and using Airflow to power customer-facing data delivery.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">01:07</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk is a third-party logistics company guaranteeing two-day delivery across the US. The data platform team's mission is to lower cognitive load for developers working with data.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">05:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk migrated to Airflow in 2022, moving away from a closed-source UI-based tool, driven by the need for a code-first approach, open source extensibility and broad cloud provider support.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">10:02</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The team uses Cosmos for developer-facing visibility and lineage and BashOperator for internal pipelines where runtime performance matters.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">12:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Switching from Cosmos to the BashOperator for a frequently running pipeline reduced runtime from over 15 minutes to three minutes.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:14</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Because the full dbt chain runs inside Airflow, a configurable downstream DAG can deliver processed data directly to each merchant's preferred destination, with secrets management and SLA tracking already handled.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:03</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Per-team alerting is hooked to each DAG by owner and severity, so teams can react to SLA breaches immediately.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">18:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> ShipMonk uses Airflow in three ways for AI: authoring DAGs faster with skills, orchestrating AI workloads in Lambda and containers and using Astronomer's skills repo to simplify Airflow version upgrades.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/filipkuncar/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Filip Kunčar</a></p><p>https://www.linkedin.com/in/filipkuncar/</p><p><br></p><p><a href="https://www.linkedin.com/company/shipmonk-product-development/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ShipMonk Product Development</a></p><p>https://www.linkedin.com/company/shipmonk-product-development/</p><p><br></p><p><a href="http://www.shipmonk.com" target="_blank">ShipMonk</a> | Website</p><p>http://www.shipmonk.com</p><p><br></p><p><a href="http://www.astronomer.io/cosmos" target="_blank">Astronomer Cosmos</a></p><p>http://www.astronomer.io/cosmos</p><p><br></p><p><a href="http://www.github.com/astronomer/airflow-llm-providers-demo" target="_blank">Astronomer AI Skills Repo</a></p><p>http://www.github.com/astronomer/airflow-llm-providers-demo</p><p><br></p><p><a href="http://www.datadoghq.com" target="_blank">Datadog</a></p><p>http://www.datadoghq.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[We explore how a third-party logistics platform built its entire data orchestration layer on Airflow, and what that makes possible for developer teams and merchant-facing products alike.Filip Kunčar, Platform Director at ShipMonk Product Developmen...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>80</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[1bef6309-fff6-437c-9093-e982250cc514]]></guid>
  <title><![CDATA[Building Airflow CTL with Buğra Öztürk at Mollie]]></title>
  <description><![CDATA[<p><a href="https://www.linkedin.com/in/bugraozturk93/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Buğra Öztürk</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at</span><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and Committer and PMC member on the Apache Airflow project, joins us to walk through Airflow CTL — what it is, how it differs from the existing Airflow CLI and where it is headed under AIP-94.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">03:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Buğra has contributed to Airflow since 2022, from docs changes up to Committer and PMC member — a path he hopes inspires others to start small and contribute.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:05</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL solves secure user interaction by abstracting database credentials behind the public core API.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">05:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CLI and Airflow CTL are complementary — CLI handles administration and database management while CTL handles secure user interactions via the API.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL authenticates via the API, acquires a JWT token and stores it securely in the OS keyring — running on the user's machine and never requiring direct database access.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">08:21</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Concrete use cases include local DAG development without the UI and CI/CD automation using headless mode with short-lived JWT tokens.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">10:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AIP-94 describes the long-term vision — decoupling all remote commands from the Airflow CLI and routing them through Airflow CTL.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:12</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL is currently at 0.X and already being used in CI and deployment automations. The move to 1.0 with full CLI parity is the next milestone under AIP-94. &nbsp; </span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿﻿﻿﻿</span>16:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Multi-team deployment becoming generally available in a future Airflow release is Buğra's most-anticipated upcoming feature beyond Airflow CTL.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bugraozturk93/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Buğra Öztürk</a></p><p>https://www.linkedin.com/in/bugraozturk93/</p><p><br></p><p><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a></p><p>https://www.linkedin.com/company/mollie/</p><p><br></p><p><a href="https://www.mollie.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a> | Website</p><p>https://www.mollie.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow CTL</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://lists.apache.org/thread/d2o1pr78wxdp1wozq519stp0pkcv6k6c" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AIP-94 on Airflow Confluence</a></p><p>https://lists.apache.org/thread/d2o1pr78wxdp1wozq519stp0pkcv6k6c</p><p><br></p><p><a href="https://www.github.com/apache/airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow GitHub</a></p><p>https://www.github.com/apache/airflow</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿﻿﻿﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/14526753-229f-4ac9-874b-05576fa6c8d6/8e8ac15e31.jpg" />
  <pubDate>Thu, 30 Apr 2026 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="18916495" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/14526753-229f-4ac9-874b-05576fa6c8d6/episode.mp3" />
  <itunes:title><![CDATA[Building Airflow CTL with Buğra Öztürk at Mollie]]></itunes:title>
  <itunes:duration>19:42</itunes:duration>
  <itunes:summary><![CDATA[<p><a href="https://www.linkedin.com/in/bugraozturk93/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Buğra Öztürk</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at</span><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and Committer and PMC member on the Apache Airflow project, joins us to walk through Airflow CTL — what it is, how it differs from the existing Airflow CLI and where it is headed under AIP-94.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">03:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Buğra has contributed to Airflow since 2022, from docs changes up to Committer and PMC member — a path he hopes inspires others to start small and contribute.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:05</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL solves secure user interaction by abstracting database credentials behind the public core API.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">05:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CLI and Airflow CTL are complementary — CLI handles administration and database management while CTL handles secure user interactions via the API.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL authenticates via the API, acquires a JWT token and stores it securely in the OS keyring — running on the user's machine and never requiring direct database access.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">08:21</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Concrete use cases include local DAG development without the UI and CI/CD automation using headless mode with short-lived JWT tokens.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">10:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AIP-94 describes the long-term vision — decoupling all remote commands from the Airflow CLI and routing them through Airflow CTL.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:12</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL is currently at 0.X and already being used in CI and deployment automations. The move to 1.0 with full CLI parity is the next milestone under AIP-94. &nbsp; </span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿﻿﻿﻿</span>16:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Multi-team deployment becoming generally available in a future Airflow release is Buğra's most-anticipated upcoming feature beyond Airflow CTL.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bugraozturk93/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Buğra Öztürk</a></p><p>https://www.linkedin.com/in/bugraozturk93/</p><p><br></p><p><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a></p><p>https://www.linkedin.com/company/mollie/</p><p><br></p><p><a href="https://www.mollie.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a> | Website</p><p>https://www.mollie.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow CTL</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://lists.apache.org/thread/d2o1pr78wxdp1wozq519stp0pkcv6k6c" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AIP-94 on Airflow Confluence</a></p><p>https://lists.apache.org/thread/d2o1pr78wxdp1wozq519stp0pkcv6k6c</p><p><br></p><p><a href="https://www.github.com/apache/airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow GitHub</a></p><p>https://www.github.com/apache/airflow</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿﻿﻿﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><a href="https://www.linkedin.com/in/bugraozturk93/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Buğra Öztürk</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at</span><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and Committer and PMC member on the Apache Airflow project, joins us to walk through Airflow CTL — what it is, how it differs from the existing Airflow CLI and where it is headed under AIP-94.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">03:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Buğra has contributed to Airflow since 2022, from docs changes up to Committer and PMC member — a path he hopes inspires others to start small and contribute.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:05</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL solves secure user interaction by abstracting database credentials behind the public core API.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">05:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CLI and Airflow CTL are complementary — CLI handles administration and database management while CTL handles secure user interactions via the API.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL authenticates via the API, acquires a JWT token and stores it securely in the OS keyring — running on the user's machine and never requiring direct database access.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">08:21</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Concrete use cases include local DAG development without the UI and CI/CD automation using headless mode with short-lived JWT tokens.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">10:08</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> AIP-94 describes the long-term vision — decoupling all remote commands from the Airflow CLI and routing them through Airflow CTL.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:12</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow CTL is currently at 0.X and already being used in CI and deployment automations. The move to 1.0 with full CLI parity is the next milestone under AIP-94. &nbsp; </span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿﻿﻿﻿</span>16:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Multi-team deployment becoming generally available in a future Airflow release is Buğra's most-anticipated upcoming feature beyond Airflow CTL.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bugraozturk93/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Buğra Öztürk</a></p><p>https://www.linkedin.com/in/bugraozturk93/</p><p><br></p><p><a href="https://www.linkedin.com/company/mollie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a></p><p>https://www.linkedin.com/company/mollie/</p><p><br></p><p><a href="https://www.mollie.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mollie</a> | Website</p><p>https://www.mollie.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow CTL</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://lists.apache.org/thread/d2o1pr78wxdp1wozq519stp0pkcv6k6c" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AIP-94 on Airflow Confluence</a></p><p>https://lists.apache.org/thread/d2o1pr78wxdp1wozq519stp0pkcv6k6c</p><p><br></p><p><a href="https://www.github.com/apache/airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow GitHub</a></p><p>https://www.github.com/apache/airflow</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿﻿﻿﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Buğra Öztürk, Senior Data Engineer at Mollie and Committer and PMC member on the Apache Airflow project, joins us to walk through Airflow CTL — what it is, how it differs from the existing Airflow CLI and where it is headed under AIP-94.Key Takeawa...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>79</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[bc8f7e4d-1edb-45eb-99ba-b7938c6e05d5]]></guid>
  <title><![CDATA[Introducing Airflow’s Common AI Provider with Pavan Kumar Gopidesu and Kaxil Naik]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore the newly released Apache Airflow common AI provider — what problem it solves, how it was built and what's coming next.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/kaxil/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kaxil Naik</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Director of Engineering at</span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and </span><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> PMC member, and</span><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pavan Kumar Gopidesu</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/experian/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Experian</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and </span><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> PMC member, join us to walk through the provider's first release and the technical decisions behind it.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:05</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The common AI provider was born from a real production problem.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow already had the primitives needed for durable agent execution, making it the natural foundation for AI orchestration.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">09:15</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The LLM schema compare operator uses Apache DataFusion to fetch source schemas.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">11:07</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Apache DataFusion was chosen for its speed.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Hook tool sets expose Airflow's provider hooks to agents with an allowed methods list that blocks destructive operations.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Passing durable=True to an LLM operator caches tool calls and LLM outputs mid-task.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">18:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The provider offers three abstraction levels.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">21:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The provider currently requires Airflow 3 — the team is open to adding Airflow 2.11 support if demand is high enough.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">24:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> MCP server configs can be stored as Airflow connections.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/kaxil/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kaxil Naik</a></p><p>https://www.linkedin.com/in/kaxil/</p><p><br></p><p><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pavan Kumar Gopidesu</a></p><p>https://www.linkedin.com/in/pavan-kumar-gopidesu/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://www.linkedin.com/company/experian/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Experian</a></p><p>https://www.linkedin.com/company/experian/</p><p><br></p><p><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://www.linkedin.com/company/apache-airflow</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-common-ai/stable/commits.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow common AI provider docs</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-common-ai/stable/commits.html</p><p><br></p><p><a href="https://datafusion.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache DataFusion</a></p><p>https://datafusion.apache.org/</p><p><br></p><p><a href="https://pydantic.dev/docs/ai/overview/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pydantic AI</a></p><p>https://pydantic.dev/docs/ai/overview/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-slack/stable/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-slack/stable/index.html</p><p><br></p><p><a href="https://airflow.apache.org/blog/common-ai-provider/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Introducing the Common AI Provider: LLM and AI Agent Support for Apache Airflow</a></p><p>https://airflow.apache.org/blog/common-ai-provider/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/35c8a78b-45fe-415a-a8d0-841cf151ed42/643e7ddb75.jpg" />
  <pubDate>Thu, 23 Apr 2026 10:25:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="27463648" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/35c8a78b-45fe-415a-a8d0-841cf151ed42/episode.mp3" />
  <itunes:title><![CDATA[Introducing Airflow’s Common AI Provider with Pavan Kumar Gopidesu and Kaxil Naik]]></itunes:title>
  <itunes:duration>28:36</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore the newly released Apache Airflow common AI provider — what problem it solves, how it was built and what's coming next.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/kaxil/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kaxil Naik</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Director of Engineering at</span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and </span><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> PMC member, and</span><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pavan Kumar Gopidesu</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/experian/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Experian</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and </span><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> PMC member, join us to walk through the provider's first release and the technical decisions behind it.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:05</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The common AI provider was born from a real production problem.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow already had the primitives needed for durable agent execution, making it the natural foundation for AI orchestration.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">09:15</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The LLM schema compare operator uses Apache DataFusion to fetch source schemas.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">11:07</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Apache DataFusion was chosen for its speed.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Hook tool sets expose Airflow's provider hooks to agents with an allowed methods list that blocks destructive operations.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Passing durable=True to an LLM operator caches tool calls and LLM outputs mid-task.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">18:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The provider offers three abstraction levels.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">21:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The provider currently requires Airflow 3 — the team is open to adding Airflow 2.11 support if demand is high enough.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">24:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> MCP server configs can be stored as Airflow connections.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/kaxil/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kaxil Naik</a></p><p>https://www.linkedin.com/in/kaxil/</p><p><br></p><p><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pavan Kumar Gopidesu</a></p><p>https://www.linkedin.com/in/pavan-kumar-gopidesu/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://www.linkedin.com/company/experian/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Experian</a></p><p>https://www.linkedin.com/company/experian/</p><p><br></p><p><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://www.linkedin.com/company/apache-airflow</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-common-ai/stable/commits.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow common AI provider docs</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-common-ai/stable/commits.html</p><p><br></p><p><a href="https://datafusion.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache DataFusion</a></p><p>https://datafusion.apache.org/</p><p><br></p><p><a href="https://pydantic.dev/docs/ai/overview/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pydantic AI</a></p><p>https://pydantic.dev/docs/ai/overview/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-slack/stable/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-slack/stable/index.html</p><p><br></p><p><a href="https://airflow.apache.org/blog/common-ai-provider/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Introducing the Common AI Provider: LLM and AI Agent Support for Apache Airflow</a></p><p>https://airflow.apache.org/blog/common-ai-provider/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore the newly released Apache Airflow common AI provider — what problem it solves, how it was built and what's coming next.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/kaxil/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kaxil Naik</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Director of Engineering at</span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and </span><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> PMC member, and</span><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pavan Kumar Gopidesu</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/experian/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Experian</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and </span><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> PMC member, join us to walk through the provider's first release and the technical decisions behind it.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">00:00</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Introduction.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">04:05</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The common AI provider was born from a real production problem.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">07:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Airflow already had the primitives needed for durable agent execution, making it the natural foundation for AI orchestration.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">09:15</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The LLM schema compare operator uses Apache DataFusion to fetch source schemas.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">11:07</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Apache DataFusion was chosen for its speed.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">13:09</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Hook tool sets expose Airflow's provider hooks to agents with an allowed methods list that blocks destructive operations.</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">15:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> Passing durable=True to an LLM operator caches tool calls and LLM outputs mid-task.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">18:13</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The provider offers three abstraction levels.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">21:20</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> The provider currently requires Airflow 3 — the team is open to adding Airflow 2.11 support if demand is high enough.&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">24:10</strong><span style="background-color: transparent; color: rgb(22, 14, 61);"> MCP server configs can be stored as Airflow connections.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/kaxil/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kaxil Naik</a></p><p>https://www.linkedin.com/in/kaxil/</p><p><br></p><p><a href="https://www.linkedin.com/in/pavan-kumar-gopidesu/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pavan Kumar Gopidesu</a></p><p>https://www.linkedin.com/in/pavan-kumar-gopidesu/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://www.linkedin.com/company/experian/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Experian</a></p><p>https://www.linkedin.com/company/experian/</p><p><br></p><p><a href="https://www.linkedin.com/company/apache-airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://www.linkedin.com/company/apache-airflow</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-common-ai/stable/commits.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow common AI provider docs</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-common-ai/stable/commits.html</p><p><br></p><p><a href="https://datafusion.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache DataFusion</a></p><p>https://datafusion.apache.org/</p><p><br></p><p><a href="https://pydantic.dev/docs/ai/overview/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pydantic AI</a></p><p>https://pydantic.dev/docs/ai/overview/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-slack/stable/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-slack/stable/index.html</p><p><br></p><p><a href="https://airflow.apache.org/blog/common-ai-provider/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Introducing the Common AI Provider: LLM and AI Agent Support for Apache Airflow</a></p><p>https://airflow.apache.org/blog/common-ai-provider/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[In this episode, we explore the newly released Apache Airflow common AI provider — what problem it solves, how it was built and what's coming next.Kaxil Naik, Senior Director of Engineering at Astronomer and Apache Airflow PMC member, and Pavan Kum...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>78</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[58235e2f-edef-47ab-b968-2a4b90b2d50f]]></guid>
  <title><![CDATA[Building AI Debugging Agents Into Airflow DAGs at Jeppesen ForeFlight with Samantha Blaney Cuevas]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Aviation data pipelines run on strict 28-day publication cycles, and the margin for error is zero. In this episode, we're joined by </span><a href="https://www.linkedin.com/in/samantha-blaney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Samantha Blaney Cuevas</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer at</span><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to explore how her team orchestrates a complex, time-sensitive data pipeline with Airflow and where AI is starting to fit into that picture.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:05 Airflow orchestrates almost all business logic and data transformations across the cycle, with custom timetables built to track busy and slow periods programmatically.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:10 Cycle-aware sensing tasks handle irregular source deliveries, including duplicates and early or late arrivals, without disrupting the pipeline.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:07 The two main AI use cases are pipeline debugging and cycle awareness — both designed to reduce the manual overhead of monitoring a complex DAG dependency graph.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:03 The Data Port agent is a two-task DAG that routes Slack pipeline alerts to either a predefined command list or an AI token, depending on whether the fix is already known.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:10 AI is still in development at Jeppesen ForeFlight — the team is focused on token efficiency and scoping how much autonomy to give agents across different environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:04 Airflow setup and MCP configuration were straightforward — the harder design work was deciding which environments agents could access across QA staging and production.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:06 Airflow's skills repo and agent tooling are helping onboard new developers and extend pipeline awareness to analysts who work alongside engineers on the cycle.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:10 Samantha would like to see single-task retries with different parameters in Airflow — resetting one task without clearing the full pipeline run.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:05 A future AI use case under consideration is live DAG editing and re-upload within Airflow to make one-off fixes without halting pipeline progress.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/samantha-blaney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Samantha Blaney Cuevas</a></p><p>https://www.linkedin.com/in/samantha-blaney/</p><p><br></p><p><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a> | LinkedIn</p><p>https://www.linkedin.com/company/jeppesen-foreflight/</p><p><br></p><p><a href="http://www.foreflight.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a> | Website</p><p>http://www.foreflight.com</p><p><br></p><p><a href="http://www.github.com/astronomer/airflow-llm-providers-demo" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Airflow Skills Repo</a></p><p>http://www.github.com/astronomer/airflow-llm-providers-demo</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/1543054c-5a2f-47fa-a70d-12757ca2165f/a86b9d3d25.jpg" />
  <pubDate>Thu, 16 Apr 2026 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21396547" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/1543054c-5a2f-47fa-a70d-12757ca2165f/episode.mp3" />
  <itunes:title><![CDATA[Building AI Debugging Agents Into Airflow DAGs at Jeppesen ForeFlight with Samantha Blaney Cuevas]]></itunes:title>
  <itunes:duration>22:17</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Aviation data pipelines run on strict 28-day publication cycles, and the margin for error is zero. In this episode, we're joined by </span><a href="https://www.linkedin.com/in/samantha-blaney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Samantha Blaney Cuevas</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer at</span><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to explore how her team orchestrates a complex, time-sensitive data pipeline with Airflow and where AI is starting to fit into that picture.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:05 Airflow orchestrates almost all business logic and data transformations across the cycle, with custom timetables built to track busy and slow periods programmatically.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:10 Cycle-aware sensing tasks handle irregular source deliveries, including duplicates and early or late arrivals, without disrupting the pipeline.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:07 The two main AI use cases are pipeline debugging and cycle awareness — both designed to reduce the manual overhead of monitoring a complex DAG dependency graph.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:03 The Data Port agent is a two-task DAG that routes Slack pipeline alerts to either a predefined command list or an AI token, depending on whether the fix is already known.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:10 AI is still in development at Jeppesen ForeFlight — the team is focused on token efficiency and scoping how much autonomy to give agents across different environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:04 Airflow setup and MCP configuration were straightforward — the harder design work was deciding which environments agents could access across QA staging and production.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:06 Airflow's skills repo and agent tooling are helping onboard new developers and extend pipeline awareness to analysts who work alongside engineers on the cycle.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:10 Samantha would like to see single-task retries with different parameters in Airflow — resetting one task without clearing the full pipeline run.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:05 A future AI use case under consideration is live DAG editing and re-upload within Airflow to make one-off fixes without halting pipeline progress.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/samantha-blaney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Samantha Blaney Cuevas</a></p><p>https://www.linkedin.com/in/samantha-blaney/</p><p><br></p><p><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a> | LinkedIn</p><p>https://www.linkedin.com/company/jeppesen-foreflight/</p><p><br></p><p><a href="http://www.foreflight.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a> | Website</p><p>http://www.foreflight.com</p><p><br></p><p><a href="http://www.github.com/astronomer/airflow-llm-providers-demo" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Airflow Skills Repo</a></p><p>http://www.github.com/astronomer/airflow-llm-providers-demo</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Aviation data pipelines run on strict 28-day publication cycles, and the margin for error is zero. In this episode, we're joined by </span><a href="https://www.linkedin.com/in/samantha-blaney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Samantha Blaney Cuevas</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer at</span><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(22, 14, 61);"> </a><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to explore how her team orchestrates a complex, time-sensitive data pipeline with Airflow and where AI is starting to fit into that picture.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:05 Airflow orchestrates almost all business logic and data transformations across the cycle, with custom timetables built to track busy and slow periods programmatically.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:10 Cycle-aware sensing tasks handle irregular source deliveries, including duplicates and early or late arrivals, without disrupting the pipeline.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:07 The two main AI use cases are pipeline debugging and cycle awareness — both designed to reduce the manual overhead of monitoring a complex DAG dependency graph.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:03 The Data Port agent is a two-task DAG that routes Slack pipeline alerts to either a predefined command list or an AI token, depending on whether the fix is already known.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:10 AI is still in development at Jeppesen ForeFlight — the team is focused on token efficiency and scoping how much autonomy to give agents across different environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:04 Airflow setup and MCP configuration were straightforward — the harder design work was deciding which environments agents could access across QA staging and production.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:06 Airflow's skills repo and agent tooling are helping onboard new developers and extend pipeline awareness to analysts who work alongside engineers on the cycle.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:10 Samantha would like to see single-task retries with different parameters in Airflow — resetting one task without clearing the full pipeline run.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:05 A future AI use case under consideration is live DAG editing and re-upload within Airflow to make one-off fixes without halting pipeline progress.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/samantha-blaney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Samantha Blaney Cuevas</a></p><p>https://www.linkedin.com/in/samantha-blaney/</p><p><br></p><p><a href="https://www.linkedin.com/company/jeppesen-foreflight/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a> | LinkedIn</p><p>https://www.linkedin.com/company/jeppesen-foreflight/</p><p><br></p><p><a href="http://www.foreflight.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jeppesen ForeFlight</a> | Website</p><p>http://www.foreflight.com</p><p><br></p><p><a href="http://www.github.com/astronomer/airflow-llm-providers-demo" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Airflow Skills Repo</a></p><p>http://www.github.com/astronomer/airflow-llm-providers-demo</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Aviation data pipelines run on strict 28-day publication cycles, and the margin for error is zero. In this episode, we're joined by Samantha Blaney Cuevas, Software Engineer at Jeppesen ForeFlight, to explore how her team orchestrates a complex, ti...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>77</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[87849d21-4021-45d4-9836-741e29fe6088]]></guid>
  <title><![CDATA[Introducing Airflow 3.2]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">We introduce Airflow 3.2 and its updates for teams that build and operate data pipelines.</span></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">’s Head of Customer Education, </span><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><span style="background-color: transparent; color: rgb(51, 51, 51);">Senior Manager of Developer Relations, </span><a href="https://www.linkedin.com/in/kentendanas" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kenten Danas</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, break down what’s new, from asset partitioning to Async Python tasks and DAG versioning. They explore how these updates improve scheduling, performance and observability in production workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿</span>Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:10 Airflow 3 architecture separates workers from the metadata database.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:05 Plugin versioning and UI-based backfills simplify operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:20 Asset partitioning enables granular, partition-level scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:15 Triggering DAGs on partitions instead of full datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:05 Deferrable operators reduce worker slot usage.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:00 Async operators reduce database pressure and overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:10 Async improves throughput, not single task speed.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:20 Inlets and outlets improve asset lineage visibility.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">23:00 DAG version markers show changes directly in the UI.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a></p><p>https://www.linkedin.com/in/marclamberti/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/webinars/introducing-airflow-3-2-video" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">3.2 Webinar</a></p><p>https://www.astronomer.io/events/webinars/introducing-airflow-3-2-video</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/airflow-partitioned-runs" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Asset Partitioning Guide</a></p><p>https://www.astronomer.io/docs/learn/airflow-partitioned-runs</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/deferrable-operators" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Asynchronous Processes Guide</a></p><p>https://www.astronomer.io/docs/learn/deferrable-operators</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-3-2-0-2026-04-07" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Release Notes</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-3-2-0-2026-04-07</p><p><br></p><p><a href="https://airflow.apache.org/registry/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Provider Registry</a></p><p>https://airflow.apache.org/registry/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9d45ee56-8414-4873-912e-94c8f78f208c/39a5b96ea0.jpg" />
  <pubDate>Thu, 09 Apr 2026 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="25314196" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9d45ee56-8414-4873-912e-94c8f78f208c/episode.mp3" />
  <itunes:title><![CDATA[Introducing Airflow 3.2]]></itunes:title>
  <itunes:duration>26:22</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">We introduce Airflow 3.2 and its updates for teams that build and operate data pipelines.</span></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">’s Head of Customer Education, </span><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><span style="background-color: transparent; color: rgb(51, 51, 51);">Senior Manager of Developer Relations, </span><a href="https://www.linkedin.com/in/kentendanas" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kenten Danas</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, break down what’s new, from asset partitioning to Async Python tasks and DAG versioning. They explore how these updates improve scheduling, performance and observability in production workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿</span>Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:10 Airflow 3 architecture separates workers from the metadata database.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:05 Plugin versioning and UI-based backfills simplify operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:20 Asset partitioning enables granular, partition-level scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:15 Triggering DAGs on partitions instead of full datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:05 Deferrable operators reduce worker slot usage.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:00 Async operators reduce database pressure and overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:10 Async improves throughput, not single task speed.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:20 Inlets and outlets improve asset lineage visibility.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">23:00 DAG version markers show changes directly in the UI.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a></p><p>https://www.linkedin.com/in/marclamberti/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/webinars/introducing-airflow-3-2-video" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">3.2 Webinar</a></p><p>https://www.astronomer.io/events/webinars/introducing-airflow-3-2-video</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/airflow-partitioned-runs" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Asset Partitioning Guide</a></p><p>https://www.astronomer.io/docs/learn/airflow-partitioned-runs</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/deferrable-operators" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Asynchronous Processes Guide</a></p><p>https://www.astronomer.io/docs/learn/deferrable-operators</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-3-2-0-2026-04-07" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Release Notes</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-3-2-0-2026-04-07</p><p><br></p><p><a href="https://airflow.apache.org/registry/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Provider Registry</a></p><p>https://airflow.apache.org/registry/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">We introduce Airflow 3.2 and its updates for teams that build and operate data pipelines.</span></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">’s Head of Customer Education, </span><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><span style="background-color: transparent; color: rgb(51, 51, 51);">Senior Manager of Developer Relations, </span><a href="https://www.linkedin.com/in/kentendanas" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kenten Danas</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, break down what’s new, from asset partitioning to Async Python tasks and DAG versioning. They explore how these updates improve scheduling, performance and observability in production workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿</span>Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:10 Airflow 3 architecture separates workers from the metadata database.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:05 Plugin versioning and UI-based backfills simplify operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:20 Asset partitioning enables granular, partition-level scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:15 Triggering DAGs on partitions instead of full datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:05 Deferrable operators reduce worker slot usage.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:00 Async operators reduce database pressure and overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:10 Async improves throughput, not single task speed.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:20 Inlets and outlets improve asset lineage visibility.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">23:00 DAG version markers show changes directly in the UI.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a></p><p>https://www.linkedin.com/in/marclamberti/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/webinars/introducing-airflow-3-2-video" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">3.2 Webinar</a></p><p>https://www.astronomer.io/events/webinars/introducing-airflow-3-2-video</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/airflow-partitioned-runs" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Asset Partitioning Guide</a></p><p>https://www.astronomer.io/docs/learn/airflow-partitioned-runs</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/deferrable-operators" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Asynchronous Processes Guide</a></p><p>https://www.astronomer.io/docs/learn/deferrable-operators</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-3-2-0-2026-04-07" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Release Notes</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-3-2-0-2026-04-07</p><p><br></p><p><a href="https://airflow.apache.org/registry/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Provider Registry</a></p><p>https://airflow.apache.org/registry/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[We introduce Airflow 3.2 and its updates for teams that build and operate data pipelines.Astronomer’s Head of Customer Education, Marc Lamberti, and Senior Manager of Developer Relations, Kenten Danas, break down what’s new, from asset partitioning...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>76</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[0b9f65a4-85da-4b01-8dc0-975611636cf7]]></guid>
  <title><![CDATA[Reflections on a Decade of Data Engineering at Seattle Data Guy]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Lessons from the past decade of data engineering reveal how much the ecosystem has changed and what has stayed surprisingly consistent.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/benjaminrogojan/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Benjamin Rogojan</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Owner and Data Consultant at </span><a href="https://www.linkedin.com/company/seattle-data-guy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Seattle Data Guy</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to reflect on how the data engineering landscape has evolved alongside Apache Airflow. We explore when Airflow makes sense as an orchestrator, why batch processing is still dominant and how AI is reshaping the workflows and responsibilities of modern data engineers.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:00 Airflow becomes valuable when workflows involve many pipelines, teams and dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:00 Data engineers are still focused on making data accessible and aligning work with business needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:30 Batch pipelines remain the most common approach even as real-time use cases grow.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:45 Many “real-time” requests are actually event-driven batch workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:00 Airflow replaced many custom-built pipeline systems with built-in dependency management.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:00 Modern orchestration tools often build on Airflow concepts or differentiate from them.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:00 AI can assist with writing SQL and pipelines but still requires experienced engineers.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:30 Organizations are collecting increasingly granular data creating more engineering demand.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:00 The data stack has shifted rapidly from Hadoop-era systems to modern cloud platforms.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/benjaminrogojan/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Benjamin Rogojan</a></p><p>https://www.linkedin.com/in/benjaminrogojan/</p><p><br></p><p><a href="https://www.linkedin.com/company/seattle-data-guy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Seattle Data Guy</a></p><p>https://www.linkedin.com/company/seattle-data-guy/</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://airflowsummit.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit / Airflow Conference</a></p><p>https://airflowsummit.org</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://developers.hubspot.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">HubSpot Data Sharing / APIs</a></p><p>https://developers.hubspot.com</p><p><br></p><p><a href="https://mlflow.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MLflow</a></p><p>https://mlflow.org</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/e5b5ee60-0877-412a-9a4f-cc59514f9c40/deffb5ddb2.jpg" />
  <pubDate>Fri, 03 Apr 2026 02:06:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="25164570" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/e5b5ee60-0877-412a-9a4f-cc59514f9c40/episode.mp3" />
  <itunes:title><![CDATA[Reflections on a Decade of Data Engineering at Seattle Data Guy]]></itunes:title>
  <itunes:duration>26:12</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Lessons from the past decade of data engineering reveal how much the ecosystem has changed and what has stayed surprisingly consistent.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/benjaminrogojan/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Benjamin Rogojan</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Owner and Data Consultant at </span><a href="https://www.linkedin.com/company/seattle-data-guy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Seattle Data Guy</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to reflect on how the data engineering landscape has evolved alongside Apache Airflow. We explore when Airflow makes sense as an orchestrator, why batch processing is still dominant and how AI is reshaping the workflows and responsibilities of modern data engineers.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:00 Airflow becomes valuable when workflows involve many pipelines, teams and dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:00 Data engineers are still focused on making data accessible and aligning work with business needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:30 Batch pipelines remain the most common approach even as real-time use cases grow.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:45 Many “real-time” requests are actually event-driven batch workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:00 Airflow replaced many custom-built pipeline systems with built-in dependency management.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:00 Modern orchestration tools often build on Airflow concepts or differentiate from them.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:00 AI can assist with writing SQL and pipelines but still requires experienced engineers.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:30 Organizations are collecting increasingly granular data creating more engineering demand.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:00 The data stack has shifted rapidly from Hadoop-era systems to modern cloud platforms.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/benjaminrogojan/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Benjamin Rogojan</a></p><p>https://www.linkedin.com/in/benjaminrogojan/</p><p><br></p><p><a href="https://www.linkedin.com/company/seattle-data-guy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Seattle Data Guy</a></p><p>https://www.linkedin.com/company/seattle-data-guy/</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://airflowsummit.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit / Airflow Conference</a></p><p>https://airflowsummit.org</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://developers.hubspot.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">HubSpot Data Sharing / APIs</a></p><p>https://developers.hubspot.com</p><p><br></p><p><a href="https://mlflow.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MLflow</a></p><p>https://mlflow.org</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Lessons from the past decade of data engineering reveal how much the ecosystem has changed and what has stayed surprisingly consistent.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/benjaminrogojan/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Benjamin Rogojan</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Owner and Data Consultant at </span><a href="https://www.linkedin.com/company/seattle-data-guy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Seattle Data Guy</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to reflect on how the data engineering landscape has evolved alongside Apache Airflow. We explore when Airflow makes sense as an orchestrator, why batch processing is still dominant and how AI is reshaping the workflows and responsibilities of modern data engineers.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:00 Airflow becomes valuable when workflows involve many pipelines, teams and dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:00 Data engineers are still focused on making data accessible and aligning work with business needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:30 Batch pipelines remain the most common approach even as real-time use cases grow.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:45 Many “real-time” requests are actually event-driven batch workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:00 Airflow replaced many custom-built pipeline systems with built-in dependency management.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:00 Modern orchestration tools often build on Airflow concepts or differentiate from them.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:00 AI can assist with writing SQL and pipelines but still requires experienced engineers.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:30 Organizations are collecting increasingly granular data creating more engineering demand.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:00 The data stack has shifted rapidly from Hadoop-era systems to modern cloud platforms.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/benjaminrogojan/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Benjamin Rogojan</a></p><p>https://www.linkedin.com/in/benjaminrogojan/</p><p><br></p><p><a href="https://www.linkedin.com/company/seattle-data-guy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Seattle Data Guy</a></p><p>https://www.linkedin.com/company/seattle-data-guy/</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://airflowsummit.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit / Airflow Conference</a></p><p>https://airflowsummit.org</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://developers.hubspot.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">HubSpot Data Sharing / APIs</a></p><p>https://developers.hubspot.com</p><p><br></p><p><a href="https://mlflow.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MLflow</a></p><p>https://mlflow.org</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Lessons from the past decade of data engineering reveal how much the ecosystem has changed and what has stayed surprisingly consistent.In this episode, Benjamin Rogojan, Owner and Data Consultant at Seattle Data Guy, joins us to reflect on how the ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>75</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[5694749b-ef0f-484f-893b-f86bc31dcea2]]></guid>
  <title><![CDATA[Managing Data Quality and Governance With Airflow at Credit Karma with Ashir Alam]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Data quality is not optional when you manage credit data at scale.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/ashir-alam/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ashir Alam</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/intuit-credit-karma/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Credit Karma</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how his team acts as the gatekeeper for credit data ingestion, how they standardize data quality with Airflow and DAG Factory and how they scale safely across thousands of DAGs. We explore how governance, PII protection and orchestration come together inside a modern data platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿﻿﻿﻿</span></span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">01:00 Overview of Credit Karma’s products and financial data ecosystem.</span></p><p><span style="background-color: transparent;">02:00 The team acts as gatekeepers for ingesting data from TransUnion and Equifax.</span></p><p><span style="background-color: transparent;">03:00 Why PII handling and controlled downstream access led to adopting Airflow.</span></p><p><span style="background-color: transparent;">04:00 BigQuery as the warehouse and Airflow as the primary orchestrator.</span></p><p><span style="background-color: transparent;">05:00 Why data quality and governance are critical in financial systems.</span></p><p><span style="background-color: transparent;">07:00 Why Airflow was selected: ease of use and unified ETL plus data quality.</span></p><p><span style="background-color: transparent;">09:00 Introduction to DAG Factory and YAML-based DAG generation.</span></p><p><span style="background-color: transparent;">10:00 GitHub executor creates PR-driven DAG workflows with CI checks.</span></p><p><span style="background-color: transparent;">12:00 BigQuery operators, structured checks and custom Slack and PagerDuty alerts.</span></p><p><span style="background-color: transparent;">13:00 Failed checks stop ETL pipelines and trigger notifications.</span></p><p><span style="background-color: transparent;">17:00 Scaling DAG Factory across thousands of DAGs and runtime vs compile-time concerns.</span></p><p><span style="background-color: transparent;">19:00 Future improvements: better defaults, retries and GenAI workflows in Airflow.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ashir-alam/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ashir Alam</a></p><p>https://www.linkedin.com/in/ashir-alam/</p><p><br></p><p><a href="https://www.linkedin.com/company/intuit-credit-karma/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Credit Karma</a></p><p>https://www.linkedin.com/company/intuit-credit-karma/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://github.com/astronomer/dag-factory" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DAG Factory</a></p><p>https://github.com/astronomer/dag-factory</p><p><br></p><p><a href="https://cloud.google.com/bigquery" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BigQuery (Google Cloud)</a></p><p>https://cloud.google.com/bigquery</p><p><br></p><p><a href="https://github.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a></p><p>https://github.com/</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://www.pagerduty.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PagerDuty</a></p><p>https://www.pagerduty.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/395354de-f229-4bb7-bfeb-0513e1c3f554/306cc48222.jpg" />
  <pubDate>Thu, 26 Mar 2026 05:33:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21197361" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/395354de-f229-4bb7-bfeb-0513e1c3f554/episode.mp3" />
  <itunes:title><![CDATA[Managing Data Quality and Governance With Airflow at Credit Karma with Ashir Alam]]></itunes:title>
  <itunes:duration>22:04</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Data quality is not optional when you manage credit data at scale.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/ashir-alam/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ashir Alam</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/intuit-credit-karma/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Credit Karma</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how his team acts as the gatekeeper for credit data ingestion, how they standardize data quality with Airflow and DAG Factory and how they scale safely across thousands of DAGs. We explore how governance, PII protection and orchestration come together inside a modern data platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿﻿﻿﻿</span></span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">01:00 Overview of Credit Karma’s products and financial data ecosystem.</span></p><p><span style="background-color: transparent;">02:00 The team acts as gatekeepers for ingesting data from TransUnion and Equifax.</span></p><p><span style="background-color: transparent;">03:00 Why PII handling and controlled downstream access led to adopting Airflow.</span></p><p><span style="background-color: transparent;">04:00 BigQuery as the warehouse and Airflow as the primary orchestrator.</span></p><p><span style="background-color: transparent;">05:00 Why data quality and governance are critical in financial systems.</span></p><p><span style="background-color: transparent;">07:00 Why Airflow was selected: ease of use and unified ETL plus data quality.</span></p><p><span style="background-color: transparent;">09:00 Introduction to DAG Factory and YAML-based DAG generation.</span></p><p><span style="background-color: transparent;">10:00 GitHub executor creates PR-driven DAG workflows with CI checks.</span></p><p><span style="background-color: transparent;">12:00 BigQuery operators, structured checks and custom Slack and PagerDuty alerts.</span></p><p><span style="background-color: transparent;">13:00 Failed checks stop ETL pipelines and trigger notifications.</span></p><p><span style="background-color: transparent;">17:00 Scaling DAG Factory across thousands of DAGs and runtime vs compile-time concerns.</span></p><p><span style="background-color: transparent;">19:00 Future improvements: better defaults, retries and GenAI workflows in Airflow.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ashir-alam/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ashir Alam</a></p><p>https://www.linkedin.com/in/ashir-alam/</p><p><br></p><p><a href="https://www.linkedin.com/company/intuit-credit-karma/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Credit Karma</a></p><p>https://www.linkedin.com/company/intuit-credit-karma/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://github.com/astronomer/dag-factory" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DAG Factory</a></p><p>https://github.com/astronomer/dag-factory</p><p><br></p><p><a href="https://cloud.google.com/bigquery" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BigQuery (Google Cloud)</a></p><p>https://cloud.google.com/bigquery</p><p><br></p><p><a href="https://github.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a></p><p>https://github.com/</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://www.pagerduty.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PagerDuty</a></p><p>https://www.pagerduty.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Data quality is not optional when you manage credit data at scale.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/ashir-alam/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ashir Alam</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/intuit-credit-karma/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Credit Karma</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how his team acts as the gatekeeper for credit data ingestion, how they standardize data quality with Airflow and DAG Factory and how they scale safely across thousands of DAGs. We explore how governance, PII protection and orchestration come together inside a modern data platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿﻿﻿﻿</span></span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">01:00 Overview of Credit Karma’s products and financial data ecosystem.</span></p><p><span style="background-color: transparent;">02:00 The team acts as gatekeepers for ingesting data from TransUnion and Equifax.</span></p><p><span style="background-color: transparent;">03:00 Why PII handling and controlled downstream access led to adopting Airflow.</span></p><p><span style="background-color: transparent;">04:00 BigQuery as the warehouse and Airflow as the primary orchestrator.</span></p><p><span style="background-color: transparent;">05:00 Why data quality and governance are critical in financial systems.</span></p><p><span style="background-color: transparent;">07:00 Why Airflow was selected: ease of use and unified ETL plus data quality.</span></p><p><span style="background-color: transparent;">09:00 Introduction to DAG Factory and YAML-based DAG generation.</span></p><p><span style="background-color: transparent;">10:00 GitHub executor creates PR-driven DAG workflows with CI checks.</span></p><p><span style="background-color: transparent;">12:00 BigQuery operators, structured checks and custom Slack and PagerDuty alerts.</span></p><p><span style="background-color: transparent;">13:00 Failed checks stop ETL pipelines and trigger notifications.</span></p><p><span style="background-color: transparent;">17:00 Scaling DAG Factory across thousands of DAGs and runtime vs compile-time concerns.</span></p><p><span style="background-color: transparent;">19:00 Future improvements: better defaults, retries and GenAI workflows in Airflow.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ashir-alam/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ashir Alam</a></p><p>https://www.linkedin.com/in/ashir-alam/</p><p><br></p><p><a href="https://www.linkedin.com/company/intuit-credit-karma/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Credit Karma</a></p><p>https://www.linkedin.com/company/intuit-credit-karma/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://github.com/astronomer/dag-factory" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DAG Factory</a></p><p>https://github.com/astronomer/dag-factory</p><p><br></p><p><a href="https://cloud.google.com/bigquery" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BigQuery (Google Cloud)</a></p><p>https://cloud.google.com/bigquery</p><p><br></p><p><a href="https://github.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a></p><p>https://github.com/</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://www.pagerduty.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PagerDuty</a></p><p>https://www.pagerduty.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data quality is not optional when you manage credit data at scale.In this episode, Ashir Alam, Senior Data Engineer at Credit Karma, joins us to share how his team acts as the gatekeeper for credit data ingestion, how they standardize data quality ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>74</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[fca554c0-14c3-482a-a1f6-9c426cda6c45]]></guid>
  <title><![CDATA[Open Source Airflow Contributions and Performance Improvements at G-Research with Christos Bisias]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Modern Airflow isn’t just orchestration. It's a contribution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿</span></span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore how open source investment drives real performance gains and deeper observability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">We’re joined by </span><a href="https://www.linkedin.com/in/xbis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christos Bisias</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Open Source Software Engineer</span><span style="background-color: transparent; color: rgb(51, 51, 51);">, </span><span style="background-color: transparent;">Apache Airflow</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> at </span><a href="https://www.linkedin.com/company/g-research/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">G-Research</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to discuss how his team uses Airflow for large-scale data transformations, contributes upstream and improves scheduler throughput and OpenTelemetry support. From trace-level observability to CI-enforced metrics governance and a major scheduler optimization, this conversation spans strategy, engineering and community impact.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">01:20 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">How G-Research applies machine learning and big data to predict financial market movements.</span></p><p><span style="background-color: transparent;">02:15 Contributing to open source is a business decision.</span></p><p><span style="background-color: transparent;">03:10 Maintaining a fork is costly.</span></p><p><span style="background-color: transparent;">04:30 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">OpenTelemetry collects metrics, logs and traces to provide deep system visibility. </span></p><p><span style="background-color: transparent;">06:10 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Custom spans help identify bottlenecks inside tasks and enable performance optimization. </span></p><p><span style="background-color: transparent;">08:05 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">OpenTelemetry integration works properly in Airflow 3.0 and above.</span></p><p><span style="background-color: transparent;">10:00 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">A YAML-based metrics registry with CI enforcement ensures consistency between docs and exported metrics.</span></p><p><span style="background-color: transparent;">12:10 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Scheduler throughput improved significantly by applying concurrency limits earlier in the database query.&nbsp; </span></p><p><span style="background-color: transparent;">15:20 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Future Task SDK changes may enable language-agnostic DAG authoring beyond Python.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/xbis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christos Bisias</a></p><p>https://www.linkedin.com/in/xbis/</p><p><br></p><p><a href="https://www.linkedin.com/company/g-research/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">G-Research</a> </p><p>https://www.linkedin.com/company/g-research/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://opentelemetry.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenTelemetry</a></p><p>https://opentelemetry.io/</p><p><br></p><p><a href="https://prometheus.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prometheus</a></p><p>https://prometheus.io/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://www.jaegertracing.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaeger</a></p><p>https://www.jaegertracing.io/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/aecc3072-6118-4227-8e0b-15d753a45586/304785ec5d.jpg" />
  <pubDate>Thu, 19 Mar 2026 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="17024055" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/aecc3072-6118-4227-8e0b-15d753a45586/episode.mp3" />
  <itunes:title><![CDATA[Open Source Airflow Contributions and Performance Improvements at G-Research with Christos Bisias]]></itunes:title>
  <itunes:duration>17:43</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Modern Airflow isn’t just orchestration. It's a contribution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿</span></span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore how open source investment drives real performance gains and deeper observability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">We’re joined by </span><a href="https://www.linkedin.com/in/xbis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christos Bisias</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Open Source Software Engineer</span><span style="background-color: transparent; color: rgb(51, 51, 51);">, </span><span style="background-color: transparent;">Apache Airflow</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> at </span><a href="https://www.linkedin.com/company/g-research/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">G-Research</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to discuss how his team uses Airflow for large-scale data transformations, contributes upstream and improves scheduler throughput and OpenTelemetry support. From trace-level observability to CI-enforced metrics governance and a major scheduler optimization, this conversation spans strategy, engineering and community impact.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">01:20 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">How G-Research applies machine learning and big data to predict financial market movements.</span></p><p><span style="background-color: transparent;">02:15 Contributing to open source is a business decision.</span></p><p><span style="background-color: transparent;">03:10 Maintaining a fork is costly.</span></p><p><span style="background-color: transparent;">04:30 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">OpenTelemetry collects metrics, logs and traces to provide deep system visibility. </span></p><p><span style="background-color: transparent;">06:10 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Custom spans help identify bottlenecks inside tasks and enable performance optimization. </span></p><p><span style="background-color: transparent;">08:05 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">OpenTelemetry integration works properly in Airflow 3.0 and above.</span></p><p><span style="background-color: transparent;">10:00 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">A YAML-based metrics registry with CI enforcement ensures consistency between docs and exported metrics.</span></p><p><span style="background-color: transparent;">12:10 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Scheduler throughput improved significantly by applying concurrency limits earlier in the database query.&nbsp; </span></p><p><span style="background-color: transparent;">15:20 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Future Task SDK changes may enable language-agnostic DAG authoring beyond Python.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/xbis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christos Bisias</a></p><p>https://www.linkedin.com/in/xbis/</p><p><br></p><p><a href="https://www.linkedin.com/company/g-research/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">G-Research</a> </p><p>https://www.linkedin.com/company/g-research/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://opentelemetry.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenTelemetry</a></p><p>https://opentelemetry.io/</p><p><br></p><p><a href="https://prometheus.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prometheus</a></p><p>https://prometheus.io/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://www.jaegertracing.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaeger</a></p><p>https://www.jaegertracing.io/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Modern Airflow isn’t just orchestration. It's a contribution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿</span></span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore how open source investment drives real performance gains and deeper observability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">We’re joined by </span><a href="https://www.linkedin.com/in/xbis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christos Bisias</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Open Source Software Engineer</span><span style="background-color: transparent; color: rgb(51, 51, 51);">, </span><span style="background-color: transparent;">Apache Airflow</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> at </span><a href="https://www.linkedin.com/company/g-research/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">G-Research</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to discuss how his team uses Airflow for large-scale data transformations, contributes upstream and improves scheduler throughput and OpenTelemetry support. From trace-level observability to CI-enforced metrics governance and a major scheduler optimization, this conversation spans strategy, engineering and community impact.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">01:20 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">How G-Research applies machine learning and big data to predict financial market movements.</span></p><p><span style="background-color: transparent;">02:15 Contributing to open source is a business decision.</span></p><p><span style="background-color: transparent;">03:10 Maintaining a fork is costly.</span></p><p><span style="background-color: transparent;">04:30 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">OpenTelemetry collects metrics, logs and traces to provide deep system visibility. </span></p><p><span style="background-color: transparent;">06:10 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Custom spans help identify bottlenecks inside tasks and enable performance optimization. </span></p><p><span style="background-color: transparent;">08:05 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">OpenTelemetry integration works properly in Airflow 3.0 and above.</span></p><p><span style="background-color: transparent;">10:00 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">A YAML-based metrics registry with CI enforcement ensures consistency between docs and exported metrics.</span></p><p><span style="background-color: transparent;">12:10 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Scheduler throughput improved significantly by applying concurrency limits earlier in the database query.&nbsp; </span></p><p><span style="background-color: transparent;">15:20 </span><span style="background-color: transparent; color: rgb(22, 14, 61);">Future Task SDK changes may enable language-agnostic DAG authoring beyond Python.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/xbis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christos Bisias</a></p><p>https://www.linkedin.com/in/xbis/</p><p><br></p><p><a href="https://www.linkedin.com/company/g-research/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">G-Research</a> </p><p>https://www.linkedin.com/company/g-research/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://opentelemetry.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenTelemetry</a></p><p>https://opentelemetry.io/</p><p><br></p><p><a href="https://prometheus.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prometheus</a></p><p>https://prometheus.io/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://www.jaegertracing.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaeger</a></p><p>https://www.jaegertracing.io/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Modern Airflow isn’t just orchestration. It's a contribution.﻿﻿In this episode, we explore how open source investment drives real performance gains and deeper observability.We’re joined by Christos Bisias, Open Source Software Engineer, Apache Airf...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>73</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[6f656ed3-a808-4318-aa10-2e7c0b4e15f5]]></guid>
  <title><![CDATA[Automating Threat Intelligence Using Airflow with Karan Alang]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/karan-alang-4173437" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Karan Alang</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Software Engineer at </span><a href="https://www.linkedin.com/company/versa-networks" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins the conversation to discuss how Airflow can be used to automate threat intelligence in modern cybersecurity environments. He explains the growing scale of cloud computing, the profitability of hacking and the shortage of SOC analysts. Karan also outlines a novel architecture that combines Airflow, XDR, graph databases and LLMs to orchestrate automated threat detection and response.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">05:00 Organizations face massive log volumes and a shortage of SOC analysts.</span></p><p><span style="background-color: transparent;">07:00 The solution integrates Airflow, XDR, Neo4j graph databases and LLMs into one architecture.</span></p><p><span style="background-color: transparent;">08:00 MITRE ATT&amp;CK provides a global framework for mapping tactics and techniques.</span></p><p><span style="background-color: transparent;">11:00 Airflow acts as the orchestration backbone for ingestion graph transformation and LLM workflows.</span></p><p><span style="background-color: transparent;">13:00 Graph databases provide a full relationship view of attackers’ systems and entities.</span></p><p><span style="background-color: transparent;">14:00 LLMs automate mapping activity to MITRE ATT&amp;CK and assign explainable risk scores.</span></p><p><span style="background-color: transparent;">17:00 Traditional signature-based detection allows lateral movement and exfiltration before teams can react.</span></p><p><span style="background-color: transparent;">18:00 End-to-end automation is essential to mitigating modern cybersecurity threats.</span></p><p><span style="background-color: transparent;">20:00 Future opportunities include deeper LLM integration as first-class citizens within Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/karan-alang-4173437" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Karan Alang</a></p><p>https://www.linkedin.com/in/karan-alang-4173437</p><p><br></p><p><a href="https://www.linkedin.com/company/versa-networks" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a> | LinkedIn</p><p>https://www.linkedin.com/company/versa-networks</p><p><br></p><p><a href="https://versa-networks.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a> | Website</p><p>https://versa-networks.com</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer (Managed Airflow on GCP)</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://www.microsoft.com/es-es/security/business/siem-and-xdr/microsoft-defender-xdr" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Microsoft Defender XDR</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://www.microsoft.com/es-es/security/business/siem-and-xdr/microsoft-defender-xdr</p><p><br></p><p><a href="https://neo4j.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Neo4j (Graph Database)</a></p><p>https://neo4j.com</p><p><br></p><p><a href="https://attack.mitre.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MITRE ATT&amp;CK Framework</a></p><p>https://attack.mitre.org</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/acf691f3-eefe-444c-adce-ed6c3002c88f/29ed199e0b.jpg" />
  <pubDate>Thu, 12 Mar 2026 02:58:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21352782" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/acf691f3-eefe-444c-adce-ed6c3002c88f/episode.mp3" />
  <itunes:title><![CDATA[Automating Threat Intelligence Using Airflow with Karan Alang]]></itunes:title>
  <itunes:duration>22:14</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/karan-alang-4173437" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Karan Alang</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Software Engineer at </span><a href="https://www.linkedin.com/company/versa-networks" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins the conversation to discuss how Airflow can be used to automate threat intelligence in modern cybersecurity environments. He explains the growing scale of cloud computing, the profitability of hacking and the shortage of SOC analysts. Karan also outlines a novel architecture that combines Airflow, XDR, graph databases and LLMs to orchestrate automated threat detection and response.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">05:00 Organizations face massive log volumes and a shortage of SOC analysts.</span></p><p><span style="background-color: transparent;">07:00 The solution integrates Airflow, XDR, Neo4j graph databases and LLMs into one architecture.</span></p><p><span style="background-color: transparent;">08:00 MITRE ATT&amp;CK provides a global framework for mapping tactics and techniques.</span></p><p><span style="background-color: transparent;">11:00 Airflow acts as the orchestration backbone for ingestion graph transformation and LLM workflows.</span></p><p><span style="background-color: transparent;">13:00 Graph databases provide a full relationship view of attackers’ systems and entities.</span></p><p><span style="background-color: transparent;">14:00 LLMs automate mapping activity to MITRE ATT&amp;CK and assign explainable risk scores.</span></p><p><span style="background-color: transparent;">17:00 Traditional signature-based detection allows lateral movement and exfiltration before teams can react.</span></p><p><span style="background-color: transparent;">18:00 End-to-end automation is essential to mitigating modern cybersecurity threats.</span></p><p><span style="background-color: transparent;">20:00 Future opportunities include deeper LLM integration as first-class citizens within Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/karan-alang-4173437" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Karan Alang</a></p><p>https://www.linkedin.com/in/karan-alang-4173437</p><p><br></p><p><a href="https://www.linkedin.com/company/versa-networks" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a> | LinkedIn</p><p>https://www.linkedin.com/company/versa-networks</p><p><br></p><p><a href="https://versa-networks.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a> | Website</p><p>https://versa-networks.com</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer (Managed Airflow on GCP)</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://www.microsoft.com/es-es/security/business/siem-and-xdr/microsoft-defender-xdr" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Microsoft Defender XDR</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://www.microsoft.com/es-es/security/business/siem-and-xdr/microsoft-defender-xdr</p><p><br></p><p><a href="https://neo4j.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Neo4j (Graph Database)</a></p><p>https://neo4j.com</p><p><br></p><p><a href="https://attack.mitre.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MITRE ATT&amp;CK Framework</a></p><p>https://attack.mitre.org</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/karan-alang-4173437" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Karan Alang</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Software Engineer at </span><a href="https://www.linkedin.com/company/versa-networks" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins the conversation to discuss how Airflow can be used to automate threat intelligence in modern cybersecurity environments. He explains the growing scale of cloud computing, the profitability of hacking and the shortage of SOC analysts. Karan also outlines a novel architecture that combines Airflow, XDR, graph databases and LLMs to orchestrate automated threat detection and response.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">00:00 Introduction.</span></p><p><span style="background-color: transparent;">05:00 Organizations face massive log volumes and a shortage of SOC analysts.</span></p><p><span style="background-color: transparent;">07:00 The solution integrates Airflow, XDR, Neo4j graph databases and LLMs into one architecture.</span></p><p><span style="background-color: transparent;">08:00 MITRE ATT&amp;CK provides a global framework for mapping tactics and techniques.</span></p><p><span style="background-color: transparent;">11:00 Airflow acts as the orchestration backbone for ingestion graph transformation and LLM workflows.</span></p><p><span style="background-color: transparent;">13:00 Graph databases provide a full relationship view of attackers’ systems and entities.</span></p><p><span style="background-color: transparent;">14:00 LLMs automate mapping activity to MITRE ATT&amp;CK and assign explainable risk scores.</span></p><p><span style="background-color: transparent;">17:00 Traditional signature-based detection allows lateral movement and exfiltration before teams can react.</span></p><p><span style="background-color: transparent;">18:00 End-to-end automation is essential to mitigating modern cybersecurity threats.</span></p><p><span style="background-color: transparent;">20:00 Future opportunities include deeper LLM integration as first-class citizens within Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/karan-alang-4173437" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Karan Alang</a></p><p>https://www.linkedin.com/in/karan-alang-4173437</p><p><br></p><p><a href="https://www.linkedin.com/company/versa-networks" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a> | LinkedIn</p><p>https://www.linkedin.com/company/versa-networks</p><p><br></p><p><a href="https://versa-networks.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Versa Networks</a> | Website</p><p>https://versa-networks.com</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer (Managed Airflow on GCP)</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://www.microsoft.com/es-es/security/business/siem-and-xdr/microsoft-defender-xdr" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Microsoft Defender XDR</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://www.microsoft.com/es-es/security/business/siem-and-xdr/microsoft-defender-xdr</p><p><br></p><p><a href="https://neo4j.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Neo4j (Graph Database)</a></p><p>https://neo4j.com</p><p><br></p><p><a href="https://attack.mitre.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MITRE ATT&amp;CK Framework</a></p><p>https://attack.mitre.org</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[In this episode, Karan Alang, Principal Software Engineer at Versa Networks, joins the conversation to discuss how Airflow can be used to automate threat intelligence in modern cybersecurity environments. He explains the growing scale of cloud comp...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>72</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[3fc501fc-ae6a-4b8d-a7e0-1a9c0e6c721a]]></guid>
  <title><![CDATA[Using Plugins To Customize Airflow at Ponder Labs with Egor Tarasenko]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore how teams scale Apache Airflow in complex environments and what it takes to make orchestration work across many stakeholders. We look at real-world challenges around visibility, ownership and predictability as data platforms grow.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/egorseno/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data and AI Engineer at </span><a href="https://www.linkedin.com/company/ponder-labs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ponder Labs</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how Ponder Labs customizes Airflow for education organizations using plugins, event-driven architectures and AI-powered tooling. He explains how his team supports large charter school networks and why structure, consistency and extensibility become critical at scale.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:21 Ponder Labs helps education organizations bring data from many systems together so it becomes useful for teachers, school leaders and administrators.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:10 Airflow serves as the backbone for orchestrating ingestion, transformation and reverse ETL across client data platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:43 Everything is triggered from Airflow to maintain dependency, visibility and a single operational picture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:05 Managing hundreds of DAGs requires a focus on structure, visibility and consistency across teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:51 Treating DAGs like APIs helps teams scale without needing deep knowledge of upstream logic.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:00 Custom plugins like schedule insights help predict DAG run times across layered dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:00 AI-powered Airflow chat enables non-technical stakeholders to understand DAG ownership dependencies and cluster activity.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:06 Migrating plugins to Airflow 3 improves developer experience through cleaner APIs and faster extensibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/egorseno/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko</a></p><p>https://www.linkedin.com/in/egorseno/</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Astro Platform</a></p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://egortarasenko.substack.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko on Substack</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://egortarasenko.substack.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/3e005da3-5aa8-41a0-9452-3eb11e92d842/a9e3efa03d.jpg" />
  <pubDate>Thu, 05 Mar 2026 00:15:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26647071" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/3e005da3-5aa8-41a0-9452-3eb11e92d842/episode.mp3" />
  <itunes:title><![CDATA[Using Plugins To Customize Airflow at Ponder Labs with Egor Tarasenko]]></itunes:title>
  <itunes:duration>27:45</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore how teams scale Apache Airflow in complex environments and what it takes to make orchestration work across many stakeholders. We look at real-world challenges around visibility, ownership and predictability as data platforms grow.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/egorseno/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data and AI Engineer at </span><a href="https://www.linkedin.com/company/ponder-labs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ponder Labs</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how Ponder Labs customizes Airflow for education organizations using plugins, event-driven architectures and AI-powered tooling. He explains how his team supports large charter school networks and why structure, consistency and extensibility become critical at scale.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:21 Ponder Labs helps education organizations bring data from many systems together so it becomes useful for teachers, school leaders and administrators.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:10 Airflow serves as the backbone for orchestrating ingestion, transformation and reverse ETL across client data platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:43 Everything is triggered from Airflow to maintain dependency, visibility and a single operational picture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:05 Managing hundreds of DAGs requires a focus on structure, visibility and consistency across teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:51 Treating DAGs like APIs helps teams scale without needing deep knowledge of upstream logic.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:00 Custom plugins like schedule insights help predict DAG run times across layered dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:00 AI-powered Airflow chat enables non-technical stakeholders to understand DAG ownership dependencies and cluster activity.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:06 Migrating plugins to Airflow 3 improves developer experience through cleaner APIs and faster extensibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/egorseno/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko</a></p><p>https://www.linkedin.com/in/egorseno/</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Astro Platform</a></p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://egortarasenko.substack.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko on Substack</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://egortarasenko.substack.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we explore how teams scale Apache Airflow in complex environments and what it takes to make orchestration work across many stakeholders. We look at real-world challenges around visibility, ownership and predictability as data platforms grow.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/egorseno/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data and AI Engineer at </span><a href="https://www.linkedin.com/company/ponder-labs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ponder Labs</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how Ponder Labs customizes Airflow for education organizations using plugins, event-driven architectures and AI-powered tooling. He explains how his team supports large charter school networks and why structure, consistency and extensibility become critical at scale.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:21 Ponder Labs helps education organizations bring data from many systems together so it becomes useful for teachers, school leaders and administrators.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:10 Airflow serves as the backbone for orchestrating ingestion, transformation and reverse ETL across client data platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:43 Everything is triggered from Airflow to maintain dependency, visibility and a single operational picture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:05 Managing hundreds of DAGs requires a focus on structure, visibility and consistency across teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:51 Treating DAGs like APIs helps teams scale without needing deep knowledge of upstream logic.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:00 Custom plugins like schedule insights help predict DAG run times across layered dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:00 AI-powered Airflow chat enables non-technical stakeholders to understand DAG ownership dependencies and cluster activity.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:06 Migrating plugins to Airflow 3 improves developer experience through cleaner APIs and faster extensibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/egorseno/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko</a></p><p>https://www.linkedin.com/in/egorseno/</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Astro Platform</a></p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://egortarasenko.substack.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Egor Tarasenko on Substack</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p>https://egortarasenko.substack.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[In this episode, we explore how teams scale Apache Airflow in complex environments and what it takes to make orchestration work across many stakeholders. We look at real-world challenges around visibility, ownership and predictability as data platf...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>71</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[67f5eff3-53f4-4032-8b6a-14a68d1befca]]></guid>
  <title><![CDATA[Scaling Airflow at Wix for Analytics and AI with Ethan Shalev]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Modern data orchestration at scale demands reliability, speed and thoughtful adoption of new tooling. As organizations grow, keeping pipelines efficient while supporting more teams becomes a critical challenge.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/eshalev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ethan Shalev</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to discuss how Wix operates Airflow at massive scale, migrates to Airflow 3 and uses AI to accelerate development.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Wix structures data engineering across multiple product-focused organizations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:40 Migrating nearly 8,000 DAGs to Airflow 3 requires careful planning.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:31 Migration creates an opportunity to remove long-standing legacy Airflow code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:32 Internal playbooks and Cursor rules standardize and speed up DAG migrations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:39 Airflow 3 introduces backfills, DAG versioning and asset-aware scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:16 Deferrable operators reduce scheduler congestion in large Airflow environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:54 AI-generated code still requires review and strong testing practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:52 Moving to managed Airflow reduces operational burden on internal platform teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:57 Improving multi-tenancy and UI personalization remains a key Airflow need.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/eshalev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ethan Shalev</a></p><p>https://www.linkedin.com/in/eshalev/</p><p><br></p><p><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | LinkedIn</p><p>https://www.linkedin.com/company/wix-com/</p><p><br></p><p><a href="https://www.wix.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | Website</p><p>https://www.wix.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://trino.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Trino</a></p><p>https://trino.io/</p><p><br></p><p><a href="https://iceberg.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Iceberg</a></p><p>https://iceberg.apache.org/</p><p><br></p><p><a href="https://cursor.sh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cursor</a></p><p>https://cursor.sh/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9eb4945e-1d4d-4edc-82a9-fc4617dfe88b/413d98f7e5.jpg" />
  <pubDate>Thu, 26 Feb 2026 00:00:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="17288126" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9eb4945e-1d4d-4edc-82a9-fc4617dfe88b/episode.mp3" />
  <itunes:title><![CDATA[Scaling Airflow at Wix for Analytics and AI with Ethan Shalev]]></itunes:title>
  <itunes:duration>18:00</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Modern data orchestration at scale demands reliability, speed and thoughtful adoption of new tooling. As organizations grow, keeping pipelines efficient while supporting more teams becomes a critical challenge.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/eshalev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ethan Shalev</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to discuss how Wix operates Airflow at massive scale, migrates to Airflow 3 and uses AI to accelerate development.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Wix structures data engineering across multiple product-focused organizations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:40 Migrating nearly 8,000 DAGs to Airflow 3 requires careful planning.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:31 Migration creates an opportunity to remove long-standing legacy Airflow code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:32 Internal playbooks and Cursor rules standardize and speed up DAG migrations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:39 Airflow 3 introduces backfills, DAG versioning and asset-aware scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:16 Deferrable operators reduce scheduler congestion in large Airflow environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:54 AI-generated code still requires review and strong testing practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:52 Moving to managed Airflow reduces operational burden on internal platform teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:57 Improving multi-tenancy and UI personalization remains a key Airflow need.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/eshalev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ethan Shalev</a></p><p>https://www.linkedin.com/in/eshalev/</p><p><br></p><p><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | LinkedIn</p><p>https://www.linkedin.com/company/wix-com/</p><p><br></p><p><a href="https://www.wix.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | Website</p><p>https://www.wix.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://trino.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Trino</a></p><p>https://trino.io/</p><p><br></p><p><a href="https://iceberg.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Iceberg</a></p><p>https://iceberg.apache.org/</p><p><br></p><p><a href="https://cursor.sh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cursor</a></p><p>https://cursor.sh/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Modern data orchestration at scale demands reliability, speed and thoughtful adoption of new tooling. As organizations grow, keeping pipelines efficient while supporting more teams becomes a critical challenge.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/eshalev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ethan Shalev</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to discuss how Wix operates Airflow at massive scale, migrates to Airflow 3 and uses AI to accelerate development.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Wix structures data engineering across multiple product-focused organizations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:40 Migrating nearly 8,000 DAGs to Airflow 3 requires careful planning.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:31 Migration creates an opportunity to remove long-standing legacy Airflow code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:32 Internal playbooks and Cursor rules standardize and speed up DAG migrations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:39 Airflow 3 introduces backfills, DAG versioning and asset-aware scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:16 Deferrable operators reduce scheduler congestion in large Airflow environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:54 AI-generated code still requires review and strong testing practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:52 Moving to managed Airflow reduces operational burden on internal platform teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:57 Improving multi-tenancy and UI personalization remains a key Airflow need.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/eshalev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ethan Shalev</a></p><p>https://www.linkedin.com/in/eshalev/</p><p><br></p><p><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | LinkedIn</p><p>https://www.linkedin.com/company/wix-com/</p><p><br></p><p><a href="https://www.wix.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | Website</p><p>https://www.wix.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://trino.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Trino</a></p><p>https://trino.io/</p><p><br></p><p><a href="https://iceberg.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Iceberg</a></p><p>https://iceberg.apache.org/</p><p><br></p><p><a href="https://cursor.sh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cursor</a></p><p>https://cursor.sh/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Modern data orchestration at scale demands reliability, speed and thoughtful adoption of new tooling. As organizations grow, keeping pipelines efficient while supporting more teams becomes a critical challenge.In this episode, we’re joined by Ethan...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>70</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[b2966f43-e21c-49b8-8f26-48adaeea7c33]]></guid>
  <title><![CDATA[Using Airflow To Orchestrate Billions of Events at Addi with Carlos Daniel Puerto Niño]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Strong data orchestration is as much about culture and visibility as it is about technology. As data platforms scale, teams need systems that reduce cognitive load while increasing reliability and observability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/carlospuertoni%C3%B1o/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Carlos Daniel Puerto Niño</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Analytics Engineer and Data Analyst at </span><a href="https://www.linkedin.com/company/addicol/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how Addi uses Airflow to support batch orchestration, manage organizational complexity and improve monitoring across its data platform.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:25 Changes in company strategy increase data platform complexity over time.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:00 Centralized data teams help manage organizational and technical change.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:08 Scalable architectures support growing data volumes and use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:10 Adopting orchestration tools introduces operational and maintenance challenges.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:43 Abstraction layers lower technical barriers for onboarding new team members.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:36 Modularity and visibility improve the reliability of data pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:14 Integrated monitoring supports faster incident response and resolution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:19 Limited access to orchestration metadata constrains proactive analysis.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/carlospuertoni%C3%B1o/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Carlos Daniel Puerto Niño</a></p><p>https://www.linkedin.com/in/carlospuertoni%C3%B1o/</p><p><br></p><p><a href="https://www.linkedin.com/company/addicol/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a> | LinkedIn</p><p>https://www.linkedin.com/company/addicol/</p><p><br></p><p><a href="https://www.addi.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a> | Website</p><p>https://www.addi.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.databricks.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Databricks</a></p><p>https://www.databricks.com/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/b86e6e0e-c640-48ef-9e09-f9f341d65c59/b3f38c5559.jpg" />
  <pubDate>Thu, 19 Feb 2026 00:10:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23825844" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/b86e6e0e-c640-48ef-9e09-f9f341d65c59/episode.mp3" />
  <itunes:title><![CDATA[Using Airflow To Orchestrate Billions of Events at Addi with Carlos Daniel Puerto Niño]]></itunes:title>
  <itunes:duration>24:49</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Strong data orchestration is as much about culture and visibility as it is about technology. As data platforms scale, teams need systems that reduce cognitive load while increasing reliability and observability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/carlospuertoni%C3%B1o/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Carlos Daniel Puerto Niño</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Analytics Engineer and Data Analyst at </span><a href="https://www.linkedin.com/company/addicol/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how Addi uses Airflow to support batch orchestration, manage organizational complexity and improve monitoring across its data platform.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:25 Changes in company strategy increase data platform complexity over time.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:00 Centralized data teams help manage organizational and technical change.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:08 Scalable architectures support growing data volumes and use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:10 Adopting orchestration tools introduces operational and maintenance challenges.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:43 Abstraction layers lower technical barriers for onboarding new team members.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:36 Modularity and visibility improve the reliability of data pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:14 Integrated monitoring supports faster incident response and resolution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:19 Limited access to orchestration metadata constrains proactive analysis.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/carlospuertoni%C3%B1o/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Carlos Daniel Puerto Niño</a></p><p>https://www.linkedin.com/in/carlospuertoni%C3%B1o/</p><p><br></p><p><a href="https://www.linkedin.com/company/addicol/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a> | LinkedIn</p><p>https://www.linkedin.com/company/addicol/</p><p><br></p><p><a href="https://www.addi.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a> | Website</p><p>https://www.addi.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.databricks.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Databricks</a></p><p>https://www.databricks.com/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Strong data orchestration is as much about culture and visibility as it is about technology. As data platforms scale, teams need systems that reduce cognitive load while increasing reliability and observability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/carlospuertoni%C3%B1o/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Carlos Daniel Puerto Niño</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Analytics Engineer and Data Analyst at </span><a href="https://www.linkedin.com/company/addicol/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how Addi uses Airflow to support batch orchestration, manage organizational complexity and improve monitoring across its data platform.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:25 Changes in company strategy increase data platform complexity over time.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:00 Centralized data teams help manage organizational and technical change.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:08 Scalable architectures support growing data volumes and use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:10 Adopting orchestration tools introduces operational and maintenance challenges.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:43 Abstraction layers lower technical barriers for onboarding new team members.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:36 Modularity and visibility improve the reliability of data pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:14 Integrated monitoring supports faster incident response and resolution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:19 Limited access to orchestration metadata constrains proactive analysis.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/carlospuertoni%C3%B1o/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Carlos Daniel Puerto Niño</a></p><p>https://www.linkedin.com/in/carlospuertoni%C3%B1o/</p><p><br></p><p><a href="https://www.linkedin.com/company/addicol/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a> | LinkedIn</p><p>https://www.linkedin.com/company/addicol/</p><p><br></p><p><a href="https://www.addi.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Addi</a> | Website</p><p>https://www.addi.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.databricks.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Databricks</a></p><p>https://www.databricks.com/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Strong data orchestration is as much about culture and visibility as it is about technology. As data platforms scale, teams need systems that reduce cognitive load while increasing reliability and observability.In this episode, Carlos Daniel Puerto...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>69</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[8d359b8f-dc6c-4850-9c81-05546a615c68]]></guid>
  <title><![CDATA[Building Event-Driven Data Pipelines With Airflow 3 at Astrafy with Andrea Bombino]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Real-time data expectations are reshaping how modern data teams think about orchestration and dependencies. As event-driven architectures become more common, teams need to rethink how pipelines react to data changes, rather than schedules.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Co-Founder and Head of Analytics Engineering at </span><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how event-driven scheduling in Airflow is evolving and how </span><span style="background-color: transparent;">Astrafy</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> applies it to deliver faster, more responsive data pipelines.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:02 </span><span style="background-color: transparent;">Astrafy</span><span style="background-color: transparent; color: rgb(22, 14, 61);">’s role in guiding clients across the modern data stack.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:15 Strong DAG dependencies create challenges for time-based scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:48 Event-driven pipelines respond to increasing real-time data demands.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:30 Airflow 3 introduces native support for event-driven orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:27 Sensor-based workflows reveal scalability and efficiency limitations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:32 Event-driven assets improve efficiency and pipeline elegance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:45 Governance and cross-instance coordination emerge as ongoing challenges.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a></p><p>https://www.linkedin.com/in/andrea-bombino/</p><p><br></p><p><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> | LinkedIn</p><p>https://www.linkedin.com/company/astrafy/</p><p><br></p><p><a href="https://www.astrafy.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> | Website</p><p>https://www.astrafy.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud</a></p><p>https://cloud.google.com/</p><p><br></p><p><a href="https://cloud.google.com/pubsub" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Pub/Sub</a></p><p>https://cloud.google.com/pubsub</p><p><br></p><p><a href="https://cloud.google.com/bigquery" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google BigQuery</a></p><p>https://cloud.google.com/bigquery</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/45bf9a27-8730-4a1d-92c8-c111823cdbb8/5e35581c2a.jpg" />
  <pubDate>Thu, 12 Feb 2026 08:34:03 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="17968982" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/45bf9a27-8730-4a1d-92c8-c111823cdbb8/episode.mp3" />
  <itunes:title><![CDATA[Building Event-Driven Data Pipelines With Airflow 3 at Astrafy with Andrea Bombino]]></itunes:title>
  <itunes:duration>18:43</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Real-time data expectations are reshaping how modern data teams think about orchestration and dependencies. As event-driven architectures become more common, teams need to rethink how pipelines react to data changes, rather than schedules.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Co-Founder and Head of Analytics Engineering at </span><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how event-driven scheduling in Airflow is evolving and how </span><span style="background-color: transparent;">Astrafy</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> applies it to deliver faster, more responsive data pipelines.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:02 </span><span style="background-color: transparent;">Astrafy</span><span style="background-color: transparent; color: rgb(22, 14, 61);">’s role in guiding clients across the modern data stack.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:15 Strong DAG dependencies create challenges for time-based scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:48 Event-driven pipelines respond to increasing real-time data demands.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:30 Airflow 3 introduces native support for event-driven orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:27 Sensor-based workflows reveal scalability and efficiency limitations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:32 Event-driven assets improve efficiency and pipeline elegance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:45 Governance and cross-instance coordination emerge as ongoing challenges.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a></p><p>https://www.linkedin.com/in/andrea-bombino/</p><p><br></p><p><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> | LinkedIn</p><p>https://www.linkedin.com/company/astrafy/</p><p><br></p><p><a href="https://www.astrafy.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> | Website</p><p>https://www.astrafy.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud</a></p><p>https://cloud.google.com/</p><p><br></p><p><a href="https://cloud.google.com/pubsub" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Pub/Sub</a></p><p>https://cloud.google.com/pubsub</p><p><br></p><p><a href="https://cloud.google.com/bigquery" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google BigQuery</a></p><p>https://cloud.google.com/bigquery</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Real-time data expectations are reshaping how modern data teams think about orchestration and dependencies. As event-driven architectures become more common, teams need to rethink how pipelines react to data changes, rather than schedules.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Co-Founder and Head of Analytics Engineering at </span><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how event-driven scheduling in Airflow is evolving and how </span><span style="background-color: transparent;">Astrafy</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> applies it to deliver faster, more responsive data pipelines.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:02 </span><span style="background-color: transparent;">Astrafy</span><span style="background-color: transparent; color: rgb(22, 14, 61);">’s role in guiding clients across the modern data stack.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:15 Strong DAG dependencies create challenges for time-based scheduling.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:48 Event-driven pipelines respond to increasing real-time data demands.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:30 Airflow 3 introduces native support for event-driven orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:27 Sensor-based workflows reveal scalability and efficiency limitations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:32 Event-driven assets improve efficiency and pipeline elegance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:45 Governance and cross-instance coordination emerge as ongoing challenges.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a></p><p>https://www.linkedin.com/in/andrea-bombino/</p><p><br></p><p><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> | LinkedIn</p><p>https://www.linkedin.com/company/astrafy/</p><p><br></p><p><a href="https://www.astrafy.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> | Website</p><p>https://www.astrafy.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud</a></p><p>https://cloud.google.com/</p><p><br></p><p><a href="https://cloud.google.com/pubsub" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Pub/Sub</a></p><p>https://cloud.google.com/pubsub</p><p><br></p><p><a href="https://cloud.google.com/bigquery" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google BigQuery</a></p><p>https://cloud.google.com/bigquery</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Real-time data expectations are reshaping how modern data teams think about orchestration and dependencies. As event-driven architectures become more common, teams need to rethink how pipelines react to data changes, rather than schedules.In this e...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>70</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[61fa3c01-76b7-4f1a-b598-d81ea9270386]]></guid>
  <title><![CDATA[Uphold’s Approach to Orchestrating Modern Data Workflows with Jaime Oliveira]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">A strong data-driven mindset underpins how fintech teams scale analytics, infrastructure and decision-making across the business.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/jaime-oliveira-b075855a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaime Oliveira</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/upholdinc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how Uphold structures its data organization and orchestration strategy. Jaime shares how the team uses Airflow and dbt to support analytics, reporting and data activation while evolving their approach as the stack grows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:23 A data-driven mindset supports product development and business decisions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:55 Diverse ingestion pipelines enable scalable analytics.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:18 A single orchestration platform simplifies analytics workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:17 Early experience with orchestration tools shapes engineering practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:16 Analytics orchestration works best when aligned with transformation workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:25 Infrastructure choices involve tradeoffs in testing, visibility and overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:39 More collaborative workflow tools could improve accessibility and autonomy.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jaime-oliveira-b075855a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaime Oliveira</a></p><p>https://www.linkedin.com/in/jaime-oliveira-b075855a/</p><p><br></p><p><a href="https://www.linkedin.com/company/upholdinc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a> | LinkedIn</p><p>https://www.linkedin.com/company/upholdinc/</p><p><br></p><p><a href="https://uphold.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a> | Website</p><p>https://uphold.com</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io</p><p><br></p><p><a href="https://astronomer.github.io/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://astronomer.github.io/astronomer-cosmos</p><p><br></p><p><a href="https://www.astronomer.io/ebooks/orchestrating-dbt-with-airflow-using-cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos e-book</a></p><p>https://www.astronomer.io/ebooks/orchestrating-dbt-with-airflow-using-cosmos/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow </span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/5cb03ee5-9e03-497d-b807-cf95fc629896/36037af7bd.jpg" />
  <pubDate>Thu, 05 Feb 2026 00:10:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="18166676" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/5cb03ee5-9e03-497d-b807-cf95fc629896/episode.mp3" />
  <itunes:title><![CDATA[Uphold’s Approach to Orchestrating Modern Data Workflows with Jaime Oliveira]]></itunes:title>
  <itunes:duration>18:55</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">A strong data-driven mindset underpins how fintech teams scale analytics, infrastructure and decision-making across the business.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/jaime-oliveira-b075855a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaime Oliveira</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/upholdinc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how Uphold structures its data organization and orchestration strategy. Jaime shares how the team uses Airflow and dbt to support analytics, reporting and data activation while evolving their approach as the stack grows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:23 A data-driven mindset supports product development and business decisions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:55 Diverse ingestion pipelines enable scalable analytics.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:18 A single orchestration platform simplifies analytics workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:17 Early experience with orchestration tools shapes engineering practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:16 Analytics orchestration works best when aligned with transformation workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:25 Infrastructure choices involve tradeoffs in testing, visibility and overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:39 More collaborative workflow tools could improve accessibility and autonomy.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jaime-oliveira-b075855a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaime Oliveira</a></p><p>https://www.linkedin.com/in/jaime-oliveira-b075855a/</p><p><br></p><p><a href="https://www.linkedin.com/company/upholdinc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a> | LinkedIn</p><p>https://www.linkedin.com/company/upholdinc/</p><p><br></p><p><a href="https://uphold.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a> | Website</p><p>https://uphold.com</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io</p><p><br></p><p><a href="https://astronomer.github.io/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://astronomer.github.io/astronomer-cosmos</p><p><br></p><p><a href="https://www.astronomer.io/ebooks/orchestrating-dbt-with-airflow-using-cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos e-book</a></p><p>https://www.astronomer.io/ebooks/orchestrating-dbt-with-airflow-using-cosmos/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow </span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">A strong data-driven mindset underpins how fintech teams scale analytics, infrastructure and decision-making across the business.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/jaime-oliveira-b075855a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaime Oliveira</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/upholdinc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how Uphold structures its data organization and orchestration strategy. Jaime shares how the team uses Airflow and dbt to support analytics, reporting and data activation while evolving their approach as the stack grows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:23 A data-driven mindset supports product development and business decisions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:55 Diverse ingestion pipelines enable scalable analytics.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:18 A single orchestration platform simplifies analytics workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:17 Early experience with orchestration tools shapes engineering practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:16 Analytics orchestration works best when aligned with transformation workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:25 Infrastructure choices involve tradeoffs in testing, visibility and overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:39 More collaborative workflow tools could improve accessibility and autonomy.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jaime-oliveira-b075855a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jaime Oliveira</a></p><p>https://www.linkedin.com/in/jaime-oliveira-b075855a/</p><p><br></p><p><a href="https://www.linkedin.com/company/upholdinc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a> | LinkedIn</p><p>https://www.linkedin.com/company/upholdinc/</p><p><br></p><p><a href="https://uphold.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uphold</a> | Website</p><p>https://uphold.com</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io</p><p><br></p><p><a href="https://astronomer.github.io/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://astronomer.github.io/astronomer-cosmos</p><p><br></p><p><a href="https://www.astronomer.io/ebooks/orchestrating-dbt-with-airflow-using-cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos e-book</a></p><p>https://www.astronomer.io/ebooks/orchestrating-dbt-with-airflow-using-cosmos/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow </span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[A strong data-driven mindset underpins how fintech teams scale analytics, infrastructure and decision-making across the business.In this episode, Jaime Oliveira, Lead Data Engineer at Uphold, joins us to discuss how Uphold structures its data organ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>69</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[98be3cf3-6873-4417-97da-8e4271032592]]></guid>
  <title><![CDATA[Modern Airflow Best Practices for Scalable Data Pipelines with Bhavani Ravi]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building reliable data pipelines at scale requires more than writing code. It depends on thoughtful design, infrastructure trade-offs and an understanding of how orchestration platforms evolve over time.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, Airflow best practices shaped by real-world implementation are examined. </span><a href="https://www.linkedin.com/in/bhavanicodes/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bhavani Ravi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Independent Software Consultant and Apache Airflow Champion, shares lessons on pipeline design, architectural decisions and the evolution of the Airflow ecosystem in modern data environments.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:30 Independent consulting supports effective Airflow adoption.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:38 Early challenges shaped modern Airflow practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:21 Airflow setup has become significantly simpler.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:30 New features expanded workflow capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:03 Frequent releases support long-term sustainability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:34 Community and providers strengthen the ecosystem.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:03 Pipeline design should come before coding.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:55 Decoupling logic requires careful trade-offs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:30 Plugins extend Airflow into new use cases.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bhavanicodes/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bhavani Ravi</a></p><p>https://www.linkedin.com/in/bhavanicodes/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://learn.microsoft.com/en-us/fabric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Azure Fabric</a></p><p>https://learn.microsoft.com/en-us/fabric/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/daf182b9-3d10-4963-b972-ffbb9bba2a02/283c1249e2.jpg" />
  <pubDate>Thu, 29 Jan 2026 00:05:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="16486481" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/daf182b9-3d10-4963-b972-ffbb9bba2a02/episode.mp3" />
  <itunes:title><![CDATA[Modern Airflow Best Practices for Scalable Data Pipelines with Bhavani Ravi]]></itunes:title>
  <itunes:duration>17:10</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building reliable data pipelines at scale requires more than writing code. It depends on thoughtful design, infrastructure trade-offs and an understanding of how orchestration platforms evolve over time.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, Airflow best practices shaped by real-world implementation are examined. </span><a href="https://www.linkedin.com/in/bhavanicodes/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bhavani Ravi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Independent Software Consultant and Apache Airflow Champion, shares lessons on pipeline design, architectural decisions and the evolution of the Airflow ecosystem in modern data environments.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:30 Independent consulting supports effective Airflow adoption.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:38 Early challenges shaped modern Airflow practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:21 Airflow setup has become significantly simpler.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:30 New features expanded workflow capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:03 Frequent releases support long-term sustainability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:34 Community and providers strengthen the ecosystem.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:03 Pipeline design should come before coding.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:55 Decoupling logic requires careful trade-offs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:30 Plugins extend Airflow into new use cases.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bhavanicodes/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bhavani Ravi</a></p><p>https://www.linkedin.com/in/bhavanicodes/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://learn.microsoft.com/en-us/fabric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Azure Fabric</a></p><p>https://learn.microsoft.com/en-us/fabric/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building reliable data pipelines at scale requires more than writing code. It depends on thoughtful design, infrastructure trade-offs and an understanding of how orchestration platforms evolve over time.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, Airflow best practices shaped by real-world implementation are examined. </span><a href="https://www.linkedin.com/in/bhavanicodes/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bhavani Ravi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Independent Software Consultant and Apache Airflow Champion, shares lessons on pipeline design, architectural decisions and the evolution of the Airflow ecosystem in modern data environments.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:30 Independent consulting supports effective Airflow adoption.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:38 Early challenges shaped modern Airflow practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:21 Airflow setup has become significantly simpler.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:30 New features expanded workflow capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:03 Frequent releases support long-term sustainability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:34 Community and providers strengthen the ecosystem.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:03 Pipeline design should come before coding.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:55 Decoupling logic requires careful trade-offs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:30 Plugins extend Airflow into new use cases.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bhavanicodes/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bhavani Ravi</a></p><p>https://www.linkedin.com/in/bhavanicodes/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://learn.microsoft.com/en-us/fabric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Azure Fabric</a></p><p>https://learn.microsoft.com/en-us/fabric/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Building reliable data pipelines at scale requires more than writing code. It depends on thoughtful design, infrastructure trade-offs and an understanding of how orchestration platforms evolve over time.In this episode, Airflow best practices shape...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>68</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[57af11f5-cab8-4bf5-8d25-bbe9ddd8d3b5]]></guid>
  <title><![CDATA[Inside Conviva’s Decision To Power Its Data Platform With Airflow with Han Zhang]]></title>
  <description><![CDATA[<p><a href="https://www.linkedin.com/company/conviva/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Conviva</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> operates at a massive scale, delivering outcome-based intelligence for digital businesses through real-time and batch data processing. As new use cases emerged, the team needed a way to extend a streaming-first architecture without rebuilding core systems.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/zhanghan177" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Han Zhang</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> joins us to explain how Conviva uses Apache Airflow as the orchestration backbone for its batch workloads, how the control plane is designed and what trade-offs shaped their platform decisions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:17 Large-scale data platforms require low-latency processing capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:08 Batch workloads can complement streaming pipelines for additional use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:45 An orchestration framework can act as the core coordination layer.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:12 Batch processing enables workloads that streaming alone cannot support.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:50 Ecosystem maturity and observability are key orchestration considerations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:15 Built-in run history and logs make failures easier to diagnose.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:20 Platform users can monitor workflows without managing orchestration logic.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:08 Identity, secrets and scheduling present ongoing optimization challenges.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:59 Configuration history and change visibility improve operational reliability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/zhanghan177" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Han Zhang</a></p><p>https://www.linkedin.com/in/zhanghan177</p><p><br></p><p><a href="http://www.conviva.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Conviva</a> | Website</p><p>http://www.conviva.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://docs.celeryq.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Celery</a></p><p>https://docs.celeryq.dev/</p><p><br></p><p><a href="https://temporal.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Temporal</a></p><p>https://temporal.io/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://ldap.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LDAP</a></p><p>https://ldap.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/c5e93054-d251-43ba-8a38-fceb0e4e8a10/df64bbbe35.jpg" />
  <pubDate>Thu, 22 Jan 2026 07:05:57 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="20886337" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/c5e93054-d251-43ba-8a38-fceb0e4e8a10/episode.mp3" />
  <itunes:title><![CDATA[Inside Conviva’s Decision To Power Its Data Platform With Airflow with Han Zhang]]></itunes:title>
  <itunes:duration>21:45</itunes:duration>
  <itunes:summary><![CDATA[<p><a href="https://www.linkedin.com/company/conviva/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Conviva</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> operates at a massive scale, delivering outcome-based intelligence for digital businesses through real-time and batch data processing. As new use cases emerged, the team needed a way to extend a streaming-first architecture without rebuilding core systems.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/zhanghan177" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Han Zhang</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> joins us to explain how Conviva uses Apache Airflow as the orchestration backbone for its batch workloads, how the control plane is designed and what trade-offs shaped their platform decisions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:17 Large-scale data platforms require low-latency processing capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:08 Batch workloads can complement streaming pipelines for additional use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:45 An orchestration framework can act as the core coordination layer.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:12 Batch processing enables workloads that streaming alone cannot support.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:50 Ecosystem maturity and observability are key orchestration considerations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:15 Built-in run history and logs make failures easier to diagnose.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:20 Platform users can monitor workflows without managing orchestration logic.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:08 Identity, secrets and scheduling present ongoing optimization challenges.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:59 Configuration history and change visibility improve operational reliability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/zhanghan177" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Han Zhang</a></p><p>https://www.linkedin.com/in/zhanghan177</p><p><br></p><p><a href="http://www.conviva.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Conviva</a> | Website</p><p>http://www.conviva.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://docs.celeryq.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Celery</a></p><p>https://docs.celeryq.dev/</p><p><br></p><p><a href="https://temporal.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Temporal</a></p><p>https://temporal.io/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://ldap.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LDAP</a></p><p>https://ldap.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><a href="https://www.linkedin.com/company/conviva/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Conviva</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> operates at a massive scale, delivering outcome-based intelligence for digital businesses through real-time and batch data processing. As new use cases emerged, the team needed a way to extend a streaming-first architecture without rebuilding core systems.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/zhanghan177" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Han Zhang</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> joins us to explain how Conviva uses Apache Airflow as the orchestration backbone for its batch workloads, how the control plane is designed and what trade-offs shaped their platform decisions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:17 Large-scale data platforms require low-latency processing capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:08 Batch workloads can complement streaming pipelines for additional use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:45 An orchestration framework can act as the core coordination layer.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:12 Batch processing enables workloads that streaming alone cannot support.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:50 Ecosystem maturity and observability are key orchestration considerations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:15 Built-in run history and logs make failures easier to diagnose.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:20 Platform users can monitor workflows without managing orchestration logic.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:08 Identity, secrets and scheduling present ongoing optimization challenges.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:59 Configuration history and change visibility improve operational reliability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/zhanghan177" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Han Zhang</a></p><p>https://www.linkedin.com/in/zhanghan177</p><p><br></p><p><a href="http://www.conviva.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Conviva</a> | Website</p><p>http://www.conviva.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://docs.celeryq.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Celery</a></p><p>https://docs.celeryq.dev/</p><p><br></p><p><a href="https://temporal.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Temporal</a></p><p>https://temporal.io/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://ldap.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LDAP</a></p><p>https://ldap.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Conviva operates at a massive scale, delivering outcome-based intelligence for digital businesses through real-time and batch data processing. As new use cases emerged, the team needed a way to extend a streaming-first architecture without rebuildi...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>67</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[c06b5dc2-ae81-418c-af79-cebf6fa1e81a]]></guid>
  <title><![CDATA[Why Airflow Became the Scheduling Backbone at Condé Nast Technology Lab with Arun Karthik]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Data platforms are moving from batch-first pipelines to near real-time systems where orchestration, observability, scalability and governance all have to work together.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/earunkarthik/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arun Karthik</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><span style="background-color: transparent;">Director, Data Solutions Engineering</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> at </span><a href="https://www.linkedin.com/company/conde-nast-technology-lab/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how data engineering evolves from relational databases and ETL into distributed processing, modern orchestration with Apache Airflow and managed Airflow with Astronomer.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Early data systems rely heavily on relational databases and batch-oriented processing models.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:01 Scheduling requirements evolve beyond fixed time windows as dependencies increase.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:14 Ease of use and developer experience influence adoption of orchestration frameworks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:22 Operating open source orchestration tools requires ongoing engineering effort.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:45 Managed services help teams reduce infrastructure and maintenance responsibilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:27 Observability improves confidence in pipeline execution and system health.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:12 Governance considerations grow in importance as data platforms mature.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:46 Building data systems requires balancing speed, reliability and long-term sustainability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/earunkarthik/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arun Karthik</a></p><p>https://www.linkedin.com/in/earunkarthik/</p><p><br></p><p><a href="https://www.linkedin.com/company/conde-nast-technology-lab/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a> | LinkedIn</p><p>https://www.linkedin.com/company/conde-nast-technology-lab/</p><p><br></p><p><a href="https://www.condenast.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a> | Website</p><p>https://www.condenast.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://spark.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Spark</a></p><p>https://spark.apache.org/</p><p><br></p><p><a href="https://hadoop.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Hadoop</a></p><p>https://hadoop.apache.org/</p><p><br></p><p><a href="https://www.jenkins.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jenkins</a></p><p>https://www.jenkins.io/</p><p><br></p><p><a href="https://www.getdbt.com/product/what-is-dbt" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/product/what-is-dbt</p><p><br></p><p><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjb5o2u0r6RAxUe4hYFHZWfIAkYACICCAEQABoCdGw&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA3fnJBhAgEiwAyqmY5UaBN2qkTY6SvIIu1G81mTnow5qJsHVyvXEPKxO67AEPRJiSh5RT2xoCogIQAvD_BwE&amp;cid=CAASZeRo6gPeVuY2CkZ1mkNfuAFTmkxrkfiDZByirO70Uuzz5iyatQHokcE8RcP3ZFas2Is_a6Nv22sekciKvIMg9qCIbojXjDFlaRLvkzWRVJKLTsnjrvXxXcBlhNhDiE9ei8GL1PMZ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_1ekqox3J4psn-lr5EqcffSkWLstA&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwiV9YWu0r6RAxXjh1YBHVclCO0Q0Qx6BAhZEAE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amazon Web Services</a></p><p>https://aws.amazon.com/free/?trk=54026797-7540-48d8-9f6b-0db2c3a0040c&amp;sc_channel=ps&amp;trk=54026797-7540-48d8-9f6b-0db2c3a0040c&amp;sc_channel=ps&amp;ef_id=CjwKCAiAmp3LBhAkEiwAJM2JUKIc3E2I-hDlF6fRWgZn5n2-RWX-kEDAVApJYd88wwlsiyosV71VixoCmRoQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!785574063524!e!!g!!amazon%20web%20services!23291338728!189486861095&amp;gad_campaignid=23291338728&amp;gbraid=0AAAAADjHtp813XNbg7azDj5QMwJPbGNqZ&amp;gclid=CjwKCAiAmp3LBhAkEiwAJM2JUKIc3E2I-hDlF6fRWgZn5n2-RWX-kEDAVApJYd88wwlsiyosV71VixoCmRoQAvD_BwE</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9abe353f-371c-43a4-82c7-c21cb158c1b6/3bd28e60e5.jpg" />
  <pubDate>Thu, 15 Jan 2026 00:00:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23298797" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9abe353f-371c-43a4-82c7-c21cb158c1b6/episode.mp3" />
  <itunes:title><![CDATA[Why Airflow Became the Scheduling Backbone at Condé Nast Technology Lab with Arun Karthik]]></itunes:title>
  <itunes:duration>24:16</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Data platforms are moving from batch-first pipelines to near real-time systems where orchestration, observability, scalability and governance all have to work together.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/earunkarthik/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arun Karthik</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><span style="background-color: transparent;">Director, Data Solutions Engineering</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> at </span><a href="https://www.linkedin.com/company/conde-nast-technology-lab/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how data engineering evolves from relational databases and ETL into distributed processing, modern orchestration with Apache Airflow and managed Airflow with Astronomer.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Early data systems rely heavily on relational databases and batch-oriented processing models.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:01 Scheduling requirements evolve beyond fixed time windows as dependencies increase.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:14 Ease of use and developer experience influence adoption of orchestration frameworks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:22 Operating open source orchestration tools requires ongoing engineering effort.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:45 Managed services help teams reduce infrastructure and maintenance responsibilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:27 Observability improves confidence in pipeline execution and system health.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:12 Governance considerations grow in importance as data platforms mature.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:46 Building data systems requires balancing speed, reliability and long-term sustainability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/earunkarthik/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arun Karthik</a></p><p>https://www.linkedin.com/in/earunkarthik/</p><p><br></p><p><a href="https://www.linkedin.com/company/conde-nast-technology-lab/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a> | LinkedIn</p><p>https://www.linkedin.com/company/conde-nast-technology-lab/</p><p><br></p><p><a href="https://www.condenast.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a> | Website</p><p>https://www.condenast.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://spark.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Spark</a></p><p>https://spark.apache.org/</p><p><br></p><p><a href="https://hadoop.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Hadoop</a></p><p>https://hadoop.apache.org/</p><p><br></p><p><a href="https://www.jenkins.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jenkins</a></p><p>https://www.jenkins.io/</p><p><br></p><p><a href="https://www.getdbt.com/product/what-is-dbt" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/product/what-is-dbt</p><p><br></p><p><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjb5o2u0r6RAxUe4hYFHZWfIAkYACICCAEQABoCdGw&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA3fnJBhAgEiwAyqmY5UaBN2qkTY6SvIIu1G81mTnow5qJsHVyvXEPKxO67AEPRJiSh5RT2xoCogIQAvD_BwE&amp;cid=CAASZeRo6gPeVuY2CkZ1mkNfuAFTmkxrkfiDZByirO70Uuzz5iyatQHokcE8RcP3ZFas2Is_a6Nv22sekciKvIMg9qCIbojXjDFlaRLvkzWRVJKLTsnjrvXxXcBlhNhDiE9ei8GL1PMZ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_1ekqox3J4psn-lr5EqcffSkWLstA&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwiV9YWu0r6RAxXjh1YBHVclCO0Q0Qx6BAhZEAE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amazon Web Services</a></p><p>https://aws.amazon.com/free/?trk=54026797-7540-48d8-9f6b-0db2c3a0040c&amp;sc_channel=ps&amp;trk=54026797-7540-48d8-9f6b-0db2c3a0040c&amp;sc_channel=ps&amp;ef_id=CjwKCAiAmp3LBhAkEiwAJM2JUKIc3E2I-hDlF6fRWgZn5n2-RWX-kEDAVApJYd88wwlsiyosV71VixoCmRoQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!785574063524!e!!g!!amazon%20web%20services!23291338728!189486861095&amp;gad_campaignid=23291338728&amp;gbraid=0AAAAADjHtp813XNbg7azDj5QMwJPbGNqZ&amp;gclid=CjwKCAiAmp3LBhAkEiwAJM2JUKIc3E2I-hDlF6fRWgZn5n2-RWX-kEDAVApJYd88wwlsiyosV71VixoCmRoQAvD_BwE</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Data platforms are moving from batch-first pipelines to near real-time systems where orchestration, observability, scalability and governance all have to work together.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/earunkarthik/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arun Karthik</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><span style="background-color: transparent;">Director, Data Solutions Engineering</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> at </span><a href="https://www.linkedin.com/company/conde-nast-technology-lab/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share how data engineering evolves from relational databases and ETL into distributed processing, modern orchestration with Apache Airflow and managed Airflow with Astronomer.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Early data systems rely heavily on relational databases and batch-oriented processing models.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:01 Scheduling requirements evolve beyond fixed time windows as dependencies increase.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:14 Ease of use and developer experience influence adoption of orchestration frameworks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:22 Operating open source orchestration tools requires ongoing engineering effort.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:45 Managed services help teams reduce infrastructure and maintenance responsibilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:27 Observability improves confidence in pipeline execution and system health.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:12 Governance considerations grow in importance as data platforms mature.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:46 Building data systems requires balancing speed, reliability and long-term sustainability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/earunkarthik/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arun Karthik</a></p><p>https://www.linkedin.com/in/earunkarthik/</p><p><br></p><p><a href="https://www.linkedin.com/company/conde-nast-technology-lab/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a> | LinkedIn</p><p>https://www.linkedin.com/company/conde-nast-technology-lab/</p><p><br></p><p><a href="https://www.condenast.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Condé Nast Technology Lab</a> | Website</p><p>https://www.condenast.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://spark.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Spark</a></p><p>https://spark.apache.org/</p><p><br></p><p><a href="https://hadoop.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Hadoop</a></p><p>https://hadoop.apache.org/</p><p><br></p><p><a href="https://www.jenkins.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jenkins</a></p><p>https://www.jenkins.io/</p><p><br></p><p><a href="https://www.getdbt.com/product/what-is-dbt" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/product/what-is-dbt</p><p><br></p><p><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjb5o2u0r6RAxUe4hYFHZWfIAkYACICCAEQABoCdGw&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA3fnJBhAgEiwAyqmY5UaBN2qkTY6SvIIu1G81mTnow5qJsHVyvXEPKxO67AEPRJiSh5RT2xoCogIQAvD_BwE&amp;cid=CAASZeRo6gPeVuY2CkZ1mkNfuAFTmkxrkfiDZByirO70Uuzz5iyatQHokcE8RcP3ZFas2Is_a6Nv22sekciKvIMg9qCIbojXjDFlaRLvkzWRVJKLTsnjrvXxXcBlhNhDiE9ei8GL1PMZ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_1ekqox3J4psn-lr5EqcffSkWLstA&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwiV9YWu0r6RAxXjh1YBHVclCO0Q0Qx6BAhZEAE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amazon Web Services</a></p><p>https://aws.amazon.com/free/?trk=54026797-7540-48d8-9f6b-0db2c3a0040c&amp;sc_channel=ps&amp;trk=54026797-7540-48d8-9f6b-0db2c3a0040c&amp;sc_channel=ps&amp;ef_id=CjwKCAiAmp3LBhAkEiwAJM2JUKIc3E2I-hDlF6fRWgZn5n2-RWX-kEDAVApJYd88wwlsiyosV71VixoCmRoQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!785574063524!e!!g!!amazon%20web%20services!23291338728!189486861095&amp;gad_campaignid=23291338728&amp;gbraid=0AAAAADjHtp813XNbg7azDj5QMwJPbGNqZ&amp;gclid=CjwKCAiAmp3LBhAkEiwAJM2JUKIc3E2I-hDlF6fRWgZn5n2-RWX-kEDAVApJYd88wwlsiyosV71VixoCmRoQAvD_BwE</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data platforms are moving from batch-first pipelines to near real-time systems where orchestration, observability, scalability and governance all have to work together.In this episode, Arun Karthik, Director, Data Solutions Engineering at Condé Nas...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>66</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[f813c79f-715b-4899-9132-1510e2fbee95]]></guid>
  <title><![CDATA[The Role of Airflow in Building Smarter ML Pipelines at Vivian Health with Max Calehuff]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The integration of data orchestration and machine learning is critical to operational efficiency in healthcare tech. Vivian Health leverages Airflow to power both its ETL pipelines and ML workflows while maintaining strict compliance standards.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/maxwell-calehuff/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Max Calehuff</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/vivianhealth/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how his team uses Airflow for ML ops, regulatory compliance and large-scale data orchestration. He also shares insights into upgrading to Airflow 3 and the importance of balancing flexibility with security in a healthcare environment.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:21 The role of Airflow in managing ETL pipelines and ML retraining.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:23 Using AWS SageMaker for ML training and deployment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:47 Why Airflow’s versatility makes it ideal for MLOps.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:50 The importance of documentation and best practices for engineering teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:44 Automating anonymization of user data for compliance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:30 The benefits of remote execution in Airflow 3 for regulated industries.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:16 Quality-of-life improvements and desired features in future Airflow versions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/maxwell-calehuff/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Max Calehuff</a></p><p>https://www.linkedin.com/in/maxwell-calehuff/</p><p><br></p><p><a href="https://www.linkedin.com/company/vivianhealth/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a> | LinkedIn</p><p>https://www.linkedin.com/company/vivianhealth/</p><p><br></p><p><a href="https://www.vivian.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a> | Website</p><p>https://www.vivian.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwj3-fbz1tiQAxWXlKYDHXUBBVoYACICCAEQABoCdGI&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=Cj0KCQiA5abIBhCaARIsAM3-zFWbfj2olUvX4dqoiYNaE3q2fMf_ZifRjmbKNQCVX7D6ZMClaUXUkFkaAuwmEALw_wcB&amp;cid=CAASQuRoMccxWhBvMq-1Uez3XOZti1ul7mTDotKvSMoDHv0q2xCsyS2FzMptO5dJf3tmfkLRu22TtD8ChTmdjvs6YetTjQ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_2xE2xolEEVbpDb56qXQluxTzs-Aw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj7le3z1tiQAxWXcvUHHfZePbAQ0Qx6BAgUEAE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS SageMaker</a></p><p>https://www.google.com/aclk?sa=L&amp;ai=DChsSEwj3-fbz1tiQAxWXlKYDHXUBBVoYACICCAEQABoCdGI&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=Cj0KCQiA5abIBhCaARIsAM3-zFWbfj2olUvX4dqoiYNaE3q2fMf_ZifRjmbKNQCVX7D6ZMClaUXUkFkaAuwmEALw_wcB&amp;cid=CAASQuRoMccxWhBvMq-1Uez3XOZti1ul7mTDotKvSMoDHv0q2xCsyS2FzMptO5dJf3tmfkLRu22TtD8ChTmdjvs6YetTjQ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_2xE2xolEEVbpDb56qXQluxTzs-Aw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj7le3z1tiQAxWXcvUHHfZePbAQ0Qx6BAgUEAE</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbtLabs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://github.com/astronomer/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos</a></p><p>https://github.com/astronomer/astronomer-cosmos</p><p><br></p><p><a href="https://www.split.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Split</a></p><p>https://www.split.io/</p><p><br></p><p><a href="https://www.snowflake.com/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/en/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/1f649f31-f3e0-45b8-9f5a-4e66b6f1dd26/68e60f9a33.jpg" />
  <pubDate>Thu, 11 Dec 2025 04:02:06 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="18728414" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/1f649f31-f3e0-45b8-9f5a-4e66b6f1dd26/episode.mp3" />
  <itunes:title><![CDATA[The Role of Airflow in Building Smarter ML Pipelines at Vivian Health with Max Calehuff]]></itunes:title>
  <itunes:duration>19:30</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The integration of data orchestration and machine learning is critical to operational efficiency in healthcare tech. Vivian Health leverages Airflow to power both its ETL pipelines and ML workflows while maintaining strict compliance standards.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/maxwell-calehuff/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Max Calehuff</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/vivianhealth/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how his team uses Airflow for ML ops, regulatory compliance and large-scale data orchestration. He also shares insights into upgrading to Airflow 3 and the importance of balancing flexibility with security in a healthcare environment.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:21 The role of Airflow in managing ETL pipelines and ML retraining.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:23 Using AWS SageMaker for ML training and deployment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:47 Why Airflow’s versatility makes it ideal for MLOps.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:50 The importance of documentation and best practices for engineering teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:44 Automating anonymization of user data for compliance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:30 The benefits of remote execution in Airflow 3 for regulated industries.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:16 Quality-of-life improvements and desired features in future Airflow versions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/maxwell-calehuff/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Max Calehuff</a></p><p>https://www.linkedin.com/in/maxwell-calehuff/</p><p><br></p><p><a href="https://www.linkedin.com/company/vivianhealth/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a> | LinkedIn</p><p>https://www.linkedin.com/company/vivianhealth/</p><p><br></p><p><a href="https://www.vivian.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a> | Website</p><p>https://www.vivian.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwj3-fbz1tiQAxWXlKYDHXUBBVoYACICCAEQABoCdGI&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=Cj0KCQiA5abIBhCaARIsAM3-zFWbfj2olUvX4dqoiYNaE3q2fMf_ZifRjmbKNQCVX7D6ZMClaUXUkFkaAuwmEALw_wcB&amp;cid=CAASQuRoMccxWhBvMq-1Uez3XOZti1ul7mTDotKvSMoDHv0q2xCsyS2FzMptO5dJf3tmfkLRu22TtD8ChTmdjvs6YetTjQ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_2xE2xolEEVbpDb56qXQluxTzs-Aw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj7le3z1tiQAxWXcvUHHfZePbAQ0Qx6BAgUEAE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS SageMaker</a></p><p>https://www.google.com/aclk?sa=L&amp;ai=DChsSEwj3-fbz1tiQAxWXlKYDHXUBBVoYACICCAEQABoCdGI&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=Cj0KCQiA5abIBhCaARIsAM3-zFWbfj2olUvX4dqoiYNaE3q2fMf_ZifRjmbKNQCVX7D6ZMClaUXUkFkaAuwmEALw_wcB&amp;cid=CAASQuRoMccxWhBvMq-1Uez3XOZti1ul7mTDotKvSMoDHv0q2xCsyS2FzMptO5dJf3tmfkLRu22TtD8ChTmdjvs6YetTjQ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_2xE2xolEEVbpDb56qXQluxTzs-Aw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj7le3z1tiQAxWXcvUHHfZePbAQ0Qx6BAgUEAE</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbtLabs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://github.com/astronomer/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos</a></p><p>https://github.com/astronomer/astronomer-cosmos</p><p><br></p><p><a href="https://www.split.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Split</a></p><p>https://www.split.io/</p><p><br></p><p><a href="https://www.snowflake.com/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/en/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The integration of data orchestration and machine learning is critical to operational efficiency in healthcare tech. Vivian Health leverages Airflow to power both its ETL pipelines and ML workflows while maintaining strict compliance standards.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/maxwell-calehuff/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Max Calehuff</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer at </span><a href="https://www.linkedin.com/company/vivianhealth/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss how his team uses Airflow for ML ops, regulatory compliance and large-scale data orchestration. He also shares insights into upgrading to Airflow 3 and the importance of balancing flexibility with security in a healthcare environment.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:21 The role of Airflow in managing ETL pipelines and ML retraining.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:23 Using AWS SageMaker for ML training and deployment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:47 Why Airflow’s versatility makes it ideal for MLOps.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:50 The importance of documentation and best practices for engineering teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:44 Automating anonymization of user data for compliance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:30 The benefits of remote execution in Airflow 3 for regulated industries.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:16 Quality-of-life improvements and desired features in future Airflow versions.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/maxwell-calehuff/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Max Calehuff</a></p><p>https://www.linkedin.com/in/maxwell-calehuff/</p><p><br></p><p><a href="https://www.linkedin.com/company/vivianhealth/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a> | LinkedIn</p><p>https://www.linkedin.com/company/vivianhealth/</p><p><br></p><p><a href="https://www.vivian.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vivian Health</a> | Website</p><p>https://www.vivian.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwj3-fbz1tiQAxWXlKYDHXUBBVoYACICCAEQABoCdGI&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=Cj0KCQiA5abIBhCaARIsAM3-zFWbfj2olUvX4dqoiYNaE3q2fMf_ZifRjmbKNQCVX7D6ZMClaUXUkFkaAuwmEALw_wcB&amp;cid=CAASQuRoMccxWhBvMq-1Uez3XOZti1ul7mTDotKvSMoDHv0q2xCsyS2FzMptO5dJf3tmfkLRu22TtD8ChTmdjvs6YetTjQ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_2xE2xolEEVbpDb56qXQluxTzs-Aw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj7le3z1tiQAxWXcvUHHfZePbAQ0Qx6BAgUEAE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS SageMaker</a></p><p>https://www.google.com/aclk?sa=L&amp;ai=DChsSEwj3-fbz1tiQAxWXlKYDHXUBBVoYACICCAEQABoCdGI&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=Cj0KCQiA5abIBhCaARIsAM3-zFWbfj2olUvX4dqoiYNaE3q2fMf_ZifRjmbKNQCVX7D6ZMClaUXUkFkaAuwmEALw_wcB&amp;cid=CAASQuRoMccxWhBvMq-1Uez3XOZti1ul7mTDotKvSMoDHv0q2xCsyS2FzMptO5dJf3tmfkLRu22TtD8ChTmdjvs6YetTjQ&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_2xE2xolEEVbpDb56qXQluxTzs-Aw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj7le3z1tiQAxWXcvUHHfZePbAQ0Qx6BAgUEAE</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbtLabs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://github.com/astronomer/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos</a></p><p>https://github.com/astronomer/astronomer-cosmos</p><p><br></p><p><a href="https://www.split.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Split</a></p><p>https://www.split.io/</p><p><br></p><p><a href="https://www.snowflake.com/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/en/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The integration of data orchestration and machine learning is critical to operational efficiency in healthcare tech. Vivian Health leverages Airflow to power both its ETL pipelines and ML workflows while maintaining strict compliance standards.Max ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>65</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[e7711863-7224-41af-95c5-e27b21bb1ce6]]></guid>
  <title><![CDATA[Scaling Airflow to 11,000 DAGs Across Three Regions at Intercom with András Gombosi and Paul Vickers]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Intercom’s data infrastructure reveals how a well-built orchestration system can scale to serve global needs. With thousands of DAGs powering analytics, AI and customer operations, the team’s approach combines technical depth with organizational insight.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/andrasgombosi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">András Gombosi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><span style="background-color: transparent;">Senior Engineering Manager of Data Infra and Analytics Engineering</span><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/paul-vickers-a22b76a3/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paul Vickers</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Engineer, both at </span><a href="https://www.linkedin.com/company/intercom/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, share how they built one of the largest Airflow deployments in production and enabled self-serve data platforms across teams.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:24 Community input encourages confident adoption of a common platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:50 Self-serve workflows require consistent guardrails and review.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:25 Internal infrastructure support accelerates scalable deployments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:26 Batch LLM processing benefits from a configuration-driven design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:20 Standardized development environments enable effective AI-assisted work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:58 Applied AI enhances internal analysis and operational enablement.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">27:27 Strong test coverage and staged upgrades protect stability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">30:36 Proactive observability and on-call ownership improve outcomes.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrasgombosi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">András Gombosi</a></p><p>https://www.linkedin.com/in/andrasgombosi/</p><p><br></p><p><a href="https://www.linkedin.com/in/paul-vickers-a22b76a3/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paul Vickers</a></p><p>https://www.linkedin.com/in/paul-vickers-a22b76a3/</p><p><br></p><p><a href="https://www.linkedin.com/company/intercom/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a> | LinkedIn</p><p>https://www.linkedin.com/company/intercom/</p><p><br></p><p><a href="https://www.intercom.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a> | Website</p><p>https://www.intercom.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbtLabs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://www.snowflake.com/en/product/features/cortex/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake Cortex AI</a></p><p>https://www.snowflake.com/en/product/features/cortex/</p><p><br></p><p><a href="https://www.datadoghq.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Datadog</a></p><p>https://www.datadoghq.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/606cad1e-b592-4394-9b86-d572d7562f4a/f2ec2cdc86.jpg" />
  <pubDate>Thu, 04 Dec 2025 03:33:35 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="33033067" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/606cad1e-b592-4394-9b86-d572d7562f4a/episode.mp3" />
  <itunes:title><![CDATA[Scaling Airflow to 11,000 DAGs Across Three Regions at Intercom with András Gombosi and Paul Vickers]]></itunes:title>
  <itunes:duration>34:24</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Intercom’s data infrastructure reveals how a well-built orchestration system can scale to serve global needs. With thousands of DAGs powering analytics, AI and customer operations, the team’s approach combines technical depth with organizational insight.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/andrasgombosi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">András Gombosi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><span style="background-color: transparent;">Senior Engineering Manager of Data Infra and Analytics Engineering</span><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/paul-vickers-a22b76a3/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paul Vickers</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Engineer, both at </span><a href="https://www.linkedin.com/company/intercom/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, share how they built one of the largest Airflow deployments in production and enabled self-serve data platforms across teams.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:24 Community input encourages confident adoption of a common platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:50 Self-serve workflows require consistent guardrails and review.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:25 Internal infrastructure support accelerates scalable deployments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:26 Batch LLM processing benefits from a configuration-driven design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:20 Standardized development environments enable effective AI-assisted work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:58 Applied AI enhances internal analysis and operational enablement.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">27:27 Strong test coverage and staged upgrades protect stability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">30:36 Proactive observability and on-call ownership improve outcomes.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrasgombosi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">András Gombosi</a></p><p>https://www.linkedin.com/in/andrasgombosi/</p><p><br></p><p><a href="https://www.linkedin.com/in/paul-vickers-a22b76a3/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paul Vickers</a></p><p>https://www.linkedin.com/in/paul-vickers-a22b76a3/</p><p><br></p><p><a href="https://www.linkedin.com/company/intercom/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a> | LinkedIn</p><p>https://www.linkedin.com/company/intercom/</p><p><br></p><p><a href="https://www.intercom.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a> | Website</p><p>https://www.intercom.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbtLabs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://www.snowflake.com/en/product/features/cortex/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake Cortex AI</a></p><p>https://www.snowflake.com/en/product/features/cortex/</p><p><br></p><p><a href="https://www.datadoghq.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Datadog</a></p><p>https://www.datadoghq.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Intercom’s data infrastructure reveals how a well-built orchestration system can scale to serve global needs. With thousands of DAGs powering analytics, AI and customer operations, the team’s approach combines technical depth with organizational insight.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/andrasgombosi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">András Gombosi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><span style="background-color: transparent;">Senior Engineering Manager of Data Infra and Analytics Engineering</span><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/paul-vickers-a22b76a3/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paul Vickers</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Engineer, both at </span><a href="https://www.linkedin.com/company/intercom/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, share how they built one of the largest Airflow deployments in production and enabled self-serve data platforms across teams.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:24 Community input encourages confident adoption of a common platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:50 Self-serve workflows require consistent guardrails and review.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:25 Internal infrastructure support accelerates scalable deployments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:26 Batch LLM processing benefits from a configuration-driven design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:20 Standardized development environments enable effective AI-assisted work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:58 Applied AI enhances internal analysis and operational enablement.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">27:27 Strong test coverage and staged upgrades protect stability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">30:36 Proactive observability and on-call ownership improve outcomes.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrasgombosi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">András Gombosi</a></p><p>https://www.linkedin.com/in/andrasgombosi/</p><p><br></p><p><a href="https://www.linkedin.com/in/paul-vickers-a22b76a3/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paul Vickers</a></p><p>https://www.linkedin.com/in/paul-vickers-a22b76a3/</p><p><br></p><p><a href="https://www.linkedin.com/company/intercom/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a> | LinkedIn</p><p>https://www.linkedin.com/company/intercom/</p><p><br></p><p><a href="https://www.intercom.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Intercom</a> | Website</p><p>https://www.intercom.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbtLabs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://www.snowflake.com/en/product/features/cortex/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake Cortex AI</a></p><p>https://www.snowflake.com/en/product/features/cortex/</p><p><br></p><p><a href="https://www.datadoghq.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Datadog</a></p><p>https://www.datadoghq.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The evolution of Intercom’s data infrastructure reveals how a well-built orchestration system can scale to serve global needs. With thousands of DAGs powering analytics, AI and customer operations, the team’s approach combines technical depth with ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>65</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[776278b4-5f92-49e5-8bc5-d46cb7595caf]]></guid>
  <title><![CDATA[How Covestro Turns Airflow Into a Simulation Toolbox with Anja Mackenzie]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building scalable, reproducible workflows for scientific computing often requires bridging the gap between research flexibility and enterprise reliability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/anja-mackenzie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anja MacKenzie</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Expert for Cheminformatics at </span><a href="https://www.linkedin.com/company/covestro/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, explains how her team uses Airflow and Kubernetes to create a shared, self-service platform for computational chemistry.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:19 Custom scripts made sharing and reuse difficult.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:29 Workflows are manually triggered with user traceability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:38 Customization supports varied compute requirements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:48 Persistent volumes allow tasks to share large amounts of data.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:25 Custom operators separate logic from infrastructure.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:43 Modified triggers connect dependent workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:36 UI plugins enable file uploads and secure access.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/anja-mackenzie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anja MacKenzie</a></p><p>https://www.linkedin.com/in/anja-mackenzie/</p><p><br></p><p><a href="https://www.linkedin.com/company/covestro/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a> | LinkedIn</p><p>https://www.linkedin.com/company/covestro/</p><p><br></p><p><a href="https://www.covestro.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a> | Website</p><p>https://www.covestro.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow KubernetesPodOperator</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.udemy.com/user/lockgfg/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_GammaCatchall_NonP_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Gamma&amp;utm_content=deal4584&amp;utm_term=_._ag_169801645584_._ad_700876640602_._kw__._de_c_._dm__._pl__._ti_dsa-1456167871416_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21341313808&amp;gbraid=0AAAAADROdO1_-I2TMcVyU8F3i1jRXJ24K&amp;gclid=Cj0KCQjwvJHIBhCgARIsAEQnWlC1uYHIRm3y9Q8rPNSuVPNivsxogqfczpKHwhmNho2uKZYC-y0taNQaApU2EALw_wcB" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Academy by Marc Lamberti</a></p><p>https://www.udemy.com/user/lockgfg/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_GammaCatchall_NonP_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Gamma&amp;utm_content=deal4584&amp;utm_term=_._ag_169801645584_._ad_700876640602_._kw__._de_c_._dm__._pl__._ti_dsa-1456167871416_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21341313808&amp;gbraid=0AAAAADROdO1_-I2TMcVyU8F3i1jRXJ24K&amp;gclid=Cj0KCQjwvJHIBhCgARIsAEQnWlC1uYHIRm3y9Q8rPNSuVPNivsxogqfczpKHwhmNho2uKZYC-y0taNQaApU2EALw_wcB</p><p><br></p><p><a href="https://airflow.apache.org/docs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Documentation</a></p><p>https://airflow.apache.org/docs/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/1.10.9/plugins.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Plugins</a></p><p>https://airflow.apache.org/docs/apache-airflow/1.10.9/plugins.html</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/d87fa102-b114-4740-b847-92c891b3361c/8184098455.jpg" />
  <pubDate>Thu, 20 Nov 2025 00:50:13 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22250974" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/d87fa102-b114-4740-b847-92c891b3361c/episode.mp3" />
  <itunes:title><![CDATA[How Covestro Turns Airflow Into a Simulation Toolbox with Anja Mackenzie]]></itunes:title>
  <itunes:duration>23:10</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building scalable, reproducible workflows for scientific computing often requires bridging the gap between research flexibility and enterprise reliability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/anja-mackenzie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anja MacKenzie</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Expert for Cheminformatics at </span><a href="https://www.linkedin.com/company/covestro/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, explains how her team uses Airflow and Kubernetes to create a shared, self-service platform for computational chemistry.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:19 Custom scripts made sharing and reuse difficult.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:29 Workflows are manually triggered with user traceability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:38 Customization supports varied compute requirements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:48 Persistent volumes allow tasks to share large amounts of data.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:25 Custom operators separate logic from infrastructure.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:43 Modified triggers connect dependent workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:36 UI plugins enable file uploads and secure access.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/anja-mackenzie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anja MacKenzie</a></p><p>https://www.linkedin.com/in/anja-mackenzie/</p><p><br></p><p><a href="https://www.linkedin.com/company/covestro/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a> | LinkedIn</p><p>https://www.linkedin.com/company/covestro/</p><p><br></p><p><a href="https://www.covestro.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a> | Website</p><p>https://www.covestro.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow KubernetesPodOperator</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.udemy.com/user/lockgfg/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_GammaCatchall_NonP_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Gamma&amp;utm_content=deal4584&amp;utm_term=_._ag_169801645584_._ad_700876640602_._kw__._de_c_._dm__._pl__._ti_dsa-1456167871416_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21341313808&amp;gbraid=0AAAAADROdO1_-I2TMcVyU8F3i1jRXJ24K&amp;gclid=Cj0KCQjwvJHIBhCgARIsAEQnWlC1uYHIRm3y9Q8rPNSuVPNivsxogqfczpKHwhmNho2uKZYC-y0taNQaApU2EALw_wcB" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Academy by Marc Lamberti</a></p><p>https://www.udemy.com/user/lockgfg/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_GammaCatchall_NonP_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Gamma&amp;utm_content=deal4584&amp;utm_term=_._ag_169801645584_._ad_700876640602_._kw__._de_c_._dm__._pl__._ti_dsa-1456167871416_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21341313808&amp;gbraid=0AAAAADROdO1_-I2TMcVyU8F3i1jRXJ24K&amp;gclid=Cj0KCQjwvJHIBhCgARIsAEQnWlC1uYHIRm3y9Q8rPNSuVPNivsxogqfczpKHwhmNho2uKZYC-y0taNQaApU2EALw_wcB</p><p><br></p><p><a href="https://airflow.apache.org/docs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Documentation</a></p><p>https://airflow.apache.org/docs/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/1.10.9/plugins.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Plugins</a></p><p>https://airflow.apache.org/docs/apache-airflow/1.10.9/plugins.html</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building scalable, reproducible workflows for scientific computing often requires bridging the gap between research flexibility and enterprise reliability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/anja-mackenzie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anja MacKenzie</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Expert for Cheminformatics at </span><a href="https://www.linkedin.com/company/covestro/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, explains how her team uses Airflow and Kubernetes to create a shared, self-service platform for computational chemistry.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:19 Custom scripts made sharing and reuse difficult.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:29 Workflows are manually triggered with user traceability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:38 Customization supports varied compute requirements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:48 Persistent volumes allow tasks to share large amounts of data.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:25 Custom operators separate logic from infrastructure.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:43 Modified triggers connect dependent workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:36 UI plugins enable file uploads and secure access.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/anja-mackenzie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anja MacKenzie</a></p><p>https://www.linkedin.com/in/anja-mackenzie/</p><p><br></p><p><a href="https://www.linkedin.com/company/covestro/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a> | LinkedIn</p><p>https://www.linkedin.com/company/covestro/</p><p><br></p><p><a href="https://www.covestro.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Covestro</a> | Website</p><p>https://www.covestro.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow KubernetesPodOperator</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.udemy.com/user/lockgfg/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_GammaCatchall_NonP_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Gamma&amp;utm_content=deal4584&amp;utm_term=_._ag_169801645584_._ad_700876640602_._kw__._de_c_._dm__._pl__._ti_dsa-1456167871416_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21341313808&amp;gbraid=0AAAAADROdO1_-I2TMcVyU8F3i1jRXJ24K&amp;gclid=Cj0KCQjwvJHIBhCgARIsAEQnWlC1uYHIRm3y9Q8rPNSuVPNivsxogqfczpKHwhmNho2uKZYC-y0taNQaApU2EALw_wcB" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Academy by Marc Lamberti</a></p><p>https://www.udemy.com/user/lockgfg/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_GammaCatchall_NonP_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Gamma&amp;utm_content=deal4584&amp;utm_term=_._ag_169801645584_._ad_700876640602_._kw__._de_c_._dm__._pl__._ti_dsa-1456167871416_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21341313808&amp;gbraid=0AAAAADROdO1_-I2TMcVyU8F3i1jRXJ24K&amp;gclid=Cj0KCQjwvJHIBhCgARIsAEQnWlC1uYHIRm3y9Q8rPNSuVPNivsxogqfczpKHwhmNho2uKZYC-y0taNQaApU2EALw_wcB</p><p><br></p><p><a href="https://airflow.apache.org/docs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Documentation</a></p><p>https://airflow.apache.org/docs/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/1.10.9/plugins.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Plugins</a></p><p>https://airflow.apache.org/docs/apache-airflow/1.10.9/plugins.html</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Building scalable, reproducible workflows for scientific computing often requires bridging the gap between research flexibility and enterprise reliability.In this episode, Anja MacKenzie, Expert for Cheminformatics at Covestro, explains how her tea...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>64</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[d3ddc0ca-6c19-4a25-955d-b86a848ccc33]]></guid>
  <title><![CDATA[Building Secure Financial Data Platforms at AgileEngine with Valentyn Druzhynin]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The use of Apache Airflow in financial services demands a balance between innovation and compliance. Agile Engine’s approach to orchestration showcases how secure, auditable workflows can scale even within the constraints of regulatory environments.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/valentyn-druzhynin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Valentyn Druzhynin</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/agileengine/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, discusses how his team leverages Airflow for ETF calculations, data validation and workflow reliability within tightly controlled release cycles.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:24 The orchestrator ensures secure and auditable workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:13 Validations before and after computation prevent errors.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:24 Release freezes shape prioritization and delivery plans.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:14 Migration plans must respect managed service constraints.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:04 Versioning, backfills and event triggers increase reliability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:08 UI and integration improvements simplify operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:05 New contributors should start small and seek help.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/valentyn-druzhynin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Valentyn Druzhynin</a></p><p>https://www.linkedin.com/in/valentyn-druzhynin/</p><p><br></p><p><a href="https://www.linkedin.com/company/agileengine/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a> | LinkedIn</p><p>https://www.linkedin.com/company/agileengine/</p><p><br></p><p><a href="https://agileengine.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a> | Website</p><p>https://agileengine.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://aws.amazon.com/managed-workflows-for-apache-airflow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Managed Airflow</a></p><p>https://aws.amazon.com/managed-workflows-for-apache-airflow/</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer (Managed Airflow)</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/2b42d2c0-e208-461c-a6aa-b216e93c85bf/20ea702a70.jpg" />
  <pubDate>Thu, 13 Nov 2025 06:24:37 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="20424492" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/2b42d2c0-e208-461c-a6aa-b216e93c85bf/episode.mp3" />
  <itunes:title><![CDATA[Building Secure Financial Data Platforms at AgileEngine with Valentyn Druzhynin]]></itunes:title>
  <itunes:duration>21:16</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The use of Apache Airflow in financial services demands a balance between innovation and compliance. Agile Engine’s approach to orchestration showcases how secure, auditable workflows can scale even within the constraints of regulatory environments.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/valentyn-druzhynin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Valentyn Druzhynin</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/agileengine/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, discusses how his team leverages Airflow for ETF calculations, data validation and workflow reliability within tightly controlled release cycles.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:24 The orchestrator ensures secure and auditable workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:13 Validations before and after computation prevent errors.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:24 Release freezes shape prioritization and delivery plans.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:14 Migration plans must respect managed service constraints.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:04 Versioning, backfills and event triggers increase reliability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:08 UI and integration improvements simplify operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:05 New contributors should start small and seek help.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/valentyn-druzhynin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Valentyn Druzhynin</a></p><p>https://www.linkedin.com/in/valentyn-druzhynin/</p><p><br></p><p><a href="https://www.linkedin.com/company/agileengine/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a> | LinkedIn</p><p>https://www.linkedin.com/company/agileengine/</p><p><br></p><p><a href="https://agileengine.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a> | Website</p><p>https://agileengine.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://aws.amazon.com/managed-workflows-for-apache-airflow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Managed Airflow</a></p><p>https://aws.amazon.com/managed-workflows-for-apache-airflow/</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer (Managed Airflow)</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The use of Apache Airflow in financial services demands a balance between innovation and compliance. Agile Engine’s approach to orchestration showcases how secure, auditable workflows can scale even within the constraints of regulatory environments.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/valentyn-druzhynin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Valentyn Druzhynin</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/agileengine/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, discusses how his team leverages Airflow for ETF calculations, data validation and workflow reliability within tightly controlled release cycles.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:24 The orchestrator ensures secure and auditable workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:13 Validations before and after computation prevent errors.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:24 Release freezes shape prioritization and delivery plans.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:14 Migration plans must respect managed service constraints.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:04 Versioning, backfills and event triggers increase reliability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:08 UI and integration improvements simplify operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:05 New contributors should start small and seek help.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/valentyn-druzhynin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Valentyn Druzhynin</a></p><p>https://www.linkedin.com/in/valentyn-druzhynin/</p><p><br></p><p><a href="https://www.linkedin.com/company/agileengine/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a> | LinkedIn</p><p>https://www.linkedin.com/company/agileengine/</p><p><br></p><p><a href="https://agileengine.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AgileEngine</a> | Website</p><p>https://agileengine.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://aws.amazon.com/managed-workflows-for-apache-airflow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Managed Airflow</a></p><p>https://aws.amazon.com/managed-workflows-for-apache-airflow/</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer (Managed Airflow)</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The use of Apache Airflow in financial services demands a balance between innovation and compliance. Agile Engine’s approach to orchestration showcases how secure, auditable workflows can scale even within the constraints of regulatory environments...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>63</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[335bfd00-bf0c-41b9-ab69-2178b2ca088f]]></guid>
  <title><![CDATA[How Redica Transformed Their Data With Airflow and Snowflake with Shankar Mahindar]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The life sciences industry relies on data accuracy, regulatory insight and quality intelligence. Building a unified system that keeps these elements aligned is no small feat.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we welcome </span><a href="https://www.linkedin.com/in/shankar-mahindar-83a61b137/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shankar Mahindar</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer II at </span><a href="https://www.linkedin.com/company/redicasystems/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a><span style="background-color: transparent; color: rgb(22, 14, 61);">. We discuss how the team restructures its data platform with Airflow to strengthen governance, reduce compliance risk and improve customer experience.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:53 A focused analytics platform reduces compliance risk in life sciences.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:31 A centralized warehouse orchestrated by Airflow strengthens governance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:12 Managed orchestration keeps attention on analytics and outcomes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:32 A modern transformation stack enables scalable modeling and operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:51 Event-driven pipelines improve data freshness and responsiveness.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:13 Asset-oriented scheduling and versioning enhance reliability and change control.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:53 Observability and SLAs build confidence in data quality and freshness.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:04 Priorities include partitioned assets and streamlined developer tooling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shankar-mahindar-83a61b137/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shankar Mahindar</a></p><p>https://www.linkedin.com/in/shankar-mahindar-83a61b137/</p><p><br></p><p><a href="https://www.linkedin.com/company/redicasystems/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a> | LinkedIn</p><p>https://www.linkedin.com/company/redicasystems/</p><p><br></p><p><a href="https://redica.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a> | Website</p><p>https://redica.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://aws.amazon.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS</a></p><p>https://aws.amazon.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ca378f3d-5b07-412b-9e76-29f22b6a8803/ae2226a5df.jpg" />
  <pubDate>Thu, 06 Nov 2025 04:23:27 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22863702" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ca378f3d-5b07-412b-9e76-29f22b6a8803/episode.mp3" />
  <itunes:title><![CDATA[How Redica Transformed Their Data With Airflow and Snowflake with Shankar Mahindar]]></itunes:title>
  <itunes:duration>23:48</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The life sciences industry relies on data accuracy, regulatory insight and quality intelligence. Building a unified system that keeps these elements aligned is no small feat.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we welcome </span><a href="https://www.linkedin.com/in/shankar-mahindar-83a61b137/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shankar Mahindar</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer II at </span><a href="https://www.linkedin.com/company/redicasystems/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a><span style="background-color: transparent; color: rgb(22, 14, 61);">. We discuss how the team restructures its data platform with Airflow to strengthen governance, reduce compliance risk and improve customer experience.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:53 A focused analytics platform reduces compliance risk in life sciences.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:31 A centralized warehouse orchestrated by Airflow strengthens governance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:12 Managed orchestration keeps attention on analytics and outcomes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:32 A modern transformation stack enables scalable modeling and operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:51 Event-driven pipelines improve data freshness and responsiveness.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:13 Asset-oriented scheduling and versioning enhance reliability and change control.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:53 Observability and SLAs build confidence in data quality and freshness.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:04 Priorities include partitioned assets and streamlined developer tooling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shankar-mahindar-83a61b137/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shankar Mahindar</a></p><p>https://www.linkedin.com/in/shankar-mahindar-83a61b137/</p><p><br></p><p><a href="https://www.linkedin.com/company/redicasystems/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a> | LinkedIn</p><p>https://www.linkedin.com/company/redicasystems/</p><p><br></p><p><a href="https://redica.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a> | Website</p><p>https://redica.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://aws.amazon.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS</a></p><p>https://aws.amazon.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The life sciences industry relies on data accuracy, regulatory insight and quality intelligence. Building a unified system that keeps these elements aligned is no small feat.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we welcome </span><a href="https://www.linkedin.com/in/shankar-mahindar-83a61b137/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shankar Mahindar</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer II at </span><a href="https://www.linkedin.com/company/redicasystems/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a><span style="background-color: transparent; color: rgb(22, 14, 61);">. We discuss how the team restructures its data platform with Airflow to strengthen governance, reduce compliance risk and improve customer experience.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">01:53 A focused analytics platform reduces compliance risk in life sciences.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:31 A centralized warehouse orchestrated by Airflow strengthens governance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:12 Managed orchestration keeps attention on analytics and outcomes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:32 A modern transformation stack enables scalable modeling and operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:51 Event-driven pipelines improve data freshness and responsiveness.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:13 Asset-oriented scheduling and versioning enhance reliability and change control.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:53 Observability and SLAs build confidence in data quality and freshness.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:04 Priorities include partitioned assets and streamlined developer tooling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shankar-mahindar-83a61b137/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shankar Mahindar</a></p><p>https://www.linkedin.com/in/shankar-mahindar-83a61b137/</p><p><br></p><p><a href="https://www.linkedin.com/company/redicasystems/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a> | LinkedIn</p><p>https://www.linkedin.com/company/redicasystems/</p><p><br></p><p><a href="https://redica.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Redica Systems</a> | Website</p><p>https://redica.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://aws.amazon.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS</a></p><p>https://aws.amazon.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The life sciences industry relies on data accuracy, regulatory insight and quality intelligence. Building a unified system that keeps these elements aligned is no small feat.In this episode, we welcome Shankar Mahindar, Senior Data Engineer II at R...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>62</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[6015cd4b-53b7-49d3-98f8-28906da9ea59]]></guid>
  <title><![CDATA[How Airflow and AI Power Investigative Journalism at the Financial Times with Zdravko Hvarlingov]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The Financial Times leverages Airflow and AI to uncover powerful stories hidden within vast, unstructured data.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zdravko Hvarlingov</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at the </span><a href="https://www.linkedin.com/company/financial-times/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, discusses building multi-tenant Airflow systems and AI-driven pipelines that surface stories that might otherwise be missed. Zdravko walks through entity extraction and fuzzy matching, linking the UK Register of Members’ Financial Interests with Companies House, and how this work cuts weeks of manual analysis to minutes.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:12 What computational journalism means for day-to-day newsroom work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:22 Why a shared orchestration platform supports consistent, scalable workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:30 Tradeoffs of one centralized platform versus many separate instances.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:52 Using pipelines to structure messy sources for faster analysis.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:14 Turning recurring disclosures into usable data for investigations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:03 Applying lightweight ML and matching to reveal entities and links.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:46 How automation reduces manual effort and shortens time to insight.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:41 Practical improvements that make backfilling and reliability easier.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zdravko Hvarlingov</a></p><p>https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/</p><p><br></p><p><a href="https://www.linkedin.com/company/financial-times/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a> | LinkedIn</p><p>https://www.linkedin.com/company/financial-times/</p><p><br></p><p><a href="https://www.ft.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a> | Website</p><p>https://www.ft.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">UK Register of Members’ Financial Interests</a></p><p>https://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/</p><p><br></p><p><a href="https://www.gov.uk/government/organisations/companies-house" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">UK Companies House</a></p><p>https://www.gov.uk/government/organisations/companies-house</p><p><br></p><p><a href="https://www.doppler.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Doppler</a></p><p>https://www.doppler.com/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/executor/kubernetes.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Kubernetes Executor</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/executor/kubernetes.html</p><p><br></p><p><a href="https://github.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a></p><p>https://github.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/c25ca044-0536-4054-b12f-9097274abe0e/03ab455aa0.jpg" />
  <pubDate>Thu, 30 Oct 2025 05:48:07 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23493566" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/c25ca044-0536-4054-b12f-9097274abe0e/episode.mp3" />
  <itunes:title><![CDATA[How Airflow and AI Power Investigative Journalism at the Financial Times with Zdravko Hvarlingov]]></itunes:title>
  <itunes:duration>24:28</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The Financial Times leverages Airflow and AI to uncover powerful stories hidden within vast, unstructured data.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zdravko Hvarlingov</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at the </span><a href="https://www.linkedin.com/company/financial-times/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, discusses building multi-tenant Airflow systems and AI-driven pipelines that surface stories that might otherwise be missed. Zdravko walks through entity extraction and fuzzy matching, linking the UK Register of Members’ Financial Interests with Companies House, and how this work cuts weeks of manual analysis to minutes.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:12 What computational journalism means for day-to-day newsroom work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:22 Why a shared orchestration platform supports consistent, scalable workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:30 Tradeoffs of one centralized platform versus many separate instances.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:52 Using pipelines to structure messy sources for faster analysis.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:14 Turning recurring disclosures into usable data for investigations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:03 Applying lightweight ML and matching to reveal entities and links.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:46 How automation reduces manual effort and shortens time to insight.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:41 Practical improvements that make backfilling and reliability easier.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zdravko Hvarlingov</a></p><p>https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/</p><p><br></p><p><a href="https://www.linkedin.com/company/financial-times/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a> | LinkedIn</p><p>https://www.linkedin.com/company/financial-times/</p><p><br></p><p><a href="https://www.ft.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a> | Website</p><p>https://www.ft.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">UK Register of Members’ Financial Interests</a></p><p>https://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/</p><p><br></p><p><a href="https://www.gov.uk/government/organisations/companies-house" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">UK Companies House</a></p><p>https://www.gov.uk/government/organisations/companies-house</p><p><br></p><p><a href="https://www.doppler.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Doppler</a></p><p>https://www.doppler.com/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/executor/kubernetes.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Kubernetes Executor</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/executor/kubernetes.html</p><p><br></p><p><a href="https://github.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a></p><p>https://github.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The Financial Times leverages Airflow and AI to uncover powerful stories hidden within vast, unstructured data.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zdravko Hvarlingov</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at the </span><a href="https://www.linkedin.com/company/financial-times/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, discusses building multi-tenant Airflow systems and AI-driven pipelines that surface stories that might otherwise be missed. Zdravko walks through entity extraction and fuzzy matching, linking the UK Register of Members’ Financial Interests with Companies House, and how this work cuts weeks of manual analysis to minutes.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:12 What computational journalism means for day-to-day newsroom work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:22 Why a shared orchestration platform supports consistent, scalable workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:30 Tradeoffs of one centralized platform versus many separate instances.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:52 Using pipelines to structure messy sources for faster analysis.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:14 Turning recurring disclosures into usable data for investigations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:03 Applying lightweight ML and matching to reveal entities and links.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:46 How automation reduces manual effort and shortens time to insight.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:41 Practical improvements that make backfilling and reliability easier.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zdravko Hvarlingov</a></p><p>https://www.linkedin.com/in/zdravko-hvarlingov-3aa36016b/</p><p><br></p><p><a href="https://www.linkedin.com/company/financial-times/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a> | LinkedIn</p><p>https://www.linkedin.com/company/financial-times/</p><p><br></p><p><a href="https://www.ft.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Financial Times</a> | Website</p><p>https://www.ft.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">UK Register of Members’ Financial Interests</a></p><p>https://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/</p><p><br></p><p><a href="https://www.gov.uk/government/organisations/companies-house" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">UK Companies House</a></p><p>https://www.gov.uk/government/organisations/companies-house</p><p><br></p><p><a href="https://www.doppler.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Doppler</a></p><p>https://www.doppler.com/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/executor/kubernetes.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Kubernetes Executor</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/executor/kubernetes.html</p><p><br></p><p><a href="https://github.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a></p><p>https://github.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The Financial Times leverages Airflow and AI to uncover powerful stories hidden within vast, unstructured data.In this episode, Zdravko Hvarlingov, Senior Software Engineer at the Financial Times, discusses building multi-tenant Airflow systems and...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>61</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[f22b61aa-674e-4767-b810-d5fd5a84c86c]]></guid>
  <title><![CDATA[Inside Vinted’s Code-Generated Airflow Pipelines with Oscar Ligthart and Rodrigo Loredo]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift from monolithic to decentralized data workflows changes how teams build, connect and scale pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we feature </span><a href="https://www.linkedin.com/in/oscar-ligthart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Oscar Ligthart</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer, and </span><a href="https://www.linkedin.com/in/rodrigo-loredo-410a16134/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rodrigo Loredo</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Analytics Engineer, both at </span><a href="https://www.linkedin.com/company/vinted/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, as we unpack their YAML-driven abstraction that generates Airflow DAGs and standardizes cross-team orchestration.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:28 Challenges of decentralization.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:45 YAML-based generator standardizes pipelines and dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:28 Declarative assets and sensors align cross-DAG dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:29 Task-level callbacks enable auto-recovery and clear ownership.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:39 Standardized building blocks simplify upgrades and maintenance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">24:52 Platform focus frees domain work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">26:49 Container-only standardization prevents sprawl.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/oscar-ligthart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Oscar Ligthart</a></p><p>https://www.linkedin.com/in/oscar-ligthart/</p><p><br></p><p><a href="https://www.linkedin.com/in/rodrigo-loredo-410a16134/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rodrigo Loredo</a></p><p>https://www.linkedin.com/in/rodrigo-loredo-410a16134/</p><p><br></p><p><a href="https://www.linkedin.com/company/vinted/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a> | LinkedIn</p><p>https://www.linkedin.com/company/vinted/</p><p><br></p><p><a href="https://www.vinted.com/?srsltid=AfmBOor87MGR_eLOauCO93V9A-aLDaAhGYx9cnu_oN8s1SAXMlCRuhW7" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a> | Website</p><p>https://www.vinted.com/?srsltid=AfmBOor87MGR_eLOauCO93V9A-aLDaAhGYx9cnu_oN8s1SAXMlCRuhW7</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://cloud.google.com/vertex-ai" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Vertex AI</a></p><p>https://cloud.google.com/vertex-ai</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/airflow-datasets" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Datasets &amp; Assets (concepts)</a></p><p>https://www.astronomer.io/docs/learn/airflow-datasets</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/a755ea53-66f6-46d4-b984-504ecbf3e97d/08f781dde5.jpg" />
  <pubDate>Thu, 23 Oct 2025 01:25:27 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="28422144" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/a755ea53-66f6-46d4-b984-504ecbf3e97d/episode.mp3" />
  <itunes:title><![CDATA[Inside Vinted’s Code-Generated Airflow Pipelines with Oscar Ligthart and Rodrigo Loredo]]></itunes:title>
  <itunes:duration>29:36</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift from monolithic to decentralized data workflows changes how teams build, connect and scale pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we feature </span><a href="https://www.linkedin.com/in/oscar-ligthart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Oscar Ligthart</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer, and </span><a href="https://www.linkedin.com/in/rodrigo-loredo-410a16134/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rodrigo Loredo</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Analytics Engineer, both at </span><a href="https://www.linkedin.com/company/vinted/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, as we unpack their YAML-driven abstraction that generates Airflow DAGs and standardizes cross-team orchestration.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:28 Challenges of decentralization.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:45 YAML-based generator standardizes pipelines and dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:28 Declarative assets and sensors align cross-DAG dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:29 Task-level callbacks enable auto-recovery and clear ownership.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:39 Standardized building blocks simplify upgrades and maintenance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">24:52 Platform focus frees domain work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">26:49 Container-only standardization prevents sprawl.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/oscar-ligthart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Oscar Ligthart</a></p><p>https://www.linkedin.com/in/oscar-ligthart/</p><p><br></p><p><a href="https://www.linkedin.com/in/rodrigo-loredo-410a16134/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rodrigo Loredo</a></p><p>https://www.linkedin.com/in/rodrigo-loredo-410a16134/</p><p><br></p><p><a href="https://www.linkedin.com/company/vinted/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a> | LinkedIn</p><p>https://www.linkedin.com/company/vinted/</p><p><br></p><p><a href="https://www.vinted.com/?srsltid=AfmBOor87MGR_eLOauCO93V9A-aLDaAhGYx9cnu_oN8s1SAXMlCRuhW7" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a> | Website</p><p>https://www.vinted.com/?srsltid=AfmBOor87MGR_eLOauCO93V9A-aLDaAhGYx9cnu_oN8s1SAXMlCRuhW7</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://cloud.google.com/vertex-ai" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Vertex AI</a></p><p>https://cloud.google.com/vertex-ai</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/airflow-datasets" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Datasets &amp; Assets (concepts)</a></p><p>https://www.astronomer.io/docs/learn/airflow-datasets</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift from monolithic to decentralized data workflows changes how teams build, connect and scale pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we feature </span><a href="https://www.linkedin.com/in/oscar-ligthart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Oscar Ligthart</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Data Engineer, and </span><a href="https://www.linkedin.com/in/rodrigo-loredo-410a16134/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rodrigo Loredo</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Lead Analytics Engineer, both at </span><a href="https://www.linkedin.com/company/vinted/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, as we unpack their YAML-driven abstraction that generates Airflow DAGs and standardizes cross-team orchestration.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:28 Challenges of decentralization.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:45 YAML-based generator standardizes pipelines and dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:28 Declarative assets and sensors align cross-DAG dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:29 Task-level callbacks enable auto-recovery and clear ownership.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:39 Standardized building blocks simplify upgrades and maintenance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">24:52 Platform focus frees domain work.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">26:49 Container-only standardization prevents sprawl.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/oscar-ligthart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Oscar Ligthart</a></p><p>https://www.linkedin.com/in/oscar-ligthart/</p><p><br></p><p><a href="https://www.linkedin.com/in/rodrigo-loredo-410a16134/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rodrigo Loredo</a></p><p>https://www.linkedin.com/in/rodrigo-loredo-410a16134/</p><p><br></p><p><a href="https://www.linkedin.com/company/vinted/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a> | LinkedIn</p><p>https://www.linkedin.com/company/vinted/</p><p><br></p><p><a href="https://www.vinted.com/?srsltid=AfmBOor87MGR_eLOauCO93V9A-aLDaAhGYx9cnu_oN8s1SAXMlCRuhW7" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vinted</a> | Website</p><p>https://www.vinted.com/?srsltid=AfmBOor87MGR_eLOauCO93V9A-aLDaAhGYx9cnu_oN8s1SAXMlCRuhW7</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://cloud.google.com/vertex-ai" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Vertex AI</a></p><p>https://cloud.google.com/vertex-ai</p><p><br></p><p><a href="https://www.astronomer.io/docs/learn/airflow-datasets" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Datasets &amp; Assets (concepts)</a></p><p>https://www.astronomer.io/docs/learn/airflow-datasets</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The shift from monolithic to decentralized data workflows changes how teams build, connect and scale pipelines.In this episode, we feature Oscar Ligthart, Lead Data Engineer, and Rodrigo Loredo, Lead Analytics Engineer, both at Vinted, as we unpack...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>60</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[84d317ee-d936-4e33-bcf0-8cf29c6810af]]></guid>
  <title><![CDATA[Transforming Data Pipelines at XENA Intelligence with Naseem Shah]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift from simple cron jobs to orchestrated AI-powered workflows is reshaping how startups scale. For a small team, these transitions come with unique challenges and big opportunities.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/naseemshah/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Naseem Shah</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Head of Engineering at </span><a href="https://www.linkedin.com/company/xena-intelligence/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how he built data pipelines from scratch, adopted Apache Airflow and transformed Amazon review analysis with LLMs.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:28 The importance of building initial products that support growth and investment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:16 The process of adopting new tools to improve reliability and efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:29 Approaches to learning complex technologies through practice and fundamentals.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:57 Trade-offs small teams face when balancing performance and costs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:40 Using AI-driven approaches to generate insights from large datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:38 How unstructured data can be transformed into actionable information.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">25:55 Moving from manual tasks to fully automated workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">28:05 Orchestration as a foundation for scaling advanced use cases.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/naseemshah/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Naseem Shah</a></p><p>https://www.linkedin.com/in/naseemshah/</p><p><br></p><p><a href="https://www.linkedin.com/company/xena-intelligence/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a> | LinkedIn</p><p>https://www.linkedin.com/company/xena-intelligence/</p><p><br></p><p><a href="https://xenaintelligence.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a> | Website</p><p>https://xenaintelligence.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://www.techstars.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Techstars</a></p><p>https://www.techstars.com/</p><p><br></p><p><a href="https://www.docker.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Docker</a></p><p>https://www.docker.com/</p><p><br></p><p><a href="https://aws.amazon.com/sqs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS SQS</a></p><p>https://aws.amazon.com/sqs/</p><p><br></p><p><a href="https://www.postgresql.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PostgreSQL</a></p><p>https://www.postgresql.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/059c559e-278c-49e8-9f1a-2fb768d19c77/55e46a8d92.jpg" />
  <pubDate>Thu, 16 Oct 2025 05:36:36 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="27406918" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/059c559e-278c-49e8-9f1a-2fb768d19c77/episode.mp3" />
  <itunes:title><![CDATA[Transforming Data Pipelines at XENA Intelligence with Naseem Shah]]></itunes:title>
  <itunes:duration>28:32</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift from simple cron jobs to orchestrated AI-powered workflows is reshaping how startups scale. For a small team, these transitions come with unique challenges and big opportunities.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/naseemshah/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Naseem Shah</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Head of Engineering at </span><a href="https://www.linkedin.com/company/xena-intelligence/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how he built data pipelines from scratch, adopted Apache Airflow and transformed Amazon review analysis with LLMs.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:28 The importance of building initial products that support growth and investment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:16 The process of adopting new tools to improve reliability and efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:29 Approaches to learning complex technologies through practice and fundamentals.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:57 Trade-offs small teams face when balancing performance and costs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:40 Using AI-driven approaches to generate insights from large datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:38 How unstructured data can be transformed into actionable information.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">25:55 Moving from manual tasks to fully automated workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">28:05 Orchestration as a foundation for scaling advanced use cases.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/naseemshah/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Naseem Shah</a></p><p>https://www.linkedin.com/in/naseemshah/</p><p><br></p><p><a href="https://www.linkedin.com/company/xena-intelligence/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a> | LinkedIn</p><p>https://www.linkedin.com/company/xena-intelligence/</p><p><br></p><p><a href="https://xenaintelligence.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a> | Website</p><p>https://xenaintelligence.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://www.techstars.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Techstars</a></p><p>https://www.techstars.com/</p><p><br></p><p><a href="https://www.docker.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Docker</a></p><p>https://www.docker.com/</p><p><br></p><p><a href="https://aws.amazon.com/sqs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS SQS</a></p><p>https://aws.amazon.com/sqs/</p><p><br></p><p><a href="https://www.postgresql.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PostgreSQL</a></p><p>https://www.postgresql.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift from simple cron jobs to orchestrated AI-powered workflows is reshaping how startups scale. For a small team, these transitions come with unique challenges and big opportunities.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/naseemshah/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Naseem Shah</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Head of Engineering at </span><a href="https://www.linkedin.com/company/xena-intelligence/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how he built data pipelines from scratch, adopted Apache Airflow and transformed Amazon review analysis with LLMs.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:28 The importance of building initial products that support growth and investment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:16 The process of adopting new tools to improve reliability and efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:29 Approaches to learning complex technologies through practice and fundamentals.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:57 Trade-offs small teams face when balancing performance and costs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:40 Using AI-driven approaches to generate insights from large datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:38 How unstructured data can be transformed into actionable information.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">25:55 Moving from manual tasks to fully automated workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">28:05 Orchestration as a foundation for scaling advanced use cases.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/naseemshah/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Naseem Shah</a></p><p>https://www.linkedin.com/in/naseemshah/</p><p><br></p><p><a href="https://www.linkedin.com/company/xena-intelligence/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a> | LinkedIn</p><p>https://www.linkedin.com/company/xena-intelligence/</p><p><br></p><p><a href="https://xenaintelligence.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Xena Intelligence</a> | Website</p><p>https://xenaintelligence.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/composer" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Composer</a></p><p>https://cloud.google.com/composer</p><p><br></p><p><a href="https://www.techstars.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Techstars</a></p><p>https://www.techstars.com/</p><p><br></p><p><a href="https://www.docker.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Docker</a></p><p>https://www.docker.com/</p><p><br></p><p><a href="https://aws.amazon.com/sqs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS SQS</a></p><p>https://aws.amazon.com/sqs/</p><p><br></p><p><a href="https://www.postgresql.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PostgreSQL</a></p><p>https://www.postgresql.org/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The shift from simple cron jobs to orchestrated AI-powered workflows is reshaping how startups scale. For a small team, these transitions come with unique challenges and big opportunities.In this episode, Naseem Shah, Head of Engineering at Xena In...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>59</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[1c54c88a-ca83-4431-b6a4-6bda51130ea6]]></guid>
  <title><![CDATA[Scaling Geospatial Workflows With Airflow at Overture Maps Foundation and Wherobots with Alex Iannicelli and Daniel Smith]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Using Airflow to orchestrate geospatial data pipelines unlocks powerful efficiencies for data teams. The combination of scalable processing and visual observability streamlines workflows, reduces costs and improves iteration speed.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/atiannicelli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alex Iannicelli</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Staff Software Engineer at </span><a href="https://www.linkedin.com/company/overture-maps-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/daniel-smith-analyst/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daniel Smith</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Solutions Architect at </span><a href="https://www.linkedin.com/company/wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, join us to discuss leveraging Apache Airflow and Apache Sedona to process massive geospatial datasets, build reproducible pipelines and orchestrate complex workflows across platforms.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:22 How merging multiple data sources supports comprehensive datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:20 The value of flexible configurations for running pipelines on different platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:35 Why orchestration tools are essential for handling continuous data streams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:45 The importance of observability for monitoring progress and troubleshooting issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:30 Strategies for processing large, complex datasets efficiently.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:27 Expanding orchestration beyond core pipelines to automate frequent tasks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:02 Advantages of using open-source operators to simplify integration and deployment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:32 Desired improvements in orchestration tools for usability and workflow management.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/atiannicelli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alex Iannicelli</a></p><p>https://www.linkedin.com/in/atiannicelli/</p><p><br></p><p><a href="https://www.linkedin.com/company/overture-maps-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a> | LinkedIn</p><p>https://www.linkedin.com/company/overture-maps-foundation/</p><p><br></p><p><a href="https://overturemaps.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a> | Website</p><p>https://overturemaps.org</p><p><br></p><p><a href="https://www.linkedin.com/in/daniel-smith-analyst/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daniel Smith</a></p><p>https://www.linkedin.com/in/daniel-smith-analyst/</p><p><br></p><p><a href="https://www.linkedin.com/company/wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a> | LinkedIn</p><p>https://www.linkedin.com/company/wherobots</p><p><br></p><p><a href="https://www.wherobots.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a> | Website</p><p>https://www.wherobots.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://sedona.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Sedona</a></p><p>https://sedona.apache.org/</p><p><br></p><p><a href="https://github.com/wherobots/airflow-providers-wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Github repo</a></p><p>https://github.com/wherobots/airflow-providers-wherobots</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ee50b2df-0094-40cb-aa00-08fd510e4f3a/bf2facf316.jpg" />
  <pubDate>Thu, 09 Oct 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23099431" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ee50b2df-0094-40cb-aa00-08fd510e4f3a/episode.mp3" />
  <itunes:title><![CDATA[Scaling Geospatial Workflows With Airflow at Overture Maps Foundation and Wherobots with Alex Iannicelli and Daniel Smith]]></itunes:title>
  <itunes:duration>24:03</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Using Airflow to orchestrate geospatial data pipelines unlocks powerful efficiencies for data teams. The combination of scalable processing and visual observability streamlines workflows, reduces costs and improves iteration speed.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/atiannicelli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alex Iannicelli</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Staff Software Engineer at </span><a href="https://www.linkedin.com/company/overture-maps-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/daniel-smith-analyst/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daniel Smith</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Solutions Architect at </span><a href="https://www.linkedin.com/company/wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, join us to discuss leveraging Apache Airflow and Apache Sedona to process massive geospatial datasets, build reproducible pipelines and orchestrate complex workflows across platforms.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:22 How merging multiple data sources supports comprehensive datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:20 The value of flexible configurations for running pipelines on different platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:35 Why orchestration tools are essential for handling continuous data streams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:45 The importance of observability for monitoring progress and troubleshooting issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:30 Strategies for processing large, complex datasets efficiently.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:27 Expanding orchestration beyond core pipelines to automate frequent tasks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:02 Advantages of using open-source operators to simplify integration and deployment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:32 Desired improvements in orchestration tools for usability and workflow management.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/atiannicelli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alex Iannicelli</a></p><p>https://www.linkedin.com/in/atiannicelli/</p><p><br></p><p><a href="https://www.linkedin.com/company/overture-maps-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a> | LinkedIn</p><p>https://www.linkedin.com/company/overture-maps-foundation/</p><p><br></p><p><a href="https://overturemaps.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a> | Website</p><p>https://overturemaps.org</p><p><br></p><p><a href="https://www.linkedin.com/in/daniel-smith-analyst/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daniel Smith</a></p><p>https://www.linkedin.com/in/daniel-smith-analyst/</p><p><br></p><p><a href="https://www.linkedin.com/company/wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a> | LinkedIn</p><p>https://www.linkedin.com/company/wherobots</p><p><br></p><p><a href="https://www.wherobots.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a> | Website</p><p>https://www.wherobots.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://sedona.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Sedona</a></p><p>https://sedona.apache.org/</p><p><br></p><p><a href="https://github.com/wherobots/airflow-providers-wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Github repo</a></p><p>https://github.com/wherobots/airflow-providers-wherobots</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Using Airflow to orchestrate geospatial data pipelines unlocks powerful efficiencies for data teams. The combination of scalable processing and visual observability streamlines workflows, reduces costs and improves iteration speed.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/atiannicelli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alex Iannicelli</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Staff Software Engineer at </span><a href="https://www.linkedin.com/company/overture-maps-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/daniel-smith-analyst/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daniel Smith</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Solutions Architect at </span><a href="https://www.linkedin.com/company/wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, join us to discuss leveraging Apache Airflow and Apache Sedona to process massive geospatial datasets, build reproducible pipelines and orchestrate complex workflows across platforms.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:22 How merging multiple data sources supports comprehensive datasets.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:20 The value of flexible configurations for running pipelines on different platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:35 Why orchestration tools are essential for handling continuous data streams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:45 The importance of observability for monitoring progress and troubleshooting issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">11:30 Strategies for processing large, complex datasets efficiently.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:27 Expanding orchestration beyond core pipelines to automate frequent tasks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:02 Advantages of using open-source operators to simplify integration and deployment.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:32 Desired improvements in orchestration tools for usability and workflow management.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/atiannicelli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alex Iannicelli</a></p><p>https://www.linkedin.com/in/atiannicelli/</p><p><br></p><p><a href="https://www.linkedin.com/company/overture-maps-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a> | LinkedIn</p><p>https://www.linkedin.com/company/overture-maps-foundation/</p><p><br></p><p><a href="https://overturemaps.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Overture Maps Foundation</a> | Website</p><p>https://overturemaps.org</p><p><br></p><p><a href="https://www.linkedin.com/in/daniel-smith-analyst/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daniel Smith</a></p><p>https://www.linkedin.com/in/daniel-smith-analyst/</p><p><br></p><p><a href="https://www.linkedin.com/company/wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a> | LinkedIn</p><p>https://www.linkedin.com/company/wherobots</p><p><br></p><p><a href="https://www.wherobots.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wherobots</a> | Website</p><p>https://www.wherobots.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://sedona.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Sedona</a></p><p>https://sedona.apache.org/</p><p><br></p><p><a href="https://github.com/wherobots/airflow-providers-wherobots" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Github repo</a></p><p>https://github.com/wherobots/airflow-providers-wherobots</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Using Airflow to orchestrate geospatial data pipelines unlocks powerful efficiencies for data teams. The combination of scalable processing and visual observability streamlines workflows, reduces costs and improves iteration speed.In this episode, ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>58</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[05c8a950-867a-4eaf-9301-e74cc65f00fc]]></guid>
  <title><![CDATA[Scaling Airflow for Enterprise Data Platforms at PepsiCo with Kunal Bhattacharya]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">PepsiCo’s data platform drives insights across finance, marketing and data science. Delivering stability, scalability and developer delight is central to its success, and engineering leadership plays a key role in making this possible.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/kunaljubce/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kunal Bhattacharya</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Manager of Data Platform Engineering at </span><a href="https://www.linkedin.com/company/pepsico/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team manages Airflow at scale while ensuring security, performance and cost efficiency.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:31 Enabling developer delight by extending platform capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:56 Role of Snowflake, dbt and Airflow in PepsiCo’s data stack.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:10 Local developer environments built using official Airflow Helm charts.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:13 Pre-staging and PR environments as testing playgrounds.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:08 Automating labeling and resource allocation via DAG factories.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:16 Cost optimization through pod labeling and Datadog insights.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:01 Isolating dbt engines to improve performance across teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:12 Wishlist for Airflow 3: Improved role-based grants and database modeling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/kunaljubce/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kunal Bhattacharya</a></p><p>https://www.linkedin.com/in/kunaljubce/</p><p><br></p><p><a href="https://www.linkedin.com/company/pepsico/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a> | LinkedIn</p><p>https://www.linkedin.com/company/pepsico/</p><p><br></p><p><a href="https://www.pepsico.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a> | Website</p><p>https://www.pepsico.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io</p><p><br></p><p><a href="https://greatexpectations.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io</p><p><br></p><p><a href="https://www.montecarlodata.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monte Carlo</a></p><p>https://www.montecarlodata.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/d80ba09d-a3d3-4315-ba89-da059e19c985/93b4020f5a.jpg" />
  <pubDate>Thu, 02 Oct 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="18317142" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/d80ba09d-a3d3-4315-ba89-da059e19c985/episode.mp3" />
  <itunes:title><![CDATA[Scaling Airflow for Enterprise Data Platforms at PepsiCo with Kunal Bhattacharya]]></itunes:title>
  <itunes:duration>19:04</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">PepsiCo’s data platform drives insights across finance, marketing and data science. Delivering stability, scalability and developer delight is central to its success, and engineering leadership plays a key role in making this possible.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/kunaljubce/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kunal Bhattacharya</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Manager of Data Platform Engineering at </span><a href="https://www.linkedin.com/company/pepsico/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team manages Airflow at scale while ensuring security, performance and cost efficiency.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:31 Enabling developer delight by extending platform capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:56 Role of Snowflake, dbt and Airflow in PepsiCo’s data stack.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:10 Local developer environments built using official Airflow Helm charts.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:13 Pre-staging and PR environments as testing playgrounds.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:08 Automating labeling and resource allocation via DAG factories.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:16 Cost optimization through pod labeling and Datadog insights.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:01 Isolating dbt engines to improve performance across teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:12 Wishlist for Airflow 3: Improved role-based grants and database modeling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/kunaljubce/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kunal Bhattacharya</a></p><p>https://www.linkedin.com/in/kunaljubce/</p><p><br></p><p><a href="https://www.linkedin.com/company/pepsico/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a> | LinkedIn</p><p>https://www.linkedin.com/company/pepsico/</p><p><br></p><p><a href="https://www.pepsico.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a> | Website</p><p>https://www.pepsico.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io</p><p><br></p><p><a href="https://greatexpectations.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io</p><p><br></p><p><a href="https://www.montecarlodata.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monte Carlo</a></p><p>https://www.montecarlodata.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">PepsiCo’s data platform drives insights across finance, marketing and data science. Delivering stability, scalability and developer delight is central to its success, and engineering leadership plays a key role in making this possible.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/kunaljubce/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kunal Bhattacharya</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Manager of Data Platform Engineering at </span><a href="https://www.linkedin.com/company/pepsico/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team manages Airflow at scale while ensuring security, performance and cost efficiency.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:31 Enabling developer delight by extending platform capabilities.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:56 Role of Snowflake, dbt and Airflow in PepsiCo’s data stack.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">06:10 Local developer environments built using official Airflow Helm charts.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:13 Pre-staging and PR environments as testing playgrounds.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:08 Automating labeling and resource allocation via DAG factories.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:16 Cost optimization through pod labeling and Datadog insights.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:01 Isolating dbt engines to improve performance across teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:12 Wishlist for Airflow 3: Improved role-based grants and database modeling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/kunaljubce/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kunal Bhattacharya</a></p><p>https://www.linkedin.com/in/kunaljubce/</p><p><br></p><p><a href="https://www.linkedin.com/company/pepsico/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a> | LinkedIn</p><p>https://www.linkedin.com/company/pepsico/</p><p><br></p><p><a href="https://www.pepsico.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PepsiCo</a> | Website</p><p>https://www.pepsico.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com</p><p><br></p><p><a href="https://www.getdbt.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io</p><p><br></p><p><a href="https://greatexpectations.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io</p><p><br></p><p><a href="https://www.montecarlodata.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monte Carlo</a></p><p>https://www.montecarlodata.com</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[PepsiCo’s data platform drives insights across finance, marketing and data science. Delivering stability, scalability and developer delight is central to its success, and engineering leadership plays a key role in making this possible.In this episo...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>57</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[ceaaa1c3-3790-4e24-b9da-fe56a3a119ad]]></guid>
  <title><![CDATA[Building a Unified Data Platform at Pattern with William Graham]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The orchestration of data workflows at scale requires both flexibility and security. At Pattern, decoupling scheduling from orchestration has reshaped how data teams manage large-scale pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/willgraham2/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">William Graham</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/pattern-hq/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who explains how his team leverages Apache Airflow alongside their open-source tool Heimdall to streamline scheduling, orchestration and access management.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:44 Structure of Pattern’s data teams across acquisition, engineering and platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:27 How Airflow became the central scheduler for batch jobs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:57 Credential management challenges that led to decoupling scheduling and orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:21 Heimdall simplifies multi-application access through a unified interface.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:15 Standardized operators in Airflow using Heimdall integration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:13 Open-source contributions and early adoption of Heimdall within Pattern.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:01 Community support for Airflow and satisfaction with scheduling flexibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/willgraham2/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">William Graham</a></p><p>https://www.linkedin.com/in/willgraham2/</p><p><br></p><p><a href="https://www.linkedin.com/company/pattern-hq/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a> | LinkedIn</p><p>https://www.linkedin.com/company/pattern-hq/</p><p><br></p><p><a href="https://pattern.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a> | Website</p><p>https://pattern.com</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://github.com/Rev4N1/Heimdall" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Heimdall on GitHub</a></p><p>https://github.com/patterninc/heimdall</p><p><br></p><p><a href="https://netflix.github.io/genie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix Genie</a></p><p>https://netflix.github.io/genie/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/7faab114-f1f2-49ba-9b51-52a46508161c/de4d93e215.jpg" />
  <pubDate>Thu, 25 Sep 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23185948" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/7faab114-f1f2-49ba-9b51-52a46508161c/episode.mp3" />
  <itunes:title><![CDATA[Building a Unified Data Platform at Pattern with William Graham]]></itunes:title>
  <itunes:duration>24:09</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The orchestration of data workflows at scale requires both flexibility and security. At Pattern, decoupling scheduling from orchestration has reshaped how data teams manage large-scale pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/willgraham2/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">William Graham</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/pattern-hq/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who explains how his team leverages Apache Airflow alongside their open-source tool Heimdall to streamline scheduling, orchestration and access management.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:44 Structure of Pattern’s data teams across acquisition, engineering and platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:27 How Airflow became the central scheduler for batch jobs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:57 Credential management challenges that led to decoupling scheduling and orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:21 Heimdall simplifies multi-application access through a unified interface.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:15 Standardized operators in Airflow using Heimdall integration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:13 Open-source contributions and early adoption of Heimdall within Pattern.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:01 Community support for Airflow and satisfaction with scheduling flexibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/willgraham2/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">William Graham</a></p><p>https://www.linkedin.com/in/willgraham2/</p><p><br></p><p><a href="https://www.linkedin.com/company/pattern-hq/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a> | LinkedIn</p><p>https://www.linkedin.com/company/pattern-hq/</p><p><br></p><p><a href="https://pattern.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a> | Website</p><p>https://pattern.com</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://github.com/Rev4N1/Heimdall" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Heimdall on GitHub</a></p><p>https://github.com/patterninc/heimdall</p><p><br></p><p><a href="https://netflix.github.io/genie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix Genie</a></p><p>https://netflix.github.io/genie/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The orchestration of data workflows at scale requires both flexibility and security. At Pattern, decoupling scheduling from orchestration has reshaped how data teams manage large-scale pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/willgraham2/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">William Graham</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/pattern-hq/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who explains how his team leverages Apache Airflow alongside their open-source tool Heimdall to streamline scheduling, orchestration and access management.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:44 Structure of Pattern’s data teams across acquisition, engineering and platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:27 How Airflow became the central scheduler for batch jobs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:57 Credential management challenges that led to decoupling scheduling and orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:21 Heimdall simplifies multi-application access through a unified interface.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:15 Standardized operators in Airflow using Heimdall integration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:13 Open-source contributions and early adoption of Heimdall within Pattern.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">21:01 Community support for Airflow and satisfaction with scheduling flexibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/willgraham2/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">William Graham</a></p><p>https://www.linkedin.com/in/willgraham2/</p><p><br></p><p><a href="https://www.linkedin.com/company/pattern-hq/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a> | LinkedIn</p><p>https://www.linkedin.com/company/pattern-hq/</p><p><br></p><p><a href="https://pattern.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Pattern</a> | Website</p><p>https://pattern.com</p><p><br></p><p><a href="https://airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org</p><p><br></p><p><a href="https://github.com/Rev4N1/Heimdall" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Heimdall on GitHub</a></p><p>https://github.com/patterninc/heimdall</p><p><br></p><p><a href="https://netflix.github.io/genie/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix Genie</a></p><p>https://netflix.github.io/genie/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The orchestration of data workflows at scale requires both flexibility and security. At Pattern, decoupling scheduling from orchestration has reshaped how data teams manage large-scale pipelines.In this episode, we are joined by William Graham, Sen...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>56</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[b509a1de-383c-4b68-b2e0-c0753df241f0]]></guid>
  <title><![CDATA[How Astronomer Turns Proactive Monitoring Into Customer Success with Collin McNulty]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Airflow continues to shape data orchestration and monitoring strategies. Leveraging it beyond traditional ETL use cases opens powerful new possibilities for proactive support and internal operations.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/collin-mcnulty/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Collin McNulty</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Sr. Director of Global Support at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares insights from his journey into data engineering and the lessons learned from leading Astronomer’s Customer Reliability Engineering (CRE) team.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:07 Lessons learned in adapting to major platform transitions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:18 How proactive monitoring improves reliability and customer experience.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:10 Using automation to enhance internal support processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:09 Why keeping systems current helps avoid unnecessary issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:14 Approaches that strengthen system reliability and efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:46 Best practices for simplifying complex orchestration dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">23:24 Anticipated innovations that expand orchestration capabilities.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/collin-mcnulty/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Collin McNulty</a></p><p>https://www.linkedin.com/in/collin-mcnulty/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://prometheus.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prometheus</a></p><p>https://prometheus.io/</p><p><br></p><p><a href="https://www.splunk.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Splunk</a></p><p>https://www.splunk.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/e475903c-28a4-4111-bc28-fa60b4d5b2c0/d01ede0b98.jpg" />
  <pubDate>Thu, 18 Sep 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="24545988" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/e475903c-28a4-4111-bc28-fa60b4d5b2c0/episode.mp3" />
  <itunes:title><![CDATA[How Astronomer Turns Proactive Monitoring Into Customer Success with Collin McNulty]]></itunes:title>
  <itunes:duration>25:34</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Airflow continues to shape data orchestration and monitoring strategies. Leveraging it beyond traditional ETL use cases opens powerful new possibilities for proactive support and internal operations.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/collin-mcnulty/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Collin McNulty</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Sr. Director of Global Support at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares insights from his journey into data engineering and the lessons learned from leading Astronomer’s Customer Reliability Engineering (CRE) team.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:07 Lessons learned in adapting to major platform transitions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:18 How proactive monitoring improves reliability and customer experience.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:10 Using automation to enhance internal support processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:09 Why keeping systems current helps avoid unnecessary issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:14 Approaches that strengthen system reliability and efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:46 Best practices for simplifying complex orchestration dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">23:24 Anticipated innovations that expand orchestration capabilities.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/collin-mcnulty/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Collin McNulty</a></p><p>https://www.linkedin.com/in/collin-mcnulty/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://prometheus.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prometheus</a></p><p>https://prometheus.io/</p><p><br></p><p><a href="https://www.splunk.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Splunk</a></p><p>https://www.splunk.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Airflow continues to shape data orchestration and monitoring strategies. Leveraging it beyond traditional ETL use cases opens powerful new possibilities for proactive support and internal operations.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/collin-mcnulty/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Collin McNulty</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Sr. Director of Global Support at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares insights from his journey into data engineering and the lessons learned from leading Astronomer’s Customer Reliability Engineering (CRE) team.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">03:07 Lessons learned in adapting to major platform transitions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:18 How proactive monitoring improves reliability and customer experience.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:10 Using automation to enhance internal support processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:09 Why keeping systems current helps avoid unnecessary issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:14 Approaches that strengthen system reliability and efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:46 Best practices for simplifying complex orchestration dependencies.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">23:24 Anticipated innovations that expand orchestration capabilities.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/collin-mcnulty/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Collin McNulty</a></p><p>https://www.linkedin.com/in/collin-mcnulty/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://prometheus.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prometheus</a></p><p>https://prometheus.io/</p><p><br></p><p><a href="https://www.splunk.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Splunk</a></p><p>https://www.splunk.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The evolution of Airflow continues to shape data orchestration and monitoring strategies. Leveraging it beyond traditional ETL use cases opens powerful new possibilities for proactive support and internal operations.In this episode, we are joined b...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>55</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[0eb6542e-4481-4bad-bf4a-90005cfe9cac]]></guid>
  <title><![CDATA[Overcoming Data Engineering Challenges at Daiichi Sankyo Europe GmbH with Evgenii Prusov]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift to a unified data platform is reshaping how pharmaceutical companies manage and orchestrate data. Establishing standards across regions and teams ensures scalability and efficiency in handling large-scale analytics.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/prusov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Evgenii Prusov</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Platform Engineer of </span><a href="https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss building and scaling a centralized data platform with Airflow and Astronomer.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:49 Building a centralized data platform for 15 European countries.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:19 Adopting SaaS to manage Airflow from day one.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:01 Leveraging Airflow for data orchestration across products.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:16 Teaching non-Python users how to work with Airflow is challenging.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:25 Creating a global data community across Europe, the US and Japan.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:04 Monthly calls help share knowledge and align regional teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:47 Contributing to the open-source Airflow project as a way to deepen expertise.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:32 Desire for more guidelines, debugging tutorials and testing best practices in Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:&nbsp;</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/prusov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Evgenii Prusov</a></p><p>https://www.linkedin.com/in/prusov/</p><p><br></p><p><a href="https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a> | LinkedIn</p><p>https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/</p><p><br></p><p><a href="https://www.daiichi-sankyo.eu" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a> | Website</p><p>https://www.daiichi-sankyo.eu</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/f74b2b8e-0a5d-4819-8ff8-d43296d41979/7221495a21.jpg" />
  <pubDate>Thu, 11 Sep 2025 06:58:49 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="18667810" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/f74b2b8e-0a5d-4819-8ff8-d43296d41979/episode.mp3" />
  <itunes:title><![CDATA[Overcoming Data Engineering Challenges at Daiichi Sankyo Europe GmbH with Evgenii Prusov]]></itunes:title>
  <itunes:duration>19:26</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift to a unified data platform is reshaping how pharmaceutical companies manage and orchestrate data. Establishing standards across regions and teams ensures scalability and efficiency in handling large-scale analytics.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/prusov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Evgenii Prusov</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Platform Engineer of </span><a href="https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss building and scaling a centralized data platform with Airflow and Astronomer.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:49 Building a centralized data platform for 15 European countries.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:19 Adopting SaaS to manage Airflow from day one.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:01 Leveraging Airflow for data orchestration across products.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:16 Teaching non-Python users how to work with Airflow is challenging.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:25 Creating a global data community across Europe, the US and Japan.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:04 Monthly calls help share knowledge and align regional teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:47 Contributing to the open-source Airflow project as a way to deepen expertise.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:32 Desire for more guidelines, debugging tutorials and testing best practices in Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:&nbsp;</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/prusov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Evgenii Prusov</a></p><p>https://www.linkedin.com/in/prusov/</p><p><br></p><p><a href="https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a> | LinkedIn</p><p>https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/</p><p><br></p><p><a href="https://www.daiichi-sankyo.eu" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a> | Website</p><p>https://www.daiichi-sankyo.eu</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The shift to a unified data platform is reshaping how pharmaceutical companies manage and orchestrate data. Establishing standards across regions and teams ensures scalability and efficiency in handling large-scale analytics.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/prusov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Evgenii Prusov</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Platform Engineer of </span><a href="https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss building and scaling a centralized data platform with Airflow and Astronomer.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:49 Building a centralized data platform for 15 European countries.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:19 Adopting SaaS to manage Airflow from day one.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:01 Leveraging Airflow for data orchestration across products.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:16 Teaching non-Python users how to work with Airflow is challenging.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:25 Creating a global data community across Europe, the US and Japan.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:04 Monthly calls help share knowledge and align regional teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:47 Contributing to the open-source Airflow project as a way to deepen expertise.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">16:32 Desire for more guidelines, debugging tutorials and testing best practices in Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:&nbsp;</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/prusov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Evgenii Prusov</a></p><p>https://www.linkedin.com/in/prusov/</p><p><br></p><p><a href="https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a> | LinkedIn</p><p>https://www.linkedin.com/company/daiichi-sankyo-europe-gmbh/</p><p><br></p><p><a href="https://www.daiichi-sankyo.eu" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Daiichi Sankyo Europe GmbH</a> | Website</p><p>https://www.daiichi-sankyo.eu</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The shift to a unified data platform is reshaping how pharmaceutical companies manage and orchestrate data. Establishing standards across regions and teams ensures scalability and efficiency in handling large-scale analytics.In this episode, Evgeni...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>54</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[9c27584d-e162-4cca-9145-55a4fc628363]]></guid>
  <title><![CDATA[Building a Data-Driven Beauty and Wellness Marketplace at StyleSeat with Paschal Onuorah]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">StyleSeat is revolutionizing how beauty and wellness professionals grow their businesses through data-driven tools. From streamlining scheduling to optimizing marketing, their platform empowers professionals to focus on their craft while expanding their client base.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/onuorah-paschal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paschal Onuorah</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/styleseat/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how the company leverages Airflow, dbt, and Cosmos to drive marketplace intelligence, improve client connections and deliver measurable growth for professionals.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:44 The role of the data engineering team in driving business success.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:52 Leveraging technology for real-time business intelligence.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:52 Data-driven strategies for improving marketing outcomes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:05 How adopting the right tools can increase revenue growth.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:25 Advantages of simplifying and integrating technical workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:45 Benefits of multi-environment configurations for development and production.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:17 Foundational skills and best practices for learning Airflow effectively.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:33 Opportunities for deeper tool integration and improved data visualization.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/onuorah-paschal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paschal Onuorah</a></p><p>https://www.linkedin.com/in/onuorah-paschal/</p><p><br></p><p><a href="https://www.linkedin.com/company/styleseat/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a> | LinkedIn</p><p>https://www.linkedin.com/company/styleseat/</p><p><br></p><p><a href="https://www.styleseat.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a> | Website</p><p>https://www.styleseat.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://www.astronomer.io/cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://www.astronomer.io/cosmos/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/3d5a1c6e-f3e2-46eb-b2be-4dfa55abc2c5/33c5704fd9.jpg" />
  <pubDate>Thu, 04 Sep 2025 05:54:20 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22162784" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/3d5a1c6e-f3e2-46eb-b2be-4dfa55abc2c5/episode.mp3" />
  <itunes:title><![CDATA[Building a Data-Driven Beauty and Wellness Marketplace at StyleSeat with Paschal Onuorah]]></itunes:title>
  <itunes:duration>23:05</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">StyleSeat is revolutionizing how beauty and wellness professionals grow their businesses through data-driven tools. From streamlining scheduling to optimizing marketing, their platform empowers professionals to focus on their craft while expanding their client base.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/onuorah-paschal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paschal Onuorah</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/styleseat/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how the company leverages Airflow, dbt, and Cosmos to drive marketplace intelligence, improve client connections and deliver measurable growth for professionals.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:44 The role of the data engineering team in driving business success.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:52 Leveraging technology for real-time business intelligence.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:52 Data-driven strategies for improving marketing outcomes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:05 How adopting the right tools can increase revenue growth.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:25 Advantages of simplifying and integrating technical workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:45 Benefits of multi-environment configurations for development and production.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:17 Foundational skills and best practices for learning Airflow effectively.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:33 Opportunities for deeper tool integration and improved data visualization.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/onuorah-paschal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paschal Onuorah</a></p><p>https://www.linkedin.com/in/onuorah-paschal/</p><p><br></p><p><a href="https://www.linkedin.com/company/styleseat/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a> | LinkedIn</p><p>https://www.linkedin.com/company/styleseat/</p><p><br></p><p><a href="https://www.styleseat.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a> | Website</p><p>https://www.styleseat.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://www.astronomer.io/cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://www.astronomer.io/cosmos/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">StyleSeat is revolutionizing how beauty and wellness professionals grow their businesses through data-driven tools. From streamlining scheduling to optimizing marketing, their platform empowers professionals to focus on their craft while expanding their client base.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/onuorah-paschal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paschal Onuorah</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/styleseat/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how the company leverages Airflow, dbt, and Cosmos to drive marketplace intelligence, improve client connections and deliver measurable growth for professionals.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:44 The role of the data engineering team in driving business success.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:52 Leveraging technology for real-time business intelligence.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:52 Data-driven strategies for improving marketing outcomes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">13:05 How adopting the right tools can increase revenue growth.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:25 Advantages of simplifying and integrating technical workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">18:45 Benefits of multi-environment configurations for development and production.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:17 Foundational skills and best practices for learning Airflow effectively.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">22:33 Opportunities for deeper tool integration and improved data visualization.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/onuorah-paschal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Paschal Onuorah</a></p><p>https://www.linkedin.com/in/onuorah-paschal/</p><p><br></p><p><a href="https://www.linkedin.com/company/styleseat/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a> | LinkedIn</p><p>https://www.linkedin.com/company/styleseat/</p><p><br></p><p><a href="https://www.styleseat.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">StyleSeat</a> | Website</p><p>https://www.styleseat.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://www.astronomer.io/cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://www.astronomer.io/cosmos/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[StyleSeat is revolutionizing how beauty and wellness professionals grow their businesses through data-driven tools. From streamlining scheduling to optimizing marketing, their platform empowers professionals to focus on their craft while expanding ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>53</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[db4b7301-bab7-4734-b4ec-fa1aee1f8da5]]></guid>
  <title><![CDATA[Building the Future of Airflow Execution at Astronomer with Ian Buss and Piotr Chomiak]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of orchestration in Airflow continues with innovations that address both scalability and security. From improving executor reliability to enabling remote execution, these advancements reshape how organizations manage data pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/ian-buss/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ian Buss</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Software Engineer at Astronomer, and </span><a href="https://www.linkedin.com/in/piotr-chomiak-b1955624/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Piotr Chomiak</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Product Manager at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who share insights into the Astro Executor and remote execution.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:13 How product leadership drives scalability for enterprise needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:23 Architectural changes that improve reliability and remove bottlenecks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:15 Metrics that enhance visibility into system performance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:54 The role of remote execution in addressing security requirements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:56 Differences between open-source solutions and managed offerings.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:04 Broad industry adoption and applicability of remote execution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:39 Future advancements in language support and multi-tenancy.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ian-buss/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ian Buss</a></p><p>https://www.linkedin.com/in/ian-buss/</p><p><br></p><p><a href="https://www.linkedin.com/in/piotr-chomiak-b1955624/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Piotr Chomiak</a></p><p>https://www.linkedin.com/in/piotr-chomiak-b1955624/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/community/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack Community</a></p><p>https://airflow.apache.org/community/</p><p><br></p><p><a href="https://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/1bd9a540-3cb5-4da8-9334-da133d2ac3af/9b3b56b3b3.jpg" />
  <pubDate>Thu, 28 Aug 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21521635" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/1bd9a540-3cb5-4da8-9334-da133d2ac3af/episode.mp3" />
  <itunes:title><![CDATA[Building the Future of Airflow Execution at Astronomer with Ian Buss and Piotr Chomiak]]></itunes:title>
  <itunes:duration>22:25</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of orchestration in Airflow continues with innovations that address both scalability and security. From improving executor reliability to enabling remote execution, these advancements reshape how organizations manage data pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/ian-buss/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ian Buss</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Software Engineer at Astronomer, and </span><a href="https://www.linkedin.com/in/piotr-chomiak-b1955624/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Piotr Chomiak</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Product Manager at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who share insights into the Astro Executor and remote execution.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:13 How product leadership drives scalability for enterprise needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:23 Architectural changes that improve reliability and remove bottlenecks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:15 Metrics that enhance visibility into system performance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:54 The role of remote execution in addressing security requirements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:56 Differences between open-source solutions and managed offerings.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:04 Broad industry adoption and applicability of remote execution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:39 Future advancements in language support and multi-tenancy.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ian-buss/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ian Buss</a></p><p>https://www.linkedin.com/in/ian-buss/</p><p><br></p><p><a href="https://www.linkedin.com/in/piotr-chomiak-b1955624/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Piotr Chomiak</a></p><p>https://www.linkedin.com/in/piotr-chomiak-b1955624/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/community/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack Community</a></p><p>https://airflow.apache.org/community/</p><p><br></p><p><a href="https://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of orchestration in Airflow continues with innovations that address both scalability and security. From improving executor reliability to enabling remote execution, these advancements reshape how organizations manage data pipelines.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/ian-buss/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ian Buss</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Software Engineer at Astronomer, and </span><a href="https://www.linkedin.com/in/piotr-chomiak-b1955624/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Piotr Chomiak</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Product Manager at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who share insights into the Astro Executor and remote execution.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:13 How product leadership drives scalability for enterprise needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">08:23 Architectural changes that improve reliability and remove bottlenecks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">10:15 Metrics that enhance visibility into system performance.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">12:54 The role of remote execution in addressing security requirements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">15:56 Differences between open-source solutions and managed offerings.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">19:04 Broad industry adoption and applicability of remote execution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">20:39 Future advancements in language support and multi-tenancy.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ian-buss/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ian Buss</a></p><p>https://www.linkedin.com/in/ian-buss/</p><p><br></p><p><a href="https://www.linkedin.com/in/piotr-chomiak-b1955624/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Piotr Chomiak</a></p><p>https://www.linkedin.com/in/piotr-chomiak-b1955624/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/community/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack Community</a></p><p>https://airflow.apache.org/community/</p><p><br></p><p><a href="https://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The evolution of orchestration in Airflow continues with innovations that address both scalability and security. From improving executor reliability to enabling remote execution, these advancements reshape how organizations manage data pipelines.In...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>54</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[96848ae0-def1-441b-9ee0-dccb68b2f6bf]]></guid>
  <title><![CDATA[Scaling On-Prem Airflow With 2,000 DAGs at Numberly with Sébastien Crocquevieille]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Scaling 2,000+ data pipelines isn’t easy. But with the right tools and a self-hosted mindset, it becomes achievable.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/scroc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sébastien Crocquevieille</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/numberly/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, unpacks how the team scaled their on-prem Airflow setup using open-source tooling and Kubernetes. We explore orchestration strategies, UI-driven stakeholder access and Airflow’s evolving features.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Overview of the company’s operations and global presence.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:00 The tech stack and structure of the data engineering team.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:24 Running nearly 2,000 DAGs in production using Airflow.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:42 How Airflow’s UI empowers stakeholders to self-serve and troubleshoot.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:05 Details on the Kubernetes-based Airflow setup using Helm charts.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:31 Transition from GitSync to NFS for DAG syncing due to performance issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:11 Making every team member Airflow-literate through local installation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:56 Using custom libraries and plugins to extend Airflow functionality.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/scroc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sébastien Crocquevieille</a></p><p>https://www.linkedin.com/in/scroc/</p><p><br></p><p><a href="https://www.linkedin.com/company/numberly/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a> | LinkedIn</p><p>https://www.linkedin.com/company/numberly/</p><p><br></p><p><a href="https://numberly.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a> | Website</p><p>https://numberly.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://kafka.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Kafka</a></p><p>https://kafka.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/helm-chart/stable/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Helm Chart for Apache Airflow</a></p><p>https://airflow.apache.org/docs/helm-chart/stable/index.html</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://about.gitlab.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitLab</a></p><p>https://about.gitlab.com/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">KubernetesPodOperator – Airflow</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html</p><p><br></p><p><a href="https://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/44764ab8-9d37-4f20-be48-3fd0f7787ae7/7f098f99e0.jpg" />
  <pubDate>Thu, 21 Aug 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23327219" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/44764ab8-9d37-4f20-be48-3fd0f7787ae7/episode.mp3" />
  <itunes:title><![CDATA[Scaling On-Prem Airflow With 2,000 DAGs at Numberly with Sébastien Crocquevieille]]></itunes:title>
  <itunes:duration>24:17</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Scaling 2,000+ data pipelines isn’t easy. But with the right tools and a self-hosted mindset, it becomes achievable.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/scroc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sébastien Crocquevieille</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/numberly/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, unpacks how the team scaled their on-prem Airflow setup using open-source tooling and Kubernetes. We explore orchestration strategies, UI-driven stakeholder access and Airflow’s evolving features.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Overview of the company’s operations and global presence.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:00 The tech stack and structure of the data engineering team.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:24 Running nearly 2,000 DAGs in production using Airflow.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:42 How Airflow’s UI empowers stakeholders to self-serve and troubleshoot.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:05 Details on the Kubernetes-based Airflow setup using Helm charts.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:31 Transition from GitSync to NFS for DAG syncing due to performance issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:11 Making every team member Airflow-literate through local installation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:56 Using custom libraries and plugins to extend Airflow functionality.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/scroc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sébastien Crocquevieille</a></p><p>https://www.linkedin.com/in/scroc/</p><p><br></p><p><a href="https://www.linkedin.com/company/numberly/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a> | LinkedIn</p><p>https://www.linkedin.com/company/numberly/</p><p><br></p><p><a href="https://numberly.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a> | Website</p><p>https://numberly.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://kafka.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Kafka</a></p><p>https://kafka.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/helm-chart/stable/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Helm Chart for Apache Airflow</a></p><p>https://airflow.apache.org/docs/helm-chart/stable/index.html</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://about.gitlab.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitLab</a></p><p>https://about.gitlab.com/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">KubernetesPodOperator – Airflow</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html</p><p><br></p><p><a href="https://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Scaling 2,000+ data pipelines isn’t easy. But with the right tools and a self-hosted mindset, it becomes achievable.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/scroc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sébastien Crocquevieille</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/numberly/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, unpacks how the team scaled their on-prem Airflow setup using open-source tooling and Kubernetes. We explore orchestration strategies, UI-driven stakeholder access and Airflow’s evolving features.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">00:00 Introduction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">02:13 Overview of the company’s operations and global presence.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:00 The tech stack and structure of the data engineering team.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">04:24 Running nearly 2,000 DAGs in production using Airflow.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">05:42 How Airflow’s UI empowers stakeholders to self-serve and troubleshoot.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">07:05 Details on the Kubernetes-based Airflow setup using Helm charts.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">09:31 Transition from GitSync to NFS for DAG syncing due to performance issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">14:11 Making every team member Airflow-literate through local installation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">17:56 Using custom libraries and plugins to extend Airflow functionality.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/scroc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sébastien Crocquevieille</a></p><p>https://www.linkedin.com/in/scroc/</p><p><br></p><p><a href="https://www.linkedin.com/company/numberly/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a> | LinkedIn</p><p>https://www.linkedin.com/company/numberly/</p><p><br></p><p><a href="https://numberly.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Numberly</a> | Website</p><p>https://numberly.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="https://kafka.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Kafka</a></p><p>https://kafka.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/helm-chart/stable/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Helm Chart for Apache Airflow</a></p><p>https://airflow.apache.org/docs/helm-chart/stable/index.html</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://about.gitlab.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitLab</a></p><p>https://about.gitlab.com/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">KubernetesPodOperator – Airflow</a></p><p>https://airflow.apache.org/docs/apache-airflow-providers-cncf-kubernetes/stable/operators.html</p><p><br></p><p><a href="https://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Scaling 2,000+ data pipelines isn’t easy. But with the right tools and a self-hosted mindset, it becomes achievable.In this episode, Sébastien Crocquevieille, Data Engineer at Numberly, unpacks how the team scaled their on-prem Airflow setup using ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>53</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[0a3621da-e19b-4dc3-b362-a6c184ed90c3]]></guid>
  <title><![CDATA[How Moniepoint Group Uses Airflow for Exposure Monitoring with Adeolu Adegboye]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing financial data at scale requires precise orchestration and proactive monitoring to maintain operational efficiency.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/adeolu-adegboye/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adeolu Adegboye</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/moniepoint-inc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares how his team uses data pipelines and workflow automation to manage high volumes of transactions, ensure timely alerts and support diverse stakeholders across the business.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(00:00) Introduction.&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:48) The role of data engineering in supporting all business operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:17) Leveraging workflow orchestration to manage daily processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:20) Proactively monitoring for anomalies to prevent potential issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:12) Simplifying complex insights for non-technical teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:01) Improving efficiency through dynamic and parallel workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:19) Optimizing system performance to handle large-scale operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:19) Exploring creative and innovative uses for workflow automation.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/adeolu-adegboye/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adeolu Adegboye</a></p><p>https://www.linkedin.com/in/adeolu-adegboye/</p><p><br></p><p><a href="https://www.linkedin.com/company/moniepoint-inc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a> | LinkedIn</p><p>https://www.linkedin.com/company/moniepoint-inc/</p><p><br></p><p><a href="https://www.moniepoint.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a> | Website</p><p>https://www.moniepoint.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://clickhouse.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ClickHouse</a></p><p>https://clickhouse.com/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="http://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/01cf6965-95b4-4894-8d7f-ea7234d28ceb/e79b85c850.jpg" />
  <pubDate>Thu, 14 Aug 2025 04:50:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="20680701" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/01cf6965-95b4-4894-8d7f-ea7234d28ceb/episode.mp3" />
  <itunes:title><![CDATA[How Moniepoint Group Uses Airflow for Exposure Monitoring with Adeolu Adegboye]]></itunes:title>
  <itunes:duration>21:32</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing financial data at scale requires precise orchestration and proactive monitoring to maintain operational efficiency.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/adeolu-adegboye/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adeolu Adegboye</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/moniepoint-inc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares how his team uses data pipelines and workflow automation to manage high volumes of transactions, ensure timely alerts and support diverse stakeholders across the business.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(00:00) Introduction.&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:48) The role of data engineering in supporting all business operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:17) Leveraging workflow orchestration to manage daily processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:20) Proactively monitoring for anomalies to prevent potential issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:12) Simplifying complex insights for non-technical teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:01) Improving efficiency through dynamic and parallel workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:19) Optimizing system performance to handle large-scale operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:19) Exploring creative and innovative uses for workflow automation.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/adeolu-adegboye/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adeolu Adegboye</a></p><p>https://www.linkedin.com/in/adeolu-adegboye/</p><p><br></p><p><a href="https://www.linkedin.com/company/moniepoint-inc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a> | LinkedIn</p><p>https://www.linkedin.com/company/moniepoint-inc/</p><p><br></p><p><a href="https://www.moniepoint.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a> | Website</p><p>https://www.moniepoint.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://clickhouse.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ClickHouse</a></p><p>https://clickhouse.com/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="http://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing financial data at scale requires precise orchestration and proactive monitoring to maintain operational efficiency.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we are joined by </span><a href="https://www.linkedin.com/in/adeolu-adegboye/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adeolu Adegboye</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Engineer at </span><a href="https://www.linkedin.com/company/moniepoint-inc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares how his team uses data pipelines and workflow automation to manage high volumes of transactions, ensure timely alerts and support diverse stakeholders across the business.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(00:00) Introduction.&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:48) The role of data engineering in supporting all business operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:17) Leveraging workflow orchestration to manage daily processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:20) Proactively monitoring for anomalies to prevent potential issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:12) Simplifying complex insights for non-technical teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:01) Improving efficiency through dynamic and parallel workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:19) Optimizing system performance to handle large-scale operations.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:19) Exploring creative and innovative uses for workflow automation.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/adeolu-adegboye/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adeolu Adegboye</a></p><p>https://www.linkedin.com/in/adeolu-adegboye/</p><p><br></p><p><a href="https://www.linkedin.com/company/moniepoint-inc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a> | LinkedIn</p><p>https://www.linkedin.com/company/moniepoint-inc/</p><p><br></p><p><a href="https://www.moniepoint.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Moniepoint Group</a> | Website</p><p>https://www.moniepoint.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://clickhouse.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ClickHouse</a></p><p>https://clickhouse.com/</p><p><br></p><p><a href="https://grafana.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/</p><p><br></p><p><a href="http://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Managing financial data at scale requires precise orchestration and proactive monitoring to maintain operational efficiency.In this episode, we are joined by Adeolu Adegboye, Data Engineer at Moniepoint Group, who shares how his team uses data pipe...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>52</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[4bdd3ba5-5975-4a3b-9ecc-e777078b11f4]]></guid>
  <title><![CDATA[Inside Bosch’s Airflow 3 Revolution: Remote Execution with Jens Scheffler]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Airflow has reached a milestone with the introduction of remote execution in Airflow 3, enabling flexible orchestration across distributed environments.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Test Execution Cluster Technical Architect at </span><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares insights on how his team’s need for large-scale, cross-environment testing influenced the development of the Edge Executor and shaped this major release.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:39) The role of remote execution in supporting large-scale testing needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:44) How community support contributed to the Edge Executor’s development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:41) Navigating network and infrastructure limitations within secure environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:25) Transitioning from database-heavy processes to an API-driven model.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:16) How the new task SDK in Airflow 3 improves distributed task execution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:54) What is required to set up and configure the Edge Executor.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:36) Managing multiple queues to optimize tasks across different environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(23:30) Examples of extreme distance use cases for edge execution.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a></p><p>https://www.linkedin.com/in/jens-scheffler/</p><p><br></p><p><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> | LinkedIn</p><p>https://www.linkedin.com/company/bosch/</p><p><br></p><p><a href="https://www.bosch.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> | Website</p><p>https://www.bosch.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Edge Executor (Edge3 Provider Package)</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html</p><p><br></p><p><a href="https://www.astronomer.io/docs/astro/astro-executor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer’s Astro Executor</a></p><p>https://www.astronomer.io/docs/astro/astro-executor/</p><p><br></p><p><a href="http://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9c5b21b6-1673-4a74-a0ad-371b0dbbff1d/12c3965aa2.jpg" />
  <pubDate>Thu, 07 Aug 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26917488" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9c5b21b6-1673-4a74-a0ad-371b0dbbff1d/episode.mp3" />
  <itunes:title><![CDATA[Inside Bosch’s Airflow 3 Revolution: Remote Execution with Jens Scheffler]]></itunes:title>
  <itunes:duration>28:02</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Airflow has reached a milestone with the introduction of remote execution in Airflow 3, enabling flexible orchestration across distributed environments.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Test Execution Cluster Technical Architect at </span><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares insights on how his team’s need for large-scale, cross-environment testing influenced the development of the Edge Executor and shaped this major release.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:39) The role of remote execution in supporting large-scale testing needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:44) How community support contributed to the Edge Executor’s development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:41) Navigating network and infrastructure limitations within secure environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:25) Transitioning from database-heavy processes to an API-driven model.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:16) How the new task SDK in Airflow 3 improves distributed task execution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:54) What is required to set up and configure the Edge Executor.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:36) Managing multiple queues to optimize tasks across different environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(23:30) Examples of extreme distance use cases for edge execution.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a></p><p>https://www.linkedin.com/in/jens-scheffler/</p><p><br></p><p><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> | LinkedIn</p><p>https://www.linkedin.com/company/bosch/</p><p><br></p><p><a href="https://www.bosch.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> | Website</p><p>https://www.bosch.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Edge Executor (Edge3 Provider Package)</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html</p><p><br></p><p><a href="https://www.astronomer.io/docs/astro/astro-executor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer’s Astro Executor</a></p><p>https://www.astronomer.io/docs/astro/astro-executor/</p><p><br></p><p><a href="http://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The evolution of Airflow has reached a milestone with the introduction of remote execution in Airflow 3, enabling flexible orchestration across distributed environments.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Test Execution Cluster Technical Architect at </span><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares insights on how his team’s need for large-scale, cross-environment testing influenced the development of the Edge Executor and shaped this major release.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:39) The role of remote execution in supporting large-scale testing needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:44) How community support contributed to the Edge Executor’s development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:41) Navigating network and infrastructure limitations within secure environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:25) Transitioning from database-heavy processes to an API-driven model.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:16) How the new task SDK in Airflow 3 improves distributed task execution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:54) What is required to set up and configure the Edge Executor.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:36) Managing multiple queues to optimize tasks across different environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(23:30) Examples of extreme distance use cases for edge execution.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a></p><p>https://www.linkedin.com/in/jens-scheffler/</p><p><br></p><p><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> | LinkedIn</p><p>https://www.linkedin.com/company/bosch/</p><p><br></p><p><a href="https://www.bosch.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> | Website</p><p>https://www.bosch.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Edge Executor (Edge3 Provider Package)</a></p><p>https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html</p><p><br></p><p><a href="https://www.astronomer.io/docs/astro/astro-executor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer’s Astro Executor</a></p><p>https://www.astronomer.io/docs/astro/astro-executor/</p><p><br></p><p><a href="http://astronomer.io/beyond/dataflowcast" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Beyond Analytics Conference</a></p><p>https://astronomer.io/beyond/dataflowcast</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span>Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The evolution of Airflow has reached a milestone with the introduction of remote execution in Airflow 3, enabling flexible orchestration across distributed environments.In this episode, Jens Scheffler, Test Execution Cluster Technical Architect at ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>51</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[e74573a6-59e3-4529-a456-76b950228544]]></guid>
  <title><![CDATA[Inside Modern Data Infrastructure at Massdriver with Cory O’Daniel and Jake Ferriero]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing modern data platforms means navigating a web of complex infrastructure, competing team needs and evolving security standards. For data teams to truly thrive, infrastructure must become both accessible and compliant without sacrificing velocity or reliability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/coryodaniel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cory O’Daniel</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, CEO and Co-Founder at </span><a href="https://www.linkedin.com/company/massdriver/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/jacob-ferriero/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jacob Ferriero</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to unpack what it takes to make data platform engineering scalable, sustainable and secure. They share lessons from years of experience working with DevOps, ML teams and platform engineers and discuss how Airflow fits into the orchestration layer of today’s data stacks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:27) Making infrastructure accessible without deep ops knowledge.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:23) Distinct personas and responsibilities across data teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:53) Infrastructure hurdles specific to ML workloads.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(11:13) Compliance and governance shaping platform design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:27) Tooling mismatches between teams cause friction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:13) Airflow’s orchestration role within broader system architecture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:10) Creating reusable infrastructure patterns for consistency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(24:13) Enabling secure access without slowing down development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(26:55) Opportunities to improve Airflow with event-driven and reliability tooling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/coryodaniel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cory O’Daniel</a></p><p>https://www.linkedin.com/in/coryodaniel/</p><p><br></p><p><a href="https://www.linkedin.com/company/massdriver/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a> | LinkedIn</p><p>https://www.linkedin.com/company/massdriver/</p><p><br></p><p><a href="https://www.massdriver.cloud/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a> | Website</p><p>https://www.massdriver.cloud/</p><p><br></p><p><a href="https://www.linkedin.com/in/jacob-ferriero/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jacob Ferriero</a></p><p>https://www.linkedin.com/in/jacob-ferriero/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.prequel.co/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prequel</a></p><p>https://www.prequel.co/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9b719df5-58cb-4cc6-a833-b6bd5afd6bac/c368695965.jpg" />
  <pubDate>Thu, 31 Jul 2025 04:50:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="30155000" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9b719df5-58cb-4cc6-a833-b6bd5afd6bac/episode.mp3" />
  <itunes:title><![CDATA[Inside Modern Data Infrastructure at Massdriver with Cory O’Daniel and Jake Ferriero]]></itunes:title>
  <itunes:duration>31:24</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing modern data platforms means navigating a web of complex infrastructure, competing team needs and evolving security standards. For data teams to truly thrive, infrastructure must become both accessible and compliant without sacrificing velocity or reliability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/coryodaniel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cory O’Daniel</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, CEO and Co-Founder at </span><a href="https://www.linkedin.com/company/massdriver/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/jacob-ferriero/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jacob Ferriero</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to unpack what it takes to make data platform engineering scalable, sustainable and secure. They share lessons from years of experience working with DevOps, ML teams and platform engineers and discuss how Airflow fits into the orchestration layer of today’s data stacks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:27) Making infrastructure accessible without deep ops knowledge.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:23) Distinct personas and responsibilities across data teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:53) Infrastructure hurdles specific to ML workloads.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(11:13) Compliance and governance shaping platform design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:27) Tooling mismatches between teams cause friction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:13) Airflow’s orchestration role within broader system architecture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:10) Creating reusable infrastructure patterns for consistency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(24:13) Enabling secure access without slowing down development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(26:55) Opportunities to improve Airflow with event-driven and reliability tooling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/coryodaniel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cory O’Daniel</a></p><p>https://www.linkedin.com/in/coryodaniel/</p><p><br></p><p><a href="https://www.linkedin.com/company/massdriver/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a> | LinkedIn</p><p>https://www.linkedin.com/company/massdriver/</p><p><br></p><p><a href="https://www.massdriver.cloud/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a> | Website</p><p>https://www.massdriver.cloud/</p><p><br></p><p><a href="https://www.linkedin.com/in/jacob-ferriero/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jacob Ferriero</a></p><p>https://www.linkedin.com/in/jacob-ferriero/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.prequel.co/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prequel</a></p><p>https://www.prequel.co/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing modern data platforms means navigating a web of complex infrastructure, competing team needs and evolving security standards. For data teams to truly thrive, infrastructure must become both accessible and compliant without sacrificing velocity or reliability.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/coryodaniel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cory O’Daniel</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, CEO and Co-Founder at </span><a href="https://www.linkedin.com/company/massdriver/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span><a href="https://www.linkedin.com/in/jacob-ferriero/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jacob Ferriero</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, to unpack what it takes to make data platform engineering scalable, sustainable and secure. They share lessons from years of experience working with DevOps, ML teams and platform engineers and discuss how Airflow fits into the orchestration layer of today’s data stacks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:27) Making infrastructure accessible without deep ops knowledge.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:23) Distinct personas and responsibilities across data teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:53) Infrastructure hurdles specific to ML workloads.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(11:13) Compliance and governance shaping platform design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:27) Tooling mismatches between teams cause friction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:13) Airflow’s orchestration role within broader system architecture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:10) Creating reusable infrastructure patterns for consistency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(24:13) Enabling secure access without slowing down development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(26:55) Opportunities to improve Airflow with event-driven and reliability tooling.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/coryodaniel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cory O’Daniel</a></p><p>https://www.linkedin.com/in/coryodaniel/</p><p><br></p><p><a href="https://www.linkedin.com/company/massdriver/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a> | LinkedIn</p><p>https://www.linkedin.com/company/massdriver/</p><p><br></p><p><a href="https://www.massdriver.cloud/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Massdriver</a> | Website</p><p>https://www.massdriver.cloud/</p><p><br></p><p><a href="https://www.linkedin.com/in/jacob-ferriero/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jacob Ferriero</a></p><p>https://www.linkedin.com/in/jacob-ferriero/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.prequel.co/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Prequel</a></p><p>https://www.prequel.co/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Managing modern data platforms means navigating a web of complex infrastructure, competing team needs and evolving security standards. For data teams to truly thrive, infrastructure must become both accessible and compliant without sacrificing velo...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>50</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[6f0d9bb3-11de-4166-b7ef-d92a2a82ccdd]]></guid>
  <title><![CDATA[The Future of Airflow Telemetry with Bolke de Bruin]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Telemetry has the potential to guide the future of Airflow, but only if it’s implemented transparently and with community trust.&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/bolke/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bolke de Bruin</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Director at </span><a href="https://www.linkedin.com/company/metyis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and a long-time Airflow PMC member. Bolke discusses how telemetry has been handled in the past, why it matters now and what it will take to get it right.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:20) The role of foundations in establishing credibility and sustainability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:52) Why data collection is critical to open-source project direction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:24) Lessons learned from previous approaches to user data collection.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:23) The current state of telemetry in the project.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:53) Community trust as a prerequisite for technical implementation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:54) The importance of managing sensitive data within trusted ecosystems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:37) Ethical considerations in balancing participation and access.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:45) Forward-looking ideas for improving workflow design and usability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bolke/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bolke de Bruin</a></p><p>https://www.linkedin.com/in/bolke/</p><p><br></p><p><a href="https://www.linkedin.com/company/metyis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a> | LinkedIn</p><p>https://www.linkedin.com/company/metyis/</p><p><br></p><p><a href="http://www.metyis.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a> | Website</p><p>http://www.metyis.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><a href="https://lists.apache.org/list.html?dev@airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Dev List</a></p><p>https://lists.apache.org/list.html?dev@airflow.apache.org</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/cde9c8cb-0771-458a-8f59-cc623bc79f8a/4ab768757c.jpg" />
  <pubDate>Thu, 17 Jul 2025 07:40:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21050177" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/cde9c8cb-0771-458a-8f59-cc623bc79f8a/episode.mp3" />
  <itunes:title><![CDATA[The Future of Airflow Telemetry with Bolke de Bruin]]></itunes:title>
  <itunes:duration>21:55</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Telemetry has the potential to guide the future of Airflow, but only if it’s implemented transparently and with community trust.&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/bolke/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bolke de Bruin</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Director at </span><a href="https://www.linkedin.com/company/metyis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and a long-time Airflow PMC member. Bolke discusses how telemetry has been handled in the past, why it matters now and what it will take to get it right.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:20) The role of foundations in establishing credibility and sustainability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:52) Why data collection is critical to open-source project direction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:24) Lessons learned from previous approaches to user data collection.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:23) The current state of telemetry in the project.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:53) Community trust as a prerequisite for technical implementation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:54) The importance of managing sensitive data within trusted ecosystems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:37) Ethical considerations in balancing participation and access.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:45) Forward-looking ideas for improving workflow design and usability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bolke/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bolke de Bruin</a></p><p>https://www.linkedin.com/in/bolke/</p><p><br></p><p><a href="https://www.linkedin.com/company/metyis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a> | LinkedIn</p><p>https://www.linkedin.com/company/metyis/</p><p><br></p><p><a href="http://www.metyis.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a> | Website</p><p>http://www.metyis.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><a href="https://lists.apache.org/list.html?dev@airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Dev List</a></p><p>https://lists.apache.org/list.html?dev@airflow.apache.org</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Telemetry has the potential to guide the future of Airflow, but only if it’s implemented transparently and with community trust.&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/bolke/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bolke de Bruin</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Director at </span><a href="https://www.linkedin.com/company/metyis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> and a long-time Airflow PMC member. Bolke discusses how telemetry has been handled in the past, why it matters now and what it will take to get it right.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:20) The role of foundations in establishing credibility and sustainability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:52) Why data collection is critical to open-source project direction.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:24) Lessons learned from previous approaches to user data collection.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:23) The current state of telemetry in the project.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:53) Community trust as a prerequisite for technical implementation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:54) The importance of managing sensitive data within trusted ecosystems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:37) Ethical considerations in balancing participation and access.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:45) Forward-looking ideas for improving workflow design and usability.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bolke/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bolke de Bruin</a></p><p>https://www.linkedin.com/in/bolke/</p><p><br></p><p><a href="https://www.linkedin.com/company/metyis/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a> | LinkedIn</p><p>https://www.linkedin.com/company/metyis/</p><p><br></p><p><a href="http://www.metyis.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metyis</a> | Website</p><p>http://www.metyis.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><a href="https://lists.apache.org/list.html?dev@airflow.apache.org" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Dev List</a></p><p>https://lists.apache.org/list.html?dev@airflow.apache.org</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Telemetry has the potential to guide the future of Airflow, but only if it’s implemented transparently and with community trust. In this episode, we’re joined by Bolke de Bruin, Director at Metyis and a long-time Airflow PMC member. Bolke discusses...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>49</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[a27bf596-e73f-49ac-82d8-f0cae4167aab]]></guid>
  <title><![CDATA[Transforming the Airflow UI for Cloudera’s Users with Shubham Raj]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Contributing to open-source projects can be daunting, but it can also unlock unexpected innovation. This episode showcases how one engineer’s journey with Apache Airflow led to impactful UI enhancements and infrastructure solutions at scale. </span><a href="https://www.linkedin.com/in/shubhamrajofficial/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shubham Raj</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer II at </span><a href="https://www.linkedin.com/company/cloudera/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team built a drag-and-drop DAG editor for non-coders, contributions which helped shape the Airflow 3.0 Ul and introduced features like </span>external XCom control<span style="background-color: transparent; color: rgb(22, 14, 61);"> and bulk APls.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿</span></span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:30) Day-to-day responsibilities building platforms that simplify orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:27) Factors that make onboarding into large open-source projects accessible.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:35) The value of improved user interfaces for task state visibility and control.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:49) Enabling faster debugging by exposing internal data through APIs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:00) Balancing frontend design goals with backend functionality.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:19) Creating workflow editors that lower the barrier to entry.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:54) Supporting a variety of task types within a visual DAG builder.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:32) Common infrastructure challenges faced by orchestration users.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:37) Addressing dependency management across distributed environments.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shubhamrajofficial/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shubham Raj</a></p><p>https://www.linkedin.com/in/shubhamrajofficial/</p><p><br></p><p><a href="https://www.linkedin.com/company/cloudera/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a> | LinkedIn</p><p>https://www.linkedin.com/company/cloudera/</p><p><br></p><p><a href="https://www.cloudera.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a> | Website</p><p>https://www.cloudera.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">2023 Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/01def055-b4d0-4317-a747-b0062fd51af1/08754beb93.jpg" />
  <pubDate>Thu, 10 Jul 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21577223" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/01def055-b4d0-4317-a747-b0062fd51af1/episode.mp3" />
  <itunes:title><![CDATA[Transforming the Airflow UI for Cloudera’s Users with Shubham Raj]]></itunes:title>
  <itunes:duration>22:28</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Contributing to open-source projects can be daunting, but it can also unlock unexpected innovation. This episode showcases how one engineer’s journey with Apache Airflow led to impactful UI enhancements and infrastructure solutions at scale. </span><a href="https://www.linkedin.com/in/shubhamrajofficial/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shubham Raj</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer II at </span><a href="https://www.linkedin.com/company/cloudera/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team built a drag-and-drop DAG editor for non-coders, contributions which helped shape the Airflow 3.0 Ul and introduced features like </span>external XCom control<span style="background-color: transparent; color: rgb(22, 14, 61);"> and bulk APls.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿</span></span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:30) Day-to-day responsibilities building platforms that simplify orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:27) Factors that make onboarding into large open-source projects accessible.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:35) The value of improved user interfaces for task state visibility and control.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:49) Enabling faster debugging by exposing internal data through APIs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:00) Balancing frontend design goals with backend functionality.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:19) Creating workflow editors that lower the barrier to entry.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:54) Supporting a variety of task types within a visual DAG builder.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:32) Common infrastructure challenges faced by orchestration users.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:37) Addressing dependency management across distributed environments.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shubhamrajofficial/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shubham Raj</a></p><p>https://www.linkedin.com/in/shubhamrajofficial/</p><p><br></p><p><a href="https://www.linkedin.com/company/cloudera/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a> | LinkedIn</p><p>https://www.linkedin.com/company/cloudera/</p><p><br></p><p><a href="https://www.cloudera.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a> | Website</p><p>https://www.cloudera.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">2023 Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Contributing to open-source projects can be daunting, but it can also unlock unexpected innovation. This episode showcases how one engineer’s journey with Apache Airflow led to impactful UI enhancements and infrastructure solutions at scale. </span><a href="https://www.linkedin.com/in/shubhamrajofficial/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shubham Raj</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer II at </span><a href="https://www.linkedin.com/company/cloudera/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team built a drag-and-drop DAG editor for non-coders, contributions which helped shape the Airflow 3.0 Ul and introduced features like </span>external XCom control<span style="background-color: transparent; color: rgb(22, 14, 61);"> and bulk APls.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿﻿</span></span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:30) Day-to-day responsibilities building platforms that simplify orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:27) Factors that make onboarding into large open-source projects accessible.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:35) The value of improved user interfaces for task state visibility and control.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:49) Enabling faster debugging by exposing internal data through APIs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:00) Balancing frontend design goals with backend functionality.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:19) Creating workflow editors that lower the barrier to entry.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:54) Supporting a variety of task types within a visual DAG builder.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:32) Common infrastructure challenges faced by orchestration users.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:37) Addressing dependency management across distributed environments.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shubhamrajofficial/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shubham Raj</a></p><p>https://www.linkedin.com/in/shubhamrajofficial/</p><p><br></p><p><a href="https://www.linkedin.com/company/cloudera/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a> | LinkedIn</p><p>https://www.linkedin.com/company/cloudera/</p><p><br></p><p><a href="https://www.cloudera.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a> | Website</p><p>https://www.cloudera.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">2023 Airflow Summit</a></p><p>https://airflowsummit.org/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Contributing to open-source projects can be daunting, but it can also unlock unexpected innovation. This episode showcases how one engineer’s journey with Apache Airflow led to impactful UI enhancements and infrastructure solutions at scale. Shubha...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>48</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[74dbd93c-1be1-48af-8121-63059da869e3]]></guid>
  <title><![CDATA[Streamlining Thousands of Data Pipelines at Lyft with Yunhao Qing]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing data pipelines at scale is not just a technical challenge. It is also an organizational one. At Lyft, success means empowering dozens of teams to build with autonomy while enforcing governance and best practices across thousands of workflows.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we speak with </span><a href="https://www.linkedin.com/in/yunhao-qing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Yunhao Qing</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer at </span><a href="https://www.linkedin.com/company/lyft/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, about building a governed data-engineering platform powered by Airflow that balances flexibility, standardization and scale.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:17) Supporting internal teams with a centralized orchestration platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:54) Migrating to a managed service to reduce infrastructure overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(06:04) Embedding platform-level governance into custom components.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:02) Consolidating and regulating the creation of custom code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:48) Identifying and correcting inefficient workflow patterns.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(11:17) Replacing manual workarounds with native platform features.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:32) Preparing teams for major version upgrades.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:03) Leveraging asset-based scheduling for smarter triggers.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:13) Envisioning GenAI and semantic search for future productivity.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/yunhao-qing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Yunhao Qing</a></p><p>https://www.linkedin.com/in/yunhao-qing</p><p><br></p><p><a href="https://www.linkedin.com/company/lyft/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a> | LinkedIn</p><p>https://www.linkedin.com/company/lyft/</p><p><br></p><p><a href="https://www.lyft.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a> | Website</p><p>https://www.lyft.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/654ee397-0b3d-475c-b927-eaa2e35a76aa/f2581c5f52.jpg" />
  <pubDate>Mon, 07 Jul 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="18797377" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/654ee397-0b3d-475c-b927-eaa2e35a76aa/episode.mp3" />
  <itunes:title><![CDATA[Streamlining Thousands of Data Pipelines at Lyft with Yunhao Qing]]></itunes:title>
  <itunes:duration>19:34</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing data pipelines at scale is not just a technical challenge. It is also an organizational one. At Lyft, success means empowering dozens of teams to build with autonomy while enforcing governance and best practices across thousands of workflows.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we speak with </span><a href="https://www.linkedin.com/in/yunhao-qing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Yunhao Qing</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer at </span><a href="https://www.linkedin.com/company/lyft/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, about building a governed data-engineering platform powered by Airflow that balances flexibility, standardization and scale.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:17) Supporting internal teams with a centralized orchestration platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:54) Migrating to a managed service to reduce infrastructure overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(06:04) Embedding platform-level governance into custom components.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:02) Consolidating and regulating the creation of custom code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:48) Identifying and correcting inefficient workflow patterns.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(11:17) Replacing manual workarounds with native platform features.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:32) Preparing teams for major version upgrades.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:03) Leveraging asset-based scheduling for smarter triggers.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:13) Envisioning GenAI and semantic search for future productivity.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/yunhao-qing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Yunhao Qing</a></p><p>https://www.linkedin.com/in/yunhao-qing</p><p><br></p><p><a href="https://www.linkedin.com/company/lyft/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a> | LinkedIn</p><p>https://www.linkedin.com/company/lyft/</p><p><br></p><p><a href="https://www.lyft.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a> | Website</p><p>https://www.lyft.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Managing data pipelines at scale is not just a technical challenge. It is also an organizational one. At Lyft, success means empowering dozens of teams to build with autonomy while enforcing governance and best practices across thousands of workflows.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we speak with </span><a href="https://www.linkedin.com/in/yunhao-qing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Yunhao Qing</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Software Engineer at </span><a href="https://www.linkedin.com/company/lyft/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, about building a governed data-engineering platform powered by Airflow that balances flexibility, standardization and scale.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:17) Supporting internal teams with a centralized orchestration platform.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:54) Migrating to a managed service to reduce infrastructure overhead.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(06:04) Embedding platform-level governance into custom components.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:02) Consolidating and regulating the creation of custom code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:48) Identifying and correcting inefficient workflow patterns.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(11:17) Replacing manual workarounds with native platform features.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:32) Preparing teams for major version upgrades.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:03) Leveraging asset-based scheduling for smarter triggers.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:13) Envisioning GenAI and semantic search for future productivity.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/yunhao-qing" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Yunhao Qing</a></p><p>https://www.linkedin.com/in/yunhao-qing</p><p><br></p><p><a href="https://www.linkedin.com/company/lyft/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a> | LinkedIn</p><p>https://www.linkedin.com/company/lyft/</p><p><br></p><p><a href="https://www.lyft.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Lyft</a> | Website</p><p>https://www.lyft.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Managing data pipelines at scale is not just a technical challenge. It is also an organizational one. At Lyft, success means empowering dozens of teams to build with autonomy while enforcing governance and best practices across thousands of workflo...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>47</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[f9a2d7bf-b988-4651-8e81-3b9804163263]]></guid>
  <title><![CDATA[Transforming Customer Education in Data Engineering at Astronomer with Marc Lamberti]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Understanding the complexities of Apache Airflow can be daunting for newcomers and seasoned data engineers. But with the right guidance, mastering the tool becomes an achievable milestone.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Head of Customer Education at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share his journey from Udemy instructor to driving education at Astronomer, and how he's helping over 100,000 learners demystify Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:36) Early exposure to Airflow while addressing inefficiencies in data workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:10) Common barriers to implementing open source tools in enterprise settings.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(06:18) The shift from part-time teaching to a full-time focus on Airflow education.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:53) A modular, guided approach to structuring educational content.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:57) The value of highlighting underused Airflow features for broader adoption.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:35) Certifications as a method to assess readiness and uncover knowledge gaps.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:25) Coverage of essential Airflow concepts in the Fundamentals exam.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:07) The DAG Authoring exam’s emphasis on practical, advanced features.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:08) A call for more visible integration of Airflow with AI workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a></p><p>https://www.linkedin.com/in/marclamberti/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://academy.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Academy</a></p><p>https://academy.astronomer.io/</p><p><br></p><p><a href="https://www.astronomer.io/certification/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Fundamentals Certification</a></p><p>https://www.astronomer.io/certification/</p><p><br></p><p><a href="https://academy.astronomer.io/plan/astronomer-certification-dag-authoring-for-apache-airflow-exam" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DAG Authoring Certification</a></p><p>https://academy.astronomer.io/plan/astronomer-certification-dag-authoring-for-apache-airflow-exam</p><p><br></p><p><a href="https://www.udemy.com/course/the-complete-hands-on-course-to-master-apache-airflow/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_Beta_Prof_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Beta&amp;utm_content=deal4584&amp;utm_term=_._ag_162511579404_._ad_696197165418_._kw__._de_c_._dm__._pl__._ti_dsa-1677053911088_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21168154305&amp;gbraid=0AAAAADROdO3MpljfP-gssiYSmDEPdhZV9&amp;gclid=Cj0KCQjw097CBhDIARIsAJ3-nxdjZA6G5-Y0-akk6Huksy2PLb04t92J4iNfUSIbMdrSAla_tb-o2N8aArOeEALw_wcB&amp;couponCode=PMNVD3025" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Complete Hands-On Introduction to Airflow</a></p><p>https://www.udemy.com/course/the-complete-hands-on-course-to-master-apache-airflow/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_Beta_Prof_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Beta&amp;utm_content=deal4584&amp;utm_term=_._ag_162511579404_._ad_696197165418_._kw__._de_c_._dm__._pl__._ti_dsa-1677053911088_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21168154305&amp;gbraid=0AAAAADROdO3MpljfP-gssiYSmDEPdhZV9&amp;gclid=Cj0KCQjw097CBhDIARIsAJ3-nxdjZA6G5-Y0-akk6Huksy2PLb04t92J4iNfUSIbMdrSAla_tb-o2N8aArOeEALw_wcB&amp;couponCode=PMNVD3025</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/b021b34d-1286-40a0-90ff-fb244ee189c6/9a19987593.jpg" />
  <pubDate>Thu, 26 Jun 2025 03:44:47 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21424668" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/b021b34d-1286-40a0-90ff-fb244ee189c6/episode.mp3" />
  <itunes:title><![CDATA[Transforming Customer Education in Data Engineering at Astronomer with Marc Lamberti]]></itunes:title>
  <itunes:duration>22:19</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Understanding the complexities of Apache Airflow can be daunting for newcomers and seasoned data engineers. But with the right guidance, mastering the tool becomes an achievable milestone.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Head of Customer Education at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share his journey from Udemy instructor to driving education at Astronomer, and how he's helping over 100,000 learners demystify Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:36) Early exposure to Airflow while addressing inefficiencies in data workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:10) Common barriers to implementing open source tools in enterprise settings.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(06:18) The shift from part-time teaching to a full-time focus on Airflow education.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:53) A modular, guided approach to structuring educational content.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:57) The value of highlighting underused Airflow features for broader adoption.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:35) Certifications as a method to assess readiness and uncover knowledge gaps.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:25) Coverage of essential Airflow concepts in the Fundamentals exam.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:07) The DAG Authoring exam’s emphasis on practical, advanced features.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:08) A call for more visible integration of Airflow with AI workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a></p><p>https://www.linkedin.com/in/marclamberti/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://academy.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Academy</a></p><p>https://academy.astronomer.io/</p><p><br></p><p><a href="https://www.astronomer.io/certification/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Fundamentals Certification</a></p><p>https://www.astronomer.io/certification/</p><p><br></p><p><a href="https://academy.astronomer.io/plan/astronomer-certification-dag-authoring-for-apache-airflow-exam" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DAG Authoring Certification</a></p><p>https://academy.astronomer.io/plan/astronomer-certification-dag-authoring-for-apache-airflow-exam</p><p><br></p><p><a href="https://www.udemy.com/course/the-complete-hands-on-course-to-master-apache-airflow/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_Beta_Prof_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Beta&amp;utm_content=deal4584&amp;utm_term=_._ag_162511579404_._ad_696197165418_._kw__._de_c_._dm__._pl__._ti_dsa-1677053911088_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21168154305&amp;gbraid=0AAAAADROdO3MpljfP-gssiYSmDEPdhZV9&amp;gclid=Cj0KCQjw097CBhDIARIsAJ3-nxdjZA6G5-Y0-akk6Huksy2PLb04t92J4iNfUSIbMdrSAla_tb-o2N8aArOeEALw_wcB&amp;couponCode=PMNVD3025" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Complete Hands-On Introduction to Airflow</a></p><p>https://www.udemy.com/course/the-complete-hands-on-course-to-master-apache-airflow/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_Beta_Prof_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Beta&amp;utm_content=deal4584&amp;utm_term=_._ag_162511579404_._ad_696197165418_._kw__._de_c_._dm__._pl__._ti_dsa-1677053911088_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21168154305&amp;gbraid=0AAAAADROdO3MpljfP-gssiYSmDEPdhZV9&amp;gclid=Cj0KCQjw097CBhDIARIsAJ3-nxdjZA6G5-Y0-akk6Huksy2PLb04t92J4iNfUSIbMdrSAla_tb-o2N8aArOeEALw_wcB&amp;couponCode=PMNVD3025</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Understanding the complexities of Apache Airflow can be daunting for newcomers and seasoned data engineers. But with the right guidance, mastering the tool becomes an achievable milestone.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Head of Customer Education at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share his journey from Udemy instructor to driving education at Astronomer, and how he's helping over 100,000 learners demystify Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:36) Early exposure to Airflow while addressing inefficiencies in data workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:10) Common barriers to implementing open source tools in enterprise settings.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(06:18) The shift from part-time teaching to a full-time focus on Airflow education.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:53) A modular, guided approach to structuring educational content.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:57) The value of highlighting underused Airflow features for broader adoption.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:35) Certifications as a method to assess readiness and uncover knowledge gaps.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:25) Coverage of essential Airflow concepts in the Fundamentals exam.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:07) The DAG Authoring exam’s emphasis on practical, advanced features.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:08) A call for more visible integration of Airflow with AI workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/marclamberti/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Marc Lamberti</a></p><p>https://www.linkedin.com/in/marclamberti/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn</p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://academy.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Academy</a></p><p>https://academy.astronomer.io/</p><p><br></p><p><a href="https://www.astronomer.io/certification/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Fundamentals Certification</a></p><p>https://www.astronomer.io/certification/</p><p><br></p><p><a href="https://academy.astronomer.io/plan/astronomer-certification-dag-authoring-for-apache-airflow-exam" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DAG Authoring Certification</a></p><p>https://academy.astronomer.io/plan/astronomer-certification-dag-authoring-for-apache-airflow-exam</p><p><br></p><p><a href="https://www.udemy.com/course/the-complete-hands-on-course-to-master-apache-airflow/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_Beta_Prof_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Beta&amp;utm_content=deal4584&amp;utm_term=_._ag_162511579404_._ad_696197165418_._kw__._de_c_._dm__._pl__._ti_dsa-1677053911088_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21168154305&amp;gbraid=0AAAAADROdO3MpljfP-gssiYSmDEPdhZV9&amp;gclid=Cj0KCQjw097CBhDIARIsAJ3-nxdjZA6G5-Y0-akk6Huksy2PLb04t92J4iNfUSIbMdrSAla_tb-o2N8aArOeEALw_wcB&amp;couponCode=PMNVD3025" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Complete Hands-On Introduction to Airflow</a></p><p>https://www.udemy.com/course/the-complete-hands-on-course-to-master-apache-airflow/?utm_source=adwords&amp;utm_medium=udemyads&amp;utm_campaign=Search_DSA_Beta_Prof_la.EN_cc.ROW-English&amp;campaigntype=Search&amp;portfolio=ROW-English&amp;language=EN&amp;product=Course&amp;test=&amp;audience=DSA&amp;topic=&amp;priority=Beta&amp;utm_content=deal4584&amp;utm_term=_._ag_162511579404_._ad_696197165418_._kw__._de_c_._dm__._pl__._ti_dsa-1677053911088_._li_9061346_._pd__._&amp;matchtype=&amp;gad_source=1&amp;gad_campaignid=21168154305&amp;gbraid=0AAAAADROdO3MpljfP-gssiYSmDEPdhZV9&amp;gclid=Cj0KCQjw097CBhDIARIsAJ3-nxdjZA6G5-Y0-akk6Huksy2PLb04t92J4iNfUSIbMdrSAla_tb-o2N8aArOeEALw_wcB&amp;couponCode=PMNVD3025</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Understanding the complexities of Apache Airflow can be daunting for newcomers and seasoned data engineers. But with the right guidance, mastering the tool becomes an achievable milestone.In this episode, Marc Lamberti, Head of Customer Education a...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>46</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[1b224376-6342-4b56-9318-8ac164287a2f]]></guid>
  <title><![CDATA[Embracing Data Mesh and SQL Sensors for Scalable Workflows at lastminute.com with Alberto Crespi]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The flexibility of Airflow plays a pivotal role in enabling decentralized data architectures and empowering cross-functional teams.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we speak with </span><a href="https://www.linkedin.com/in/crespialberto/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alberto Crespi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Architect at </span><a href="http://lastminute.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">lastminute.com</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares how his team scales Airflow across 12 teams while supporting both vertical and horizontal structures under a data mesh approach.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:17) Defining responsibilities within data architecture teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:15) Consolidating multiple orchestrators into a single solution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:00) Scaling Airflow environments with shared infrastructure and DevOps practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:59) Managing dependencies and readiness using SQL sensors.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:23) Enhancing visibility and response through Slack-integrated monitoring.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:28) Extending Airflow’s flexibility to run legacy systems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:28) Integrating transformation tools into orchestrated pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(25:54) Enabling non-engineers to contribute to pipeline development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(27:33) Fostering adoption through collaboration and communication.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/crespialberto/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alberto Crespi</a></p><p>https://www.linkedin.com/in/crespialberto/</p><p><br></p><p><a href="https://lastminute.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">lastminute.com</a> | Website</p><p>https://lastminute.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://github.com/astronomer/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://github.com/astronomer/astronomer-cosmos</p><p><br></p><p><a href="https://about.gitlab.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitLab</a><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://www.atlassian.com/software/confluence" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Confluence</a></p><p>https://www.atlassian.com/software/confluence</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/c00a8a35-54be-4a16-8fac-60522ae1fff1/cdecc72a99.jpg" />
  <pubDate>Fri, 20 Jun 2025 02:18:19 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="28949605" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/c00a8a35-54be-4a16-8fac-60522ae1fff1/episode.mp3" />
  <itunes:title><![CDATA[Embracing Data Mesh and SQL Sensors for Scalable Workflows at lastminute.com with Alberto Crespi]]></itunes:title>
  <itunes:duration>30:09</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The flexibility of Airflow plays a pivotal role in enabling decentralized data architectures and empowering cross-functional teams.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we speak with </span><a href="https://www.linkedin.com/in/crespialberto/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alberto Crespi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Architect at </span><a href="http://lastminute.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">lastminute.com</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares how his team scales Airflow across 12 teams while supporting both vertical and horizontal structures under a data mesh approach.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:17) Defining responsibilities within data architecture teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:15) Consolidating multiple orchestrators into a single solution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:00) Scaling Airflow environments with shared infrastructure and DevOps practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:59) Managing dependencies and readiness using SQL sensors.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:23) Enhancing visibility and response through Slack-integrated monitoring.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:28) Extending Airflow’s flexibility to run legacy systems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:28) Integrating transformation tools into orchestrated pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(25:54) Enabling non-engineers to contribute to pipeline development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(27:33) Fostering adoption through collaboration and communication.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/crespialberto/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alberto Crespi</a></p><p>https://www.linkedin.com/in/crespialberto/</p><p><br></p><p><a href="https://lastminute.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">lastminute.com</a> | Website</p><p>https://lastminute.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://github.com/astronomer/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://github.com/astronomer/astronomer-cosmos</p><p><br></p><p><a href="https://about.gitlab.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitLab</a><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://www.atlassian.com/software/confluence" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Confluence</a></p><p>https://www.atlassian.com/software/confluence</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The flexibility of Airflow plays a pivotal role in enabling decentralized data architectures and empowering cross-functional teams.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we speak with </span><a href="https://www.linkedin.com/in/crespialberto/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alberto Crespi</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Architect at </span><a href="http://lastminute.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">lastminute.com</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, who shares how his team scales Airflow across 12 teams while supporting both vertical and horizontal structures under a data mesh approach.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(02:17) Defining responsibilities within data architecture teams.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:15) Consolidating multiple orchestrators into a single solution.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:00) Scaling Airflow environments with shared infrastructure and DevOps practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:59) Managing dependencies and readiness using SQL sensors.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:23) Enhancing visibility and response through Slack-integrated monitoring.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:28) Extending Airflow’s flexibility to run legacy systems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:28) Integrating transformation tools into orchestrated pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(25:54) Enabling non-engineers to contribute to pipeline development.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(27:33) Fostering adoption through collaboration and communication.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/crespialberto/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Alberto Crespi</a></p><p>https://www.linkedin.com/in/crespialberto/</p><p><br></p><p><a href="https://lastminute.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">lastminute.com</a> | Website</p><p>https://lastminute.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://github.com/astronomer/astronomer-cosmos" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Cosmos</a></p><p>https://github.com/astronomer/astronomer-cosmos</p><p><br></p><p><a href="https://about.gitlab.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitLab</a><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a></p><p>https://kubernetes.io/</p><p><br></p><p><a href="https://www.atlassian.com/software/confluence" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Confluence</a></p><p>https://www.atlassian.com/software/confluence</p><p><br></p><p><a href="https://slack.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Slack</a></p><p>https://slack.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The flexibility of Airflow plays a pivotal role in enabling decentralized data architectures and empowering cross-functional teams.In this episode, we speak with Alberto Crespi, Data Architect at lastminute.com, who shares how his team scales Airfl...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>45</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[69a6710a-747d-48ba-adf8-413c5a4b523a]]></guid>
  <title><![CDATA[The AI-Ready Pipeline: Reimagining Airflow at Veyer® Logistics with Anu Pabla]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Innovation in orchestration is redefining how engineers approach both traditional ETL pipelines and emerging AI workloads. Understanding how to harness Airflow’s flexibility and observability is essential for teams navigating today’s evolving data landscape.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/atomicap/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anu Pabla</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Engineer at </span><a href="https://www.linkedin.com/company/the-odp-corporation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss her journey from legacy orchestration patterns to AI-native pipelines and why she sees Airflow as the future of AI workload orchestration.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:43) Engaging with external technology communities fosters innovation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:05) Mentoring early-career engineers builds confidence in a complex tech landscape.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:51) Orchestration patterns continue to evolve with modern data needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:41) Managing AI workflows requires structured and flexible orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:35) High-quality, meaningful data remains foundational across use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:08) Community-driven open source tools offer lasting value.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:59) Self-healing systems support both legacy and AI pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:20) Orchestration platforms can drive future AI-native workloads.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/atomicap/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anu Pabla</a></p><p>https://www.linkedin.com/in/atomicap/</p><p><br></p><p><a href="https://www.linkedin.com/company/the-odp-corporation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a></p><p>https://www.linkedin.com/company/the-odp-corporation/</p><p><br></p><p><a href="https://www.theodpcorp.com/homepage" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a> | Website</p><p>https://www.theodpcorp.com/homepage</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.llamaindex.ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LlamaIndex</a></p><p>https://www.llamaindex.ai/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9209aa62-5b90-4f3f-ab40-bb4a12888f06/13e9fcfcb0.jpg" />
  <pubDate>Thu, 12 Jun 2025 06:48:03 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22431532" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9209aa62-5b90-4f3f-ab40-bb4a12888f06/episode.mp3" />
  <itunes:title><![CDATA[The AI-Ready Pipeline: Reimagining Airflow at Veyer® Logistics with Anu Pabla]]></itunes:title>
  <itunes:duration>23:21</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Innovation in orchestration is redefining how engineers approach both traditional ETL pipelines and emerging AI workloads. Understanding how to harness Airflow’s flexibility and observability is essential for teams navigating today’s evolving data landscape.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/atomicap/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anu Pabla</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Engineer at </span><a href="https://www.linkedin.com/company/the-odp-corporation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss her journey from legacy orchestration patterns to AI-native pipelines and why she sees Airflow as the future of AI workload orchestration.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:43) Engaging with external technology communities fosters innovation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:05) Mentoring early-career engineers builds confidence in a complex tech landscape.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:51) Orchestration patterns continue to evolve with modern data needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:41) Managing AI workflows requires structured and flexible orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:35) High-quality, meaningful data remains foundational across use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:08) Community-driven open source tools offer lasting value.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:59) Self-healing systems support both legacy and AI pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:20) Orchestration platforms can drive future AI-native workloads.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/atomicap/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anu Pabla</a></p><p>https://www.linkedin.com/in/atomicap/</p><p><br></p><p><a href="https://www.linkedin.com/company/the-odp-corporation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a></p><p>https://www.linkedin.com/company/the-odp-corporation/</p><p><br></p><p><a href="https://www.theodpcorp.com/homepage" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a> | Website</p><p>https://www.theodpcorp.com/homepage</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.llamaindex.ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LlamaIndex</a></p><p>https://www.llamaindex.ai/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Innovation in orchestration is redefining how engineers approach both traditional ETL pipelines and emerging AI workloads. Understanding how to harness Airflow’s flexibility and observability is essential for teams navigating today’s evolving data landscape.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, </span><a href="https://www.linkedin.com/in/atomicap/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anu Pabla</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Principal Engineer at </span><a href="https://www.linkedin.com/company/the-odp-corporation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to discuss her journey from legacy orchestration patterns to AI-native pipelines and why she sees Airflow as the future of AI workload orchestration.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:43) Engaging with external technology communities fosters innovation.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:05) Mentoring early-career engineers builds confidence in a complex tech landscape.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:51) Orchestration patterns continue to evolve with modern data needs.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:41) Managing AI workflows requires structured and flexible orchestration.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:35) High-quality, meaningful data remains foundational across use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:08) Community-driven open source tools offer lasting value.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:59) Self-healing systems support both legacy and AI pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:20) Orchestration platforms can drive future AI-native workloads.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/atomicap/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anu Pabla</a></p><p>https://www.linkedin.com/in/atomicap/</p><p><br></p><p><a href="https://www.linkedin.com/company/the-odp-corporation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a></p><p>https://www.linkedin.com/company/the-odp-corporation/</p><p><br></p><p><a href="https://www.theodpcorp.com/homepage" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The ODP Corporation</a> | Website</p><p>https://www.theodpcorp.com/homepage</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.llamaindex.ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LlamaIndex</a></p><p>https://www.llamaindex.ai/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Innovation in orchestration is redefining how engineers approach both traditional ETL pipelines and emerging AI workloads. Understanding how to harness Airflow’s flexibility and observability is essential for teams navigating today’s evolving data ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>44</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[ec1ab1c3-97e7-4840-8e8c-ba5f6561d4ee]]></guid>
  <title><![CDATA[Streamlining AI and ML Operations at IBM with BJ Adesoji and Ryan Yackel]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The orchestration layer is foundational to building robust AI- and ML-powered data pipelines, especially in complex hybrid enterprise environments. IBM’s partnership with Astronomer reflects a strategic alignment to simplify and scale Airflow-based workflows across industries.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/company/databand-ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM</a><span style="background-color: transparent; color: rgb(22, 14, 61);">’s Senior Product Manager, </span><a href="https://www.linkedin.com/in/bj-soji/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BJ Adesoji</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span>GTM PM and Growth Leader<span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><a href="https://www.linkedin.com/in/ryanyackel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Yackel</a><span style="background-color: transparent; color: rgb(22, 14, 61);">. We discuss how IBM customers are using Airflow in production, the challenges they face at scale and what the new IBM–Astronomer collaboration unlocks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:09) The growing importance of orchestration tools in enterprise environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:48) How organizations are expanding orchestration beyond traditional use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:24) Common patterns across industries adopting orchestration platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:16) Why orchestration is essential for supporting business-critical workloads.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:00) The role of orchestration in compliance and regulatory processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:02) Challenges enterprises face when managing orchestration infrastructure.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:58) Opportunities to simplify and centralize orchestration at scale.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:11) The value of integrating orchestration with broader data toolchains.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:54) How AI is shaping the future of orchestrated data workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bj-soji/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BJ Adesoji</a></p><p>https://www.linkedin.com/in/bj-soji/</p><p><br></p><p><a href="https://www.linkedin.com/in/ryanyackel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Yackel</a></p><p>https://www.linkedin.com/in/ryanyackel/</p><p><br></p><p><a href="https://www.linkedin.com/company/databand-ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM</a> | LinkedIn</p><p>https://www.linkedin.com/company/databand-ai/</p><p><br></p><p><a href="https://www.ibm.com/products/databand" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM Databand</a></p><p>https://www.ibm.com/products/databand</p><p><br></p><p><a href="https://www.ibm.com/products/datastage" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM DataStage</a></p><p>https://www.ibm.com/products/datastage</p><p><br></p><p><a href="https://www.ibm.com/products/watsonx-governance" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM watsonx.governance</a></p><p>https://www.ibm.com/products/watsonx-governance</p><p><br></p><p><a href="https://www.ibm.com/products/knowledge-catalog" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM Knowledge Catalog</a></p><p>https://www.ibm.com/products/knowledge-catalog</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.ibm.com/products/watsonx-orchestrate" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">watsonx Orchestrate</a></p><p>https://www.ibm.com/products/watsonx-orchestrate</p><p><br></p><p><a href="https://domino.ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Domino</a></p><p>https://domino.ai/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/en/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://aws.amazon.com/sagemaker/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amazon SageMaker</a></p><p>https://aws.amazon.com/sagemaker/</p><p><br></p><p><a href="https://www.cloudera.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a></p><p>https://www.cloudera.com/</p><p><br></p><p><a href="https://www.mongodb.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MongoDB</a></p><p>https://www.mongodb.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/3a310dd3-4e75-4f68-8dca-e99e9b52fa31/af4467012a.jpg" />
  <pubDate>Thu, 05 Jun 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23750193" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/3a310dd3-4e75-4f68-8dca-e99e9b52fa31/episode.mp3" />
  <itunes:title><![CDATA[Streamlining AI and ML Operations at IBM with BJ Adesoji and Ryan Yackel]]></itunes:title>
  <itunes:duration>24:44</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The orchestration layer is foundational to building robust AI- and ML-powered data pipelines, especially in complex hybrid enterprise environments. IBM’s partnership with Astronomer reflects a strategic alignment to simplify and scale Airflow-based workflows across industries.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/company/databand-ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM</a><span style="background-color: transparent; color: rgb(22, 14, 61);">’s Senior Product Manager, </span><a href="https://www.linkedin.com/in/bj-soji/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BJ Adesoji</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span>GTM PM and Growth Leader<span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><a href="https://www.linkedin.com/in/ryanyackel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Yackel</a><span style="background-color: transparent; color: rgb(22, 14, 61);">. We discuss how IBM customers are using Airflow in production, the challenges they face at scale and what the new IBM–Astronomer collaboration unlocks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:09) The growing importance of orchestration tools in enterprise environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:48) How organizations are expanding orchestration beyond traditional use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:24) Common patterns across industries adopting orchestration platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:16) Why orchestration is essential for supporting business-critical workloads.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:00) The role of orchestration in compliance and regulatory processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:02) Challenges enterprises face when managing orchestration infrastructure.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:58) Opportunities to simplify and centralize orchestration at scale.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:11) The value of integrating orchestration with broader data toolchains.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:54) How AI is shaping the future of orchestrated data workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bj-soji/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BJ Adesoji</a></p><p>https://www.linkedin.com/in/bj-soji/</p><p><br></p><p><a href="https://www.linkedin.com/in/ryanyackel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Yackel</a></p><p>https://www.linkedin.com/in/ryanyackel/</p><p><br></p><p><a href="https://www.linkedin.com/company/databand-ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM</a> | LinkedIn</p><p>https://www.linkedin.com/company/databand-ai/</p><p><br></p><p><a href="https://www.ibm.com/products/databand" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM Databand</a></p><p>https://www.ibm.com/products/databand</p><p><br></p><p><a href="https://www.ibm.com/products/datastage" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM DataStage</a></p><p>https://www.ibm.com/products/datastage</p><p><br></p><p><a href="https://www.ibm.com/products/watsonx-governance" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM watsonx.governance</a></p><p>https://www.ibm.com/products/watsonx-governance</p><p><br></p><p><a href="https://www.ibm.com/products/knowledge-catalog" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM Knowledge Catalog</a></p><p>https://www.ibm.com/products/knowledge-catalog</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.ibm.com/products/watsonx-orchestrate" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">watsonx Orchestrate</a></p><p>https://www.ibm.com/products/watsonx-orchestrate</p><p><br></p><p><a href="https://domino.ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Domino</a></p><p>https://domino.ai/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/en/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://aws.amazon.com/sagemaker/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amazon SageMaker</a></p><p>https://aws.amazon.com/sagemaker/</p><p><br></p><p><a href="https://www.cloudera.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a></p><p>https://www.cloudera.com/</p><p><br></p><p><a href="https://www.mongodb.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MongoDB</a></p><p>https://www.mongodb.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">The orchestration layer is foundational to building robust AI- and ML-powered data pipelines, especially in complex hybrid enterprise environments. IBM’s partnership with Astronomer reflects a strategic alignment to simplify and scale Airflow-based workflows across industries.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, we’re joined by </span><a href="https://www.linkedin.com/company/databand-ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM</a><span style="background-color: transparent; color: rgb(22, 14, 61);">’s Senior Product Manager, </span><a href="https://www.linkedin.com/in/bj-soji/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BJ Adesoji</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, and </span>GTM PM and Growth Leader<span style="background-color: transparent; color: rgb(22, 14, 61);">, </span><a href="https://www.linkedin.com/in/ryanyackel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Yackel</a><span style="background-color: transparent; color: rgb(22, 14, 61);">. We discuss how IBM customers are using Airflow in production, the challenges they face at scale and what the new IBM–Astronomer collaboration unlocks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:09) The growing importance of orchestration tools in enterprise environments.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(04:48) How organizations are expanding orchestration beyond traditional use cases.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:24) Common patterns across industries adopting orchestration platforms.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:16) Why orchestration is essential for supporting business-critical workloads.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:00) The role of orchestration in compliance and regulatory processes.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(13:02) Challenges enterprises face when managing orchestration infrastructure.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(14:58) Opportunities to simplify and centralize orchestration at scale.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:11) The value of integrating orchestration with broader data toolchains.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(20:54) How AI is shaping the future of orchestrated data workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/bj-soji/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BJ Adesoji</a></p><p>https://www.linkedin.com/in/bj-soji/</p><p><br></p><p><a href="https://www.linkedin.com/in/ryanyackel/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Yackel</a></p><p>https://www.linkedin.com/in/ryanyackel/</p><p><br></p><p><a href="https://www.linkedin.com/company/databand-ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM</a> | LinkedIn</p><p>https://www.linkedin.com/company/databand-ai/</p><p><br></p><p><a href="https://www.ibm.com/products/databand" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM Databand</a></p><p>https://www.ibm.com/products/databand</p><p><br></p><p><a href="https://www.ibm.com/products/datastage" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM DataStage</a></p><p>https://www.ibm.com/products/datastage</p><p><br></p><p><a href="https://www.ibm.com/products/watsonx-governance" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM watsonx.governance</a></p><p>https://www.ibm.com/products/watsonx-governance</p><p><br></p><p><a href="https://www.ibm.com/products/knowledge-catalog" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">IBM Knowledge Catalog</a></p><p>https://www.ibm.com/products/knowledge-catalog</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.ibm.com/products/watsonx-orchestrate" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">watsonx Orchestrate</a></p><p>https://www.ibm.com/products/watsonx-orchestrate</p><p><br></p><p><a href="https://domino.ai/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Domino</a></p><p>https://domino.ai/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a></p><p>https://www.astronomer.io/</p><p><br></p><p><a href="https://www.snowflake.com/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a></p><p>https://www.snowflake.com/en/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://aws.amazon.com/sagemaker/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amazon SageMaker</a></p><p>https://aws.amazon.com/sagemaker/</p><p><br></p><p><a href="https://www.cloudera.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloudera</a></p><p>https://www.cloudera.com/</p><p><br></p><p><a href="https://www.mongodb.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MongoDB</a></p><p>https://www.mongodb.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The orchestration layer is foundational to building robust AI- and ML-powered data pipelines, especially in complex hybrid enterprise environments. IBM’s partnership with Astronomer reflects a strategic alignment to simplify and scale Airflow-based...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>43</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[e9412b45-e1f5-4f3d-8f5f-d4ce478b8379]]></guid>
  <title><![CDATA[Inside the Custom Framework for Managing Airflow Code at Wix with Gil Reich]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Efficient orchestration and maintainability are crucial for data engineering at scale. </span><a href="https://www.linkedin.com/in/gilreich/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Gil Reich</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Developer for Data Science at </span><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team reduced code duplication, standardized pipelines, and improved Airflow task orchestration using a Python-based framework built within the data science team.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, Gil explains how this internal framework simplifies DAG creation, improves documentation accuracy, and enables consistent task generation for machine learning pipelines. He also shares lessons from complex DAG optimization and maintaining testable code.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:23) Code duplication creates long-term problems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:16) Frameworks bring order to complex pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:41) Shared functions cut down repetitive code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:18) Auto-generated docs stay accurate by design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:40) On-demand DAGs support real-time workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(25:08) Task-level sensors improve run efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(27:40) Combine local runs with automated tests.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(30:09) Clean code helps teams scale faster.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/gilreich/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Gil Reich</a></p><p>https://www.linkedin.com/in/gilreich/</p><p><br></p><p><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | LinkedIn</p><p>https://www.linkedin.com/company/wix-com/</p><p><br></p><p><a href="https://www.wix.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | Website</p><p>https://www.wix.com/</p><p><br></p><p><a href="https://airflowsummit.org/slides/2024/92-refactoring-dags.pdf" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DS DAG Framework</a></p><p>https://airflowsummit.org/slides/2024/92-refactoring-dags.pdf</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/7df6dbf0-3b70-498b-9473-e29d2442e204/dd2fdb0d00.jpg" />
  <pubDate>Thu, 29 May 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="29794301" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/7df6dbf0-3b70-498b-9473-e29d2442e204/episode.mp3" />
  <itunes:title><![CDATA[Inside the Custom Framework for Managing Airflow Code at Wix with Gil Reich]]></itunes:title>
  <itunes:duration>31:02</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Efficient orchestration and maintainability are crucial for data engineering at scale. </span><a href="https://www.linkedin.com/in/gilreich/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Gil Reich</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Developer for Data Science at </span><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team reduced code duplication, standardized pipelines, and improved Airflow task orchestration using a Python-based framework built within the data science team.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, Gil explains how this internal framework simplifies DAG creation, improves documentation accuracy, and enables consistent task generation for machine learning pipelines. He also shares lessons from complex DAG optimization and maintaining testable code.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:23) Code duplication creates long-term problems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:16) Frameworks bring order to complex pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:41) Shared functions cut down repetitive code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:18) Auto-generated docs stay accurate by design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:40) On-demand DAGs support real-time workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(25:08) Task-level sensors improve run efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(27:40) Combine local runs with automated tests.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(30:09) Clean code helps teams scale faster.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/gilreich/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Gil Reich</a></p><p>https://www.linkedin.com/in/gilreich/</p><p><br></p><p><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | LinkedIn</p><p>https://www.linkedin.com/company/wix-com/</p><p><br></p><p><a href="https://www.wix.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | Website</p><p>https://www.wix.com/</p><p><br></p><p><a href="https://airflowsummit.org/slides/2024/92-refactoring-dags.pdf" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DS DAG Framework</a></p><p>https://airflowsummit.org/slides/2024/92-refactoring-dags.pdf</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Efficient orchestration and maintainability are crucial for data engineering at scale. </span><a href="https://www.linkedin.com/in/gilreich/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Gil Reich</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Data Developer for Data Science at </span><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, shares how his team reduced code duplication, standardized pipelines, and improved Airflow task orchestration using a Python-based framework built within the data science team.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">In this episode, Gil explains how this internal framework simplifies DAG creation, improves documentation accuracy, and enables consistent task generation for machine learning pipelines. He also shares lessons from complex DAG optimization and maintaining testable code.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:23) Code duplication creates long-term problems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(08:16) Frameworks bring order to complex pipelines.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:41) Shared functions cut down repetitive code.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:18) Auto-generated docs stay accurate by design.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(22:40) On-demand DAGs support real-time workflows.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(25:08) Task-level sensors improve run efficiency.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(27:40) Combine local runs with automated tests.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(30:09) Clean code helps teams scale faster.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/gilreich/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Gil Reich</a></p><p>https://www.linkedin.com/in/gilreich/</p><p><br></p><p><a href="https://www.linkedin.com/company/wix-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | LinkedIn</p><p>https://www.linkedin.com/company/wix-com/</p><p><br></p><p><a href="https://www.wix.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Wix</a> | Website</p><p>https://www.wix.com/</p><p><br></p><p><a href="https://airflowsummit.org/slides/2024/92-refactoring-dags.pdf" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DS DAG Framework</a></p><p>https://airflowsummit.org/slides/2024/92-refactoring-dags.pdf</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;&nbsp;</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Efficient orchestration and maintainability are crucial for data engineering at scale. Gil Reich, Data Developer for Data Science at Wix, shares how his team reduced code duplication, standardized pipelines, and improved Airflow task orchestration ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>42</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[569743f1-2c08-4c37-a04e-5b186a5b7de3]]></guid>
  <title><![CDATA[Modernizing Legacy Data Systems With Airflow at Procter & Gamble with Adonis Castillo Cordero]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Legacy architecture and AI workloads pose unique challenges at scale, especially in a global enterprise with complex data systems. In this episode, we explore strategies to proactively monitor and optimize pipelines while minimizing downstream failures.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/adoniscc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adonis Castillo Cordero</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Automation Manager at </span><a href="https://www.linkedin.com/company/procter-and-gamble/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share actionable best practices for dependency mapping, anomaly detection and architecture simplification using Apache Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:13) Integrating legacy data systems into modern architecture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:51) Designing workflows for real-time data processing.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:57) Mapping dependencies early to avoid pipeline failures.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:02) Building automated monitoring into orchestration frameworks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:09) Detecting anomalies to prevent performance bottlenecks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:24) Monitoring data quality to catch silent failures.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:02) Prioritizing responses based on impact severity.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:55) Simplifying dashboards to highlight critical metrics.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/adoniscc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adonis Castillo Cordero</a></p><p>https://www.linkedin.com/in/adoniscc/</p><p><br></p><p><a href="https://www.linkedin.com/company/procter-and-gamble/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a> | LinkedIn</p><p>https://www.linkedin.com/company/procter-and-gamble/</p><p><br></p><p><a href="http://www.pg.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a> | Website</p><p>http://www.pg.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://openlineage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenLineage</a></p><p>https://openlineage.io/</p><p><br></p><p><a href="https://azure.microsoft.com/en-us/products/monitor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Azure Monitor</a></p><p>https://azure.microsoft.com/en-us/products/monitor/</p><p><br></p><p><a href="https://aws.amazon.com/lookout-for-metrics/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Lookout for Metrics</a></p><p>https://aws.amazon.com/lookout-for-metrics/</p><p><br></p><p><a href="https://www.montecarlodata.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monte Carlo</a></p><p>https://www.montecarlodata.com/</p><p><br></p><p><a href="https://greatexpectations.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/1cc5225f-2d2e-4a34-9454-e92410c2c6cb/167f613ab1.jpg" />
  <pubDate>Thu, 22 May 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21334389" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/1cc5225f-2d2e-4a34-9454-e92410c2c6cb/episode.mp3" />
  <itunes:title><![CDATA[Modernizing Legacy Data Systems With Airflow at Procter & Gamble with Adonis Castillo Cordero]]></itunes:title>
  <itunes:duration>22:13</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Legacy architecture and AI workloads pose unique challenges at scale, especially in a global enterprise with complex data systems. In this episode, we explore strategies to proactively monitor and optimize pipelines while minimizing downstream failures.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/adoniscc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adonis Castillo Cordero</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Automation Manager at </span><a href="https://www.linkedin.com/company/procter-and-gamble/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share actionable best practices for dependency mapping, anomaly detection and architecture simplification using Apache Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:13) Integrating legacy data systems into modern architecture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:51) Designing workflows for real-time data processing.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:57) Mapping dependencies early to avoid pipeline failures.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:02) Building automated monitoring into orchestration frameworks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:09) Detecting anomalies to prevent performance bottlenecks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:24) Monitoring data quality to catch silent failures.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:02) Prioritizing responses based on impact severity.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:55) Simplifying dashboards to highlight critical metrics.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/adoniscc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adonis Castillo Cordero</a></p><p>https://www.linkedin.com/in/adoniscc/</p><p><br></p><p><a href="https://www.linkedin.com/company/procter-and-gamble/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a> | LinkedIn</p><p>https://www.linkedin.com/company/procter-and-gamble/</p><p><br></p><p><a href="http://www.pg.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a> | Website</p><p>http://www.pg.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://openlineage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenLineage</a></p><p>https://openlineage.io/</p><p><br></p><p><a href="https://azure.microsoft.com/en-us/products/monitor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Azure Monitor</a></p><p>https://azure.microsoft.com/en-us/products/monitor/</p><p><br></p><p><a href="https://aws.amazon.com/lookout-for-metrics/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Lookout for Metrics</a></p><p>https://aws.amazon.com/lookout-for-metrics/</p><p><br></p><p><a href="https://www.montecarlodata.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monte Carlo</a></p><p>https://www.montecarlodata.com/</p><p><br></p><p><a href="https://greatexpectations.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Legacy architecture and AI workloads pose unique challenges at scale, especially in a global enterprise with complex data systems. In this episode, we explore strategies to proactively monitor and optimize pipelines while minimizing downstream failures.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/adoniscc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adonis Castillo Cordero</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Automation Manager at </span><a href="https://www.linkedin.com/company/procter-and-gamble/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share actionable best practices for dependency mapping, anomaly detection and architecture simplification using Apache Airflow.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:13) Integrating legacy data systems into modern architecture.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(05:51) Designing workflows for real-time data processing.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(07:57) Mapping dependencies early to avoid pipeline failures.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(09:02) Building automated monitoring into orchestration frameworks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(12:09) Detecting anomalies to prevent performance bottlenecks.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(15:24) Monitoring data quality to catch silent failures.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(17:02) Prioritizing responses based on impact severity.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(18:55) Simplifying dashboards to highlight critical metrics.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/adoniscc/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Adonis Castillo Cordero</a></p><p>https://www.linkedin.com/in/adoniscc/</p><p><br></p><p><a href="https://www.linkedin.com/company/procter-and-gamble/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a> | LinkedIn</p><p>https://www.linkedin.com/company/procter-and-gamble/</p><p><br></p><p><a href="http://www.pg.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Procter &amp; Gamble</a> | Website</p><p>http://www.pg.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://openlineage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenLineage</a></p><p>https://openlineage.io/</p><p><br></p><p><a href="https://azure.microsoft.com/en-us/products/monitor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Azure Monitor</a></p><p>https://azure.microsoft.com/en-us/products/monitor/</p><p><br></p><p><a href="https://aws.amazon.com/lookout-for-metrics/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Lookout for Metrics</a></p><p>https://aws.amazon.com/lookout-for-metrics/</p><p><br></p><p><a href="https://www.montecarlodata.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monte Carlo</a></p><p>https://www.montecarlodata.com/</p><p><br></p><p><a href="https://greatexpectations.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io/</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/london/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/new-york/</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/sydney/&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/san-francisco/&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">https://www.astronomer.io/events/roadshow/chicago/</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Legacy architecture and AI workloads pose unique challenges at scale, especially in a global enterprise with complex data systems. In this episode, we explore strategies to proactively monitor and optimize pipelines while minimizing downstream fail...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>41</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[ecd018eb-1b18-4c9e-9e95-0c99214770b6]]></guid>
  <title><![CDATA[Building an End-to-End Data Observability System at Netflix with Joseph Machado]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building reliable data pipelines starts with maintaining strong data quality standards and creating efficient systems for auditing, publishing and monitoring. In this episode, we explore the real-world patterns and best practices for ensuring data pipelines stay accurate, scalable and trustworthy.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/josephmachado1991/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Joseph Machado</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/netflix/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share practical insights gleaned from supporting Netflix’s Ads business as well as over a decade of experience in the data engineering space. He discusses implementing audit publish patterns, building observability dashboards, defining in-band and separate data quality checks, and optimizing data validation across large-scale systems.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:14) Supporting data privacy and engineering efficiency within data systems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:41) Validating outputs with reconciliation checks to catch transformation issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:06) Applying standardized patterns for auditing, validating and publishing data.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:28) Capturing historical check results to monitor system health and improvements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(21:29) Treating data quality and availability as separate monitoring concerns.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(26:26) Using containerization strategies to streamline pipeline executions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(29:47) Leveraging orchestration platforms for better visibility and retry capability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(31:59) Managing business pressure without sacrificing data quality practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(35:46) Starting simple with quality checks and evolving toward more complex frameworks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/josephmachado1991/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Joseph Machado</a></p><p>https://www.linkedin.com/in/josephmachado1991/</p><p><br></p><p><a href="https://www.linkedin.com/company/netflix/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a> | LinkedIn</p><p>https://www.linkedin.com/company/netflix/</p><p><br></p><p><a href="https://www.netflix.com/browse" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a> | Website</p><p>https://www.netflix.com/browse</p><p><br></p><p><a href="https://www.startdataengineering.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Start Data Engineering</a></p><p>https://www.startdataengineering.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://greatexpectations.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/159862ea-d95c-47e3-a96f-acdbfeb94897/b2d0c1885d.jpg" />
  <pubDate>Thu, 15 May 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="37356854" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/159862ea-d95c-47e3-a96f-acdbfeb94897/episode.mp3" />
  <itunes:title><![CDATA[Building an End-to-End Data Observability System at Netflix with Joseph Machado]]></itunes:title>
  <itunes:duration>38:54</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building reliable data pipelines starts with maintaining strong data quality standards and creating efficient systems for auditing, publishing and monitoring. In this episode, we explore the real-world patterns and best practices for ensuring data pipelines stay accurate, scalable and trustworthy.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/josephmachado1991/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Joseph Machado</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/netflix/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share practical insights gleaned from supporting Netflix’s Ads business as well as over a decade of experience in the data engineering space. He discusses implementing audit publish patterns, building observability dashboards, defining in-band and separate data quality checks, and optimizing data validation across large-scale systems.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:14) Supporting data privacy and engineering efficiency within data systems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:41) Validating outputs with reconciliation checks to catch transformation issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:06) Applying standardized patterns for auditing, validating and publishing data.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:28) Capturing historical check results to monitor system health and improvements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(21:29) Treating data quality and availability as separate monitoring concerns.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(26:26) Using containerization strategies to streamline pipeline executions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(29:47) Leveraging orchestration platforms for better visibility and retry capability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(31:59) Managing business pressure without sacrificing data quality practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(35:46) Starting simple with quality checks and evolving toward more complex frameworks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/josephmachado1991/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Joseph Machado</a></p><p>https://www.linkedin.com/in/josephmachado1991/</p><p><br></p><p><a href="https://www.linkedin.com/company/netflix/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a> | LinkedIn</p><p>https://www.linkedin.com/company/netflix/</p><p><br></p><p><a href="https://www.netflix.com/browse" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a> | Website</p><p>https://www.netflix.com/browse</p><p><br></p><p><a href="https://www.startdataengineering.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Start Data Engineering</a></p><p>https://www.startdataengineering.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://greatexpectations.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">Building reliable data pipelines starts with maintaining strong data quality standards and creating efficient systems for auditing, publishing and monitoring. In this episode, we explore the real-world patterns and best practices for ensuring data pipelines stay accurate, scalable and trustworthy.</span></p><p><br></p><p><a href="https://www.linkedin.com/in/josephmachado1991/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Joseph Machado</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/netflix/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, joins us to share practical insights gleaned from supporting Netflix’s Ads business as well as over a decade of experience in the data engineering space. He discusses implementing audit publish patterns, building observability dashboards, defining in-band and separate data quality checks, and optimizing data validation across large-scale systems.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(03:14) Supporting data privacy and engineering efficiency within data systems.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(10:41) Validating outputs with reconciliation checks to catch transformation issues.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(16:06) Applying standardized patterns for auditing, validating and publishing data.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(19:28) Capturing historical check results to monitor system health and improvements.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(21:29) Treating data quality and availability as separate monitoring concerns.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(26:26) Using containerization strategies to streamline pipeline executions.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(29:47) Leveraging orchestration platforms for better visibility and retry capability.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(31:59) Managing business pressure without sacrificing data quality practices.</span></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">(35:46) Starting simple with quality checks and evolving toward more complex frameworks.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/josephmachado1991/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Joseph Machado</a></p><p>https://www.linkedin.com/in/josephmachado1991/</p><p><br></p><p><a href="https://www.linkedin.com/company/netflix/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a> | LinkedIn</p><p>https://www.linkedin.com/company/netflix/</p><p><br></p><p><a href="https://www.netflix.com/browse" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix</a> | Website</p><p>https://www.netflix.com/browse</p><p><br></p><p><a href="https://www.startdataengineering.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Start Data Engineering</a></p><p>https://www.startdataengineering.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt Labs</a></p><p>https://www.getdbt.com/</p><p><br></p><p><a href="https://greatexpectations.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Great Expectations</a></p><p>https://greatexpectations.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Building reliable data pipelines starts with maintaining strong data quality standards and creating efficient systems for auditing, publishing and monitoring. In this episode, we explore the real-world patterns and best practices for ensuring data ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>40</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[02e6e034-6ffe-489f-8109-dc83387c30b0]]></guid>
  <title><![CDATA[Why Developer Experience Shapes Data Pipeline Standards at Next Insurance with Snir Israeli]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Creating consistency across data pipelines is critical for scaling engineering teams and ensuring long-term maintainability.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/snir-israeli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snir Israeli</a><span style="background-color: transparent;">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/nextinsurance/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a><span style="background-color: transparent;">, shares how enforcing coding standards and investing in developer experience transformed their approach to data engineering. He explains how implementing automated code checks, clear documentation practices and a scoring system helped drive alignment across teams, improve collaboration and reduce technical debt in a fast-growing data environment.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:59) Inconsistencies in code style create challenges for collaboration and maintenance.</span></p><p><span style="background-color: transparent;">(04:22) Programmatically enforcing rules helps teams scale their best practices.</span></p><p><span style="background-color: transparent;">(08:55) Performance improvements in data pipelines lead to infrastructure cost savings.</span></p><p><span style="background-color: transparent;">(13:22) Developer experience is essential for driving adoption of internal tools.</span></p><p><span style="background-color: transparent;">(19:44) Dashboards can operationalize standards enforcement and track progress over time.</span></p><p><span style="background-color: transparent;">(22:49) Standardization accelerates onboarding and reduces friction in code reviews.</span></p><p><span style="background-color: transparent;">(25:39) Linting rules require ongoing maintenance as tools and platforms evolve.</span></p><p><span style="background-color: transparent;">(27:47) Starting small and involving the team leads to better adoption and long-term success.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/snir-israeli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snir Israeli</a></p><p>https://www.linkedin.com/in/snir-israeli/</p><p><br></p><p><a href="https://www.linkedin.com/company/nextinsurance/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a> | LinkedIn</p><p>https://www.linkedin.com/company/nextinsurance/</p><p><br></p><p><a href="https://www.nextinsurance.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a> | Website</p><p>https://www.nextinsurance.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/8b78903f-b956-4713-a265-28c59e19c830/299209e2a1.jpg" />
  <pubDate>Thu, 08 May 2025 01:50:06 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="29253880" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/8b78903f-b956-4713-a265-28c59e19c830/episode.mp3" />
  <itunes:title><![CDATA[Why Developer Experience Shapes Data Pipeline Standards at Next Insurance with Snir Israeli]]></itunes:title>
  <itunes:duration>30:28</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Creating consistency across data pipelines is critical for scaling engineering teams and ensuring long-term maintainability.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/snir-israeli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snir Israeli</a><span style="background-color: transparent;">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/nextinsurance/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a><span style="background-color: transparent;">, shares how enforcing coding standards and investing in developer experience transformed their approach to data engineering. He explains how implementing automated code checks, clear documentation practices and a scoring system helped drive alignment across teams, improve collaboration and reduce technical debt in a fast-growing data environment.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:59) Inconsistencies in code style create challenges for collaboration and maintenance.</span></p><p><span style="background-color: transparent;">(04:22) Programmatically enforcing rules helps teams scale their best practices.</span></p><p><span style="background-color: transparent;">(08:55) Performance improvements in data pipelines lead to infrastructure cost savings.</span></p><p><span style="background-color: transparent;">(13:22) Developer experience is essential for driving adoption of internal tools.</span></p><p><span style="background-color: transparent;">(19:44) Dashboards can operationalize standards enforcement and track progress over time.</span></p><p><span style="background-color: transparent;">(22:49) Standardization accelerates onboarding and reduces friction in code reviews.</span></p><p><span style="background-color: transparent;">(25:39) Linting rules require ongoing maintenance as tools and platforms evolve.</span></p><p><span style="background-color: transparent;">(27:47) Starting small and involving the team leads to better adoption and long-term success.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/snir-israeli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snir Israeli</a></p><p>https://www.linkedin.com/in/snir-israeli/</p><p><br></p><p><a href="https://www.linkedin.com/company/nextinsurance/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a> | LinkedIn</p><p>https://www.linkedin.com/company/nextinsurance/</p><p><br></p><p><a href="https://www.nextinsurance.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a> | Website</p><p>https://www.nextinsurance.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Creating consistency across data pipelines is critical for scaling engineering teams and ensuring long-term maintainability.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/snir-israeli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snir Israeli</a><span style="background-color: transparent;">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/nextinsurance/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a><span style="background-color: transparent;">, shares how enforcing coding standards and investing in developer experience transformed their approach to data engineering. He explains how implementing automated code checks, clear documentation practices and a scoring system helped drive alignment across teams, improve collaboration and reduce technical debt in a fast-growing data environment.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:59) Inconsistencies in code style create challenges for collaboration and maintenance.</span></p><p><span style="background-color: transparent;">(04:22) Programmatically enforcing rules helps teams scale their best practices.</span></p><p><span style="background-color: transparent;">(08:55) Performance improvements in data pipelines lead to infrastructure cost savings.</span></p><p><span style="background-color: transparent;">(13:22) Developer experience is essential for driving adoption of internal tools.</span></p><p><span style="background-color: transparent;">(19:44) Dashboards can operationalize standards enforcement and track progress over time.</span></p><p><span style="background-color: transparent;">(22:49) Standardization accelerates onboarding and reduces friction in code reviews.</span></p><p><span style="background-color: transparent;">(25:39) Linting rules require ongoing maintenance as tools and platforms evolve.</span></p><p><span style="background-color: transparent;">(27:47) Starting small and involving the team leads to better adoption and long-term success.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/snir-israeli/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snir Israeli</a></p><p>https://www.linkedin.com/in/snir-israeli/</p><p><br></p><p><a href="https://www.linkedin.com/company/nextinsurance/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a> | LinkedIn</p><p>https://www.linkedin.com/company/nextinsurance/</p><p><br></p><p><a href="https://www.nextinsurance.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Next Insurance</a> | Website</p><p>https://www.nextinsurance.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Creating consistency across data pipelines is critical for scaling engineering teams and ensuring long-term maintainability.In this episode, Snir Israeli, Senior Data Engineer at Next Insurance, shares how enforcing coding standards and investing i...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>39</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[e00b0c49-324f-4328-aa8f-23a070950cf1]]></guid>
  <title><![CDATA[Data Quality and Observability at Tekmetric with Ipsa Trivedi]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Airflow’s adaptability is driving Tekmetric’s ability to unify complex data workflows, deliver accurate insights and support both internal operations and customer-facing services — all within a rapidly growing startup environment.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/ipsatrivedi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ipsa Trivedi</a><span style="background-color: transparent;">, Lead Data Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/tekmetric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a><span style="background-color: transparent;">, shares how her team is standardizing pipelines while supporting unique customer needs. She explains how Airflow enables end-to-end data services, simplifies orchestration across varied sources and supports scalable customization. Ipsa also highlights early wins with Airflow, its intuitive UI and the team's roadmap toward data quality, observability and a future self-serve data platform.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:26) Powering auto shops nationwide with a unified platform.</span></p><p><span style="background-color: transparent;">(05:17) A new data team was formed to centralize and scale insights.</span></p><p><span style="background-color: transparent;">(07:23) Flexible, open source and made to fit — Airflow wins.</span></p><p><span style="background-color: transparent;">(10:42) Pipelines handle anything from email to AWS.</span></p><p><span style="background-color: transparent;">(12:15) Custom DAGs fit every team’s unique needs.</span></p><p><span style="background-color: transparent;">(17:01) Data quality checks are built into the plan.</span></p><p><span style="background-color: transparent;">(18:17) Self-serve data mesh is the end goal.</span></p><p><span style="background-color: transparent;">(19:59) Airflow now fits so well, there's nothing left on the wishlist.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ipsatrivedi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ipsa Trivedi</a></p><p>https://www.linkedin.com/in/ipsatrivedi/</p><p><br></p><p><a href="https://www.linkedin.com/company/tekmetric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a> | LinkedIn</p><p>https://www.linkedin.com/company/tekmetric/</p><p><br></p><p><a href="https://www.tekmetric.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a> | Website</p><p>https://www.tekmetric.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://aws.amazon.com/free/database/?trk=fc551e06-56b0-418c-9ddd-5c9dba18569b&amp;sc_channel=ps&amp;ef_id=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!548989592596!e!!g!!amazon%20sql%20database!11543056228!112002958549&amp;gclid=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS RDS</a></p><p>https://aws.amazon.com/free/database/?trk=fc551e06-56b0-418c-9ddd-5c9dba18569b&amp;sc_channel=ps&amp;ef_id=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!548989592596!e!!g!!amazon%20sql%20database!11543056228!112002958549&amp;gclid=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE</p><p><br></p><p><a href="https://www.astronomer.io/product/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astro by Astronomer</a></p><p>https://www.astronomer.io/product/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/b5783b64-f50b-4bbc-810f-68333febf247/739a141de8.jpg" />
  <pubDate>Thu, 01 May 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21910337" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/b5783b64-f50b-4bbc-810f-68333febf247/episode.mp3" />
  <itunes:title><![CDATA[Data Quality and Observability at Tekmetric with Ipsa Trivedi]]></itunes:title>
  <itunes:duration>22:49</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Airflow’s adaptability is driving Tekmetric’s ability to unify complex data workflows, deliver accurate insights and support both internal operations and customer-facing services — all within a rapidly growing startup environment.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/ipsatrivedi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ipsa Trivedi</a><span style="background-color: transparent;">, Lead Data Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/tekmetric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a><span style="background-color: transparent;">, shares how her team is standardizing pipelines while supporting unique customer needs. She explains how Airflow enables end-to-end data services, simplifies orchestration across varied sources and supports scalable customization. Ipsa also highlights early wins with Airflow, its intuitive UI and the team's roadmap toward data quality, observability and a future self-serve data platform.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:26) Powering auto shops nationwide with a unified platform.</span></p><p><span style="background-color: transparent;">(05:17) A new data team was formed to centralize and scale insights.</span></p><p><span style="background-color: transparent;">(07:23) Flexible, open source and made to fit — Airflow wins.</span></p><p><span style="background-color: transparent;">(10:42) Pipelines handle anything from email to AWS.</span></p><p><span style="background-color: transparent;">(12:15) Custom DAGs fit every team’s unique needs.</span></p><p><span style="background-color: transparent;">(17:01) Data quality checks are built into the plan.</span></p><p><span style="background-color: transparent;">(18:17) Self-serve data mesh is the end goal.</span></p><p><span style="background-color: transparent;">(19:59) Airflow now fits so well, there's nothing left on the wishlist.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ipsatrivedi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ipsa Trivedi</a></p><p>https://www.linkedin.com/in/ipsatrivedi/</p><p><br></p><p><a href="https://www.linkedin.com/company/tekmetric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a> | LinkedIn</p><p>https://www.linkedin.com/company/tekmetric/</p><p><br></p><p><a href="https://www.tekmetric.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a> | Website</p><p>https://www.tekmetric.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://aws.amazon.com/free/database/?trk=fc551e06-56b0-418c-9ddd-5c9dba18569b&amp;sc_channel=ps&amp;ef_id=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!548989592596!e!!g!!amazon%20sql%20database!11543056228!112002958549&amp;gclid=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS RDS</a></p><p>https://aws.amazon.com/free/database/?trk=fc551e06-56b0-418c-9ddd-5c9dba18569b&amp;sc_channel=ps&amp;ef_id=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!548989592596!e!!g!!amazon%20sql%20database!11543056228!112002958549&amp;gclid=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE</p><p><br></p><p><a href="https://www.astronomer.io/product/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astro by Astronomer</a></p><p>https://www.astronomer.io/product/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Airflow’s adaptability is driving Tekmetric’s ability to unify complex data workflows, deliver accurate insights and support both internal operations and customer-facing services — all within a rapidly growing startup environment.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/ipsatrivedi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ipsa Trivedi</a><span style="background-color: transparent;">, Lead Data Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/tekmetric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a><span style="background-color: transparent;">, shares how her team is standardizing pipelines while supporting unique customer needs. She explains how Airflow enables end-to-end data services, simplifies orchestration across varied sources and supports scalable customization. Ipsa also highlights early wins with Airflow, its intuitive UI and the team's roadmap toward data quality, observability and a future self-serve data platform.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:26) Powering auto shops nationwide with a unified platform.</span></p><p><span style="background-color: transparent;">(05:17) A new data team was formed to centralize and scale insights.</span></p><p><span style="background-color: transparent;">(07:23) Flexible, open source and made to fit — Airflow wins.</span></p><p><span style="background-color: transparent;">(10:42) Pipelines handle anything from email to AWS.</span></p><p><span style="background-color: transparent;">(12:15) Custom DAGs fit every team’s unique needs.</span></p><p><span style="background-color: transparent;">(17:01) Data quality checks are built into the plan.</span></p><p><span style="background-color: transparent;">(18:17) Self-serve data mesh is the end goal.</span></p><p><span style="background-color: transparent;">(19:59) Airflow now fits so well, there's nothing left on the wishlist.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ipsatrivedi/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ipsa Trivedi</a></p><p>https://www.linkedin.com/in/ipsatrivedi/</p><p><br></p><p><a href="https://www.linkedin.com/company/tekmetric/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a> | LinkedIn</p><p>https://www.linkedin.com/company/tekmetric/</p><p><br></p><p><a href="https://www.tekmetric.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tekmetric</a> | Website</p><p>https://www.tekmetric.com/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://aws.amazon.com/free/database/?trk=fc551e06-56b0-418c-9ddd-5c9dba18569b&amp;sc_channel=ps&amp;ef_id=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!548989592596!e!!g!!amazon%20sql%20database!11543056228!112002958549&amp;gclid=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS RDS</a></p><p>https://aws.amazon.com/free/database/?trk=fc551e06-56b0-418c-9ddd-5c9dba18569b&amp;sc_channel=ps&amp;ef_id=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE:G:s&amp;s_kwcid=AL!4422!3!548989592596!e!!g!!amazon%20sql%20database!11543056228!112002958549&amp;gclid=CjwKCAjwzMi_BhACEiwAX4YZULS4jV2Xpnpcac_Q3eS9BAg-klKUDyCt6XSdOul8BLHkmWzFFh4NXRoCGhQQAvD_BwE</p><p><br></p><p><a href="https://www.astronomer.io/product/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astro by Astronomer</a></p><p>https://www.astronomer.io/product/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;">&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a><span style="background-color: transparent;">&nbsp;&nbsp;&nbsp;</span></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a><span style="background-color: transparent;">&nbsp;</span></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿</span>&nbsp;&nbsp;</span></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a><span style="background-color: transparent;">&nbsp;</span></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Airflow’s adaptability is driving Tekmetric’s ability to unify complex data workflows, deliver accurate insights and support both internal operations and customer-facing services — all within a rapidly growing startup environment.In this episode, I...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>38</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[ac88168d-9a56-4590-9f58-2016ae6dcaf2]]></guid>
  <title><![CDATA[Introducing Apache Airflow® 3 with Vikram Koka and Jed Cunningham]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">The Airflow 3.0 release marks a significant leap forward in modern data orchestration, introducing architectural upgrades that improve scalability, flexibility and long-term maintainability.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, we welcome</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a><span style="background-color: transparent;">, Chief Strategy Officer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/jedidiah-cunningham/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jed Cunningham</a><span style="background-color: transparent;">, Principal Software Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, to discuss the architectural </span>foundations, new features and future implications of this milestone release. They unpack the rationale behind DAG versioning and task execution interface, explain how Airflow now integrates more seamlessly within broader data ecosystems and share how these changes lay the groundwork for multi-cloud deployments, language-agnostic workflows and stronger enterprise security.</p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:28) Modern orchestration demands new infrastructure approaches.</span></p><p><span style="background-color: transparent;">(05:02) Removing legacy components strengthens system stability.</span></p><p><span style="background-color: transparent;">(06:26) Major releases provide the opportunity to reduce technical debt.</span></p><p><span style="background-color: transparent;">(08:31) Frontend and API modernization enable long-term adaptability.</span></p><p><span style="background-color: transparent;">(09:36) Event-based triggers expand integration possibilities.</span></p><p><span style="background-color: transparent;">(11:54) Version control improves visibility and execution reliability.</span></p><p><span style="background-color: transparent;">(14:57) Centralized access to workflow definitions increases flexibility.</span></p><p><span style="background-color: transparent;">(21:49) Decoupled architecture supports distributed and secure deployments.</span></p><p><span style="background-color: transparent;">(26:17) Community collaboration is essential for sustainable growth.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://git-scm.com/book/en/v2/Git-Tools-Bundling" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Git Bundle</a></p><p>https://git-scm.com/book/en/v2/Git-Tools-Bundling</p><p><br></p><p><a href="https://fastapi.tiangolo.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">FastAPI</a></p><p>https://fastapi.tiangolo.com/</p><p><br></p><p><a href="https://react.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">React</a></p><p>https://react.dev/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/98d3176f-d3a9-4cdf-a4c9-f7734982d239/e0928303c3.jpg" />
  <pubDate>Thu, 24 Apr 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26373723" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/98d3176f-d3a9-4cdf-a4c9-f7734982d239/episode.mp3" />
  <itunes:title><![CDATA[Introducing Apache Airflow® 3 with Vikram Koka and Jed Cunningham]]></itunes:title>
  <itunes:duration>27:28</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">The Airflow 3.0 release marks a significant leap forward in modern data orchestration, introducing architectural upgrades that improve scalability, flexibility and long-term maintainability.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, we welcome</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a><span style="background-color: transparent;">, Chief Strategy Officer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/jedidiah-cunningham/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jed Cunningham</a><span style="background-color: transparent;">, Principal Software Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, to discuss the architectural </span>foundations, new features and future implications of this milestone release. They unpack the rationale behind DAG versioning and task execution interface, explain how Airflow now integrates more seamlessly within broader data ecosystems and share how these changes lay the groundwork for multi-cloud deployments, language-agnostic workflows and stronger enterprise security.</p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:28) Modern orchestration demands new infrastructure approaches.</span></p><p><span style="background-color: transparent;">(05:02) Removing legacy components strengthens system stability.</span></p><p><span style="background-color: transparent;">(06:26) Major releases provide the opportunity to reduce technical debt.</span></p><p><span style="background-color: transparent;">(08:31) Frontend and API modernization enable long-term adaptability.</span></p><p><span style="background-color: transparent;">(09:36) Event-based triggers expand integration possibilities.</span></p><p><span style="background-color: transparent;">(11:54) Version control improves visibility and execution reliability.</span></p><p><span style="background-color: transparent;">(14:57) Centralized access to workflow definitions increases flexibility.</span></p><p><span style="background-color: transparent;">(21:49) Decoupled architecture supports distributed and secure deployments.</span></p><p><span style="background-color: transparent;">(26:17) Community collaboration is essential for sustainable growth.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://git-scm.com/book/en/v2/Git-Tools-Bundling" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Git Bundle</a></p><p>https://git-scm.com/book/en/v2/Git-Tools-Bundling</p><p><br></p><p><a href="https://fastapi.tiangolo.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">FastAPI</a></p><p>https://fastapi.tiangolo.com/</p><p><br></p><p><a href="https://react.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">React</a></p><p>https://react.dev/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">The Airflow 3.0 release marks a significant leap forward in modern data orchestration, introducing architectural upgrades that improve scalability, flexibility and long-term maintainability.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, we welcome</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a><span style="background-color: transparent;">, Chief Strategy Officer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/jedidiah-cunningham/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jed Cunningham</a><span style="background-color: transparent;">, Principal Software Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, to discuss the architectural </span>foundations, new features and future implications of this milestone release. They unpack the rationale behind DAG versioning and task execution interface, explain how Airflow now integrates more seamlessly within broader data ecosystems and share how these changes lay the groundwork for multi-cloud deployments, language-agnostic workflows and stronger enterprise security.</p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:28) Modern orchestration demands new infrastructure approaches.</span></p><p><span style="background-color: transparent;">(05:02) Removing legacy components strengthens system stability.</span></p><p><span style="background-color: transparent;">(06:26) Major releases provide the opportunity to reduce technical debt.</span></p><p><span style="background-color: transparent;">(08:31) Frontend and API modernization enable long-term adaptability.</span></p><p><span style="background-color: transparent;">(09:36) Event-based triggers expand integration possibilities.</span></p><p><span style="background-color: transparent;">(11:54) Version control improves visibility and execution reliability.</span></p><p><span style="background-color: transparent;">(14:57) Centralized access to workflow definitions increases flexibility.</span></p><p><span style="background-color: transparent;">(21:49) Decoupled architecture supports distributed and secure deployments.</span></p><p><span style="background-color: transparent;">(26:17) Community collaboration is essential for sustainable growth.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://git-scm.com/book/en/v2/Git-Tools-Bundling" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Git Bundle</a></p><p>https://git-scm.com/book/en/v2/Git-Tools-Bundling</p><p><br></p><p><a href="https://fastapi.tiangolo.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">FastAPI</a></p><p>https://fastapi.tiangolo.com/</p><p><br></p><p><a href="https://react.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">React</a></p><p>https://react.dev/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “</span><span style="background-color: transparent; color: rgb(29, 28, 29);">The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI</span><span style="background-color: transparent;">.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The Airflow 3.0 release marks a significant leap forward in modern data orchestration, introducing architectural upgrades that improve scalability, flexibility and long-term maintainability.In this episode, we welcome Vikram Koka, Chief Strategy Of...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>37</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[73a928f0-df79-4f7f-96a4-7b8e06e73033]]></guid>
  <title><![CDATA[Airflow in Action: Powering Instacart's Complex Ecosystem]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">The evolution of data orchestration at Instacart highlights the journey from fragmented systems to robust, standardized infrastructure. This transformation has enabled scalability, reliability and democratization of tools for diverse user personas.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, we’re joined by</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/in/anantag/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anant Agarwal</a><span style="background-color: transparent;">, Software Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/instacart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a><span style="background-color: transparent;">, who shares insights into Instacart's Airflow journey, from its early adoption in 2019 to the present-day centralized cluster approach. Anant discusses the challenges of managing disparate clusters, the implementation of remote executors, and the strategic standardization of infrastructure and DAG patterns to streamline workflows.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(03:49) The impact of external events on business growth and technological evolution.</span></p><p><span style="background-color: transparent;">(04:31) Challenges of managing decentralized systems across multiple teams.</span></p><p><span style="background-color: transparent;">(06:14) The importance of standardizing infrastructure and processes for scalability.</span></p><p><span style="background-color: transparent;">(09:51) Strategies for implementing efficient and repeatable deployment practices.</span></p><p><span style="background-color: transparent;">(12:17) Addressing diverse user personas with tailored solutions.</span></p><p><span style="background-color: transparent;">(14:47) Leveraging remote execution to enhance flexibility and scalability.</span></p><p><span style="background-color: transparent;">(18:36) Benefits of transitioning to a centralized system for organization-wide use.</span></p><p><span style="background-color: transparent;">(20:57) Maintaining an upgrade cadence to stay aligned with the latest advancements.</span></p><p><span style="background-color: transparent;">(23:35) Anticipation for new features and improvements in upcoming software versions.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/anantag/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anant Agarwal</a></p><p>https://www.linkedin.com/in/anantag/</p><p><br></p><p><a href="https://www.linkedin.com/company/instacart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a> | LinkedIn</p><p>https://www.linkedin.com/company/instacart/</p><p><br></p><p><a href="https://www.instacart.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a> | Website</p><p>https://www.instacart.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://aws.amazon.com/ecs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Amazon</a></p><p>https://aws.amazon.com/ecs/</p><p><br></p><p><a href="https://www.terraform.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Terraform</a></p><p>https://www.terraform.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;"> </span><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/98e53389-2a7f-43eb-aa48-f34595cd76e3/9c61284b60.jpg" />
  <pubDate>Thu, 17 Apr 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="24229592" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/98e53389-2a7f-43eb-aa48-f34595cd76e3/episode.mp3" />
  <itunes:title><![CDATA[Airflow in Action: Powering Instacart's Complex Ecosystem]]></itunes:title>
  <itunes:duration>25:14</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">The evolution of data orchestration at Instacart highlights the journey from fragmented systems to robust, standardized infrastructure. This transformation has enabled scalability, reliability and democratization of tools for diverse user personas.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, we’re joined by</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/in/anantag/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anant Agarwal</a><span style="background-color: transparent;">, Software Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/instacart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a><span style="background-color: transparent;">, who shares insights into Instacart's Airflow journey, from its early adoption in 2019 to the present-day centralized cluster approach. Anant discusses the challenges of managing disparate clusters, the implementation of remote executors, and the strategic standardization of infrastructure and DAG patterns to streamline workflows.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(03:49) The impact of external events on business growth and technological evolution.</span></p><p><span style="background-color: transparent;">(04:31) Challenges of managing decentralized systems across multiple teams.</span></p><p><span style="background-color: transparent;">(06:14) The importance of standardizing infrastructure and processes for scalability.</span></p><p><span style="background-color: transparent;">(09:51) Strategies for implementing efficient and repeatable deployment practices.</span></p><p><span style="background-color: transparent;">(12:17) Addressing diverse user personas with tailored solutions.</span></p><p><span style="background-color: transparent;">(14:47) Leveraging remote execution to enhance flexibility and scalability.</span></p><p><span style="background-color: transparent;">(18:36) Benefits of transitioning to a centralized system for organization-wide use.</span></p><p><span style="background-color: transparent;">(20:57) Maintaining an upgrade cadence to stay aligned with the latest advancements.</span></p><p><span style="background-color: transparent;">(23:35) Anticipation for new features and improvements in upcoming software versions.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/anantag/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anant Agarwal</a></p><p>https://www.linkedin.com/in/anantag/</p><p><br></p><p><a href="https://www.linkedin.com/company/instacart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a> | LinkedIn</p><p>https://www.linkedin.com/company/instacart/</p><p><br></p><p><a href="https://www.instacart.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a> | Website</p><p>https://www.instacart.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://aws.amazon.com/ecs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Amazon</a></p><p>https://aws.amazon.com/ecs/</p><p><br></p><p><a href="https://www.terraform.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Terraform</a></p><p>https://www.terraform.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;"> </span><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">The evolution of data orchestration at Instacart highlights the journey from fragmented systems to robust, standardized infrastructure. This transformation has enabled scalability, reliability and democratization of tools for diverse user personas.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, we’re joined by</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/in/anantag/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anant Agarwal</a><span style="background-color: transparent;">, Software Engineer at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/instacart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a><span style="background-color: transparent;">, who shares insights into Instacart's Airflow journey, from its early adoption in 2019 to the present-day centralized cluster approach. Anant discusses the challenges of managing disparate clusters, the implementation of remote executors, and the strategic standardization of infrastructure and DAG patterns to streamline workflows.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(03:49) The impact of external events on business growth and technological evolution.</span></p><p><span style="background-color: transparent;">(04:31) Challenges of managing decentralized systems across multiple teams.</span></p><p><span style="background-color: transparent;">(06:14) The importance of standardizing infrastructure and processes for scalability.</span></p><p><span style="background-color: transparent;">(09:51) Strategies for implementing efficient and repeatable deployment practices.</span></p><p><span style="background-color: transparent;">(12:17) Addressing diverse user personas with tailored solutions.</span></p><p><span style="background-color: transparent;">(14:47) Leveraging remote execution to enhance flexibility and scalability.</span></p><p><span style="background-color: transparent;">(18:36) Benefits of transitioning to a centralized system for organization-wide use.</span></p><p><span style="background-color: transparent;">(20:57) Maintaining an upgrade cadence to stay aligned with the latest advancements.</span></p><p><span style="background-color: transparent;">(23:35) Anticipation for new features and improvements in upcoming software versions.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/anantag/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Anant Agarwal</a></p><p>https://www.linkedin.com/in/anantag/</p><p><br></p><p><a href="https://www.linkedin.com/company/instacart/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a> | LinkedIn</p><p>https://www.linkedin.com/company/instacart/</p><p><br></p><p><a href="https://www.instacart.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Instacart</a> | Website</p><p>https://www.instacart.com</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://aws.amazon.com/ecs/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">AWS Amazon</a></p><p>https://aws.amazon.com/ecs/</p><p><br></p><p><a href="https://www.terraform.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Terraform</a></p><p>https://www.terraform.io/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a><span style="background-color: transparent;"> </span><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The evolution of data orchestration at Instacart highlights the journey from fragmented systems to robust, standardized infrastructure. This transformation has enabled scalability, reliability and democratization of tools for diverse user personas....]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>36</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[abcecdbb-2db1-4db2-b800-5b68c33da9df]]></guid>
  <title><![CDATA[From ETL to Airflow: Transforming Data Engineering at Deloitte Digital with Raviteja Tholupunoori]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Data orchestration at scale presents unique challenges, especially when aiming for flexibility and efficiency across cloud environments. Choosing the right tools and frameworks can make all the difference.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Raviteja Tholupunoori,</a><span style="background-color: transparent;"> Senior Engineer at </span><a href="https://www.linkedin.com/company/deloitte-digital/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Deloitte Digital</a><span style="background-color: transparent;">, joins us to explore how Airflow enhances orchestration, scalability and cost efficiency in enterprise data workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:45) Early challenges in data orchestration before implementing Airflow.</span></p><p><span style="background-color: transparent;">(02:42) Comparing Airflow with ETL tools like Talend and why flexibility matters.</span></p><p><span style="background-color: transparent;">(04:24) The role of Airflow in enabling cloud-agnostic data processing.</span></p><p><span style="background-color: transparent;">(05:45) Key lessons from managing dynamic DAGs at scale.</span></p><p><span style="background-color: transparent;">(13:15) How hybrid executors improve performance and efficiency.</span></p><p><span style="background-color: transparent;">(14:13) Best practices for testing and monitoring workflows with Airflow.</span></p><p><span style="background-color: transparent;">(15:13) The importance of mocking mechanisms when testing DAGs.</span></p><p><span style="background-color: transparent;">(17:57) How Prometheus, Grafana and Loki support Airflow monitoring.</span></p><p><span style="background-color: transparent;">(22:03) Cost considerations when running Airflow on self-managed infrastructure.</span></p><p><span style="background-color: transparent;">(23:14) Airflow’s latest features, including hybrid executors and dark mode.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Raviteja Tholupunoori</a></p><p>https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in</p><p><br></p><p><a href="https://www.linkedin.com/company/deloitte-digital/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Deloitte Digital</a></p><p>https://www.linkedin.com/company/deloitte-digital/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://grafana.com/solutions/apache-airflow/monitor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/solutions/apache-airflow/monitor/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Presents: Exploring Apache Airflow® 3 Roadshows</a></p><p>https://www.astronomer.io/events/roadshow/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/dda921f4-06fd-4231-835d-4368e29f469e/4ee867dd7c.jpg" />
  <pubDate>Thu, 10 Apr 2025 00:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26594823" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/dda921f4-06fd-4231-835d-4368e29f469e/episode.mp3" />
  <itunes:title><![CDATA[From ETL to Airflow: Transforming Data Engineering at Deloitte Digital with Raviteja Tholupunoori]]></itunes:title>
  <itunes:duration>27:42</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Data orchestration at scale presents unique challenges, especially when aiming for flexibility and efficiency across cloud environments. Choosing the right tools and frameworks can make all the difference.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Raviteja Tholupunoori,</a><span style="background-color: transparent;"> Senior Engineer at </span><a href="https://www.linkedin.com/company/deloitte-digital/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Deloitte Digital</a><span style="background-color: transparent;">, joins us to explore how Airflow enhances orchestration, scalability and cost efficiency in enterprise data workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:45) Early challenges in data orchestration before implementing Airflow.</span></p><p><span style="background-color: transparent;">(02:42) Comparing Airflow with ETL tools like Talend and why flexibility matters.</span></p><p><span style="background-color: transparent;">(04:24) The role of Airflow in enabling cloud-agnostic data processing.</span></p><p><span style="background-color: transparent;">(05:45) Key lessons from managing dynamic DAGs at scale.</span></p><p><span style="background-color: transparent;">(13:15) How hybrid executors improve performance and efficiency.</span></p><p><span style="background-color: transparent;">(14:13) Best practices for testing and monitoring workflows with Airflow.</span></p><p><span style="background-color: transparent;">(15:13) The importance of mocking mechanisms when testing DAGs.</span></p><p><span style="background-color: transparent;">(17:57) How Prometheus, Grafana and Loki support Airflow monitoring.</span></p><p><span style="background-color: transparent;">(22:03) Cost considerations when running Airflow on self-managed infrastructure.</span></p><p><span style="background-color: transparent;">(23:14) Airflow’s latest features, including hybrid executors and dark mode.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Raviteja Tholupunoori</a></p><p>https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in</p><p><br></p><p><a href="https://www.linkedin.com/company/deloitte-digital/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Deloitte Digital</a></p><p>https://www.linkedin.com/company/deloitte-digital/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://grafana.com/solutions/apache-airflow/monitor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/solutions/apache-airflow/monitor/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Presents: Exploring Apache Airflow® 3 Roadshows</a></p><p>https://www.astronomer.io/events/roadshow/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Data orchestration at scale presents unique challenges, especially when aiming for flexibility and efficiency across cloud environments. Choosing the right tools and frameworks can make all the difference.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Raviteja Tholupunoori,</a><span style="background-color: transparent;"> Senior Engineer at </span><a href="https://www.linkedin.com/company/deloitte-digital/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Deloitte Digital</a><span style="background-color: transparent;">, joins us to explore how Airflow enhances orchestration, scalability and cost efficiency in enterprise data workflows.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:45) Early challenges in data orchestration before implementing Airflow.</span></p><p><span style="background-color: transparent;">(02:42) Comparing Airflow with ETL tools like Talend and why flexibility matters.</span></p><p><span style="background-color: transparent;">(04:24) The role of Airflow in enabling cloud-agnostic data processing.</span></p><p><span style="background-color: transparent;">(05:45) Key lessons from managing dynamic DAGs at scale.</span></p><p><span style="background-color: transparent;">(13:15) How hybrid executors improve performance and efficiency.</span></p><p><span style="background-color: transparent;">(14:13) Best practices for testing and monitoring workflows with Airflow.</span></p><p><span style="background-color: transparent;">(15:13) The importance of mocking mechanisms when testing DAGs.</span></p><p><span style="background-color: transparent;">(17:57) How Prometheus, Grafana and Loki support Airflow monitoring.</span></p><p><span style="background-color: transparent;">(22:03) Cost considerations when running Airflow on self-managed infrastructure.</span></p><p><span style="background-color: transparent;">(23:14) Airflow’s latest features, including hybrid executors and dark mode.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Raviteja Tholupunoori</a></p><p>https://www.linkedin.com/in/raviteja0096/?originalSubdomain=in</p><p><br></p><p><a href="https://www.linkedin.com/company/deloitte-digital/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Deloitte Digital</a></p><p>https://www.linkedin.com/company/deloitte-digital/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://grafana.com/solutions/apache-airflow/monitor/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Grafana</a></p><p>https://grafana.com/solutions/apache-airflow/monitor/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Presents: Exploring Apache Airflow® 3 Roadshows</a></p><p>https://www.astronomer.io/events/roadshow/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data orchestration at scale presents unique challenges, especially when aiming for flexibility and efficiency across cloud environments. Choosing the right tools and frameworks can make all the difference. In this episode, Raviteja Tholupunoori, Se...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>35</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[1798eebf-31fd-48e2-9436-02099a76742a]]></guid>
  <title><![CDATA[A Deep Dive Into the 2025 State of Airflow Survey Results with Tamara Fingerlin of Astronomer]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">The 2025 State of Airflow report sheds light on how global users are adopting, evolving and innovating with Apache Airflow. With over 5,000 responses from 116 countries, the survey reveals critical insights into Airflows’ role in business operations, new use cases and what’s ahead for the community.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/tamara-janina-fingerlin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tamara Fingerlin</a><span style="background-color: transparent;">, Developer Advocate at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, walks us through her process of analyzing survey data, key trends from the report and what to expect from Airflow 3.0.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:14) The State of Airflow report combines anonymized telemetry and survey results.</span></p><p><span style="background-color: transparent;">(03:25) The survey received thousands of responses from many countries, showcasing global reach.</span></p><p><span style="background-color: transparent;">(04:49) The survey process involves multiple steps, from question selection to report creation.</span></p><p><span style="background-color: transparent;">(09:00) Many users expect to increase Airflow usage for revenue-generating or external use cases.</span></p><p><span style="background-color: transparent;">(11:04) Experienced users tend to utilize Airflow more for advanced use cases like MLOps.</span></p><p><span style="background-color: transparent;">(15:13) UI improvements offer enhanced navigation and error visibility.</span></p><p><span style="background-color: transparent;">(18:15) Architectural changes enable new capabilities like remote execution and language support.</span></p><p><span style="background-color: transparent;">(19:40) Long-requested features will be available in the new major release.</span></p><p><span style="background-color: transparent;">(21:00) Future aspirations include integrating data visualization capabilities into the UI.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/tamara-janina-fingerlin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tamara Fingerlin</a></p><p>https://www.linkedin.com/in/tamara-janina-fingerlin/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn </p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/airflow/state-of-airflow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">2025 State of Airflow Webinar</a></p><p>https://www.astronomer.io/airflow/state-of-airflow/</p><p><br></p><p><a href="https://apache-airflow-slack.herokuapp.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack</a></p><p>https://apache-airflow-slack.herokuapp.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Presents: Exploring Apache Airflow® 3 Roadshows</a></p><p>https://www.astronomer.io/events/roadshow/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/2069a2a9-dae5-4c2e-a079-76ee42591c69/32da1e0b26.jpg" />
  <pubDate>Thu, 03 Apr 2025 01:24:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22499659" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/2069a2a9-dae5-4c2e-a079-76ee42591c69/episode.mp3" />
  <itunes:title><![CDATA[A Deep Dive Into the 2025 State of Airflow Survey Results with Tamara Fingerlin of Astronomer]]></itunes:title>
  <itunes:duration>23:26</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">The 2025 State of Airflow report sheds light on how global users are adopting, evolving and innovating with Apache Airflow. With over 5,000 responses from 116 countries, the survey reveals critical insights into Airflows’ role in business operations, new use cases and what’s ahead for the community.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/tamara-janina-fingerlin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tamara Fingerlin</a><span style="background-color: transparent;">, Developer Advocate at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, walks us through her process of analyzing survey data, key trends from the report and what to expect from Airflow 3.0.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:14) The State of Airflow report combines anonymized telemetry and survey results.</span></p><p><span style="background-color: transparent;">(03:25) The survey received thousands of responses from many countries, showcasing global reach.</span></p><p><span style="background-color: transparent;">(04:49) The survey process involves multiple steps, from question selection to report creation.</span></p><p><span style="background-color: transparent;">(09:00) Many users expect to increase Airflow usage for revenue-generating or external use cases.</span></p><p><span style="background-color: transparent;">(11:04) Experienced users tend to utilize Airflow more for advanced use cases like MLOps.</span></p><p><span style="background-color: transparent;">(15:13) UI improvements offer enhanced navigation and error visibility.</span></p><p><span style="background-color: transparent;">(18:15) Architectural changes enable new capabilities like remote execution and language support.</span></p><p><span style="background-color: transparent;">(19:40) Long-requested features will be available in the new major release.</span></p><p><span style="background-color: transparent;">(21:00) Future aspirations include integrating data visualization capabilities into the UI.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/tamara-janina-fingerlin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tamara Fingerlin</a></p><p>https://www.linkedin.com/in/tamara-janina-fingerlin/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn </p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/airflow/state-of-airflow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">2025 State of Airflow Webinar</a></p><p>https://www.astronomer.io/airflow/state-of-airflow/</p><p><br></p><p><a href="https://apache-airflow-slack.herokuapp.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack</a></p><p>https://apache-airflow-slack.herokuapp.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Presents: Exploring Apache Airflow® 3 Roadshows</a></p><p>https://www.astronomer.io/events/roadshow/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">The 2025 State of Airflow report sheds light on how global users are adopting, evolving and innovating with Apache Airflow. With over 5,000 responses from 116 countries, the survey reveals critical insights into Airflows’ role in business operations, new use cases and what’s ahead for the community.</span></p><p><br></p><p><span style="background-color: transparent;">In this episode, </span><a href="https://www.linkedin.com/in/tamara-janina-fingerlin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tamara Fingerlin</a><span style="background-color: transparent;">, Developer Advocate at</span><span style="background-color: transparent; color: rgb(22, 14, 61);"> </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;">, walks us through her process of analyzing survey data, key trends from the report and what to expect from Airflow 3.0.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:14) The State of Airflow report combines anonymized telemetry and survey results.</span></p><p><span style="background-color: transparent;">(03:25) The survey received thousands of responses from many countries, showcasing global reach.</span></p><p><span style="background-color: transparent;">(04:49) The survey process involves multiple steps, from question selection to report creation.</span></p><p><span style="background-color: transparent;">(09:00) Many users expect to increase Airflow usage for revenue-generating or external use cases.</span></p><p><span style="background-color: transparent;">(11:04) Experienced users tend to utilize Airflow more for advanced use cases like MLOps.</span></p><p><span style="background-color: transparent;">(15:13) UI improvements offer enhanced navigation and error visibility.</span></p><p><span style="background-color: transparent;">(18:15) Architectural changes enable new capabilities like remote execution and language support.</span></p><p><span style="background-color: transparent;">(19:40) Long-requested features will be available in the new major release.</span></p><p><span style="background-color: transparent;">(21:00) Future aspirations include integrating data visualization capabilities into the UI.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/tamara-janina-fingerlin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Tamara Fingerlin</a></p><p>https://www.linkedin.com/in/tamara-janina-fingerlin/</p><p><br></p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | LinkedIn </p><p>https://www.linkedin.com/company/astronomer/</p><p><br></p><p><a href="https://www.astronomer.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> | Website</p><p>https://www.astronomer.io</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a></p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/airflow/state-of-airflow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">2025 State of Airflow Webinar</a></p><p>https://www.astronomer.io/airflow/state-of-airflow/</p><p><br></p><p><a href="https://apache-airflow-slack.herokuapp.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Slack</a></p><p>https://apache-airflow-slack.herokuapp.com/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Presents: Exploring Apache Airflow® 3 Roadshows</a></p><p>https://www.astronomer.io/events/roadshow/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/london/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/new-york/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/sydney/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/san-francisco/</a></p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">https://www.astronomer.io/events/roadshow/chicago/</a></p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The 2025 State of Airflow report sheds light on how global users are adopting, evolving and innovating with Apache Airflow. With over 5,000 responses from 116 countries, the survey reveals critical insights into Airflows’ role in business operation...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>34</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[0e220ca3-25da-4723-9adc-6ffb3ef49521]]></guid>
  <title><![CDATA[The Software Risk That Affects Everyone and How To Address It with Michael Winser and Jarek Potiuk]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">The security of open-source software is a growing concern, especially as dependencies and regulations become more complex, making it essential to understand how to manage software supply chains effectively.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, we sit down with </span><a href="https://www.linkedin.com/in/michaelw/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Michael Winser</a><span style="background-color: transparent;">, Co-Founder at Alpha-Omega and Security Strategy Ambassador at </span><a href="https://www.linkedin.com/company/eclipse-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/jarekpotiuk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jarek Potiuk</a><span style="background-color: transparent;">, Member of the Security Committee at the </span><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a><span style="background-color: transparent;">, to discuss the challenges of securing Airflow’s dependencies, the evolving landscape of open-source security and how contributors can help strengthen the ecosystem.</span></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿﻿﻿﻿</span>&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:43) Jarek quit his full-time engineer position and uses Airflow as a freelancer.&nbsp;</span></p><p><span style="background-color: transparent;">(04:32) Michael finds happiness in having meaningful work with open-source security.</span></p><p><span style="background-color: transparent;">(07:01) Software supply chain security focuses on correctness, integrity and availability.</span></p><p><span style="background-color: transparent;">(08:44) Airflow’s 790 dependencies present a unique security challenge.</span></p><p><span style="background-color: transparent;">(09:43) Airflow’s security team has significantly improved its vulnerability response.</span></p><p><span style="background-color: transparent;">(10:22) The transition to Airflow 3 emphasizes enterprise security readiness.</span></p><p><span style="background-color: transparent;">(16:20) The ‘Three Fs’ approach: fix it, fork it, or forget it.</span></p><p><span style="background-color: transparent;">(18:45) Dependency health is often more critical than fixing known vulnerabilities.</span></p><p><span style="background-color: transparent;">(23:32) The ‘Three Fs’ in action.&nbsp;</span></p><p><span style="background-color: transparent;">(26:26) Open-source contributors play a key role in supply chain security.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/michaelw/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Michael Winser</a> -</p><p>https://www.linkedin.com/in/michaelw/</p><p><br></p><p><a href="https://www.linkedin.com/in/jarekpotiuk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jarek Potiuk</a> - </p><p>https://www.linkedin.com/in/jarekpotiuk/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a> | LinkedIn -</p><p>https://www.linkedin.com/company/the-apache-software-foundation/</p><p><br></p><p><a href="https://www.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a> | Website -</p><p>https://www.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/eclipse-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a> | LinkedIn -</p><p>https://www.linkedin.com/company/eclipse-foundation/</p><p><br></p><p><a href="https://www.eclipse.org/org/foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a> | Website -</p><p>https://www.eclipse.org/org/foundation/</p><p><br></p><p><a href="https://openssf.org/community/openssf-working-groups/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenSSF Working Groups</a> -</p><p>https://openssf.org/community/openssf-working-groups/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | London</a></p><p>https://www.astronomer.io/events/roadshow/london/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | New York</a></p><p>https://www.astronomer.io/events/roadshow/new-york/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | Sydney</a></p><p>https://www.astronomer.io/events/roadshow/sydney/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | San Francisco</a></p><p>https://www.astronomer.io/events/roadshow/san-francisco/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | Chicago</a></p><p>https://www.astronomer.io/events/roadshow/chicago/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9c5a3e03-8a21-4d6f-9a78-9f28d49a2995/471f10b02c.jpg" />
  <pubDate>Thu, 20 Mar 2025 00:37:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="27318311" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9c5a3e03-8a21-4d6f-9a78-9f28d49a2995/episode.mp3" />
  <itunes:title><![CDATA[The Software Risk That Affects Everyone and How To Address It with Michael Winser and Jarek Potiuk]]></itunes:title>
  <itunes:duration>28:27</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">The security of open-source software is a growing concern, especially as dependencies and regulations become more complex, making it essential to understand how to manage software supply chains effectively.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, we sit down with </span><a href="https://www.linkedin.com/in/michaelw/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Michael Winser</a><span style="background-color: transparent;">, Co-Founder at Alpha-Omega and Security Strategy Ambassador at </span><a href="https://www.linkedin.com/company/eclipse-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/jarekpotiuk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jarek Potiuk</a><span style="background-color: transparent;">, Member of the Security Committee at the </span><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a><span style="background-color: transparent;">, to discuss the challenges of securing Airflow’s dependencies, the evolving landscape of open-source security and how contributors can help strengthen the ecosystem.</span></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿﻿﻿﻿</span>&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:43) Jarek quit his full-time engineer position and uses Airflow as a freelancer.&nbsp;</span></p><p><span style="background-color: transparent;">(04:32) Michael finds happiness in having meaningful work with open-source security.</span></p><p><span style="background-color: transparent;">(07:01) Software supply chain security focuses on correctness, integrity and availability.</span></p><p><span style="background-color: transparent;">(08:44) Airflow’s 790 dependencies present a unique security challenge.</span></p><p><span style="background-color: transparent;">(09:43) Airflow’s security team has significantly improved its vulnerability response.</span></p><p><span style="background-color: transparent;">(10:22) The transition to Airflow 3 emphasizes enterprise security readiness.</span></p><p><span style="background-color: transparent;">(16:20) The ‘Three Fs’ approach: fix it, fork it, or forget it.</span></p><p><span style="background-color: transparent;">(18:45) Dependency health is often more critical than fixing known vulnerabilities.</span></p><p><span style="background-color: transparent;">(23:32) The ‘Three Fs’ in action.&nbsp;</span></p><p><span style="background-color: transparent;">(26:26) Open-source contributors play a key role in supply chain security.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/michaelw/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Michael Winser</a> -</p><p>https://www.linkedin.com/in/michaelw/</p><p><br></p><p><a href="https://www.linkedin.com/in/jarekpotiuk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jarek Potiuk</a> - </p><p>https://www.linkedin.com/in/jarekpotiuk/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a> | LinkedIn -</p><p>https://www.linkedin.com/company/the-apache-software-foundation/</p><p><br></p><p><a href="https://www.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a> | Website -</p><p>https://www.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/eclipse-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a> | LinkedIn -</p><p>https://www.linkedin.com/company/eclipse-foundation/</p><p><br></p><p><a href="https://www.eclipse.org/org/foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a> | Website -</p><p>https://www.eclipse.org/org/foundation/</p><p><br></p><p><a href="https://openssf.org/community/openssf-working-groups/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenSSF Working Groups</a> -</p><p>https://openssf.org/community/openssf-working-groups/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | London</a></p><p>https://www.astronomer.io/events/roadshow/london/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | New York</a></p><p>https://www.astronomer.io/events/roadshow/new-york/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | Sydney</a></p><p>https://www.astronomer.io/events/roadshow/sydney/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | San Francisco</a></p><p>https://www.astronomer.io/events/roadshow/san-francisco/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | Chicago</a></p><p>https://www.astronomer.io/events/roadshow/chicago/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">The security of open-source software is a growing concern, especially as dependencies and regulations become more complex, making it essential to understand how to manage software supply chains effectively.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, we sit down with </span><a href="https://www.linkedin.com/in/michaelw/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Michael Winser</a><span style="background-color: transparent;">, Co-Founder at Alpha-Omega and Security Strategy Ambassador at </span><a href="https://www.linkedin.com/company/eclipse-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/jarekpotiuk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jarek Potiuk</a><span style="background-color: transparent;">, Member of the Security Committee at the </span><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a><span style="background-color: transparent;">, to discuss the challenges of securing Airflow’s dependencies, the evolving landscape of open-source security and how contributors can help strengthen the ecosystem.</span></p><p><span style="background-color: transparent;"><span class="ql-cursor">﻿﻿﻿﻿﻿</span>&nbsp;</span></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:43) Jarek quit his full-time engineer position and uses Airflow as a freelancer.&nbsp;</span></p><p><span style="background-color: transparent;">(04:32) Michael finds happiness in having meaningful work with open-source security.</span></p><p><span style="background-color: transparent;">(07:01) Software supply chain security focuses on correctness, integrity and availability.</span></p><p><span style="background-color: transparent;">(08:44) Airflow’s 790 dependencies present a unique security challenge.</span></p><p><span style="background-color: transparent;">(09:43) Airflow’s security team has significantly improved its vulnerability response.</span></p><p><span style="background-color: transparent;">(10:22) The transition to Airflow 3 emphasizes enterprise security readiness.</span></p><p><span style="background-color: transparent;">(16:20) The ‘Three Fs’ approach: fix it, fork it, or forget it.</span></p><p><span style="background-color: transparent;">(18:45) Dependency health is often more critical than fixing known vulnerabilities.</span></p><p><span style="background-color: transparent;">(23:32) The ‘Three Fs’ in action.&nbsp;</span></p><p><span style="background-color: transparent;">(26:26) Open-source contributors play a key role in supply chain security.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/michaelw/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Michael Winser</a> -</p><p>https://www.linkedin.com/in/michaelw/</p><p><br></p><p><a href="https://www.linkedin.com/in/jarekpotiuk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jarek Potiuk</a> - </p><p>https://www.linkedin.com/in/jarekpotiuk/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a> | LinkedIn -</p><p>https://www.linkedin.com/company/the-apache-software-foundation/</p><p><br></p><p><a href="https://www.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Software Foundation</a> | Website -</p><p>https://www.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/eclipse-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a> | LinkedIn -</p><p>https://www.linkedin.com/company/eclipse-foundation/</p><p><br></p><p><a href="https://www.eclipse.org/org/foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Eclipse Foundation</a> | Website -</p><p>https://www.eclipse.org/org/foundation/</p><p><br></p><p><a href="https://openssf.org/community/openssf-working-groups/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenSSF Working Groups</a> -</p><p>https://openssf.org/community/openssf-working-groups/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/london/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | London</a></p><p>https://www.astronomer.io/events/roadshow/london/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/new-york/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | New York</a></p><p>https://www.astronomer.io/events/roadshow/new-york/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/sydney/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | Sydney</a></p><p>https://www.astronomer.io/events/roadshow/sydney/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/san-francisco/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | San Francisco</a></p><p>https://www.astronomer.io/events/roadshow/san-francisco/</p><p><br></p><p><a href="https://www.astronomer.io/events/roadshow/chicago/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer Roadshow: Exploring Apache Airflow 3 | Chicago</a></p><p>https://www.astronomer.io/events/roadshow/chicago/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The security of open-source software is a growing concern, especially as dependencies and regulations become more complex, making it essential to understand how to manage software supply chains effectively. In this episode, we sit down with Michael...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>32</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[d817db27-4730-453f-92a1-5eaad6c9f72d]]></guid>
  <title><![CDATA[Building Scalable ML Infrastructure at Outerbounds with Savin Goyal ]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Machine learning is changing fast, and companies need better tools to handle AI workloads. The right infrastructure helps data scientists focus on solving problems instead of managing complex systems. In this episode, we talk with </span><a href="https://www.linkedin.com/in/savingoyal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Savin Goyal</a><span style="background-color: transparent;">, Co-Founder and CTO at </span><a href="https://www.linkedin.com/company/outerbounds/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Outerbounds</a><span style="background-color: transparent;">, about building ML infrastructure, how orchestration makes workflows easier and how Metaflow and Airflow work together to simplify data science.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:02) Savin spent years building AI and ML infrastructure, including at Netflix.</span></p><p><span style="background-color: transparent;">(04:05) ML engineering was not a defined role a decade ago.</span></p><p><span style="background-color: transparent;">(08:17) Modernizing AI and ML requires balancing new tools with existing strengths.</span></p><p><span style="background-color: transparent;">(10:28) ML workloads can be long-running or require heavy computation.</span></p><p><span style="background-color: transparent;">(15:29) Different teams at Netflix used multiple orchestration systems for specific needs.</span></p><p><span style="background-color: transparent;">(20:10) Stable APIs prevent rework and keep projects moving.</span></p><p><span style="background-color: transparent;">(21:07) Metaflow simplifies ML workflows by optimizing data and compute interactions.</span></p><p><span style="background-color: transparent;">(25:53) Limited local computing power makes running ML workloads challenging.</span></p><p><span style="background-color: transparent;">(27:43) Airflow UI monitors pipelines, while Metaflow UI gives ML insights.</span></p><p><span style="background-color: transparent;">(33:13) The most successful data professionals focus on business impact, not just technology.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/savingoyal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Savin Goyal</a> -</p><p>https://www.linkedin.com/in/savingoyal/</p><p><br></p><p><a href="https://www.linkedin.com/company/outerbounds/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Outerbounds</a> -</p><p>https://www.linkedin.com/company/outerbounds/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://metaflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metaflow</a> - </p><p>https://metaflow.org/</p><p><br></p><p><a href="https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78?gi=8e6a067a92e9#:~:text=Maestro%20is%20a%20fully%20managed,data%20between%20different%20storages%2C%20etc." target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix’s Maestro Orchestration System</a> -</p><p>https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78?gi=8e6a067a92e9#:~:text=Maestro%20is%20a%20fully%20managed,data%20between%20different%20storages%2C%20etc.</p><p><br></p><p><a href="https://www.tensorflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">TensorFlow</a> -</p><p>https://www.tensorflow.org/</p><p><br></p><p><a href="https://pytorch.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PyTorch</a> -</p><p>https://pytorch.org/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/e8b8f1d9-f15c-41fb-88e4-dea312f52d89/b3984b8040.jpg" />
  <pubDate>Thu, 13 Mar 2025 00:37:34 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="35298405" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/e8b8f1d9-f15c-41fb-88e4-dea312f52d89/episode.mp3" />
  <itunes:title><![CDATA[Building Scalable ML Infrastructure at Outerbounds with Savin Goyal ]]></itunes:title>
  <itunes:duration>36:46</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Machine learning is changing fast, and companies need better tools to handle AI workloads. The right infrastructure helps data scientists focus on solving problems instead of managing complex systems. In this episode, we talk with </span><a href="https://www.linkedin.com/in/savingoyal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Savin Goyal</a><span style="background-color: transparent;">, Co-Founder and CTO at </span><a href="https://www.linkedin.com/company/outerbounds/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Outerbounds</a><span style="background-color: transparent;">, about building ML infrastructure, how orchestration makes workflows easier and how Metaflow and Airflow work together to simplify data science.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:02) Savin spent years building AI and ML infrastructure, including at Netflix.</span></p><p><span style="background-color: transparent;">(04:05) ML engineering was not a defined role a decade ago.</span></p><p><span style="background-color: transparent;">(08:17) Modernizing AI and ML requires balancing new tools with existing strengths.</span></p><p><span style="background-color: transparent;">(10:28) ML workloads can be long-running or require heavy computation.</span></p><p><span style="background-color: transparent;">(15:29) Different teams at Netflix used multiple orchestration systems for specific needs.</span></p><p><span style="background-color: transparent;">(20:10) Stable APIs prevent rework and keep projects moving.</span></p><p><span style="background-color: transparent;">(21:07) Metaflow simplifies ML workflows by optimizing data and compute interactions.</span></p><p><span style="background-color: transparent;">(25:53) Limited local computing power makes running ML workloads challenging.</span></p><p><span style="background-color: transparent;">(27:43) Airflow UI monitors pipelines, while Metaflow UI gives ML insights.</span></p><p><span style="background-color: transparent;">(33:13) The most successful data professionals focus on business impact, not just technology.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/savingoyal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Savin Goyal</a> -</p><p>https://www.linkedin.com/in/savingoyal/</p><p><br></p><p><a href="https://www.linkedin.com/company/outerbounds/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Outerbounds</a> -</p><p>https://www.linkedin.com/company/outerbounds/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://metaflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metaflow</a> - </p><p>https://metaflow.org/</p><p><br></p><p><a href="https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78?gi=8e6a067a92e9#:~:text=Maestro%20is%20a%20fully%20managed,data%20between%20different%20storages%2C%20etc." target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix’s Maestro Orchestration System</a> -</p><p>https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78?gi=8e6a067a92e9#:~:text=Maestro%20is%20a%20fully%20managed,data%20between%20different%20storages%2C%20etc.</p><p><br></p><p><a href="https://www.tensorflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">TensorFlow</a> -</p><p>https://www.tensorflow.org/</p><p><br></p><p><a href="https://pytorch.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PyTorch</a> -</p><p>https://pytorch.org/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Machine learning is changing fast, and companies need better tools to handle AI workloads. The right infrastructure helps data scientists focus on solving problems instead of managing complex systems. In this episode, we talk with </span><a href="https://www.linkedin.com/in/savingoyal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Savin Goyal</a><span style="background-color: transparent;">, Co-Founder and CTO at </span><a href="https://www.linkedin.com/company/outerbounds/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Outerbounds</a><span style="background-color: transparent;">, about building ML infrastructure, how orchestration makes workflows easier and how Metaflow and Airflow work together to simplify data science.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:02) Savin spent years building AI and ML infrastructure, including at Netflix.</span></p><p><span style="background-color: transparent;">(04:05) ML engineering was not a defined role a decade ago.</span></p><p><span style="background-color: transparent;">(08:17) Modernizing AI and ML requires balancing new tools with existing strengths.</span></p><p><span style="background-color: transparent;">(10:28) ML workloads can be long-running or require heavy computation.</span></p><p><span style="background-color: transparent;">(15:29) Different teams at Netflix used multiple orchestration systems for specific needs.</span></p><p><span style="background-color: transparent;">(20:10) Stable APIs prevent rework and keep projects moving.</span></p><p><span style="background-color: transparent;">(21:07) Metaflow simplifies ML workflows by optimizing data and compute interactions.</span></p><p><span style="background-color: transparent;">(25:53) Limited local computing power makes running ML workloads challenging.</span></p><p><span style="background-color: transparent;">(27:43) Airflow UI monitors pipelines, while Metaflow UI gives ML insights.</span></p><p><span style="background-color: transparent;">(33:13) The most successful data professionals focus on business impact, not just technology.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/savingoyal/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Savin Goyal</a> -</p><p>https://www.linkedin.com/in/savingoyal/</p><p><br></p><p><a href="https://www.linkedin.com/company/outerbounds/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Outerbounds</a> -</p><p>https://www.linkedin.com/company/outerbounds/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://metaflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Metaflow</a> - </p><p>https://metaflow.org/</p><p><br></p><p><a href="https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78?gi=8e6a067a92e9#:~:text=Maestro%20is%20a%20fully%20managed,data%20between%20different%20storages%2C%20etc." target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Netflix’s Maestro Orchestration System</a> -</p><p>https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78?gi=8e6a067a92e9#:~:text=Maestro%20is%20a%20fully%20managed,data%20between%20different%20storages%2C%20etc.</p><p><br></p><p><a href="https://www.tensorflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">TensorFlow</a> -</p><p>https://www.tensorflow.org/</p><p><br></p><p><a href="https://pytorch.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">PyTorch</a> -</p><p>https://pytorch.org/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Machine learning is changing fast, and companies need better tools to handle AI workloads. The right infrastructure helps data scientists focus on solving problems instead of managing complex systems. In this episode, we talk with Savin Goyal, Co-F...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>31</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[42bfe9b3-f256-4f08-a4ad-92efcc0bea64]]></guid>
  <title><![CDATA[Customizing Airflow for Complex Data Environments at Stripe with Nick Bilozerov and Sharadh Krishnamurthy]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Keeping data pipelines reliable at scale requires more than just the right tools — it demands constant innovation. In this episode, </span><a href="https://www.linkedin.com/in/nick-bilozerov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Nick Bilozerov</a><span style="background-color: transparent;">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/stripe/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/sharadhk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sharadh Krishnamurthy</a><span style="background-color: transparent;">, Engineering Manager at Stripe, discuss how Stripe customizes Airflow for its needs, the evolution of its data orchestration framework and the transition to Airflow 2. They also share insights on scaling data workflows while maintaining performance, reliability and developer experience.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(02:04) Stripe’s mission is to grow the GDP of the internet by supporting businesses with payments and data.</span></p><p><span style="background-color: transparent;">(05:08) 80% of Stripe engineers use data orchestration, making scalability critical.</span></p><p><span style="background-color: transparent;">(06:06) Airflow powers business reports, regulatory needs and ML workflows.</span></p><p><span style="background-color: transparent;">(08:02) Custom task frameworks improve dependencies and validation.</span></p><p><span style="background-color: transparent;">(08:50) "User scope mode" enables local testing without production impact.</span></p><p><span style="background-color: transparent;">(10:39) Migrating to Airflow 2 improves isolation, safety and scalability.</span></p><p><span style="background-color: transparent;">(16:40) Monolithic DAGs caused database issues, prompting a service-based shift.</span></p><p><span style="background-color: transparent;">(19:24) Frequent Airflow upgrades ensure stability and access to new features.</span></p><p><span style="background-color: transparent;">(21:38) DAG versioning and backfill improvements enhance developer experience.</span></p><p><span style="background-color: transparent;">(23:38) Greater UI customization would offer more flexibility.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/nick-bilozerov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Nick Bilozerov</a> -</p><p>https://www.linkedin.com/in/nick-bilozerov/</p><p><br></p><p><a href="https://www.linkedin.com/in/sharadhk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sharadh Krishnamurthy</a> -</p><p>https://www.linkedin.com/in/sharadhk/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/stripe/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a> | LinkedIn -</p><p>https://www.linkedin.com/company/stripe/</p><p><br></p><p><a href="https://stripe.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a> | Website -</p><p>https://stripe.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/c9a51c98-276e-43e4-b15d-d8adf4c53452/4f75a6f25d.jpg" />
  <pubDate>Thu, 06 Mar 2025 02:20:37 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26570582" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/c9a51c98-276e-43e4-b15d-d8adf4c53452/episode.mp3" />
  <itunes:title><![CDATA[Customizing Airflow for Complex Data Environments at Stripe with Nick Bilozerov and Sharadh Krishnamurthy]]></itunes:title>
  <itunes:duration>27:40</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Keeping data pipelines reliable at scale requires more than just the right tools — it demands constant innovation. In this episode, </span><a href="https://www.linkedin.com/in/nick-bilozerov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Nick Bilozerov</a><span style="background-color: transparent;">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/stripe/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/sharadhk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sharadh Krishnamurthy</a><span style="background-color: transparent;">, Engineering Manager at Stripe, discuss how Stripe customizes Airflow for its needs, the evolution of its data orchestration framework and the transition to Airflow 2. They also share insights on scaling data workflows while maintaining performance, reliability and developer experience.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(02:04) Stripe’s mission is to grow the GDP of the internet by supporting businesses with payments and data.</span></p><p><span style="background-color: transparent;">(05:08) 80% of Stripe engineers use data orchestration, making scalability critical.</span></p><p><span style="background-color: transparent;">(06:06) Airflow powers business reports, regulatory needs and ML workflows.</span></p><p><span style="background-color: transparent;">(08:02) Custom task frameworks improve dependencies and validation.</span></p><p><span style="background-color: transparent;">(08:50) "User scope mode" enables local testing without production impact.</span></p><p><span style="background-color: transparent;">(10:39) Migrating to Airflow 2 improves isolation, safety and scalability.</span></p><p><span style="background-color: transparent;">(16:40) Monolithic DAGs caused database issues, prompting a service-based shift.</span></p><p><span style="background-color: transparent;">(19:24) Frequent Airflow upgrades ensure stability and access to new features.</span></p><p><span style="background-color: transparent;">(21:38) DAG versioning and backfill improvements enhance developer experience.</span></p><p><span style="background-color: transparent;">(23:38) Greater UI customization would offer more flexibility.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/nick-bilozerov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Nick Bilozerov</a> -</p><p>https://www.linkedin.com/in/nick-bilozerov/</p><p><br></p><p><a href="https://www.linkedin.com/in/sharadhk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sharadh Krishnamurthy</a> -</p><p>https://www.linkedin.com/in/sharadhk/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/stripe/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a> | LinkedIn -</p><p>https://www.linkedin.com/company/stripe/</p><p><br></p><p><a href="https://stripe.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a> | Website -</p><p>https://stripe.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Keeping data pipelines reliable at scale requires more than just the right tools — it demands constant innovation. In this episode, </span><a href="https://www.linkedin.com/in/nick-bilozerov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Nick Bilozerov</a><span style="background-color: transparent;">, Senior Data Engineer at </span><a href="https://www.linkedin.com/company/stripe/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a><span style="background-color: transparent;">, and </span><a href="https://www.linkedin.com/in/sharadhk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sharadh Krishnamurthy</a><span style="background-color: transparent;">, Engineering Manager at Stripe, discuss how Stripe customizes Airflow for its needs, the evolution of its data orchestration framework and the transition to Airflow 2. They also share insights on scaling data workflows while maintaining performance, reliability and developer experience.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(02:04) Stripe’s mission is to grow the GDP of the internet by supporting businesses with payments and data.</span></p><p><span style="background-color: transparent;">(05:08) 80% of Stripe engineers use data orchestration, making scalability critical.</span></p><p><span style="background-color: transparent;">(06:06) Airflow powers business reports, regulatory needs and ML workflows.</span></p><p><span style="background-color: transparent;">(08:02) Custom task frameworks improve dependencies and validation.</span></p><p><span style="background-color: transparent;">(08:50) "User scope mode" enables local testing without production impact.</span></p><p><span style="background-color: transparent;">(10:39) Migrating to Airflow 2 improves isolation, safety and scalability.</span></p><p><span style="background-color: transparent;">(16:40) Monolithic DAGs caused database issues, prompting a service-based shift.</span></p><p><span style="background-color: transparent;">(19:24) Frequent Airflow upgrades ensure stability and access to new features.</span></p><p><span style="background-color: transparent;">(21:38) DAG versioning and backfill improvements enhance developer experience.</span></p><p><span style="background-color: transparent;">(23:38) Greater UI customization would offer more flexibility.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/nick-bilozerov/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Nick Bilozerov</a> -</p><p>https://www.linkedin.com/in/nick-bilozerov/</p><p><br></p><p><a href="https://www.linkedin.com/in/sharadhk/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sharadh Krishnamurthy</a> -</p><p>https://www.linkedin.com/in/sharadhk/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.linkedin.com/company/stripe/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a> | LinkedIn -</p><p>https://www.linkedin.com/company/stripe/</p><p><br></p><p><a href="https://stripe.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Stripe</a> | Website -</p><p>https://stripe.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Keeping data pipelines reliable at scale requires more than just the right tools — it demands constant innovation. In this episode, Nick Bilozerov, Senior Data Engineer at Stripe, and Sharadh Krishnamurthy, Engineering Manager at Stripe, discuss ho...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>30</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[22aa5591-5434-40ab-8fee-738a1a773500]]></guid>
  <title><![CDATA[Harnessing Airflow for Data-Driven Policy Research at CSET with Jennifer Melot]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Turning complex datasets into meaningful analysis requires robust data infrastructure and seamless orchestration. In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/jennifer-melot-aa710144/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jennifer Melot</a><span style="background-color: transparent;">, Technical Lead </span><span style="background-color: transparent; color: rgb(22, 14, 61);">at the </span><a href="https://www.linkedin.com/company/georgetown-cset/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Center for Security and Emerging Technology</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> (CSET) at Georgetown University,</span><span style="background-color: transparent;"> to explore how Airflow powers data-driven insights in technology policy research. Jennifer shares how her team automates workflows to support analysts in navigating complex datasets.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span></strong></p><p><span style="background-color: transparent;">(02:04) CSET provides data-driven analysis to inform government decision-makers.</span></p><p><span style="background-color: transparent;">(03:54) ETL pipelines merge multiple data sources for more comprehensive insights.</span></p><p><span style="background-color: transparent;">(04:20) Airflow is central to automating and streamlining large-scale data ingestion.</span></p><p><span style="background-color: transparent;">(05:11) Larger-scale databases create challenges that require scalable solutions.</span></p><p><span style="background-color: transparent;">(07:20) Dynamic DAG generation simplifies Airflow adoption for non-engineers.</span></p><p><span style="background-color: transparent;">(12:13) DAG Factory and dynamic task mapping can improve workflow efficiency.</span></p><p><span style="background-color: transparent;">(15:46) Tracking data lineage helps teams understand dependencies across DAGs.</span></p><p><span style="background-color: transparent;">(16:14) New Airflow features enhance visibility and debugging for complex pipelines.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jennifer-melot-aa710144/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jennifer Melot</a> -</p><p>https://www.linkedin.com/in/jennifer-melot-aa710144/</p><p><br></p><p><a href="https://www.linkedin.com/company/georgetown-cset/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Center for Security and Emerging Technology</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> (CSET) -</span></p><p>https://www.linkedin.com/company/georgetown-cset/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://zenodo.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zenodo</a> -</p><p>https://zenodo.org/</p><p><br></p><p><a href="https://openlineage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenLineage</a> -</p><p>https://openlineage.io/</p><p><br></p><p><a href="https://cloud.google.com/dataplex" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloud Dataplex</a> -</p><p>https://cloud.google.com/dataplex</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/d73f36e9-f650-447c-bef2-c5f61211aae9/b61f58fce2.jpg" />
  <pubDate>Thu, 27 Feb 2025 13:15:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="17196593" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/d73f36e9-f650-447c-bef2-c5f61211aae9/episode.mp3" />
  <itunes:title><![CDATA[Harnessing Airflow for Data-Driven Policy Research at CSET with Jennifer Melot]]></itunes:title>
  <itunes:duration>17:54</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Turning complex datasets into meaningful analysis requires robust data infrastructure and seamless orchestration. In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/jennifer-melot-aa710144/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jennifer Melot</a><span style="background-color: transparent;">, Technical Lead </span><span style="background-color: transparent; color: rgb(22, 14, 61);">at the </span><a href="https://www.linkedin.com/company/georgetown-cset/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Center for Security and Emerging Technology</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> (CSET) at Georgetown University,</span><span style="background-color: transparent;"> to explore how Airflow powers data-driven insights in technology policy research. Jennifer shares how her team automates workflows to support analysts in navigating complex datasets.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span></strong></p><p><span style="background-color: transparent;">(02:04) CSET provides data-driven analysis to inform government decision-makers.</span></p><p><span style="background-color: transparent;">(03:54) ETL pipelines merge multiple data sources for more comprehensive insights.</span></p><p><span style="background-color: transparent;">(04:20) Airflow is central to automating and streamlining large-scale data ingestion.</span></p><p><span style="background-color: transparent;">(05:11) Larger-scale databases create challenges that require scalable solutions.</span></p><p><span style="background-color: transparent;">(07:20) Dynamic DAG generation simplifies Airflow adoption for non-engineers.</span></p><p><span style="background-color: transparent;">(12:13) DAG Factory and dynamic task mapping can improve workflow efficiency.</span></p><p><span style="background-color: transparent;">(15:46) Tracking data lineage helps teams understand dependencies across DAGs.</span></p><p><span style="background-color: transparent;">(16:14) New Airflow features enhance visibility and debugging for complex pipelines.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jennifer-melot-aa710144/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jennifer Melot</a> -</p><p>https://www.linkedin.com/in/jennifer-melot-aa710144/</p><p><br></p><p><a href="https://www.linkedin.com/company/georgetown-cset/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Center for Security and Emerging Technology</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> (CSET) -</span></p><p>https://www.linkedin.com/company/georgetown-cset/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://zenodo.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zenodo</a> -</p><p>https://zenodo.org/</p><p><br></p><p><a href="https://openlineage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenLineage</a> -</p><p>https://openlineage.io/</p><p><br></p><p><a href="https://cloud.google.com/dataplex" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloud Dataplex</a> -</p><p>https://cloud.google.com/dataplex</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Turning complex datasets into meaningful analysis requires robust data infrastructure and seamless orchestration. In this episode, we’re joined by </span><a href="https://www.linkedin.com/in/jennifer-melot-aa710144/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jennifer Melot</a><span style="background-color: transparent;">, Technical Lead </span><span style="background-color: transparent; color: rgb(22, 14, 61);">at the </span><a href="https://www.linkedin.com/company/georgetown-cset/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Center for Security and Emerging Technology</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> (CSET) at Georgetown University,</span><span style="background-color: transparent;"> to explore how Airflow powers data-driven insights in technology policy research. Jennifer shares how her team automates workflows to support analysts in navigating complex datasets.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿﻿</span></strong></p><p><span style="background-color: transparent;">(02:04) CSET provides data-driven analysis to inform government decision-makers.</span></p><p><span style="background-color: transparent;">(03:54) ETL pipelines merge multiple data sources for more comprehensive insights.</span></p><p><span style="background-color: transparent;">(04:20) Airflow is central to automating and streamlining large-scale data ingestion.</span></p><p><span style="background-color: transparent;">(05:11) Larger-scale databases create challenges that require scalable solutions.</span></p><p><span style="background-color: transparent;">(07:20) Dynamic DAG generation simplifies Airflow adoption for non-engineers.</span></p><p><span style="background-color: transparent;">(12:13) DAG Factory and dynamic task mapping can improve workflow efficiency.</span></p><p><span style="background-color: transparent;">(15:46) Tracking data lineage helps teams understand dependencies across DAGs.</span></p><p><span style="background-color: transparent;">(16:14) New Airflow features enhance visibility and debugging for complex pipelines.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jennifer-melot-aa710144/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jennifer Melot</a> -</p><p>https://www.linkedin.com/in/jennifer-melot-aa710144/</p><p><br></p><p><a href="https://www.linkedin.com/company/georgetown-cset/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Center for Security and Emerging Technology</a><span style="background-color: transparent; color: rgb(22, 14, 61);"> (CSET) -</span></p><p>https://www.linkedin.com/company/georgetown-cset/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://zenodo.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Zenodo</a> -</p><p>https://zenodo.org/</p><p><br></p><p><a href="https://openlineage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">OpenLineage</a> -</p><p>https://openlineage.io/</p><p><br></p><p><a href="https://cloud.google.com/dataplex" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cloud Dataplex</a> -</p><p>https://cloud.google.com/dataplex</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Turning complex datasets into meaningful analysis requires robust data infrastructure and seamless orchestration. In this episode, we’re joined by Jennifer Melot, Technical Lead at the Center for Security and Emerging Technology (CSET) at Georgetow...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>29</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[b37f3abe-5376-41a7-9481-7fcbdb361bd4]]></guid>
  <title><![CDATA[Hybrid Testing Solutions for Autonomous Driving at Bosch with Jens Scheffler and Christian Schilling]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Testing autonomous vehicles demands precision, scalability and powerful orchestration tools — enter Apache Airflow, a key component of Bosch’s cutting-edge testing framework. In this episode, we sit down with </span><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a><span style="background-color: transparent;">, Test Execution Cluster Technical Architect, and </span><a href="https://www.linkedin.com/in/christian-schilling-a5078831a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christian Schilling</a><span style="background-color: transparent;">, Product Owner Open Loop Testing Automated Driving, both at </span><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a><span style="background-color: transparent;">, to explore how Bosch harnesses Airflow to streamline complex testing scenarios. They share insights on scaling workflows, integrating hybrid infrastructures and ensuring vehicle safety through rigorous automated testing.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:35) Airflow orchestrates millions of test hours for autonomous systems.</span></p><p><span style="background-color: transparent;">(03:15) Jens scales distributed systems with Kubernetes for job orchestration.</span></p><p><span style="background-color: transparent;">(06:02) Airflow runs hundreds of tests simultaneously.</span></p><p><span style="background-color: transparent;">(06:44) Virtual testing reduces costs and on-road trials.</span></p><p><span style="background-color: transparent;">(12:19) Unified APIs and GUIs streamline operations.</span></p><p><span style="background-color: transparent;">(15:05) Self-service setups empower Bosch teams.</span></p><p><span style="background-color: transparent;">(18:00) Physical hardware integration ensures real-world timing.</span></p><p><span style="background-color: transparent;">(20:30) Dynamic task mapping scales workflows efficiently.</span></p><p><span style="background-color: transparent;">(25:22) Open-source contributions improve stability.</span></p><p><span style="background-color: transparent;">(31:06) Edge and Celery executors power Bosch's hybrid scheduling.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a> -</p><p>https://www.linkedin.com/in/jens-scheffler/</p><p><br></p><p><a href="https://www.linkedin.com/in/christian-schilling-a5078831a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christian Schilling</a> -</p><p>https://www.linkedin.com/in/christian-schilling-a5078831a/</p><p><br></p><p><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> -</p><p>https://www.linkedin.com/company/bosch/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io</p><p><br></p><p><a href="https://github.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a> -</p><p>https://github.com</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Edge Executor</a> -</p><p>https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/3227cd40-edba-4319-8a22-02c7f8d4fc9a/c196519ddf.jpg" />
  <pubDate>Thu, 13 Feb 2025 00:10:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="32401530" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/3227cd40-edba-4319-8a22-02c7f8d4fc9a/episode.mp3" />
  <itunes:title><![CDATA[Hybrid Testing Solutions for Autonomous Driving at Bosch with Jens Scheffler and Christian Schilling]]></itunes:title>
  <itunes:duration>33:45</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Testing autonomous vehicles demands precision, scalability and powerful orchestration tools — enter Apache Airflow, a key component of Bosch’s cutting-edge testing framework. In this episode, we sit down with </span><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a><span style="background-color: transparent;">, Test Execution Cluster Technical Architect, and </span><a href="https://www.linkedin.com/in/christian-schilling-a5078831a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christian Schilling</a><span style="background-color: transparent;">, Product Owner Open Loop Testing Automated Driving, both at </span><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a><span style="background-color: transparent;">, to explore how Bosch harnesses Airflow to streamline complex testing scenarios. They share insights on scaling workflows, integrating hybrid infrastructures and ensuring vehicle safety through rigorous automated testing.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:35) Airflow orchestrates millions of test hours for autonomous systems.</span></p><p><span style="background-color: transparent;">(03:15) Jens scales distributed systems with Kubernetes for job orchestration.</span></p><p><span style="background-color: transparent;">(06:02) Airflow runs hundreds of tests simultaneously.</span></p><p><span style="background-color: transparent;">(06:44) Virtual testing reduces costs and on-road trials.</span></p><p><span style="background-color: transparent;">(12:19) Unified APIs and GUIs streamline operations.</span></p><p><span style="background-color: transparent;">(15:05) Self-service setups empower Bosch teams.</span></p><p><span style="background-color: transparent;">(18:00) Physical hardware integration ensures real-world timing.</span></p><p><span style="background-color: transparent;">(20:30) Dynamic task mapping scales workflows efficiently.</span></p><p><span style="background-color: transparent;">(25:22) Open-source contributions improve stability.</span></p><p><span style="background-color: transparent;">(31:06) Edge and Celery executors power Bosch's hybrid scheduling.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a> -</p><p>https://www.linkedin.com/in/jens-scheffler/</p><p><br></p><p><a href="https://www.linkedin.com/in/christian-schilling-a5078831a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christian Schilling</a> -</p><p>https://www.linkedin.com/in/christian-schilling-a5078831a/</p><p><br></p><p><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> -</p><p>https://www.linkedin.com/company/bosch/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io</p><p><br></p><p><a href="https://github.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a> -</p><p>https://github.com</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Edge Executor</a> -</p><p>https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Testing autonomous vehicles demands precision, scalability and powerful orchestration tools — enter Apache Airflow, a key component of Bosch’s cutting-edge testing framework. In this episode, we sit down with </span><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a><span style="background-color: transparent;">, Test Execution Cluster Technical Architect, and </span><a href="https://www.linkedin.com/in/christian-schilling-a5078831a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christian Schilling</a><span style="background-color: transparent;">, Product Owner Open Loop Testing Automated Driving, both at </span><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a><span style="background-color: transparent;">, to explore how Bosch harnesses Airflow to streamline complex testing scenarios. They share insights on scaling workflows, integrating hybrid infrastructures and ensuring vehicle safety through rigorous automated testing.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:35) Airflow orchestrates millions of test hours for autonomous systems.</span></p><p><span style="background-color: transparent;">(03:15) Jens scales distributed systems with Kubernetes for job orchestration.</span></p><p><span style="background-color: transparent;">(06:02) Airflow runs hundreds of tests simultaneously.</span></p><p><span style="background-color: transparent;">(06:44) Virtual testing reduces costs and on-road trials.</span></p><p><span style="background-color: transparent;">(12:19) Unified APIs and GUIs streamline operations.</span></p><p><span style="background-color: transparent;">(15:05) Self-service setups empower Bosch teams.</span></p><p><span style="background-color: transparent;">(18:00) Physical hardware integration ensures real-world timing.</span></p><p><span style="background-color: transparent;">(20:30) Dynamic task mapping scales workflows efficiently.</span></p><p><span style="background-color: transparent;">(25:22) Open-source contributions improve stability.</span></p><p><span style="background-color: transparent;">(31:06) Edge and Celery executors power Bosch's hybrid scheduling.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jens-scheffler/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jens Scheffler</a> -</p><p>https://www.linkedin.com/in/jens-scheffler/</p><p><br></p><p><a href="https://www.linkedin.com/in/christian-schilling-a5078831a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Christian Schilling</a> -</p><p>https://www.linkedin.com/in/christian-schilling-a5078831a/</p><p><br></p><p><a href="https://www.linkedin.com/company/bosch/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Bosch</a> -</p><p>https://www.linkedin.com/company/bosch/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://kubernetes.io" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io</p><p><br></p><p><a href="https://github.com" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GitHub</a> -</p><p>https://github.com</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Edge Executor</a> -</p><p>https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/executor/index.html</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Testing autonomous vehicles demands precision, scalability and powerful orchestration tools — enter Apache Airflow, a key component of Bosch’s cutting-edge testing framework. In this episode, we sit down with Jens Scheffler, Test Execution Cluster ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>27</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[b078bf1e-b83f-446a-b16d-926628ba0cdd]]></guid>
  <title><![CDATA[Overcoming Airflow Scaling Challenges at Monzo Bank with Jonathan Rainer]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Scaling a data orchestration platform to manage thousands of tasks daily demands innovative solutions and strategic problem-solving. In this episode, we explore the complexities of scaling Airflow and the challenges of orchestrating thousands of tasks in dynamic data environments. </span><a href="https://www.linkedin.com/in/jonathan-rainer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jonathan Rainer</a><span style="background-color: transparent;">, Former Platform Engineer at </span><a href="https://www.linkedin.com/company/monzo-bank/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monzo Bank</a><span style="background-color: transparent;">, joins us to share his journey optimizing data pipelines, overcoming UI limitations and ensuring DAG consistency in high-stakes scenarios.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><span style="background-color: transparent;">(03:11) Using Airflow to schedule computation in BigQuery.</span></p><p><span style="background-color: transparent;">(07:02) How DAGs with 8,000+ tasks were managed nightly.</span></p><p><span style="background-color: transparent;">(08:18) Ensuring accuracy in regulatory reporting for banking.</span></p><p><span style="background-color: transparent;">(11:35) Handling task inconsistency and DAG failures with automation.</span></p><p><span style="background-color: transparent;">(16:09) Building a service to resolve DAG consistency issues in Airflow.</span></p><p><span style="background-color: transparent;">(25:05) Challenges with scaling the Airflow UI for thousands of tasks.</span></p><p><span style="background-color: transparent;">(27:03) The role of upstream and downstream task management in Airflow.</span></p><p><span style="background-color: transparent;">(37:33) The importance of operational metrics for monitoring Airflow health.</span></p><p><span style="background-color: transparent;">(39:19) Balancing new tools with root cause analysis to address scaling issues.</span></p><p><span style="background-color: transparent;">(41:35) Why scaling solutions require both technical and leadership buy-in</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jonathan-rainer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jonathan Rainer</a> -</p><p>https://www.linkedin.com/in/jonathan-rainer/</p><p><br></p><p><a href="https://www.linkedin.com/company/monzo-bank/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monzo Bank</a> -</p><p>https://www.linkedin.com/company/monzo-bank/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-google/stable/operators/cloud/bigquery.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BigQuery</a> -</p><p>https://airflow.apache.org/docs/apache-airflow-providers-google/stable/operators/cloud/bigquery.html</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9d43c88a-1c59-4530-814b-1ebca905c018/e48f863bd8.jpg" />
  <pubDate>Thu, 06 Feb 2025 22:09:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="41912192" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9d43c88a-1c59-4530-814b-1ebca905c018/episode.mp3" />
  <itunes:title><![CDATA[Overcoming Airflow Scaling Challenges at Monzo Bank with Jonathan Rainer]]></itunes:title>
  <itunes:duration>43:39</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Scaling a data orchestration platform to manage thousands of tasks daily demands innovative solutions and strategic problem-solving. In this episode, we explore the complexities of scaling Airflow and the challenges of orchestrating thousands of tasks in dynamic data environments. </span><a href="https://www.linkedin.com/in/jonathan-rainer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jonathan Rainer</a><span style="background-color: transparent;">, Former Platform Engineer at </span><a href="https://www.linkedin.com/company/monzo-bank/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monzo Bank</a><span style="background-color: transparent;">, joins us to share his journey optimizing data pipelines, overcoming UI limitations and ensuring DAG consistency in high-stakes scenarios.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><span style="background-color: transparent;">(03:11) Using Airflow to schedule computation in BigQuery.</span></p><p><span style="background-color: transparent;">(07:02) How DAGs with 8,000+ tasks were managed nightly.</span></p><p><span style="background-color: transparent;">(08:18) Ensuring accuracy in regulatory reporting for banking.</span></p><p><span style="background-color: transparent;">(11:35) Handling task inconsistency and DAG failures with automation.</span></p><p><span style="background-color: transparent;">(16:09) Building a service to resolve DAG consistency issues in Airflow.</span></p><p><span style="background-color: transparent;">(25:05) Challenges with scaling the Airflow UI for thousands of tasks.</span></p><p><span style="background-color: transparent;">(27:03) The role of upstream and downstream task management in Airflow.</span></p><p><span style="background-color: transparent;">(37:33) The importance of operational metrics for monitoring Airflow health.</span></p><p><span style="background-color: transparent;">(39:19) Balancing new tools with root cause analysis to address scaling issues.</span></p><p><span style="background-color: transparent;">(41:35) Why scaling solutions require both technical and leadership buy-in</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jonathan-rainer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jonathan Rainer</a> -</p><p>https://www.linkedin.com/in/jonathan-rainer/</p><p><br></p><p><a href="https://www.linkedin.com/company/monzo-bank/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monzo Bank</a> -</p><p>https://www.linkedin.com/company/monzo-bank/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-google/stable/operators/cloud/bigquery.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BigQuery</a> -</p><p>https://airflow.apache.org/docs/apache-airflow-providers-google/stable/operators/cloud/bigquery.html</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Scaling a data orchestration platform to manage thousands of tasks daily demands innovative solutions and strategic problem-solving. In this episode, we explore the complexities of scaling Airflow and the challenges of orchestrating thousands of tasks in dynamic data environments. </span><a href="https://www.linkedin.com/in/jonathan-rainer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jonathan Rainer</a><span style="background-color: transparent;">, Former Platform Engineer at </span><a href="https://www.linkedin.com/company/monzo-bank/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monzo Bank</a><span style="background-color: transparent;">, joins us to share his journey optimizing data pipelines, overcoming UI limitations and ensuring DAG consistency in high-stakes scenarios.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><span style="background-color: transparent;">(03:11) Using Airflow to schedule computation in BigQuery.</span></p><p><span style="background-color: transparent;">(07:02) How DAGs with 8,000+ tasks were managed nightly.</span></p><p><span style="background-color: transparent;">(08:18) Ensuring accuracy in regulatory reporting for banking.</span></p><p><span style="background-color: transparent;">(11:35) Handling task inconsistency and DAG failures with automation.</span></p><p><span style="background-color: transparent;">(16:09) Building a service to resolve DAG consistency issues in Airflow.</span></p><p><span style="background-color: transparent;">(25:05) Challenges with scaling the Airflow UI for thousands of tasks.</span></p><p><span style="background-color: transparent;">(27:03) The role of upstream and downstream task management in Airflow.</span></p><p><span style="background-color: transparent;">(37:33) The importance of operational metrics for monitoring Airflow health.</span></p><p><span style="background-color: transparent;">(39:19) Balancing new tools with root cause analysis to address scaling issues.</span></p><p><span style="background-color: transparent;">(41:35) Why scaling solutions require both technical and leadership buy-in</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/jonathan-rainer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Jonathan Rainer</a> -</p><p>https://www.linkedin.com/in/jonathan-rainer/</p><p><br></p><p><a href="https://www.linkedin.com/company/monzo-bank/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Monzo Bank</a> -</p><p>https://www.linkedin.com/company/monzo-bank/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://airflow.apache.org/docs/apache-airflow-providers-google/stable/operators/cloud/bigquery.html" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">BigQuery</a> -</p><p>https://airflow.apache.org/docs/apache-airflow-providers-google/stable/operators/cloud/bigquery.html</p><p><br></p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Scaling a data orchestration platform to manage thousands of tasks daily demands innovative solutions and strategic problem-solving. In this episode, we explore the complexities of scaling Airflow and the challenges of orchestrating thousands of ta...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>26</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[7f2c816f-d12e-4539-a733-0133824a2844]]></guid>
  <title><![CDATA[Orchestrating Analytics and AI Workflows at Telia with Arjun Anandkumar]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">T</span><span style="background-color: transparent;">he future of data engineering lies in seamless orchestration and automation. In this episode, </span><a href="https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arjun Anandkumar</a><span style="background-color: transparent;">, Data Engineer at </span><a href="https://www.linkedin.com/company/teliacompany/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Telia</a><span style="background-color: transparent;">, shares how his team uses Airflow to drive analytics and AI workflows. He highlights the challenges of scaling data platforms and how adopting best practices can simplify complex processes for teams across the organization. Arjun also discusses the transformative role of tools like Cosmos and Terraform in enhancing efficiency and collaboration.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:16) Telia operates across the Nordics and Baltics, focusing on telecom and energy services.</span></p><p><span style="background-color: transparent;">(03:45) Airflow runs dbt models seamlessly with Cosmos on AWS MWAA.</span></p><p><span style="background-color: transparent;">(05:47) Cosmos improves visibility and orchestration in Airflow.</span></p><p><span style="background-color: transparent;">(07:00) Medallion Architecture organizes data into bronze, silver and gold layers.</span></p><p><span style="background-color: transparent;">(08:34) Task group challenges highlight the need for adaptable workflows.</span></p><p><span style="background-color: transparent;">(15:04) Scaling managed services requires trial, error and tailored tweaks.</span></p><p><span style="background-color: transparent;">(19:46) Terraform scales infrastructure, while YAML templates manage DAGs efficiently.</span></p><p><span style="background-color: transparent;">(20:00) Templated DAGs and robust testing enhance platform management.</span></p><p><span style="background-color: transparent;">(24:15) Open-source resources drive innovation in Airflow practices.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arjun Anandkumar</a> -</p><p>https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk</p><p><br></p><p><a href="https://www.linkedin.com/company/teliacompany/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Telia</a> -</p><p>https://www.linkedin.com/company/teliacompany/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos by Astronomer</a> -</p><p>https://www.astronomer.io/cosmos/</p><p><br></p><p><a href="https://www.terraform.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Terraform</a> -</p><p>https://www.terraform.io/</p><p><br></p><p><a href="https://www.databricks.com/glossary/medallion-architecture" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Medallion Architecture by Databricks</a> -</p><p>https://www.databricks.com/glossary/medallion-architecture</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿</span>Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/b313286e-c941-4a26-a5cf-e7c4f0205229/85a0844e7f.jpg" />
  <pubDate>Thu, 30 Jan 2025 00:00:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="24962693" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/b313286e-c941-4a26-a5cf-e7c4f0205229/episode.mp3" />
  <itunes:title><![CDATA[Orchestrating Analytics and AI Workflows at Telia with Arjun Anandkumar]]></itunes:title>
  <itunes:duration>26:00</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">T</span><span style="background-color: transparent;">he future of data engineering lies in seamless orchestration and automation. In this episode, </span><a href="https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arjun Anandkumar</a><span style="background-color: transparent;">, Data Engineer at </span><a href="https://www.linkedin.com/company/teliacompany/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Telia</a><span style="background-color: transparent;">, shares how his team uses Airflow to drive analytics and AI workflows. He highlights the challenges of scaling data platforms and how adopting best practices can simplify complex processes for teams across the organization. Arjun also discusses the transformative role of tools like Cosmos and Terraform in enhancing efficiency and collaboration.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:16) Telia operates across the Nordics and Baltics, focusing on telecom and energy services.</span></p><p><span style="background-color: transparent;">(03:45) Airflow runs dbt models seamlessly with Cosmos on AWS MWAA.</span></p><p><span style="background-color: transparent;">(05:47) Cosmos improves visibility and orchestration in Airflow.</span></p><p><span style="background-color: transparent;">(07:00) Medallion Architecture organizes data into bronze, silver and gold layers.</span></p><p><span style="background-color: transparent;">(08:34) Task group challenges highlight the need for adaptable workflows.</span></p><p><span style="background-color: transparent;">(15:04) Scaling managed services requires trial, error and tailored tweaks.</span></p><p><span style="background-color: transparent;">(19:46) Terraform scales infrastructure, while YAML templates manage DAGs efficiently.</span></p><p><span style="background-color: transparent;">(20:00) Templated DAGs and robust testing enhance platform management.</span></p><p><span style="background-color: transparent;">(24:15) Open-source resources drive innovation in Airflow practices.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arjun Anandkumar</a> -</p><p>https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk</p><p><br></p><p><a href="https://www.linkedin.com/company/teliacompany/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Telia</a> -</p><p>https://www.linkedin.com/company/teliacompany/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos by Astronomer</a> -</p><p>https://www.astronomer.io/cosmos/</p><p><br></p><p><a href="https://www.terraform.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Terraform</a> -</p><p>https://www.terraform.io/</p><p><br></p><p><a href="https://www.databricks.com/glossary/medallion-architecture" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Medallion Architecture by Databricks</a> -</p><p>https://www.databricks.com/glossary/medallion-architecture</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿</span>Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">T</span><span style="background-color: transparent;">he future of data engineering lies in seamless orchestration and automation. In this episode, </span><a href="https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arjun Anandkumar</a><span style="background-color: transparent;">, Data Engineer at </span><a href="https://www.linkedin.com/company/teliacompany/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Telia</a><span style="background-color: transparent;">, shares how his team uses Airflow to drive analytics and AI workflows. He highlights the challenges of scaling data platforms and how adopting best practices can simplify complex processes for teams across the organization. Arjun also discusses the transformative role of tools like Cosmos and Terraform in enhancing efficiency and collaboration.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:16) Telia operates across the Nordics and Baltics, focusing on telecom and energy services.</span></p><p><span style="background-color: transparent;">(03:45) Airflow runs dbt models seamlessly with Cosmos on AWS MWAA.</span></p><p><span style="background-color: transparent;">(05:47) Cosmos improves visibility and orchestration in Airflow.</span></p><p><span style="background-color: transparent;">(07:00) Medallion Architecture organizes data into bronze, silver and gold layers.</span></p><p><span style="background-color: transparent;">(08:34) Task group challenges highlight the need for adaptable workflows.</span></p><p><span style="background-color: transparent;">(15:04) Scaling managed services requires trial, error and tailored tweaks.</span></p><p><span style="background-color: transparent;">(19:46) Terraform scales infrastructure, while YAML templates manage DAGs efficiently.</span></p><p><span style="background-color: transparent;">(20:00) Templated DAGs and robust testing enhance platform management.</span></p><p><span style="background-color: transparent;">(24:15) Open-source resources drive innovation in Airflow practices.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Arjun Anandkumar</a> -</p><p>https://www.linkedin.com/in/arjunanand1/?originalSubdomain=dk</p><p><br></p><p><a href="https://www.linkedin.com/company/teliacompany/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Telia</a> -</p><p>https://www.linkedin.com/company/teliacompany/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.astronomer.io/cosmos/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos by Astronomer</a> -</p><p>https://www.astronomer.io/cosmos/</p><p><br></p><p><a href="https://www.terraform.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Terraform</a> -</p><p>https://www.terraform.io/</p><p><br></p><p><a href="https://www.databricks.com/glossary/medallion-architecture" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Medallion Architecture by Databricks</a> -</p><p>https://www.databricks.com/glossary/medallion-architecture</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);"><span class="ql-cursor">﻿</span>Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[The future of data engineering lies in seamless orchestration and automation. In this episode, Arjun Anandkumar, Data Engineer at Telia, shares how his team uses Airflow to drive analytics and AI workflows. He highlights the challenges of scaling d...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>25</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[2d7d6ae7-5732-4d28-9923-65ec37898aa6]]></guid>
  <title><![CDATA[The Role of Airflow in Finance Transformation at Etraveli Group with Mihir Samant]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Transforming bottlenecked finance processes into streamlined, automated systems requires the right tools and a forward-thinking approach. In this episode, </span><a href="https://www.linkedin.com/in/misamant/?originalSubdomain=ca" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mihir Samant</a><span style="background-color: transparent;">, Senior Data Analyst at </span><a href="https://www.linkedin.com/company/etraveli-group/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Etraveli Group</a><span style="background-color: transparent;">, joins us to share how his team leverages Airflow to revolutionize finance automation. With extensive experience in data workflows and a passion for open-source tools, Mihir provides valuable insights into building efficient, scalable systems. We explore the transformative power of Airflow in automating workflows and enhancing data orchestration within the finance domain.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:14) Etraveli Group specializes in selling affordable flight tickets and ancillary services.</span></p><p><span style="background-color: transparent;">(03:56) Mihir’s finance automation team uses Airflow to tackle month-end bottlenecks.</span></p><p><span style="background-color: transparent;">(06:00) Airflow's flexibility enables end-to-end automation for finance workflows.</span></p><p><span style="background-color: transparent;">(07:00) Open-source Airflow tools offer cost-effective solutions for new teams.</span></p><p><span style="background-color: transparent;">(08:46) Sensors and dynamic DAGs are pivotal features for optimizing tasks.</span></p><p><span style="background-color: transparent;">(13:30) GitSync simplifies development by syncing environments seamlessly.</span></p><p><span style="background-color: transparent;">(16:27) Plans include integrating Databricks for more advanced data handling.</span></p><p><span style="background-color: transparent;">(17:58) Airflow and Databricks offer multiple flexible methods to trigger workflows and execute SQL queries seamlessly.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/misamant/?originalSubdomain=ca" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mihir Samant</a> -</p><p>https://www.linkedin.com/in/misamant/?originalSubdomain=ca</p><p><br></p><p><a href="https://www.linkedin.com/company/etraveli-group/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Etraveli Group</a> -</p><p>https://www.linkedin.com/company/etraveli-group/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.docker.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Docker</a> -</p><p>https://www.docker.com/</p><p><br></p><p><a href="https://www.databricks.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Databricks</a> -</p><p>https://www.databricks.com/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/6624eb44-4afc-41ec-ab45-43ecf06b46fc/a4462411ed.jpg" />
  <pubDate>Thu, 23 Jan 2025 05:47:09 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="20464616" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/6624eb44-4afc-41ec-ab45-43ecf06b46fc/episode.mp3" />
  <itunes:title><![CDATA[The Role of Airflow in Finance Transformation at Etraveli Group with Mihir Samant]]></itunes:title>
  <itunes:duration>21:19</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Transforming bottlenecked finance processes into streamlined, automated systems requires the right tools and a forward-thinking approach. In this episode, </span><a href="https://www.linkedin.com/in/misamant/?originalSubdomain=ca" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mihir Samant</a><span style="background-color: transparent;">, Senior Data Analyst at </span><a href="https://www.linkedin.com/company/etraveli-group/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Etraveli Group</a><span style="background-color: transparent;">, joins us to share how his team leverages Airflow to revolutionize finance automation. With extensive experience in data workflows and a passion for open-source tools, Mihir provides valuable insights into building efficient, scalable systems. We explore the transformative power of Airflow in automating workflows and enhancing data orchestration within the finance domain.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:14) Etraveli Group specializes in selling affordable flight tickets and ancillary services.</span></p><p><span style="background-color: transparent;">(03:56) Mihir’s finance automation team uses Airflow to tackle month-end bottlenecks.</span></p><p><span style="background-color: transparent;">(06:00) Airflow's flexibility enables end-to-end automation for finance workflows.</span></p><p><span style="background-color: transparent;">(07:00) Open-source Airflow tools offer cost-effective solutions for new teams.</span></p><p><span style="background-color: transparent;">(08:46) Sensors and dynamic DAGs are pivotal features for optimizing tasks.</span></p><p><span style="background-color: transparent;">(13:30) GitSync simplifies development by syncing environments seamlessly.</span></p><p><span style="background-color: transparent;">(16:27) Plans include integrating Databricks for more advanced data handling.</span></p><p><span style="background-color: transparent;">(17:58) Airflow and Databricks offer multiple flexible methods to trigger workflows and execute SQL queries seamlessly.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/misamant/?originalSubdomain=ca" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mihir Samant</a> -</p><p>https://www.linkedin.com/in/misamant/?originalSubdomain=ca</p><p><br></p><p><a href="https://www.linkedin.com/company/etraveli-group/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Etraveli Group</a> -</p><p>https://www.linkedin.com/company/etraveli-group/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.docker.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Docker</a> -</p><p>https://www.docker.com/</p><p><br></p><p><a href="https://www.databricks.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Databricks</a> -</p><p>https://www.databricks.com/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Transforming bottlenecked finance processes into streamlined, automated systems requires the right tools and a forward-thinking approach. In this episode, </span><a href="https://www.linkedin.com/in/misamant/?originalSubdomain=ca" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mihir Samant</a><span style="background-color: transparent;">, Senior Data Analyst at </span><a href="https://www.linkedin.com/company/etraveli-group/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Etraveli Group</a><span style="background-color: transparent;">, joins us to share how his team leverages Airflow to revolutionize finance automation. With extensive experience in data workflows and a passion for open-source tools, Mihir provides valuable insights into building efficient, scalable systems. We explore the transformative power of Airflow in automating workflows and enhancing data orchestration within the finance domain.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:14) Etraveli Group specializes in selling affordable flight tickets and ancillary services.</span></p><p><span style="background-color: transparent;">(03:56) Mihir’s finance automation team uses Airflow to tackle month-end bottlenecks.</span></p><p><span style="background-color: transparent;">(06:00) Airflow's flexibility enables end-to-end automation for finance workflows.</span></p><p><span style="background-color: transparent;">(07:00) Open-source Airflow tools offer cost-effective solutions for new teams.</span></p><p><span style="background-color: transparent;">(08:46) Sensors and dynamic DAGs are pivotal features for optimizing tasks.</span></p><p><span style="background-color: transparent;">(13:30) GitSync simplifies development by syncing environments seamlessly.</span></p><p><span style="background-color: transparent;">(16:27) Plans include integrating Databricks for more advanced data handling.</span></p><p><span style="background-color: transparent;">(17:58) Airflow and Databricks offer multiple flexible methods to trigger workflows and execute SQL queries seamlessly.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/misamant/?originalSubdomain=ca" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Mihir Samant</a> -</p><p>https://www.linkedin.com/in/misamant/?originalSubdomain=ca</p><p><br></p><p><a href="https://www.linkedin.com/company/etraveli-group/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Etraveli Group</a> -</p><p>https://www.linkedin.com/company/etraveli-group/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.docker.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Docker</a> -</p><p>https://www.docker.com/</p><p><br></p><p><a href="https://www.databricks.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Databricks</a> -</p><p>https://www.databricks.com/</p><p><br></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Transforming bottlenecked finance processes into streamlined, automated systems requires the right tools and a forward-thinking approach. In this episode, Mihir Samant, Senior Data Analyst at Etraveli Group, joins us to share how his team leverages...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>24</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[97312390-383c-4762-8f01-65efd22a9fe3]]></guid>
  <title><![CDATA[Inside Ford’s Data Transformation: Advanced Orchestration Strategies with Vasantha Kosuri-Marshall]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Data engineering is entering a new era, where orchestration and automation are redefining how large-scale projects operate. This episode features </span><a href="https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasantha Kosuri-Marshall</a><span style="background-color: transparent;">, Data and ML Ops Engineer at </span><a href="https://www.linkedin.com/company/ford-motor-company/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a><span style="background-color: transparent;">. Vasantha shares her expertise in managing complex data pipelines. She takes us through Ford's transition to cloud platforms, the adoption of Airflow and the intricate challenges of orchestrating data in a diverse environment.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(03:10) Vasantha’s transition to the Advanced Driving Assist Systems team at Ford.</span></p><p><span style="background-color: transparent;">(05:42) Early adoption of Airflow to orchestrate complex data pipelines.</span></p><p><span style="background-color: transparent;">(09:29) Ford's move from on-premise data solutions to Google Cloud Platform.</span></p><p><span style="background-color: transparent;">(12:03) The importance of Airflow's scheduling capabilities for efficient data management.</span></p><p><span style="background-color: transparent;">(16:12) Using Kubernetes to scale Airflow for large-scale data processing.</span></p><p><span style="background-color: transparent;">(19:59) Vasantha’s experience in overcoming challenges with legacy orchestration tools.</span></p><p><span style="background-color: transparent;">(22:22) Integration of data engineering and data science pipelines at Ford.</span></p><p><span style="background-color: transparent;">(28:03) How deferrable operators in Airflow improve performance and save costs.</span></p><p><span style="background-color: transparent;">(32:12) Vasantha’s insights into tuning Airflow properties for thousands of DAGs.</span></p><p><span style="background-color: transparent;">(36:09) The significance of monitoring and observability in managing Airflow instances.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasantha Kosuri-Marshall</a> -</p><p>https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Platform (GCP)</a> - </p><p>https://cloud.google.com/</p><p><br></p><p><a href="https://www.linkedin.com/company/ford-motor-company/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a> | LinkedIn -</p><p>https://www.linkedin.com/company/ford-motor-company/</p><p><br></p><p><a href="https://www.ford.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a> | Website -</p><p>https://www.ford.com/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.astronomer.io/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/6a53bdec-b579-49fc-88c3-8e64b7d105d4/589d7269e3.jpg" />
  <pubDate>Thu, 16 Jan 2025 00:15:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="37359780" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/6a53bdec-b579-49fc-88c3-8e64b7d105d4/episode.mp3" />
  <itunes:title><![CDATA[Inside Ford’s Data Transformation: Advanced Orchestration Strategies with Vasantha Kosuri-Marshall]]></itunes:title>
  <itunes:duration>38:54</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Data engineering is entering a new era, where orchestration and automation are redefining how large-scale projects operate. This episode features </span><a href="https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasantha Kosuri-Marshall</a><span style="background-color: transparent;">, Data and ML Ops Engineer at </span><a href="https://www.linkedin.com/company/ford-motor-company/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a><span style="background-color: transparent;">. Vasantha shares her expertise in managing complex data pipelines. She takes us through Ford's transition to cloud platforms, the adoption of Airflow and the intricate challenges of orchestrating data in a diverse environment.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(03:10) Vasantha’s transition to the Advanced Driving Assist Systems team at Ford.</span></p><p><span style="background-color: transparent;">(05:42) Early adoption of Airflow to orchestrate complex data pipelines.</span></p><p><span style="background-color: transparent;">(09:29) Ford's move from on-premise data solutions to Google Cloud Platform.</span></p><p><span style="background-color: transparent;">(12:03) The importance of Airflow's scheduling capabilities for efficient data management.</span></p><p><span style="background-color: transparent;">(16:12) Using Kubernetes to scale Airflow for large-scale data processing.</span></p><p><span style="background-color: transparent;">(19:59) Vasantha’s experience in overcoming challenges with legacy orchestration tools.</span></p><p><span style="background-color: transparent;">(22:22) Integration of data engineering and data science pipelines at Ford.</span></p><p><span style="background-color: transparent;">(28:03) How deferrable operators in Airflow improve performance and save costs.</span></p><p><span style="background-color: transparent;">(32:12) Vasantha’s insights into tuning Airflow properties for thousands of DAGs.</span></p><p><span style="background-color: transparent;">(36:09) The significance of monitoring and observability in managing Airflow instances.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasantha Kosuri-Marshall</a> -</p><p>https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Platform (GCP)</a> - </p><p>https://cloud.google.com/</p><p><br></p><p><a href="https://www.linkedin.com/company/ford-motor-company/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a> | LinkedIn -</p><p>https://www.linkedin.com/company/ford-motor-company/</p><p><br></p><p><a href="https://www.ford.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a> | Website -</p><p>https://www.ford.com/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.astronomer.io/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Data engineering is entering a new era, where orchestration and automation are redefining how large-scale projects operate. This episode features </span><a href="https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasantha Kosuri-Marshall</a><span style="background-color: transparent;">, Data and ML Ops Engineer at </span><a href="https://www.linkedin.com/company/ford-motor-company/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a><span style="background-color: transparent;">. Vasantha shares her expertise in managing complex data pipelines. She takes us through Ford's transition to cloud platforms, the adoption of Airflow and the intricate challenges of orchestrating data in a diverse environment.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(03:10) Vasantha’s transition to the Advanced Driving Assist Systems team at Ford.</span></p><p><span style="background-color: transparent;">(05:42) Early adoption of Airflow to orchestrate complex data pipelines.</span></p><p><span style="background-color: transparent;">(09:29) Ford's move from on-premise data solutions to Google Cloud Platform.</span></p><p><span style="background-color: transparent;">(12:03) The importance of Airflow's scheduling capabilities for efficient data management.</span></p><p><span style="background-color: transparent;">(16:12) Using Kubernetes to scale Airflow for large-scale data processing.</span></p><p><span style="background-color: transparent;">(19:59) Vasantha’s experience in overcoming challenges with legacy orchestration tools.</span></p><p><span style="background-color: transparent;">(22:22) Integration of data engineering and data science pipelines at Ford.</span></p><p><span style="background-color: transparent;">(28:03) How deferrable operators in Airflow improve performance and save costs.</span></p><p><span style="background-color: transparent;">(32:12) Vasantha’s insights into tuning Airflow properties for thousands of DAGs.</span></p><p><span style="background-color: transparent;">(36:09) The significance of monitoring and observability in managing Airflow instances.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasantha Kosuri-Marshall</a> -</p><p>https://www.linkedin.com/in/vasantha-kosuri-marshall-0b0aab188/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud Platform (GCP)</a> - </p><p>https://cloud.google.com/</p><p><br></p><p><a href="https://www.linkedin.com/company/ford-motor-company/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a> | LinkedIn -</p><p>https://www.linkedin.com/company/ford-motor-company/</p><p><br></p><p><a href="https://www.ford.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ford Motor Company</a> | Website -</p><p>https://www.ford.com/</p><p><br></p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.astronomer.io/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent;">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data engineering is entering a new era, where orchestration and automation are redefining how large-scale projects operate. This episode features Vasantha Kosuri-Marshall, Data and ML Ops Engineer at Ford Motor Company. Vasantha shares her expertis...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>23</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[5921f932-6c74-4dcb-9722-e0dc62c08ec6]]></guid>
  <title><![CDATA[Powering Finance With Advanced Data Solutions at Ramp with Ryan Delgado]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Data is the backbone of every modern business, but unlocking its full potential requires the right tools and strategies. In this episode, </span><a href="https://www.linkedin.com/in/ryan-delgado-69544568/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Delgado</a><span style="background-color: transparent;">, Director of Engineering at </span><a href="https://www.linkedin.com/company/ramp/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ramp</a><span style="background-color: transparent;">, joins us to explore how innovative data platforms can transform business operations and fuel growth. He shares insights on integrating Apache Airflow, optimizing data workflows and leveraging analytics to enhance customer experiences.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(01:52) Data is the lifeblood of Ramp, touching every vertical in the company.</span></p><p><span style="background-color: transparent;">(03:18) Ramp’s data platform team enables high-velocity scaling through tailored tools.</span></p><p><span style="background-color: transparent;">(05:27) Airflow powers Ramp’s enterprise data warehouse integrations for advanced analytics.</span></p><p><span style="background-color: transparent;">(07:55) Centralizing data in Snowflake simplifies storage and analytics pipelines.</span></p><p><span style="background-color: transparent;">(12:08) Machine learning models at Ramp integrate seamlessly with Airflow for operational excellence.</span></p><p><span style="background-color: transparent;">(14:11) Leveraging Airflow datasets eliminates inefficiencies in DAG dependencies.</span></p><p><span style="background-color: transparent;">(17:22) Platforms evolve from solving narrow business problems to scaling organizationally.</span></p><p><span style="background-color: transparent;">(18:55) ClickHouse enhances Ramp’s OLAP capabilities with 100x performance improvements.</span></p><p><span style="background-color: transparent;">(19:47) Ramp’s OLAP platform improves performance by reducing joins and leveraging ClickHouse.</span></p><p><span style="background-color: transparent;">(21:46) Ryan envisions a lighter-weight, more Python-native future for Airflow.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ryan-delgado-69544568/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Delgado</a> - </p><p>https://www.linkedin.com/in/ryan-delgado-69544568/</p><p><br></p><p><a href="https://www.linkedin.com/company/ramp/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ramp</a> - </p><p>https://www.linkedin.com/company/ramp/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> - </p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a> - </p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://clickhouse.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ClickHouse</a> -</p><p>https://clickhouse.com/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a> -</p><p>https://www.getdbt.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/64125731-37fd-49dd-9743-d05fcd4f88e6/023d18d233.jpg" />
  <pubDate>Fri, 10 Jan 2025 11:15:02 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23605161" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/64125731-37fd-49dd-9743-d05fcd4f88e6/episode.mp3" />
  <itunes:title><![CDATA[Powering Finance With Advanced Data Solutions at Ramp with Ryan Delgado]]></itunes:title>
  <itunes:duration>24:35</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Data is the backbone of every modern business, but unlocking its full potential requires the right tools and strategies. In this episode, </span><a href="https://www.linkedin.com/in/ryan-delgado-69544568/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Delgado</a><span style="background-color: transparent;">, Director of Engineering at </span><a href="https://www.linkedin.com/company/ramp/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ramp</a><span style="background-color: transparent;">, joins us to explore how innovative data platforms can transform business operations and fuel growth. He shares insights on integrating Apache Airflow, optimizing data workflows and leveraging analytics to enhance customer experiences.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(01:52) Data is the lifeblood of Ramp, touching every vertical in the company.</span></p><p><span style="background-color: transparent;">(03:18) Ramp’s data platform team enables high-velocity scaling through tailored tools.</span></p><p><span style="background-color: transparent;">(05:27) Airflow powers Ramp’s enterprise data warehouse integrations for advanced analytics.</span></p><p><span style="background-color: transparent;">(07:55) Centralizing data in Snowflake simplifies storage and analytics pipelines.</span></p><p><span style="background-color: transparent;">(12:08) Machine learning models at Ramp integrate seamlessly with Airflow for operational excellence.</span></p><p><span style="background-color: transparent;">(14:11) Leveraging Airflow datasets eliminates inefficiencies in DAG dependencies.</span></p><p><span style="background-color: transparent;">(17:22) Platforms evolve from solving narrow business problems to scaling organizationally.</span></p><p><span style="background-color: transparent;">(18:55) ClickHouse enhances Ramp’s OLAP capabilities with 100x performance improvements.</span></p><p><span style="background-color: transparent;">(19:47) Ramp’s OLAP platform improves performance by reducing joins and leveraging ClickHouse.</span></p><p><span style="background-color: transparent;">(21:46) Ryan envisions a lighter-weight, more Python-native future for Airflow.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ryan-delgado-69544568/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Delgado</a> - </p><p>https://www.linkedin.com/in/ryan-delgado-69544568/</p><p><br></p><p><a href="https://www.linkedin.com/company/ramp/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ramp</a> - </p><p>https://www.linkedin.com/company/ramp/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> - </p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a> - </p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://clickhouse.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ClickHouse</a> -</p><p>https://clickhouse.com/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a> -</p><p>https://www.getdbt.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Data is the backbone of every modern business, but unlocking its full potential requires the right tools and strategies. In this episode, </span><a href="https://www.linkedin.com/in/ryan-delgado-69544568/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Delgado</a><span style="background-color: transparent;">, Director of Engineering at </span><a href="https://www.linkedin.com/company/ramp/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ramp</a><span style="background-color: transparent;">, joins us to explore how innovative data platforms can transform business operations and fuel growth. He shares insights on integrating Apache Airflow, optimizing data workflows and leveraging analytics to enhance customer experiences.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(01:52) Data is the lifeblood of Ramp, touching every vertical in the company.</span></p><p><span style="background-color: transparent;">(03:18) Ramp’s data platform team enables high-velocity scaling through tailored tools.</span></p><p><span style="background-color: transparent;">(05:27) Airflow powers Ramp’s enterprise data warehouse integrations for advanced analytics.</span></p><p><span style="background-color: transparent;">(07:55) Centralizing data in Snowflake simplifies storage and analytics pipelines.</span></p><p><span style="background-color: transparent;">(12:08) Machine learning models at Ramp integrate seamlessly with Airflow for operational excellence.</span></p><p><span style="background-color: transparent;">(14:11) Leveraging Airflow datasets eliminates inefficiencies in DAG dependencies.</span></p><p><span style="background-color: transparent;">(17:22) Platforms evolve from solving narrow business problems to scaling organizationally.</span></p><p><span style="background-color: transparent;">(18:55) ClickHouse enhances Ramp’s OLAP capabilities with 100x performance improvements.</span></p><p><span style="background-color: transparent;">(19:47) Ramp’s OLAP platform improves performance by reducing joins and leveraging ClickHouse.</span></p><p><span style="background-color: transparent;">(21:46) Ryan envisions a lighter-weight, more Python-native future for Airflow.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/ryan-delgado-69544568/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ryan Delgado</a> - </p><p>https://www.linkedin.com/in/ryan-delgado-69544568/</p><p><br></p><p><a href="https://www.linkedin.com/company/ramp/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Ramp</a> - </p><p>https://www.linkedin.com/company/ramp/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> - </p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a> - </p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://clickhouse.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">ClickHouse</a> -</p><p>https://clickhouse.com/</p><p><br></p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a> -</p><p>https://www.getdbt.com/</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data is the backbone of every modern business, but unlocking its full potential requires the right tools and strategies. In this episode, Ryan Delgado, Director of Engineering at Ramp, joins us to explore how innovative data platforms can transform...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>22</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[cd004074-02f2-4816-b187-596a495ed8c1]]></guid>
  <title><![CDATA[Exploring the Power of Airflow 3 at Astronomer with Amogh Desai]]></title>
  <description><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">What does it take to go from fixing a broken link to becoming a committer for one of the world’s leading open-source projects?&nbsp;</span></p><p><br></p><p><a href="https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amogh Desai</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, takes us through his journey with Apache Airflow. From small contributions to building meaningful connections in the open-source community, Amogh’s story provides actionable insights for anyone on the cusp of their open-source journey.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:09) Building data engineering platforms at Cloudera with Kubernetes.</span></p><p><span style="background-color: transparent;">(04:00) Brainstorming led to contributing to Apache Airflow.</span></p><p><span style="background-color: transparent;">(05:17) Starting small with link fixes, progressing to Breeze development.</span></p><p><span style="background-color: transparent;">(07:00) Becoming a committer for Apache Airflow in September 2023.</span></p><p><span style="background-color: transparent;">(09:51) The steep learning curve for contributing to Airflow.</span></p><p><span style="background-color: transparent;">(16:30) Using GitHub’s “good-first-issue” label to get started.</span></p><p><span style="background-color: transparent;">(18:15) Setting up a development environment with Breeze.</span></p><p><span style="background-color: transparent;">(22:00) Open-source contributions enhance your resume and career.</span></p><p><span style="background-color: transparent;">(24:51) Amogh’s advice: Start small and stay consistent.</span></p><p><span style="background-color: transparent;">(28:12) Engage with the community via Slack, email lists and meetups.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amogh Desai</a> -</p><p>https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/</p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.linkedin.com/company/astronomer/</p><p><a href="https://github.com/apache/airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow GitHub Repository</a> -</p><p>https://github.com/apache/airflow</p><p><a href="https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Contributors Quick Guide</a> -</p><p>https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst</p><p><a href="https://github.com/apache/airflow/tree/main/dev/breeze" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Breeze Development Tool</a> -</p><p>https://github.com/apache/airflow/tree/main/dev/breeze</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/f405a575-71b2-4f19-8bdd-7901023974fa/e519a7c888.jpg" />
  <pubDate>Fri, 20 Dec 2024 13:41:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="29190350" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/f405a575-71b2-4f19-8bdd-7901023974fa/episode.mp3" />
  <itunes:title><![CDATA[Exploring the Power of Airflow 3 at Astronomer with Amogh Desai]]></itunes:title>
  <itunes:duration>30:24</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">What does it take to go from fixing a broken link to becoming a committer for one of the world’s leading open-source projects?&nbsp;</span></p><p><br></p><p><a href="https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amogh Desai</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, takes us through his journey with Apache Airflow. From small contributions to building meaningful connections in the open-source community, Amogh’s story provides actionable insights for anyone on the cusp of their open-source journey.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:09) Building data engineering platforms at Cloudera with Kubernetes.</span></p><p><span style="background-color: transparent;">(04:00) Brainstorming led to contributing to Apache Airflow.</span></p><p><span style="background-color: transparent;">(05:17) Starting small with link fixes, progressing to Breeze development.</span></p><p><span style="background-color: transparent;">(07:00) Becoming a committer for Apache Airflow in September 2023.</span></p><p><span style="background-color: transparent;">(09:51) The steep learning curve for contributing to Airflow.</span></p><p><span style="background-color: transparent;">(16:30) Using GitHub’s “good-first-issue” label to get started.</span></p><p><span style="background-color: transparent;">(18:15) Setting up a development environment with Breeze.</span></p><p><span style="background-color: transparent;">(22:00) Open-source contributions enhance your resume and career.</span></p><p><span style="background-color: transparent;">(24:51) Amogh’s advice: Start small and stay consistent.</span></p><p><span style="background-color: transparent;">(28:12) Engage with the community via Slack, email lists and meetups.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amogh Desai</a> -</p><p>https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/</p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.linkedin.com/company/astronomer/</p><p><a href="https://github.com/apache/airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow GitHub Repository</a> -</p><p>https://github.com/apache/airflow</p><p><a href="https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Contributors Quick Guide</a> -</p><p>https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst</p><p><a href="https://github.com/apache/airflow/tree/main/dev/breeze" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Breeze Development Tool</a> -</p><p>https://github.com/apache/airflow/tree/main/dev/breeze</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent; color: rgb(22, 14, 61);">What does it take to go from fixing a broken link to becoming a committer for one of the world’s leading open-source projects?&nbsp;</span></p><p><br></p><p><a href="https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amogh Desai</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, Senior Software Engineer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent; color: rgb(22, 14, 61);">, takes us through his journey with Apache Airflow. From small contributions to building meaningful connections in the open-source community, Amogh’s story provides actionable insights for anyone on the cusp of their open-source journey.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:09) Building data engineering platforms at Cloudera with Kubernetes.</span></p><p><span style="background-color: transparent;">(04:00) Brainstorming led to contributing to Apache Airflow.</span></p><p><span style="background-color: transparent;">(05:17) Starting small with link fixes, progressing to Breeze development.</span></p><p><span style="background-color: transparent;">(07:00) Becoming a committer for Apache Airflow in September 2023.</span></p><p><span style="background-color: transparent;">(09:51) The steep learning curve for contributing to Airflow.</span></p><p><span style="background-color: transparent;">(16:30) Using GitHub’s “good-first-issue” label to get started.</span></p><p><span style="background-color: transparent;">(18:15) Setting up a development environment with Breeze.</span></p><p><span style="background-color: transparent;">(22:00) Open-source contributions enhance your resume and career.</span></p><p><span style="background-color: transparent;">(24:51) Amogh’s advice: Start small and stay consistent.</span></p><p><span style="background-color: transparent;">(28:12) Engage with the community via Slack, email lists and meetups.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Amogh Desai</a> -</p><p>https://www.linkedin.com/in/amogh-desai-385141157/?originalSubdomain=in%20%20https://www.linkedin.com/company/astronomer/</p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.linkedin.com/company/astronomer/</p><p><a href="https://github.com/apache/airflow" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow GitHub Repository</a> -</p><p>https://github.com/apache/airflow</p><p><a href="https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Contributors Quick Guide</a> -</p><p>https://github.com/apache/airflow/blob/main/CONTRIBUTING.rst</p><p><a href="https://github.com/apache/airflow/tree/main/dev/breeze" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Breeze Development Tool</a> -</p><p>https://github.com/apache/airflow/tree/main/dev/breeze</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[What does it take to go from fixing a broken link to becoming a committer for one of the world’s leading open-source projects? Amogh Desai, Senior Software Engineer at Astronomer, takes us through his journey with Apache Airflow. From small contrib...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>21</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[5a3328c2-a882-4c84-b0a1-406515373bb3]]></guid>
  <title><![CDATA[Using Airflow To Power Machine Learning Pipelines at Optimove with Vasyl Vasyuta]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, </span><a href="https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasyl Vasyuta</a><span style="background-color: transparent;">, R&amp;D Team Leader at </span><a href="https://www.linkedin.com/company/optimove/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Optimove</a><span style="background-color: transparent;">, joins us to discuss how his team leverages Airflow to optimize data processing, orchestrate machine learning models and create personalized customer experiences.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(01:59) Optimove tailors marketing notifications with personalized customer journeys.</span></p><p><span style="background-color: transparent;">(04:25) Airflow orchestrates Snowflake procedures for massive datasets.</span></p><p><span style="background-color: transparent;">(05:11) DAGs manage workflows with branching and replay plugins.</span></p><p><span style="background-color: transparent;">(05:41) The "Joystick" plugin enables seamless data replays.</span></p><p><span style="background-color: transparent;">(09:33) Airflow supports MLOps for customer data grouping.</span></p><p><span style="background-color: transparent;">(11:15) Machine learning predicts customer behavior for better campaigns.</span></p><p><span style="background-color: transparent;">(13:20) Thousands of DAGs run every five minutes for data processing.</span></p><p><span style="background-color: transparent;">(15:36) Custom versioning allows rollbacks and gradual rollouts.</span></p><p><span style="background-color: transparent;">(18:00) Airflow logs enhance operational observability.</span></p><p><span style="background-color: transparent;">(23:00) DAG versioning in Airflow 3.0 could boost efficiency.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasyl Vasyuta</a> -</p><p>https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/</p><p><br></p><p><a href="https://www.linkedin.com/company/optimove/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Optimove</a> -</p><p>https://www.linkedin.com/company/optimove/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a> -</p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://www.datadoghq.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Datadog</a> -</p><p>https://www.datadoghq.com/</p><p><br></p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9305765e-7452-48ef-9afe-647b054d3913/adfb9b5798.jpg" />
  <pubDate>Thu, 12 Dec 2024 11:28:03 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23219385" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9305765e-7452-48ef-9afe-647b054d3913/episode.mp3" />
  <itunes:title><![CDATA[Using Airflow To Power Machine Learning Pipelines at Optimove with Vasyl Vasyuta]]></itunes:title>
  <itunes:duration>24:11</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, </span><a href="https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasyl Vasyuta</a><span style="background-color: transparent;">, R&amp;D Team Leader at </span><a href="https://www.linkedin.com/company/optimove/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Optimove</a><span style="background-color: transparent;">, joins us to discuss how his team leverages Airflow to optimize data processing, orchestrate machine learning models and create personalized customer experiences.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(01:59) Optimove tailors marketing notifications with personalized customer journeys.</span></p><p><span style="background-color: transparent;">(04:25) Airflow orchestrates Snowflake procedures for massive datasets.</span></p><p><span style="background-color: transparent;">(05:11) DAGs manage workflows with branching and replay plugins.</span></p><p><span style="background-color: transparent;">(05:41) The "Joystick" plugin enables seamless data replays.</span></p><p><span style="background-color: transparent;">(09:33) Airflow supports MLOps for customer data grouping.</span></p><p><span style="background-color: transparent;">(11:15) Machine learning predicts customer behavior for better campaigns.</span></p><p><span style="background-color: transparent;">(13:20) Thousands of DAGs run every five minutes for data processing.</span></p><p><span style="background-color: transparent;">(15:36) Custom versioning allows rollbacks and gradual rollouts.</span></p><p><span style="background-color: transparent;">(18:00) Airflow logs enhance operational observability.</span></p><p><span style="background-color: transparent;">(23:00) DAG versioning in Airflow 3.0 could boost efficiency.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasyl Vasyuta</a> -</p><p>https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/</p><p><br></p><p><a href="https://www.linkedin.com/company/optimove/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Optimove</a> -</p><p>https://www.linkedin.com/company/optimove/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a> -</p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://www.datadoghq.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Datadog</a> -</p><p>https://www.datadoghq.com/</p><p><br></p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, </span><a href="https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasyl Vasyuta</a><span style="background-color: transparent;">, R&amp;D Team Leader at </span><a href="https://www.linkedin.com/company/optimove/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Optimove</a><span style="background-color: transparent;">, joins us to discuss how his team leverages Airflow to optimize data processing, orchestrate machine learning models and create personalized customer experiences.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><strong style="background-color: transparent;"><span class="ql-cursor">﻿</span></strong></p><p><span style="background-color: transparent;">(01:59) Optimove tailors marketing notifications with personalized customer journeys.</span></p><p><span style="background-color: transparent;">(04:25) Airflow orchestrates Snowflake procedures for massive datasets.</span></p><p><span style="background-color: transparent;">(05:11) DAGs manage workflows with branching and replay plugins.</span></p><p><span style="background-color: transparent;">(05:41) The "Joystick" plugin enables seamless data replays.</span></p><p><span style="background-color: transparent;">(09:33) Airflow supports MLOps for customer data grouping.</span></p><p><span style="background-color: transparent;">(11:15) Machine learning predicts customer behavior for better campaigns.</span></p><p><span style="background-color: transparent;">(13:20) Thousands of DAGs run every five minutes for data processing.</span></p><p><span style="background-color: transparent;">(15:36) Custom versioning allows rollbacks and gradual rollouts.</span></p><p><span style="background-color: transparent;">(18:00) Airflow logs enhance operational observability.</span></p><p><span style="background-color: transparent;">(23:00) DAG versioning in Airflow 3.0 could boost efficiency.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vasyl Vasyuta</a> -</p><p>https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/</p><p><br></p><p><a href="https://www.linkedin.com/company/optimove/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Optimove</a> -</p><p>https://www.linkedin.com/company/optimove/</p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><br></p><p><a href="https://www.snowflake.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Snowflake</a> -</p><p>https://www.snowflake.com/</p><p><br></p><p><a href="https://www.datadoghq.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Datadog</a> -</p><p>https://www.datadoghq.com/</p><p><br></p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, Vasyl Vasyuta, R&D Team Leader at Op...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>20</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[e8ea0543-ae88-4981-bdd7-6992f4a87401]]></guid>
  <title><![CDATA[Maximizing Business Impact Through Data at GlossGenius with Katie Bauer]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Bridging the gap between data teams and business priorities is essential for maximizing impact and building value-driven workflows. </span><a href="https://www.linkedin.com/in/mkatiebauer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Katie Bauer</a><span style="background-color: transparent;">, Senior Director of Data at </span><a href="https://www.linkedin.com/company/glossgenius/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GlossGenius</a><span style="background-color: transparent;">, joins us to share her principles for creating effective, aligned data teams. In this episode, Katie draws from her experience at GlossGenius, Reddit and Twitter to highlight the common pitfalls data teams face and how to overcome them. She offers practical strategies for aligning team efforts with organizational goals and fostering collaboration with stakeholders.</span></p><p><br></p><p><span style="background-color: transparent;">&nbsp;</span><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:36) GlossGenius provides an all-in-one platform for beauty professionals.</span></p><p><span style="background-color: transparent;">(03:59) Airflow orchestrates data and MLOps workflows at GlossGenius.</span></p><p><span style="background-color: transparent;">(04:41) Focusing on value helps data teams achieve greater impact.</span></p><p><span style="background-color: transparent;">(06:23) Aligning team priorities with company goals minimizes friction.</span></p><p><span style="background-color: transparent;">(08:44) Building strong stakeholder relationships requires curiosity.</span></p><p><span style="background-color: transparent;">(12:46) Treating roles as flexible fosters team innovation.</span></p><p><span style="background-color: transparent;">(13:21) Adapting to new technologies improves effectiveness.</span></p><p><span style="background-color: transparent;">(18:28) Acting like your time is valuable earns respect.</span></p><p><span style="background-color: transparent;">(23:38) Proactive data initiatives drive strategic value.</span></p><p><span style="background-color: transparent;">(24:20) Usage data offers critical insights into tool effectiveness.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/mkatiebauer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Katie Bauer</a> -</p><p>https://www.linkedin.com/in/mkatiebauer/</p><p><a href="https://www.linkedin.com/company/glossgenius/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GlossGenius</a> -</p><p>https://www.linkedin.com/company/glossgenius/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DBT</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://cosmos.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos</a> -</p><p>https://cosmos.apache.org/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/55f26613-29f4-42a8-bf49-b2ef2882a39e/2be745e146.jpg" />
  <pubDate>Thu, 05 Dec 2024 04:19:38 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="24788822" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/55f26613-29f4-42a8-bf49-b2ef2882a39e/episode.mp3" />
  <itunes:title><![CDATA[Maximizing Business Impact Through Data at GlossGenius with Katie Bauer]]></itunes:title>
  <itunes:duration>25:49</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Bridging the gap between data teams and business priorities is essential for maximizing impact and building value-driven workflows. </span><a href="https://www.linkedin.com/in/mkatiebauer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Katie Bauer</a><span style="background-color: transparent;">, Senior Director of Data at </span><a href="https://www.linkedin.com/company/glossgenius/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GlossGenius</a><span style="background-color: transparent;">, joins us to share her principles for creating effective, aligned data teams. In this episode, Katie draws from her experience at GlossGenius, Reddit and Twitter to highlight the common pitfalls data teams face and how to overcome them. She offers practical strategies for aligning team efforts with organizational goals and fostering collaboration with stakeholders.</span></p><p><br></p><p><span style="background-color: transparent;">&nbsp;</span><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:36) GlossGenius provides an all-in-one platform for beauty professionals.</span></p><p><span style="background-color: transparent;">(03:59) Airflow orchestrates data and MLOps workflows at GlossGenius.</span></p><p><span style="background-color: transparent;">(04:41) Focusing on value helps data teams achieve greater impact.</span></p><p><span style="background-color: transparent;">(06:23) Aligning team priorities with company goals minimizes friction.</span></p><p><span style="background-color: transparent;">(08:44) Building strong stakeholder relationships requires curiosity.</span></p><p><span style="background-color: transparent;">(12:46) Treating roles as flexible fosters team innovation.</span></p><p><span style="background-color: transparent;">(13:21) Adapting to new technologies improves effectiveness.</span></p><p><span style="background-color: transparent;">(18:28) Acting like your time is valuable earns respect.</span></p><p><span style="background-color: transparent;">(23:38) Proactive data initiatives drive strategic value.</span></p><p><span style="background-color: transparent;">(24:20) Usage data offers critical insights into tool effectiveness.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/mkatiebauer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Katie Bauer</a> -</p><p>https://www.linkedin.com/in/mkatiebauer/</p><p><a href="https://www.linkedin.com/company/glossgenius/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GlossGenius</a> -</p><p>https://www.linkedin.com/company/glossgenius/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DBT</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://cosmos.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos</a> -</p><p>https://cosmos.apache.org/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Bridging the gap between data teams and business priorities is essential for maximizing impact and building value-driven workflows. </span><a href="https://www.linkedin.com/in/mkatiebauer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Katie Bauer</a><span style="background-color: transparent;">, Senior Director of Data at </span><a href="https://www.linkedin.com/company/glossgenius/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GlossGenius</a><span style="background-color: transparent;">, joins us to share her principles for creating effective, aligned data teams. In this episode, Katie draws from her experience at GlossGenius, Reddit and Twitter to highlight the common pitfalls data teams face and how to overcome them. She offers practical strategies for aligning team efforts with organizational goals and fostering collaboration with stakeholders.</span></p><p><br></p><p><span style="background-color: transparent;">&nbsp;</span><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:36) GlossGenius provides an all-in-one platform for beauty professionals.</span></p><p><span style="background-color: transparent;">(03:59) Airflow orchestrates data and MLOps workflows at GlossGenius.</span></p><p><span style="background-color: transparent;">(04:41) Focusing on value helps data teams achieve greater impact.</span></p><p><span style="background-color: transparent;">(06:23) Aligning team priorities with company goals minimizes friction.</span></p><p><span style="background-color: transparent;">(08:44) Building strong stakeholder relationships requires curiosity.</span></p><p><span style="background-color: transparent;">(12:46) Treating roles as flexible fosters team innovation.</span></p><p><span style="background-color: transparent;">(13:21) Adapting to new technologies improves effectiveness.</span></p><p><span style="background-color: transparent;">(18:28) Acting like your time is valuable earns respect.</span></p><p><span style="background-color: transparent;">(23:38) Proactive data initiatives drive strategic value.</span></p><p><span style="background-color: transparent;">(24:20) Usage data offers critical insights into tool effectiveness.</span></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/mkatiebauer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Katie Bauer</a> -</p><p>https://www.linkedin.com/in/mkatiebauer/</p><p><a href="https://www.linkedin.com/company/glossgenius/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">GlossGenius</a> -</p><p>https://www.linkedin.com/company/glossgenius/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DBT</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://cosmos.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Cosmos</a> -</p><p>https://cosmos.apache.org/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Bridging the gap between data teams and business priorities is essential for maximizing impact and building value-driven workflows. Katie Bauer, Senior Director of Data at GlossGenius, joins us to share her principles for creating effective, aligne...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>19</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[6739664a-6543-4340-9a94-e344d67b5405]]></guid>
  <title><![CDATA[Optimizing Large-Scale Deployments at LinkedIn with Rahul Gade]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Scaling deployments for a billion users demands innovation, precision and resilience. In this episode, we dive into how LinkedIn optimizes its continuous deployment process using Apache Airflow. </span><a href="https://www.linkedin.com/in/rahul-gade-68666818/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rahul Gade</a><span style="background-color: transparent;">, Staff Software Engineer at </span><a href="https://www.linkedin.com/company/linkedin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LinkedIn</a><span style="background-color: transparent;">, shares his insights on building scalable systems and democratizing deployments for over 10,000 engineers.&nbsp;</span></p><p><span style="background-color: transparent;">Rahul discusses the challenges of managing large-scale deployments across 6,000 services and how his team leverages Airflow to enhance efficiency, reliability and user accessibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:36) LinkedIn minimizes human involvement in production to reduce errors.</span></p><p><span style="background-color: transparent;">(02:00) Airflow powers LinkedIn’s Continuous Deployment platform.</span></p><p><span style="background-color: transparent;">(05:43) Continuous deployment adoption grew from 8% to a targeted 80%.</span></p><p><span style="background-color: transparent;">(11:25) Kubernetes ensures scalability and flexibility for deployments.</span></p><p><span style="background-color: transparent;">(12:04) A custom UI offers real-time deployment transparency.</span></p><p><span style="background-color: transparent;">(16:23) No-code YAML workflows simplify deployment tasks.</span></p><p><span style="background-color: transparent;">(17:18) Canaries and metrics ensure safe deployments across fabrics.</span></p><p><span style="background-color: transparent;">(20:45) A gateway service ensures redundancy across Airflow clusters.</span></p><p><span style="background-color: transparent;">(24:22) Abstractions let engineers focus on development, not logistics.</span></p><p><span style="background-color: transparent;">(25:20) Multi-language support in Airflow 3.0 simplifies adoption.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/rahul-gade-68666818/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rahul Gade</a> -</p><p>https://www.linkedin.com/in/rahul-gade-68666818/</p><p><a href="https://www.linkedin.com/company/linkedin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LinkedIn</a> -</p><p>https://www.linkedin.com/company/linkedin/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><a href="https://www.openpolicyagent.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Open Policy Agent (OPA)</a> -</p><p>https://www.openpolicyagent.org/</p><p><a href="https://backstage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Backstage</a> -</p><p>https://backstage.io/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/826ac8fa-d92c-405c-a6a6-d2c2194481af/672e3150bb.jpg" />
  <pubDate>Mon, 02 Dec 2024 13:03:19 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="26683849" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/826ac8fa-d92c-405c-a6a6-d2c2194481af/episode.mp3" />
  <itunes:title><![CDATA[Optimizing Large-Scale Deployments at LinkedIn with Rahul Gade]]></itunes:title>
  <itunes:duration>27:47</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Scaling deployments for a billion users demands innovation, precision and resilience. In this episode, we dive into how LinkedIn optimizes its continuous deployment process using Apache Airflow. </span><a href="https://www.linkedin.com/in/rahul-gade-68666818/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rahul Gade</a><span style="background-color: transparent;">, Staff Software Engineer at </span><a href="https://www.linkedin.com/company/linkedin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LinkedIn</a><span style="background-color: transparent;">, shares his insights on building scalable systems and democratizing deployments for over 10,000 engineers.&nbsp;</span></p><p><span style="background-color: transparent;">Rahul discusses the challenges of managing large-scale deployments across 6,000 services and how his team leverages Airflow to enhance efficiency, reliability and user accessibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:36) LinkedIn minimizes human involvement in production to reduce errors.</span></p><p><span style="background-color: transparent;">(02:00) Airflow powers LinkedIn’s Continuous Deployment platform.</span></p><p><span style="background-color: transparent;">(05:43) Continuous deployment adoption grew from 8% to a targeted 80%.</span></p><p><span style="background-color: transparent;">(11:25) Kubernetes ensures scalability and flexibility for deployments.</span></p><p><span style="background-color: transparent;">(12:04) A custom UI offers real-time deployment transparency.</span></p><p><span style="background-color: transparent;">(16:23) No-code YAML workflows simplify deployment tasks.</span></p><p><span style="background-color: transparent;">(17:18) Canaries and metrics ensure safe deployments across fabrics.</span></p><p><span style="background-color: transparent;">(20:45) A gateway service ensures redundancy across Airflow clusters.</span></p><p><span style="background-color: transparent;">(24:22) Abstractions let engineers focus on development, not logistics.</span></p><p><span style="background-color: transparent;">(25:20) Multi-language support in Airflow 3.0 simplifies adoption.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/rahul-gade-68666818/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rahul Gade</a> -</p><p>https://www.linkedin.com/in/rahul-gade-68666818/</p><p><a href="https://www.linkedin.com/company/linkedin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LinkedIn</a> -</p><p>https://www.linkedin.com/company/linkedin/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><a href="https://www.openpolicyagent.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Open Policy Agent (OPA)</a> -</p><p>https://www.openpolicyagent.org/</p><p><a href="https://backstage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Backstage</a> -</p><p>https://backstage.io/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Scaling deployments for a billion users demands innovation, precision and resilience. In this episode, we dive into how LinkedIn optimizes its continuous deployment process using Apache Airflow. </span><a href="https://www.linkedin.com/in/rahul-gade-68666818/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rahul Gade</a><span style="background-color: transparent;">, Staff Software Engineer at </span><a href="https://www.linkedin.com/company/linkedin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LinkedIn</a><span style="background-color: transparent;">, shares his insights on building scalable systems and democratizing deployments for over 10,000 engineers.&nbsp;</span></p><p><span style="background-color: transparent;">Rahul discusses the challenges of managing large-scale deployments across 6,000 services and how his team leverages Airflow to enhance efficiency, reliability and user accessibility.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:36) LinkedIn minimizes human involvement in production to reduce errors.</span></p><p><span style="background-color: transparent;">(02:00) Airflow powers LinkedIn’s Continuous Deployment platform.</span></p><p><span style="background-color: transparent;">(05:43) Continuous deployment adoption grew from 8% to a targeted 80%.</span></p><p><span style="background-color: transparent;">(11:25) Kubernetes ensures scalability and flexibility for deployments.</span></p><p><span style="background-color: transparent;">(12:04) A custom UI offers real-time deployment transparency.</span></p><p><span style="background-color: transparent;">(16:23) No-code YAML workflows simplify deployment tasks.</span></p><p><span style="background-color: transparent;">(17:18) Canaries and metrics ensure safe deployments across fabrics.</span></p><p><span style="background-color: transparent;">(20:45) A gateway service ensures redundancy across Airflow clusters.</span></p><p><span style="background-color: transparent;">(24:22) Abstractions let engineers focus on development, not logistics.</span></p><p><span style="background-color: transparent;">(25:20) Multi-language support in Airflow 3.0 simplifies adoption.</span></p><p><br></p><p><strong style="background-color: transparent; color: rgb(22, 14, 61);">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/rahul-gade-68666818/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Rahul Gade</a> -</p><p>https://www.linkedin.com/in/rahul-gade-68666818/</p><p><a href="https://www.linkedin.com/company/linkedin/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">LinkedIn</a> -</p><p>https://www.linkedin.com/company/linkedin/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><a href="https://www.openpolicyagent.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Open Policy Agent (OPA)</a> -</p><p>https://www.openpolicyagent.org/</p><p><a href="https://backstage.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Backstage</a> -</p><p>https://backstage.io/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Scaling deployments for a billion users demands innovation, precision and resilience. In this episode, we dive into how LinkedIn optimizes its continuous deployment process using Apache Airflow. Rahul Gade, Staff Software Engineer at LinkedIn, shar...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>18</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[9ce80b41-338d-4c5e-bcc7-a182b9eb479f]]></guid>
  <title><![CDATA[How Uber Manages 1 Million Daily Tasks Using Airflow, with Shobhit Shah and Sumit Maheshwari]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">When data orchestration reaches Uber’s scale, innovation becomes a necessity, not a luxury. In this episode, we discuss the innovations behind Uber’s unique Airflow setup. With our guests </span><a href="https://www.linkedin.com/in/shahshobhit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shobhit Shah</a><span style="background-color: transparent;"> and </span><a href="https://www.linkedin.com/in/maheshwarisumit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sumit Maheshwari</a><span style="background-color: transparent;">, both Staff Software Engineers at </span><a href="https://www.linkedin.com/company/uber-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a><span style="background-color: transparent;">, we explore how their team manages one of the largest data workflow systems in the world. Shobhit and Sumit walk us through the evolution of Uber’s Airflow implementation, detailing the custom solutions that support 200,000 daily pipelines. They discuss Uber's approach to tackling complex challenges in data orchestration, disaster recovery and scaling to meet the company’s extensive data needs.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><span style="background-color: transparent;">(02:03) Airflow as a service streamlines Uber’s data workflows.</span></p><p><span style="background-color: transparent;">(06:16) Serialization boosts security and reduces errors.</span></p><p><span style="background-color: transparent;">(10:05) Java-based scheduler improves system reliability.</span></p><p><span style="background-color: transparent;">(13:40) Custom recovery model supports emergency pipeline switching.</span></p><p><span style="background-color: transparent;">(15:58) No-code UI allows easy pipeline creation for non-coders.</span></p><p><span style="background-color: transparent;">(18:12) Backfill feature enables historical data processing.</span></p><p><span style="background-color: transparent;">(22:06) Regular updates keep Uber aligned with Airflow advancements.</span></p><p><span style="background-color: transparent;">(26:07) Plans to leverage Airflow’s latest features.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shahshobhit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shobhit Shah</a> -</p><p>https://www.linkedin.com/in/shahshobhit/</p><p><a href="https://www.linkedin.com/in/maheshwarisumit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sumit Maheshwar</a> -</p><p>https://www.linkedin.com/in/maheshwarisumit/</p><p><a href="https://www.linkedin.com/company/uber-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a> -</p><p>https://www.linkedin.com/company/uber-com/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a> -</p><p>https://airflowsummit.org/</p><p><a href="https://www.uber.com/tw/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a> -</p><p>https://www.uber.com/tw/en/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/2da04614-e3d7-4bc3-b989-194b51976e77/9096e68c77.jpg" />
  <pubDate>Thu, 14 Nov 2024 01:00:00 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="27584551" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/2da04614-e3d7-4bc3-b989-194b51976e77/episode.mp3" />
  <itunes:title><![CDATA[How Uber Manages 1 Million Daily Tasks Using Airflow, with Shobhit Shah and Sumit Maheshwari]]></itunes:title>
  <itunes:duration>28:44</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">When data orchestration reaches Uber’s scale, innovation becomes a necessity, not a luxury. In this episode, we discuss the innovations behind Uber’s unique Airflow setup. With our guests </span><a href="https://www.linkedin.com/in/shahshobhit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shobhit Shah</a><span style="background-color: transparent;"> and </span><a href="https://www.linkedin.com/in/maheshwarisumit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sumit Maheshwari</a><span style="background-color: transparent;">, both Staff Software Engineers at </span><a href="https://www.linkedin.com/company/uber-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a><span style="background-color: transparent;">, we explore how their team manages one of the largest data workflow systems in the world. Shobhit and Sumit walk us through the evolution of Uber’s Airflow implementation, detailing the custom solutions that support 200,000 daily pipelines. They discuss Uber's approach to tackling complex challenges in data orchestration, disaster recovery and scaling to meet the company’s extensive data needs.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><span style="background-color: transparent;">(02:03) Airflow as a service streamlines Uber’s data workflows.</span></p><p><span style="background-color: transparent;">(06:16) Serialization boosts security and reduces errors.</span></p><p><span style="background-color: transparent;">(10:05) Java-based scheduler improves system reliability.</span></p><p><span style="background-color: transparent;">(13:40) Custom recovery model supports emergency pipeline switching.</span></p><p><span style="background-color: transparent;">(15:58) No-code UI allows easy pipeline creation for non-coders.</span></p><p><span style="background-color: transparent;">(18:12) Backfill feature enables historical data processing.</span></p><p><span style="background-color: transparent;">(22:06) Regular updates keep Uber aligned with Airflow advancements.</span></p><p><span style="background-color: transparent;">(26:07) Plans to leverage Airflow’s latest features.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shahshobhit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shobhit Shah</a> -</p><p>https://www.linkedin.com/in/shahshobhit/</p><p><a href="https://www.linkedin.com/in/maheshwarisumit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sumit Maheshwar</a> -</p><p>https://www.linkedin.com/in/maheshwarisumit/</p><p><a href="https://www.linkedin.com/company/uber-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a> -</p><p>https://www.linkedin.com/company/uber-com/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a> -</p><p>https://airflowsummit.org/</p><p><a href="https://www.uber.com/tw/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a> -</p><p>https://www.uber.com/tw/en/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">When data orchestration reaches Uber’s scale, innovation becomes a necessity, not a luxury. In this episode, we discuss the innovations behind Uber’s unique Airflow setup. With our guests </span><a href="https://www.linkedin.com/in/shahshobhit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shobhit Shah</a><span style="background-color: transparent;"> and </span><a href="https://www.linkedin.com/in/maheshwarisumit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sumit Maheshwari</a><span style="background-color: transparent;">, both Staff Software Engineers at </span><a href="https://www.linkedin.com/company/uber-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a><span style="background-color: transparent;">, we explore how their team manages one of the largest data workflow systems in the world. Shobhit and Sumit walk us through the evolution of Uber’s Airflow implementation, detailing the custom solutions that support 200,000 daily pipelines. They discuss Uber's approach to tackling complex challenges in data orchestration, disaster recovery and scaling to meet the company’s extensive data needs.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><span style="background-color: transparent;">(02:03) Airflow as a service streamlines Uber’s data workflows.</span></p><p><span style="background-color: transparent;">(06:16) Serialization boosts security and reduces errors.</span></p><p><span style="background-color: transparent;">(10:05) Java-based scheduler improves system reliability.</span></p><p><span style="background-color: transparent;">(13:40) Custom recovery model supports emergency pipeline switching.</span></p><p><span style="background-color: transparent;">(15:58) No-code UI allows easy pipeline creation for non-coders.</span></p><p><span style="background-color: transparent;">(18:12) Backfill feature enables historical data processing.</span></p><p><span style="background-color: transparent;">(22:06) Regular updates keep Uber aligned with Airflow advancements.</span></p><p><span style="background-color: transparent;">(26:07) Plans to leverage Airflow’s latest features.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/shahshobhit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Shobhit Shah</a> -</p><p>https://www.linkedin.com/in/shahshobhit/</p><p><a href="https://www.linkedin.com/in/maheshwarisumit/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Sumit Maheshwar</a> -</p><p>https://www.linkedin.com/in/maheshwarisumit/</p><p><a href="https://www.linkedin.com/company/uber-com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a> -</p><p>https://www.linkedin.com/company/uber-com/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://airflowsummit.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Airflow Summit</a> -</p><p>https://airflowsummit.org/</p><p><a href="https://www.uber.com/tw/en/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Uber</a> -</p><p>https://www.uber.com/tw/en/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[When data orchestration reaches Uber’s scale, innovation becomes a necessity, not a luxury. In this episode, we discuss the innovations behind Uber’s unique Airflow setup. With our guests Shobhit Shah and Sumit Maheshwari, both Staff Software Engin...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>17</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[141a5166-e67c-4d6f-a98a-6b5f527f0f22]]></guid>
  <title><![CDATA[Building Resilient Data Systems for Modern Enterprises at Astrafy with Andrea Bombino]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Efficient data orchestration is the backbone of modern analytics and AI-driven workflows. Without the right tools, even the best data can fall short of its potential. In this episode, </span><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a><span style="background-color: transparent;">, Co-Founder and Head of Analytics Engineering at </span><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a><span style="background-color: transparent;">, shares insights into his team’s approach to optimizing data transformation and orchestration using tools like datasets and Pub/Sub to drive real-time processing. Andrea explains how they leverage Apache Airflow and Google Cloud to power dynamic data workflows.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:55) Astrafy helps companies manage data using Google Cloud.</span></p><p><span style="background-color: transparent;">(04:36) Airflow is central to Astrafy’s data engineering efforts.</span></p><p><span style="background-color: transparent;">(07:17) Datasets and Pub/Sub are used for real-time workflows.</span></p><p><span style="background-color: transparent;">(09:59) Pub/Sub links multiple Airflow environments.</span></p><p><span style="background-color: transparent;">(12:40) Datasets eliminate the need for constant monitoring.</span></p><p><span style="background-color: transparent;">(15:22) Airflow updates have improved large-scale data operations.</span></p><p><span style="background-color: transparent;">(18:03) New Airflow API features make dataset updates easier.</span></p><p><span style="background-color: transparent;">(20:45) Real-time orchestration speeds up data processing for clients.</span></p><p><span style="background-color: transparent;">(23:26) Pub/Sub enhances flexibility across cloud environments.</span></p><p><span style="background-color: transparent;">(26:08) Future Airflow features will offer more control over data workflows.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a> -</p><p>https://www.linkedin.com/in/andrea-bombino/</p><p><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> -</p><p>https://www.linkedin.com/company/astrafy/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud</a> -</p><p>https://cloud.google.com/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/c752d1ca-58ca-4bdc-910e-1dfa389bffd0/40b27f5805.jpg" />
  <pubDate>Thu, 07 Nov 2024 03:27:47 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="27355927" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/c752d1ca-58ca-4bdc-910e-1dfa389bffd0/episode.mp3" />
  <itunes:title><![CDATA[Building Resilient Data Systems for Modern Enterprises at Astrafy with Andrea Bombino]]></itunes:title>
  <itunes:duration>28:29</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Efficient data orchestration is the backbone of modern analytics and AI-driven workflows. Without the right tools, even the best data can fall short of its potential. In this episode, </span><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a><span style="background-color: transparent;">, Co-Founder and Head of Analytics Engineering at </span><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a><span style="background-color: transparent;">, shares insights into his team’s approach to optimizing data transformation and orchestration using tools like datasets and Pub/Sub to drive real-time processing. Andrea explains how they leverage Apache Airflow and Google Cloud to power dynamic data workflows.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:55) Astrafy helps companies manage data using Google Cloud.</span></p><p><span style="background-color: transparent;">(04:36) Airflow is central to Astrafy’s data engineering efforts.</span></p><p><span style="background-color: transparent;">(07:17) Datasets and Pub/Sub are used for real-time workflows.</span></p><p><span style="background-color: transparent;">(09:59) Pub/Sub links multiple Airflow environments.</span></p><p><span style="background-color: transparent;">(12:40) Datasets eliminate the need for constant monitoring.</span></p><p><span style="background-color: transparent;">(15:22) Airflow updates have improved large-scale data operations.</span></p><p><span style="background-color: transparent;">(18:03) New Airflow API features make dataset updates easier.</span></p><p><span style="background-color: transparent;">(20:45) Real-time orchestration speeds up data processing for clients.</span></p><p><span style="background-color: transparent;">(23:26) Pub/Sub enhances flexibility across cloud environments.</span></p><p><span style="background-color: transparent;">(26:08) Future Airflow features will offer more control over data workflows.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a> -</p><p>https://www.linkedin.com/in/andrea-bombino/</p><p><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> -</p><p>https://www.linkedin.com/company/astrafy/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud</a> -</p><p>https://cloud.google.com/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Efficient data orchestration is the backbone of modern analytics and AI-driven workflows. Without the right tools, even the best data can fall short of its potential. In this episode, </span><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a><span style="background-color: transparent;">, Co-Founder and Head of Analytics Engineering at </span><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a><span style="background-color: transparent;">, shares insights into his team’s approach to optimizing data transformation and orchestration using tools like datasets and Pub/Sub to drive real-time processing. Andrea explains how they leverage Apache Airflow and Google Cloud to power dynamic data workflows.&nbsp;</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:55) Astrafy helps companies manage data using Google Cloud.</span></p><p><span style="background-color: transparent;">(04:36) Airflow is central to Astrafy’s data engineering efforts.</span></p><p><span style="background-color: transparent;">(07:17) Datasets and Pub/Sub are used for real-time workflows.</span></p><p><span style="background-color: transparent;">(09:59) Pub/Sub links multiple Airflow environments.</span></p><p><span style="background-color: transparent;">(12:40) Datasets eliminate the need for constant monitoring.</span></p><p><span style="background-color: transparent;">(15:22) Airflow updates have improved large-scale data operations.</span></p><p><span style="background-color: transparent;">(18:03) New Airflow API features make dataset updates easier.</span></p><p><span style="background-color: transparent;">(20:45) Real-time orchestration speeds up data processing for clients.</span></p><p><span style="background-color: transparent;">(23:26) Pub/Sub enhances flexibility across cloud environments.</span></p><p><span style="background-color: transparent;">(26:08) Future Airflow features will offer more control over data workflows.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/andrea-bombino/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Andrea Bombino</a> -</p><p>https://www.linkedin.com/in/andrea-bombino/</p><p><a href="https://www.linkedin.com/company/astrafy/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astrafy</a> -</p><p>https://www.linkedin.com/company/astrafy/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://cloud.google.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Google Cloud</a> -</p><p>https://cloud.google.com/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">dbt</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Efficient data orchestration is the backbone of modern analytics and AI-driven workflows. Without the right tools, even the best data can fall short of its potential. In this episode, Andrea Bombino, Co-Founder and Head of Analytics Engineering at ...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>16</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[0c2ed4c0-a230-4e83-9b02-c7cd87dca2b0]]></guid>
  <title><![CDATA[Inside Airflow 3: Redefining Data Engineering with Vikram Koka]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Data orchestration is evolving faster than ever and Apache Airflow 3 is set to revolutionize how enterprises handle complex workflows. In this episode, we dive into the exciting advancements with </span><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a><span style="background-color: transparent;">, Chief Strategy Officer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;"> and PMC Member at </span><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a><span style="background-color: transparent;">. Vikram shares his insights on the evolution of Airflow and its pivotal role in shaping modern data-driven workflows, particularly with the upcoming release of Airflow 3.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:36) Vikram leads Astronomer’s engineering and open-source teams for Airflow.</span></p><p><span style="background-color: transparent;">(05:26) Airflow enables reliable data ingestion and curation.</span></p><p><span style="background-color: transparent;">(08:17) Enterprises use Airflow for mission-critical data pipelines.</span></p><p><span style="background-color: transparent;">(11:08) Airflow 3 introduces major architectural updates.</span></p><p><span style="background-color: transparent;">(13:58) Multi-cloud and edge deployments are supported in Airflow 3.</span></p><p><span style="background-color: transparent;">(16:49) Event-driven scheduling makes Airflow more dynamic.</span></p><p><span style="background-color: transparent;">(19:40) Tasks in Airflow 3 can run in any language.</span></p><p><span style="background-color: transparent;">(22:30) Multilingual task support is crucial for enterprises.</span></p><p><span style="background-color: transparent;">(25:21) Data assets and event-based integration enhance orchestration.</span></p><p><span style="background-color: transparent;">(28:12) Community feedback plays a vital role in Airflow 3.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a> -</p><p>https://www.linkedin.com/in/vikramkoka/</p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.linkedin.com/company/astronomer/</p><p><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank"> LinkedIn</a> -</p><p>https://www.linkedin.com/company/the-apache-software-foundation/</p><p><a href="https://www.linkedin.com/company/apache-airflow/" target="_blank" style="color: rgba(0, 0, 0, 0.9);">Apache Airflow LinkedIn</a><span style="color: rgba(0, 0, 0, 0.9);"> - </span></p><p>https://www.linkedin.com/company/apache-airflow/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.astronomer.io/</p><p><a href="https://www.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a> -</p><p>https://www.apache.org/</p><p><a href="https://airflow.apache.org/community/" target="_blank">Join the Airflow slack and/or Dev list</a> -</p><p>https://airflow.apache.org/community/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/72ef2b0b-cdce-4bd8-84a9-5f0355fb4499/bd6d5b491e.jpg" />
  <pubDate>Thu, 31 Oct 2024 06:01:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="28931215" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/72ef2b0b-cdce-4bd8-84a9-5f0355fb4499/episode.mp3" />
  <itunes:title><![CDATA[Inside Airflow 3: Redefining Data Engineering with Vikram Koka]]></itunes:title>
  <itunes:duration>30:08</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Data orchestration is evolving faster than ever and Apache Airflow 3 is set to revolutionize how enterprises handle complex workflows. In this episode, we dive into the exciting advancements with </span><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a><span style="background-color: transparent;">, Chief Strategy Officer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;"> and PMC Member at </span><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a><span style="background-color: transparent;">. Vikram shares his insights on the evolution of Airflow and its pivotal role in shaping modern data-driven workflows, particularly with the upcoming release of Airflow 3.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:36) Vikram leads Astronomer’s engineering and open-source teams for Airflow.</span></p><p><span style="background-color: transparent;">(05:26) Airflow enables reliable data ingestion and curation.</span></p><p><span style="background-color: transparent;">(08:17) Enterprises use Airflow for mission-critical data pipelines.</span></p><p><span style="background-color: transparent;">(11:08) Airflow 3 introduces major architectural updates.</span></p><p><span style="background-color: transparent;">(13:58) Multi-cloud and edge deployments are supported in Airflow 3.</span></p><p><span style="background-color: transparent;">(16:49) Event-driven scheduling makes Airflow more dynamic.</span></p><p><span style="background-color: transparent;">(19:40) Tasks in Airflow 3 can run in any language.</span></p><p><span style="background-color: transparent;">(22:30) Multilingual task support is crucial for enterprises.</span></p><p><span style="background-color: transparent;">(25:21) Data assets and event-based integration enhance orchestration.</span></p><p><span style="background-color: transparent;">(28:12) Community feedback plays a vital role in Airflow 3.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a> -</p><p>https://www.linkedin.com/in/vikramkoka/</p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.linkedin.com/company/astronomer/</p><p><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank"> LinkedIn</a> -</p><p>https://www.linkedin.com/company/the-apache-software-foundation/</p><p><a href="https://www.linkedin.com/company/apache-airflow/" target="_blank" style="color: rgba(0, 0, 0, 0.9);">Apache Airflow LinkedIn</a><span style="color: rgba(0, 0, 0, 0.9);"> - </span></p><p>https://www.linkedin.com/company/apache-airflow/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.astronomer.io/</p><p><a href="https://www.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a> -</p><p>https://www.apache.org/</p><p><a href="https://airflow.apache.org/community/" target="_blank">Join the Airflow slack and/or Dev list</a> -</p><p>https://airflow.apache.org/community/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Data orchestration is evolving faster than ever and Apache Airflow 3 is set to revolutionize how enterprises handle complex workflows. In this episode, we dive into the exciting advancements with </span><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a><span style="background-color: transparent;">, Chief Strategy Officer at </span><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a><span style="background-color: transparent;"> and PMC Member at </span><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a><span style="background-color: transparent;">. Vikram shares his insights on the evolution of Airflow and its pivotal role in shaping modern data-driven workflows, particularly with the upcoming release of Airflow 3.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(02:36) Vikram leads Astronomer’s engineering and open-source teams for Airflow.</span></p><p><span style="background-color: transparent;">(05:26) Airflow enables reliable data ingestion and curation.</span></p><p><span style="background-color: transparent;">(08:17) Enterprises use Airflow for mission-critical data pipelines.</span></p><p><span style="background-color: transparent;">(11:08) Airflow 3 introduces major architectural updates.</span></p><p><span style="background-color: transparent;">(13:58) Multi-cloud and edge deployments are supported in Airflow 3.</span></p><p><span style="background-color: transparent;">(16:49) Event-driven scheduling makes Airflow more dynamic.</span></p><p><span style="background-color: transparent;">(19:40) Tasks in Airflow 3 can run in any language.</span></p><p><span style="background-color: transparent;">(22:30) Multilingual task support is crucial for enterprises.</span></p><p><span style="background-color: transparent;">(25:21) Data assets and event-based integration enhance orchestration.</span></p><p><span style="background-color: transparent;">(28:12) Community feedback plays a vital role in Airflow 3.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/vikramkoka/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Vikram Koka</a> -</p><p>https://www.linkedin.com/in/vikramkoka/</p><p><a href="https://www.linkedin.com/company/astronomer/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.linkedin.com/company/astronomer/</p><p><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a><a href="https://www.linkedin.com/company/the-apache-software-foundation/" target="_blank"> LinkedIn</a> -</p><p>https://www.linkedin.com/company/the-apache-software-foundation/</p><p><a href="https://www.linkedin.com/company/apache-airflow/" target="_blank" style="color: rgba(0, 0, 0, 0.9);">Apache Airflow LinkedIn</a><span style="color: rgba(0, 0, 0, 0.9);"> - </span></p><p>https://www.linkedin.com/company/apache-airflow/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://www.astronomer.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Astronomer</a> -</p><p>https://www.astronomer.io/</p><p><a href="https://www.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">The Apache Software Foundation</a> -</p><p>https://www.apache.org/</p><p><a href="https://airflow.apache.org/community/" target="_blank">Join the Airflow slack and/or Dev list</a> -</p><p>https://airflow.apache.org/community/</p><p><a href="https://astronomer.typeform.com/airflowsurvey24" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow Survey</a> -</p><p>https://astronomer.typeform.com/airflowsurvey24</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data orchestration is evolving faster than ever and Apache Airflow 3 is set to revolutionize how enterprises handle complex workflows. In this episode, we dive into the exciting advancements with Vikram Koka, Chief Strategy Officer at Astronomer an...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>15</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[a0f4a479-f143-4230-9db0-f0f25ed889dd]]></guid>
  <title><![CDATA[Building a Data-Driven HR Platform at 15Five with Guy Dassa]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Data and AI are revolutionizing HR, empowering leaders to measure performance and drive strategic decisions like never before.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, we explore the transformation of HR technology with </span><a href="https://www.linkedin.com/in/guydassa/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Guy Dassa</a><span style="background-color: transparent;">, Chief Technology Officer at </span><a href="https://www.linkedin.com/company/15five/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a><span style="background-color: transparent;">, as he shares insights into their evolving data platform. Guy discusses how 15Five equips HR leaders with tools to measure and take action on team performance, engagement and retention. He explains their data-driven approach, highlighting how Apache Airflow supports their data ingestion, transformation, and AI integration.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:54) 15Five acts as a command center for HR leaders.</span></p><p><span style="background-color: transparent;">(03:40) Tools like performance reviews, engagement surveys, and an insights dashboard guide actionable HR steps.</span></p><p><span style="background-color: transparent;">(05:33) Data visualization, insights, and action recommendations enhance HR effectiveness to improve their people's outcomes.</span></p><p><span style="background-color: transparent;">(07:08) Strict data confidentiality and sanitized AI model training.</span></p><p><span style="background-color: transparent;">(09:21) Airflow is central to data transformation and enrichment.</span></p><p><span style="background-color: transparent;">(11:15) Airflow enrichment DAGs integrate AI models.</span></p><p><span style="background-color: transparent;">(13:33) Integration of Airflow and DBT enables efficient data transformation.</span></p><p><span style="background-color: transparent;">(15:28) Synchronization challenges arise with reverse ETL processes.</span></p><p><span style="background-color: transparent;">(17:10) Future plans include deeper Airflow integration with AI.</span></p><p><span style="background-color: transparent;">(19:31) Emphasizing the need for DAG versioning and improved dependency visibility.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/guydassa/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Guy Dassa</a> -</p><p>https://www.linkedin.com/in/guydassa/</p><p><a href="https://www.linkedin.com/company/15five/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a> -</p><p>https://www.linkedin.com/company/15five/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://mlflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MLflow</a> -</p><p>https://mlflow.org/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DBT</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><a href="https://aws.amazon.com/redshift/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">RedShift</a> -</p><p>https://aws.amazon.com/redshift/</p><p><a href="https://www.15five.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a> -</p><p>https://www.15five.com/</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/93a422f0-6b98-4828-95f7-cf95a76aa9bb/46ca9ca528.jpg" />
  <pubDate>Thu, 24 Oct 2024 06:00:00 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="19603620" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/93a422f0-6b98-4828-95f7-cf95a76aa9bb/episode.mp3" />
  <itunes:title><![CDATA[Building a Data-Driven HR Platform at 15Five with Guy Dassa]]></itunes:title>
  <itunes:duration>20:25</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Data and AI are revolutionizing HR, empowering leaders to measure performance and drive strategic decisions like never before.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, we explore the transformation of HR technology with </span><a href="https://www.linkedin.com/in/guydassa/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Guy Dassa</a><span style="background-color: transparent;">, Chief Technology Officer at </span><a href="https://www.linkedin.com/company/15five/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a><span style="background-color: transparent;">, as he shares insights into their evolving data platform. Guy discusses how 15Five equips HR leaders with tools to measure and take action on team performance, engagement and retention. He explains their data-driven approach, highlighting how Apache Airflow supports their data ingestion, transformation, and AI integration.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:54) 15Five acts as a command center for HR leaders.</span></p><p><span style="background-color: transparent;">(03:40) Tools like performance reviews, engagement surveys, and an insights dashboard guide actionable HR steps.</span></p><p><span style="background-color: transparent;">(05:33) Data visualization, insights, and action recommendations enhance HR effectiveness to improve their people's outcomes.</span></p><p><span style="background-color: transparent;">(07:08) Strict data confidentiality and sanitized AI model training.</span></p><p><span style="background-color: transparent;">(09:21) Airflow is central to data transformation and enrichment.</span></p><p><span style="background-color: transparent;">(11:15) Airflow enrichment DAGs integrate AI models.</span></p><p><span style="background-color: transparent;">(13:33) Integration of Airflow and DBT enables efficient data transformation.</span></p><p><span style="background-color: transparent;">(15:28) Synchronization challenges arise with reverse ETL processes.</span></p><p><span style="background-color: transparent;">(17:10) Future plans include deeper Airflow integration with AI.</span></p><p><span style="background-color: transparent;">(19:31) Emphasizing the need for DAG versioning and improved dependency visibility.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/guydassa/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Guy Dassa</a> -</p><p>https://www.linkedin.com/in/guydassa/</p><p><a href="https://www.linkedin.com/company/15five/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a> -</p><p>https://www.linkedin.com/company/15five/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://mlflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MLflow</a> -</p><p>https://mlflow.org/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DBT</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><a href="https://aws.amazon.com/redshift/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">RedShift</a> -</p><p>https://aws.amazon.com/redshift/</p><p><a href="https://www.15five.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a> -</p><p>https://www.15five.com/</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Data and AI are revolutionizing HR, empowering leaders to measure performance and drive strategic decisions like never before.&nbsp;</span></p><p><span style="background-color: transparent;">In this episode, we explore the transformation of HR technology with </span><a href="https://www.linkedin.com/in/guydassa/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Guy Dassa</a><span style="background-color: transparent;">, Chief Technology Officer at </span><a href="https://www.linkedin.com/company/15five/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a><span style="background-color: transparent;">, as he shares insights into their evolving data platform. Guy discusses how 15Five equips HR leaders with tools to measure and take action on team performance, engagement and retention. He explains their data-driven approach, highlighting how Apache Airflow supports their data ingestion, transformation, and AI integration.</span></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:54) 15Five acts as a command center for HR leaders.</span></p><p><span style="background-color: transparent;">(03:40) Tools like performance reviews, engagement surveys, and an insights dashboard guide actionable HR steps.</span></p><p><span style="background-color: transparent;">(05:33) Data visualization, insights, and action recommendations enhance HR effectiveness to improve their people's outcomes.</span></p><p><span style="background-color: transparent;">(07:08) Strict data confidentiality and sanitized AI model training.</span></p><p><span style="background-color: transparent;">(09:21) Airflow is central to data transformation and enrichment.</span></p><p><span style="background-color: transparent;">(11:15) Airflow enrichment DAGs integrate AI models.</span></p><p><span style="background-color: transparent;">(13:33) Integration of Airflow and DBT enables efficient data transformation.</span></p><p><span style="background-color: transparent;">(15:28) Synchronization challenges arise with reverse ETL processes.</span></p><p><span style="background-color: transparent;">(17:10) Future plans include deeper Airflow integration with AI.</span></p><p><span style="background-color: transparent;">(19:31) Emphasizing the need for DAG versioning and improved dependency visibility.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://www.linkedin.com/in/guydassa/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Guy Dassa</a> -</p><p>https://www.linkedin.com/in/guydassa/</p><p><a href="https://www.linkedin.com/company/15five/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a> -</p><p>https://www.linkedin.com/company/15five/</p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> -</p><p>https://airflow.apache.org/</p><p><a href="https://mlflow.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">MLflow</a> -</p><p>https://mlflow.org/</p><p><a href="https://www.getdbt.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">DBT</a> -</p><p>https://www.getdbt.com/</p><p><a href="https://kubernetes.io/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Kubernetes</a> -</p><p>https://kubernetes.io/</p><p><a href="https://aws.amazon.com/redshift/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">RedShift</a> -</p><p>https://aws.amazon.com/redshift/</p><p><a href="https://www.15five.com/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">15Five</a> -</p><p>https://www.15five.com/</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Data and AI are revolutionizing HR, empowering leaders to measure performance and drive strategic decisions like never before. In this episode, we explore the transformation of HR technology with Guy Dassa, Chief Technology Officer at 15Five, as he...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>14</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[829c9df0-718a-4a0a-acd3-83d6d9bf903e]]></guid>
  <title><![CDATA[The Intersection of AI and Data Management at Dosu with Devin Stein]]></title>
  <description><![CDATA[<p><span style="background-color: transparent;">Unlocking engineering productivity goes beyond coding — it’s about managing knowledge efficiently. In this episode, we explore the innovative ways in which Dosu leverages Airflow for data orchestration and supports the Airflow project.&nbsp;</span></p><p><br></p><p><a href="https://www.linkedin.com/in/devstein/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Devin Stein</a><span style="background-color: transparent;">, Founder of </span><a href="https://dosu.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Dosu</a><span style="background-color: transparent;">, shares his insights on how engineering teams can focus on value-added work by automating knowledge management. Devin dives into Dosu’s purpose, the significance of AI in their product, and why they chose Airflow as the backbone for scheduling and data management.&nbsp;</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:33) Dosu's mission to democratize engineering knowledge.</span></p><p><span style="background-color: transparent;">(05:00) AI is central to Dosu's product for structuring engineering knowledge.</span></p><p><span style="background-color: transparent;">(06:23) The importance of maintaining up-to-date data for AI effectiveness.</span></p><p><span style="background-color: transparent;">(07:55) How Airflow supports Dosu’s data ingestion and automation processes.</span></p><p><span style="background-color: transparent;">(08:45) The reasoning behind choosing Airflow over other orchestrators.</span></p><p><span style="background-color: transparent;">(11:00) Airflow enables Dosu to manage both traditional ETL and dynamic workflows.</span></p><p><span style="background-color: transparent;">(13:04) Dosu assists the Airflow project by auto-labeling issues and discussions.</span></p><p><span style="background-color: transparent;">(14:56) Thoughtful collaboration with the Airflow community to introduce AI tools.</span></p><p><span style="background-color: transparent;">(16:37) The potential of Airflow to handle more dynamic, scheduled workflows in the future.</span></p><p><span style="background-color: transparent;">(18:00) Challenges and custom solutions for implementing dynamic workflows in Airflow.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> - https://airflow.apache.org/</p><p><a href="https://dosu.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Dosu Website</a> - https://dosu.dev/</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/cohost/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/c8512979-d057-4d75-9a70-968599884b02/8f6999a7f6.jpg" />
  <pubDate>Fri, 04 Oct 2024 06:53:05 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="19499966" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/c8512979-d057-4d75-9a70-968599884b02/episode.mp3" />
  <itunes:title><![CDATA[The Intersection of AI and Data Management at Dosu with Devin Stein]]></itunes:title>
  <itunes:duration>20:18</itunes:duration>
  <itunes:summary><![CDATA[<p><span style="background-color: transparent;">Unlocking engineering productivity goes beyond coding — it’s about managing knowledge efficiently. In this episode, we explore the innovative ways in which Dosu leverages Airflow for data orchestration and supports the Airflow project.&nbsp;</span></p><p><br></p><p><a href="https://www.linkedin.com/in/devstein/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Devin Stein</a><span style="background-color: transparent;">, Founder of </span><a href="https://dosu.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Dosu</a><span style="background-color: transparent;">, shares his insights on how engineering teams can focus on value-added work by automating knowledge management. Devin dives into Dosu’s purpose, the significance of AI in their product, and why they chose Airflow as the backbone for scheduling and data management.&nbsp;</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:33) Dosu's mission to democratize engineering knowledge.</span></p><p><span style="background-color: transparent;">(05:00) AI is central to Dosu's product for structuring engineering knowledge.</span></p><p><span style="background-color: transparent;">(06:23) The importance of maintaining up-to-date data for AI effectiveness.</span></p><p><span style="background-color: transparent;">(07:55) How Airflow supports Dosu’s data ingestion and automation processes.</span></p><p><span style="background-color: transparent;">(08:45) The reasoning behind choosing Airflow over other orchestrators.</span></p><p><span style="background-color: transparent;">(11:00) Airflow enables Dosu to manage both traditional ETL and dynamic workflows.</span></p><p><span style="background-color: transparent;">(13:04) Dosu assists the Airflow project by auto-labeling issues and discussions.</span></p><p><span style="background-color: transparent;">(14:56) Thoughtful collaboration with the Airflow community to introduce AI tools.</span></p><p><span style="background-color: transparent;">(16:37) The potential of Airflow to handle more dynamic, scheduled workflows in the future.</span></p><p><span style="background-color: transparent;">(18:00) Challenges and custom solutions for implementing dynamic workflows in Airflow.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> - https://airflow.apache.org/</p><p><a href="https://dosu.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Dosu Website</a> - https://dosu.dev/</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></itunes:summary>
  <content:encoded><![CDATA[<p><span style="background-color: transparent;">Unlocking engineering productivity goes beyond coding — it’s about managing knowledge efficiently. In this episode, we explore the innovative ways in which Dosu leverages Airflow for data orchestration and supports the Airflow project.&nbsp;</span></p><p><br></p><p><a href="https://www.linkedin.com/in/devstein/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Devin Stein</a><span style="background-color: transparent;">, Founder of </span><a href="https://dosu.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Dosu</a><span style="background-color: transparent;">, shares his insights on how engineering teams can focus on value-added work by automating knowledge management. Devin dives into Dosu’s purpose, the significance of AI in their product, and why they chose Airflow as the backbone for scheduling and data management.&nbsp;</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Key Takeaways:</strong></p><p><br></p><p><span style="background-color: transparent;">(01:33) Dosu's mission to democratize engineering knowledge.</span></p><p><span style="background-color: transparent;">(05:00) AI is central to Dosu's product for structuring engineering knowledge.</span></p><p><span style="background-color: transparent;">(06:23) The importance of maintaining up-to-date data for AI effectiveness.</span></p><p><span style="background-color: transparent;">(07:55) How Airflow supports Dosu’s data ingestion and automation processes.</span></p><p><span style="background-color: transparent;">(08:45) The reasoning behind choosing Airflow over other orchestrators.</span></p><p><span style="background-color: transparent;">(11:00) Airflow enables Dosu to manage both traditional ETL and dynamic workflows.</span></p><p><span style="background-color: transparent;">(13:04) Dosu assists the Airflow project by auto-labeling issues and discussions.</span></p><p><span style="background-color: transparent;">(14:56) Thoughtful collaboration with the Airflow community to introduce AI tools.</span></p><p><span style="background-color: transparent;">(16:37) The potential of Airflow to handle more dynamic, scheduled workflows in the future.</span></p><p><span style="background-color: transparent;">(18:00) Challenges and custom solutions for implementing dynamic workflows in Airflow.</span></p><p><br></p><p><br></p><p><strong style="background-color: transparent;">Resources Mentioned:</strong></p><p><br></p><p><a href="https://airflow.apache.org/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Apache Airflow</a> - https://airflow.apache.org/</p><p><a href="https://dosu.dev/" target="_blank" style="background-color: transparent; color: rgb(17, 85, 204);">Dosu Website</a> - https://dosu.dev/</p><p><br></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering &amp; AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.</span></p><p><br></p><p><span style="background-color: transparent; color: rgb(22, 14, 61);">#AI #Automation #Airflow #MachineLearning</span></p>]]></content:encoded>
  <itunes:subtitle><![CDATA[Unlocking engineering productivity goes beyond coding — it’s about managing knowledge efficiently. In this episode, we explore the innovative ways in which Dosu leverages Airflow for data orchestration and supports the Airflow project. Devin Stein,...]]></itunes:subtitle>
 <itunes:keywords><![CDATA[]]></itunes:keywords>
  <itunes:explicit>false</itunes:explicit>
  <itunes:episodeType>full</itunes:episodeType>
  <itunes:episode>13</itunes:episode>
  <itunes:season>1</itunes:season>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1915402430]]></guid>
  <title><![CDATA[AI-Powered Vehicle Automation at Ford Motor Company with Serjesh Sharma]]></title>
  <description><![CDATA[Harnessing data at scale is the key to driving innovation in autonomous vehicle technology. In this episode, we uncover how advanced orchestration tools are transforming machine learning operations in the automotive industry. Serjesh Sharma, Supervisor ADAS Machine Learning Operations (MLOps) at Ford Motor Company, joins us to discuss the challenges and innovations his team faces working to enhance vehicle safety and automation. Serjesh shares insights into the intricate data processes that support Ford’s Advanced Driver Assistance Systems (ADAS) and how his team leverages Apache Airflow to manage massive data loads efficiently.

Key Takeaways:

(01:44) ADAS involves advanced features like pre-collision assist and self-driving capabilities.
(04:47) Ensuring sensor accuracy and vehicle safety requires extensive data processing.
(05:08) The combination of on-prem and cloud infrastructure optimizes data handling.
(09:27) Ford processes around one petabyte of data per week, using both CPUs and GPUs.
(10:33) Implementing software engineering best practices to improve scalability and reliability.
(15:18) GitHub Issues streamline onboarding and infrastructure provisioning.
(17:00) Airflow's modular design allows Ford to manage complex data pipelines.
(19:00) Kubernetes pod operators help optimize resource usage for CPU-intensive tasks.
(20:35) Ford's scale challenges led to customized Airflow configurations for high concurrency.
(21:02) Advanced orchestration tools are pivotal in managing vast data landscapes in automotive innovation.

Resources Mentioned:

Serjesh Sharma - www.linkedin.com/in/serjeshsharma/
Ford Motor Company - www.linkedin.com/company/ford-motor-company/
Apache Airflow - airflow.apache.org/
Kubernetes - kubernetes.io/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/a5f595a1-84c4-4369-ac8c-ef5f3f3b2cc5/cover-art/original_a4fec5305e0e714b9f2bcd0bc69fd5eb.jpg" />
  <pubDate>Thu, 12 Sep 2024 16:49:35 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="25143670" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/a5f595a1-84c4-4369-ac8c-ef5f3f3b2cc5/episode.mp3" />
  <itunes:title><![CDATA[AI-Powered Vehicle Automation at Ford Motor Company with Serjesh Sharma]]></itunes:title>
  <itunes:duration>26:11</itunes:duration>
  <itunes:summary><![CDATA[Harnessing data at scale is the key to driving innovation in autonomous vehicle technology. In this episode, we uncover how advanced orchestration tools are transforming machine learning operations in the automotive industry. Serjesh Sharma, Supervisor ADAS Machine Learning Operations (MLOps) at Ford Motor Company, joins us to discuss the challenges and innovations his team faces working to enhance vehicle safety and automation. Serjesh shares insights into the intricate data processes that support Ford’s Advanced Driver Assistance Systems (ADAS) and how his team leverages Apache Airflow to manage massive data loads efficiently.

Key Takeaways:

(01:44) ADAS involves advanced features like pre-collision assist and self-driving capabilities.
(04:47) Ensuring sensor accuracy and vehicle safety requires extensive data processing.
(05:08) The combination of on-prem and cloud infrastructure optimizes data handling.
(09:27) Ford processes around one petabyte of data per week, using both CPUs and GPUs.
(10:33) Implementing software engineering best practices to improve scalability and reliability.
(15:18) GitHub Issues streamline onboarding and infrastructure provisioning.
(17:00) Airflow's modular design allows Ford to manage complex data pipelines.
(19:00) Kubernetes pod operators help optimize resource usage for CPU-intensive tasks.
(20:35) Ford's scale challenges led to customized Airflow configurations for high concurrency.
(21:02) Advanced orchestration tools are pivotal in managing vast data landscapes in automotive innovation.

Resources Mentioned:

Serjesh Sharma - www.linkedin.com/in/serjeshsharma/
Ford Motor Company - www.linkedin.com/company/ford-motor-company/
Apache Airflow - airflow.apache.org/
Kubernetes - kubernetes.io/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Harnessing data at scale is the key to driving innovation in autonomous vehicle technology. In this episode, we uncover how advanced orchestration tools are transforming machine learning operations in the automotive industry. Serjesh Sharma, Supervisor ADAS Machine Learning Operations (MLOps) at Ford Motor Company, joins us to discuss the challenges and innovations his team faces working to enhance vehicle safety and automation. Serjesh shares insights into the intricate data processes that support Ford’s Advanced Driver Assistance Systems (ADAS) and how his team leverages Apache Airflow to manage massive data loads efficiently.

Key Takeaways:

(01:44) ADAS involves advanced features like pre-collision assist and self-driving capabilities.
(04:47) Ensuring sensor accuracy and vehicle safety requires extensive data processing.
(05:08) The combination of on-prem and cloud infrastructure optimizes data handling.
(09:27) Ford processes around one petabyte of data per week, using both CPUs and GPUs.
(10:33) Implementing software engineering best practices to improve scalability and reliability.
(15:18) GitHub Issues streamline onboarding and infrastructure provisioning.
(17:00) Airflow's modular design allows Ford to manage complex data pipelines.
(19:00) Kubernetes pod operators help optimize resource usage for CPU-intensive tasks.
(20:35) Ford's scale challenges led to customized Airflow configurations for high concurrency.
(21:02) Advanced orchestration tools are pivotal in managing vast data landscapes in automotive innovation.

Resources Mentioned:

Serjesh Sharma - www.linkedin.com/in/serjeshsharma/
Ford Motor Company - www.linkedin.com/company/ford-motor-company/
Apache Airflow - airflow.apache.org/
Kubernetes - kubernetes.io/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Harnessing data at scale is the key to driving innovation in autonomous vehicle technology. In this episode, we uncover how advanced orchestration tools are transforming machine learning operations in the automotive industry. Serjesh Sharma, Superv...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1911310895]]></guid>
  <title><![CDATA[From Task Failures to Operational Excellence at GumGum with Brendan Frick]]></title>
  <description><![CDATA[Data failures are inevitable but how you manage them can define the success of your operations. In this episode, we dive deep into the challenges of data engineering and AI with Brendan Frick, Senior Engineering Manager, Data at GumGum. Brendan shares his unique approach to managing task failures and DAG issues in a high-stakes ad-tech environment.

Brendan discusses how GumGum leverages Apache Airflow to streamline data processes, ensuring efficient data movement and orchestration while minimizing disruptions in their operations.

Key Takeaways:

(02:02) Brendan’s role at GumGum and its approach to ad tech.
(04:27) How GumGum uses Airflow for daily data orchestration, moving data from S3 to warehouses.
(07:02) Handling task failures in Airflow using Jira for actionable, developer-friendly responses.
(09:13) Transitioning from email alerts to a more structured system with Jira and PagerDuty.
(11:40) Monitoring task retry rates as a key metric to identify potential issues early.
(14:15) Utilizing Looker dashboards to track and analyze task performance and retry rates.
(16:39) Transitioning from Kubernetes operator to a more reliable system for data processing.
(19:25) The importance of automating stakeholder communication with data lineage tools like Atlan.
(20:48) Implementing data contracts to ensure SLAs are met across all data processes.
(22:01) The role of scalable SLAs in Airflow to ensure data reliability and meet business needs.

Resources Mentioned:

Brendan Frick -
https://www.linkedin.com/in/brendan-frick-399345107/
GumGum -
https://www.linkedin.com/company/gumgum/
Apache Airflow -
https://airflow.apache.org/
Jira -
https://www.atlassian.com/software/jira
Atlan -
https://atlan.com/
Kubernetes -
https://kubernetes.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/e7eee171-ffa2-4a6a-b11d-dc0f960cd32b/cover-art/original_ab670c00874ed213905644959c070da1.jpg" />
  <pubDate>Fri, 06 Sep 2024 06:32:20 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="46295121" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/e7eee171-ffa2-4a6a-b11d-dc0f960cd32b/episode.mp3" />
  <itunes:title><![CDATA[From Task Failures to Operational Excellence at GumGum with Brendan Frick]]></itunes:title>
  <itunes:duration>24:06</itunes:duration>
  <itunes:summary><![CDATA[Data failures are inevitable but how you manage them can define the success of your operations. In this episode, we dive deep into the challenges of data engineering and AI with Brendan Frick, Senior Engineering Manager, Data at GumGum. Brendan shares his unique approach to managing task failures and DAG issues in a high-stakes ad-tech environment.

Brendan discusses how GumGum leverages Apache Airflow to streamline data processes, ensuring efficient data movement and orchestration while minimizing disruptions in their operations.

Key Takeaways:

(02:02) Brendan’s role at GumGum and its approach to ad tech.
(04:27) How GumGum uses Airflow for daily data orchestration, moving data from S3 to warehouses.
(07:02) Handling task failures in Airflow using Jira for actionable, developer-friendly responses.
(09:13) Transitioning from email alerts to a more structured system with Jira and PagerDuty.
(11:40) Monitoring task retry rates as a key metric to identify potential issues early.
(14:15) Utilizing Looker dashboards to track and analyze task performance and retry rates.
(16:39) Transitioning from Kubernetes operator to a more reliable system for data processing.
(19:25) The importance of automating stakeholder communication with data lineage tools like Atlan.
(20:48) Implementing data contracts to ensure SLAs are met across all data processes.
(22:01) The role of scalable SLAs in Airflow to ensure data reliability and meet business needs.

Resources Mentioned:

Brendan Frick -
https://www.linkedin.com/in/brendan-frick-399345107/
GumGum -
https://www.linkedin.com/company/gumgum/
Apache Airflow -
https://airflow.apache.org/
Jira -
https://www.atlassian.com/software/jira
Atlan -
https://atlan.com/
Kubernetes -
https://kubernetes.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Data failures are inevitable but how you manage them can define the success of your operations. In this episode, we dive deep into the challenges of data engineering and AI with Brendan Frick, Senior Engineering Manager, Data at GumGum. Brendan shares his unique approach to managing task failures and DAG issues in a high-stakes ad-tech environment.

Brendan discusses how GumGum leverages Apache Airflow to streamline data processes, ensuring efficient data movement and orchestration while minimizing disruptions in their operations.

Key Takeaways:

(02:02) Brendan’s role at GumGum and its approach to ad tech.
(04:27) How GumGum uses Airflow for daily data orchestration, moving data from S3 to warehouses.
(07:02) Handling task failures in Airflow using Jira for actionable, developer-friendly responses.
(09:13) Transitioning from email alerts to a more structured system with Jira and PagerDuty.
(11:40) Monitoring task retry rates as a key metric to identify potential issues early.
(14:15) Utilizing Looker dashboards to track and analyze task performance and retry rates.
(16:39) Transitioning from Kubernetes operator to a more reliable system for data processing.
(19:25) The importance of automating stakeholder communication with data lineage tools like Atlan.
(20:48) Implementing data contracts to ensure SLAs are met across all data processes.
(22:01) The role of scalable SLAs in Airflow to ensure data reliability and meet business needs.

Resources Mentioned:

Brendan Frick -
https://www.linkedin.com/in/brendan-frick-399345107/
GumGum -
https://www.linkedin.com/company/gumgum/
Apache Airflow -
https://airflow.apache.org/
Jira -
https://www.atlassian.com/software/jira
Atlan -
https://atlan.com/
Kubernetes -
https://kubernetes.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Data failures are inevitable but how you manage them can define the success of your operations. In this episode, we dive deep into the challenges of data engineering and AI with Brendan Frick, Senior Engineering Manager, Data at GumGum. Brendan sha...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1906370054]]></guid>
  <title><![CDATA[From Sensors to Datasets: Enhancing Airflow at Astronomer with Maggie Stark and Marion Azoulai]]></title>
  <description><![CDATA[A 13% reduction in failure rates — this is how two data scientists at Astronomer revolutionized their data pipelines using Apache Airflow.

In this episode, we enter the world of data orchestration and AI with Maggie Stark and Marion Azoulai, both Senior Data Scientists at Astronomer. Maggie and Marion discuss how their team re-architected their use of Airflow to improve scalability, reliability and efficiency in data processing. They share insights on overcoming challenges with sensors and how moving to datasets transformed their workflows.

Key Takeaways:

(02:23) The data team’s role as a centralized hub within Astronomer.
(05:11) Airflow is the backbone of all data processes, running 60,000 tasks daily.
(07:13) Custom task groups enable efficient code reuse and adherence to best practices.
(11:33) Sensor-heavy architectures can lead to cascading failures and resource issues.
(12:09) Switching to datasets has improved reliability and scalability.
(14:19) Building a control DAG provides end-to-end visibility of pipelines.
(16:42) Breaking down DAGs into smaller units minimizes failures and improves management.
(19:02) Failure rates improved from 16% to 3% with the new architecture.

Resources Mentioned:

Maggie Stark -
https://www.linkedin.com/in/margaretstark/
Marion Azoulai -
https://www.linkedin.com/in/marionazoulai/
Astronomer | LinkedIn -
https://www.linkedin.com/company/astronomer/
Apache Airflow -
https://airflow.apache.org/
Astronomer | Website -
https://www.astronomer.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/35f1eecc-e084-44d1-a39f-3c7e03e0518f/cover-art/original_6758da5b279197db8187015edab27fcd.jpg" />
  <pubDate>Thu, 29 Aug 2024 15:44:33 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="21523306" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/35f1eecc-e084-44d1-a39f-3c7e03e0518f/episode.mp3" />
  <itunes:title><![CDATA[From Sensors to Datasets: Enhancing Airflow at Astronomer with Maggie Stark and Marion Azoulai]]></itunes:title>
  <itunes:duration>22:25</itunes:duration>
  <itunes:summary><![CDATA[A 13% reduction in failure rates — this is how two data scientists at Astronomer revolutionized their data pipelines using Apache Airflow.

In this episode, we enter the world of data orchestration and AI with Maggie Stark and Marion Azoulai, both Senior Data Scientists at Astronomer. Maggie and Marion discuss how their team re-architected their use of Airflow to improve scalability, reliability and efficiency in data processing. They share insights on overcoming challenges with sensors and how moving to datasets transformed their workflows.

Key Takeaways:

(02:23) The data team’s role as a centralized hub within Astronomer.
(05:11) Airflow is the backbone of all data processes, running 60,000 tasks daily.
(07:13) Custom task groups enable efficient code reuse and adherence to best practices.
(11:33) Sensor-heavy architectures can lead to cascading failures and resource issues.
(12:09) Switching to datasets has improved reliability and scalability.
(14:19) Building a control DAG provides end-to-end visibility of pipelines.
(16:42) Breaking down DAGs into smaller units minimizes failures and improves management.
(19:02) Failure rates improved from 16% to 3% with the new architecture.

Resources Mentioned:

Maggie Stark -
https://www.linkedin.com/in/margaretstark/
Marion Azoulai -
https://www.linkedin.com/in/marionazoulai/
Astronomer | LinkedIn -
https://www.linkedin.com/company/astronomer/
Apache Airflow -
https://airflow.apache.org/
Astronomer | Website -
https://www.astronomer.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[A 13% reduction in failure rates — this is how two data scientists at Astronomer revolutionized their data pipelines using Apache Airflow.

In this episode, we enter the world of data orchestration and AI with Maggie Stark and Marion Azoulai, both Senior Data Scientists at Astronomer. Maggie and Marion discuss how their team re-architected their use of Airflow to improve scalability, reliability and efficiency in data processing. They share insights on overcoming challenges with sensors and how moving to datasets transformed their workflows.

Key Takeaways:

(02:23) The data team’s role as a centralized hub within Astronomer.
(05:11) Airflow is the backbone of all data processes, running 60,000 tasks daily.
(07:13) Custom task groups enable efficient code reuse and adherence to best practices.
(11:33) Sensor-heavy architectures can lead to cascading failures and resource issues.
(12:09) Switching to datasets has improved reliability and scalability.
(14:19) Building a control DAG provides end-to-end visibility of pipelines.
(16:42) Breaking down DAGs into smaller units minimizes failures and improves management.
(19:02) Failure rates improved from 16% to 3% with the new architecture.

Resources Mentioned:

Maggie Stark -
https://www.linkedin.com/in/margaretstark/
Marion Azoulai -
https://www.linkedin.com/in/marionazoulai/
Astronomer | LinkedIn -
https://www.linkedin.com/company/astronomer/
Apache Airflow -
https://airflow.apache.org/
Astronomer | Website -
https://www.astronomer.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[A 13% reduction in failure rates — this is how two data scientists at Astronomer revolutionized their data pipelines using Apache Airflow.

In this episode, we enter the world of data orchestration and AI with Maggie Stark and Marion Azoulai, both ...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1904353025]]></guid>
  <title><![CDATA[Mastering Data Orchestration with Airflow at M Science with Ben Tallman]]></title>
  <description><![CDATA[Mastering the flow of data is essential for driving innovation and efficiency in today’s competitive landscape. In this episode, we explore the evolution of data orchestration and the pivotal role of Apache Airflow in modern data workflows.

Ben Tallman, Chief Technology Officer at M Science, joins us and shares his extensive experience with Airflow, detailing its early adoption, evolution and the profound impact it has had on data engineering practices. His insights reveal how leveraging Airflow can streamline complex data processes, enhance observability and ultimately drive business success.

Key Takeaways:

(02:31) Benjamin’s journey with Airflow and its early adoption.
(05:36) The transition from legacy schedulers to Airflow at Apigee and later Google.
(08:52) The challenges and benefits of running production-grade Airflow instances.
(10:46) How Airflow facilitates the management of large-scale data at M Science.
(11:56) The importance of reducing time to value for customers using data products.
(13:32) Airflow’s role in ensuring observability and reliability in data workflows.
(17:00) Managing petabytes of data and billions of records efficiently.
(19:08) Integration of various data sources and ensuring data product quality.
(20:04) Leveraging Airflow for data observability and reducing time to value.
(22:04) Benjamin’s vision for the future development of Airflow, including audit trails for variables.

Resources Mentioned:

Ben Tallman -
https://www.linkedin.com/in/btallman/
M Science -
https://www.linkedin.com/company/m-science-llc/
Apache Airflow -
https://airflow.apache.org/
Astronomer -
https://www.astronomer.io/
Databricks -
https://databricks.com/
Snowflake -
https://www.snowflake.com/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/80d0267a-4e82-4d23-a2b5-a8a1d5b70a46/cover-art/original_48e148938b4c703f7deaaaee5385641f.jpg" />
  <pubDate>Mon, 26 Aug 2024 16:18:40 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23617704" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/80d0267a-4e82-4d23-a2b5-a8a1d5b70a46/episode.mp3" />
  <itunes:title><![CDATA[Mastering Data Orchestration with Airflow at M Science with Ben Tallman]]></itunes:title>
  <itunes:duration>24:36</itunes:duration>
  <itunes:summary><![CDATA[Mastering the flow of data is essential for driving innovation and efficiency in today’s competitive landscape. In this episode, we explore the evolution of data orchestration and the pivotal role of Apache Airflow in modern data workflows.

Ben Tallman, Chief Technology Officer at M Science, joins us and shares his extensive experience with Airflow, detailing its early adoption, evolution and the profound impact it has had on data engineering practices. His insights reveal how leveraging Airflow can streamline complex data processes, enhance observability and ultimately drive business success.

Key Takeaways:

(02:31) Benjamin’s journey with Airflow and its early adoption.
(05:36) The transition from legacy schedulers to Airflow at Apigee and later Google.
(08:52) The challenges and benefits of running production-grade Airflow instances.
(10:46) How Airflow facilitates the management of large-scale data at M Science.
(11:56) The importance of reducing time to value for customers using data products.
(13:32) Airflow’s role in ensuring observability and reliability in data workflows.
(17:00) Managing petabytes of data and billions of records efficiently.
(19:08) Integration of various data sources and ensuring data product quality.
(20:04) Leveraging Airflow for data observability and reducing time to value.
(22:04) Benjamin’s vision for the future development of Airflow, including audit trails for variables.

Resources Mentioned:

Ben Tallman -
https://www.linkedin.com/in/btallman/
M Science -
https://www.linkedin.com/company/m-science-llc/
Apache Airflow -
https://airflow.apache.org/
Astronomer -
https://www.astronomer.io/
Databricks -
https://databricks.com/
Snowflake -
https://www.snowflake.com/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Mastering the flow of data is essential for driving innovation and efficiency in today’s competitive landscape. In this episode, we explore the evolution of data orchestration and the pivotal role of Apache Airflow in modern data workflows.

Ben Tallman, Chief Technology Officer at M Science, joins us and shares his extensive experience with Airflow, detailing its early adoption, evolution and the profound impact it has had on data engineering practices. His insights reveal how leveraging Airflow can streamline complex data processes, enhance observability and ultimately drive business success.

Key Takeaways:

(02:31) Benjamin’s journey with Airflow and its early adoption.
(05:36) The transition from legacy schedulers to Airflow at Apigee and later Google.
(08:52) The challenges and benefits of running production-grade Airflow instances.
(10:46) How Airflow facilitates the management of large-scale data at M Science.
(11:56) The importance of reducing time to value for customers using data products.
(13:32) Airflow’s role in ensuring observability and reliability in data workflows.
(17:00) Managing petabytes of data and billions of records efficiently.
(19:08) Integration of various data sources and ensuring data product quality.
(20:04) Leveraging Airflow for data observability and reducing time to value.
(22:04) Benjamin’s vision for the future development of Airflow, including audit trails for variables.

Resources Mentioned:

Ben Tallman -
https://www.linkedin.com/in/btallman/
M Science -
https://www.linkedin.com/company/m-science-llc/
Apache Airflow -
https://airflow.apache.org/
Astronomer -
https://www.astronomer.io/
Databricks -
https://databricks.com/
Snowflake -
https://www.snowflake.com/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Mastering the flow of data is essential for driving innovation and efficiency in today’s competitive landscape. In this episode, we explore the evolution of data orchestration and the pivotal role of Apache Airflow in modern data workflows.

Ben Ta...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1900165410]]></guid>
  <title><![CDATA[Welcome to The Data Flowcast]]></title>
  <description><![CDATA[Welcome to The Data Flowcast: Mastering Airflow for Data Engineering & AI — the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward.

Join us each week, as we explore the current state, future and potential of Airflow with leading thinkers in the community, and discover how best to leverage this workflow management system to meet the ever-evolving needs of data engineering and AI ecosystems.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ab46badf-f0a3-4586-a6a6-185311a31ee9/cover-art/original_6a30b0d9d0c22762ffcb824c5d174556.jpg" />
  <pubDate>Mon, 19 Aug 2024 16:19:41 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="1949897" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ab46badf-f0a3-4586-a6a6-185311a31ee9/episode.mp3" />
  <itunes:title><![CDATA[Welcome to The Data Flowcast]]></itunes:title>
  <itunes:duration>2:01</itunes:duration>
  <itunes:summary><![CDATA[Welcome to The Data Flowcast: Mastering Airflow for Data Engineering & AI — the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward.

Join us each week, as we explore the current state, future and potential of Airflow with leading thinkers in the community, and discover how best to leverage this workflow management system to meet the ever-evolving needs of data engineering and AI ecosystems.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Welcome to The Data Flowcast: Mastering Airflow for Data Engineering & AI — the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward.

Join us each week, as we explore the current state, future and potential of Airflow with leading thinkers in the community, and discover how best to leverage this workflow management system to meet the ever-evolving needs of data engineering and AI ecosystems.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Welcome to The Data Flowcast: Mastering Airflow for Data Engineering & AI — the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward.

Join us each week, as we explore the current state, future and p...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1897164294]]></guid>
  <title><![CDATA[Enhancing Business Metrics With Airflow at Artlist with Hannan Kravitz]]></title>
  <description><![CDATA[Data orchestration is revolutionizing the way companies manage and process data. In this episode, we explore the critical role of data orchestration in modern data workflows and how Apache Airflow is used to enhance data processing and AI model deployment.

Hannan Kravitz, Data Engineering Team Leader at Artlist, joins us to share his insights on leveraging Airflow for data engineering and its impact on their business operations.

Key Takeaways:

(01:00) Hannan introduces Artlist and its mission to empower content creators.
(04:27) The importance of collecting and modeling data to support business insights.
(06:40) Using Airflow to connect multiple data sources and create dashboards.
(09:40) Implementing a monitoring DAG for proactive alerts within Airflow​​.
(12:31) Customizing Airflow for business metric KPI monitoring and setting thresholds​​.
(15:00) Addressing decreases in purchases due to technical issues with proactive alerts​​.
(17:45) Customizing data quality checks with dynamic task mapping in Airflow​​.
(20:00) Desired improvements in Airflow UI and logging capabilities​​.
(21:00) Enabling business stakeholders to change thresholds using Streamlit​​.
(22:26) Future improvements desired in the Airflow project​.


Resources Mentioned:

Hannan Kravitz -
https://www.linkedin.com/in/hannan-kravitz-60563112/
Artlist -
https://www.linkedin.com/company/art-list/
Apache Airflow -
https://airflow.apache.org/
Snowflake -
https://www.snowflake.com/
Streamlit -
https://streamlit.io/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/bfac559e-51f4-4564-89b6-9ca20e3e1929/cover-art/original_2b95daf03dcfd7e3534046f68c8b1119.jpg" />
  <pubDate>Thu, 15 Aug 2024 10:00:27 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22902605" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/bfac559e-51f4-4564-89b6-9ca20e3e1929/episode.mp3" />
  <itunes:title><![CDATA[Enhancing Business Metrics With Airflow at Artlist with Hannan Kravitz]]></itunes:title>
  <itunes:duration>23:51</itunes:duration>
  <itunes:summary><![CDATA[Data orchestration is revolutionizing the way companies manage and process data. In this episode, we explore the critical role of data orchestration in modern data workflows and how Apache Airflow is used to enhance data processing and AI model deployment.

Hannan Kravitz, Data Engineering Team Leader at Artlist, joins us to share his insights on leveraging Airflow for data engineering and its impact on their business operations.

Key Takeaways:

(01:00) Hannan introduces Artlist and its mission to empower content creators.
(04:27) The importance of collecting and modeling data to support business insights.
(06:40) Using Airflow to connect multiple data sources and create dashboards.
(09:40) Implementing a monitoring DAG for proactive alerts within Airflow​​.
(12:31) Customizing Airflow for business metric KPI monitoring and setting thresholds​​.
(15:00) Addressing decreases in purchases due to technical issues with proactive alerts​​.
(17:45) Customizing data quality checks with dynamic task mapping in Airflow​​.
(20:00) Desired improvements in Airflow UI and logging capabilities​​.
(21:00) Enabling business stakeholders to change thresholds using Streamlit​​.
(22:26) Future improvements desired in the Airflow project​.


Resources Mentioned:

Hannan Kravitz -
https://www.linkedin.com/in/hannan-kravitz-60563112/
Artlist -
https://www.linkedin.com/company/art-list/
Apache Airflow -
https://airflow.apache.org/
Snowflake -
https://www.snowflake.com/
Streamlit -
https://streamlit.io/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Data orchestration is revolutionizing the way companies manage and process data. In this episode, we explore the critical role of data orchestration in modern data workflows and how Apache Airflow is used to enhance data processing and AI model deployment.

Hannan Kravitz, Data Engineering Team Leader at Artlist, joins us to share his insights on leveraging Airflow for data engineering and its impact on their business operations.

Key Takeaways:

(01:00) Hannan introduces Artlist and its mission to empower content creators.
(04:27) The importance of collecting and modeling data to support business insights.
(06:40) Using Airflow to connect multiple data sources and create dashboards.
(09:40) Implementing a monitoring DAG for proactive alerts within Airflow​​.
(12:31) Customizing Airflow for business metric KPI monitoring and setting thresholds​​.
(15:00) Addressing decreases in purchases due to technical issues with proactive alerts​​.
(17:45) Customizing data quality checks with dynamic task mapping in Airflow​​.
(20:00) Desired improvements in Airflow UI and logging capabilities​​.
(21:00) Enabling business stakeholders to change thresholds using Streamlit​​.
(22:26) Future improvements desired in the Airflow project​.


Resources Mentioned:

Hannan Kravitz -
https://www.linkedin.com/in/hannan-kravitz-60563112/
Artlist -
https://www.linkedin.com/company/art-list/
Apache Airflow -
https://airflow.apache.org/
Snowflake -
https://www.snowflake.com/
Streamlit -
https://streamlit.io/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Data orchestration is revolutionizing the way companies manage and process data. In this episode, we explore the critical role of data orchestration in modern data workflows and how Apache Airflow is used to enhance data processing and AI model dep...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1891199046]]></guid>
  <title><![CDATA[Cutting-Edge Data Engineering at Teya with Alexandre Magno Lima Martins]]></title>
  <description><![CDATA[Data engineering is constantly evolving and staying ahead means mastering tools like Apache Airflow. In this episode, we explore the world of data engineering with Alexandre Magno Lima Martins, Senior Data Engineer at Teya. Alexandre talks about optimizing data workflows and the smart solutions they've created at Teya to make data processing easier and more efficient.

Key Takeaways:

(02:01) Alexandre explains his role at Teya and the responsibilities of a data platform engineer.
(02:40) The primary use cases of Airflow at Teya, especially with dbt and machine learning projects.
(04:14) How Teya creates self-service DAGs for dbt models.
(05:58) Automating DAG creation with CI/CD pipelines.
(09:04) Switching to a multi-file method for better Airflow performance.
(12:48) Challenges faced with Kubernetes Executor vs. Celery Executor.
(16:13) Using Celery Executor to handle fast tasks efficiently.
(17:02) Implementing KEDA autoscaler for better scaling of Celery workers.
(19:05) Reasons for not using Cosmos for DAG generation and cross-DAG dependencies.
(21:16) Alexandre's wish list for future Airflow features, focusing on multi-tenancy.


Resources Mentioned:

Alexandre Magno Lima Martins -
https://www.linkedin.com/in/alex-magno/
Teya -
https://www.linkedin.com/company/teya-global/
Apache Airflow -
https://airflow.apache.org/
dbt -
https://www.getdbt.com/
Kubernetes -
https://kubernetes.io/
KEDA -
https://keda.sh/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/a1815a39-dd26-4f4e-a6fe-169d00bd8afd/cover-art/original_ba842d0eb8f4a49fbf360fa3fe739d5a.jpg" />
  <pubDate>Thu, 08 Aug 2024 10:00:10 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="45657946" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/a1815a39-dd26-4f4e-a6fe-169d00bd8afd/episode.mp3" />
  <itunes:title><![CDATA[Cutting-Edge Data Engineering at Teya with Alexandre Magno Lima Martins]]></itunes:title>
  <itunes:duration>23:46</itunes:duration>
  <itunes:summary><![CDATA[Data engineering is constantly evolving and staying ahead means mastering tools like Apache Airflow. In this episode, we explore the world of data engineering with Alexandre Magno Lima Martins, Senior Data Engineer at Teya. Alexandre talks about optimizing data workflows and the smart solutions they've created at Teya to make data processing easier and more efficient.

Key Takeaways:

(02:01) Alexandre explains his role at Teya and the responsibilities of a data platform engineer.
(02:40) The primary use cases of Airflow at Teya, especially with dbt and machine learning projects.
(04:14) How Teya creates self-service DAGs for dbt models.
(05:58) Automating DAG creation with CI/CD pipelines.
(09:04) Switching to a multi-file method for better Airflow performance.
(12:48) Challenges faced with Kubernetes Executor vs. Celery Executor.
(16:13) Using Celery Executor to handle fast tasks efficiently.
(17:02) Implementing KEDA autoscaler for better scaling of Celery workers.
(19:05) Reasons for not using Cosmos for DAG generation and cross-DAG dependencies.
(21:16) Alexandre's wish list for future Airflow features, focusing on multi-tenancy.


Resources Mentioned:

Alexandre Magno Lima Martins -
https://www.linkedin.com/in/alex-magno/
Teya -
https://www.linkedin.com/company/teya-global/
Apache Airflow -
https://airflow.apache.org/
dbt -
https://www.getdbt.com/
Kubernetes -
https://kubernetes.io/
KEDA -
https://keda.sh/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Data engineering is constantly evolving and staying ahead means mastering tools like Apache Airflow. In this episode, we explore the world of data engineering with Alexandre Magno Lima Martins, Senior Data Engineer at Teya. Alexandre talks about optimizing data workflows and the smart solutions they've created at Teya to make data processing easier and more efficient.

Key Takeaways:

(02:01) Alexandre explains his role at Teya and the responsibilities of a data platform engineer.
(02:40) The primary use cases of Airflow at Teya, especially with dbt and machine learning projects.
(04:14) How Teya creates self-service DAGs for dbt models.
(05:58) Automating DAG creation with CI/CD pipelines.
(09:04) Switching to a multi-file method for better Airflow performance.
(12:48) Challenges faced with Kubernetes Executor vs. Celery Executor.
(16:13) Using Celery Executor to handle fast tasks efficiently.
(17:02) Implementing KEDA autoscaler for better scaling of Celery workers.
(19:05) Reasons for not using Cosmos for DAG generation and cross-DAG dependencies.
(21:16) Alexandre's wish list for future Airflow features, focusing on multi-tenancy.


Resources Mentioned:

Alexandre Magno Lima Martins -
https://www.linkedin.com/in/alex-magno/
Teya -
https://www.linkedin.com/company/teya-global/
Apache Airflow -
https://airflow.apache.org/
dbt -
https://www.getdbt.com/
Kubernetes -
https://kubernetes.io/
KEDA -
https://keda.sh/

Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Data engineering is constantly evolving and staying ahead means mastering tools like Apache Airflow. In this episode, we explore the world of data engineering with Alexandre Magno Lima Martins, Senior Data Engineer at Teya. Alexandre talks about op...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1880844150]]></guid>
  <title><![CDATA[Airflow Strategies for Business Efficiency at Campbell with Larry Komenda]]></title>
  <description><![CDATA[Managing data workflows well can change the game for any company. In this episode, we talk about how Airflow makes this possible. Larry Komenda, Chief Technology Officer at Campbell, shares how Airflow supports their operations and improves efficiency.

Larry discusses his role at Campbell, their switch to Airflow, and its impact. We look at their strategies for testing and maintaining reliable workflows and how these help their business.

Key Takeaways:

(02:26) Strong technology and data systems are crucial for Campbell’s investment process.
(05:03) Airflow manages data pipelines efficiently in the market data team.
(07:39) Airflow supports various departments, including trading and operations.
(09:22) Machine learning models run on dedicated Airflow instances.
(11:12) Reliable workflows are ensured through thorough testing and development.
(13:45) Business tasks are organized separately from Airflow for easier testing.
(15:30) Non-technical teams have access to Airflow for better efficiency.
(17:20) Thorough testing before deploying to Airflow is essential.
(19:10) Non-technical users can interact with Airflow DAGs to solve their issues.
(21:55) Airflow improves efficiency and reliability in trading and operations.
(24:40) Enhancing the Airflow UI for non-technical users is important for accessibility.


Resources Mentioned:

Larry Komenda -
https://www.linkedin.com/in/larrykomenda/
Campbell -
https://www.linkedin.com/company/campbell-and-company/
30% off Airflow Summit Ticket -
https://ti.to/airflowsummit/2024/discount/30DISC_ASTRONOMER
Apache Airflow -
https://airflow.apache.org/
NumPy -
https://numpy.org/
Python -
https://www.python.org/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ac1c4e2d-d0c2-45ce-bb1f-e70628b42459/cover-art/original_c4acbb90cebb085bc76fbab2c2fc88d5.jpg" />
  <pubDate>Thu, 25 Jul 2024 20:11:35 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="50258005" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ac1c4e2d-d0c2-45ce-bb1f-e70628b42459/episode.mp3" />
  <itunes:title><![CDATA[Airflow Strategies for Business Efficiency at Campbell with Larry Komenda]]></itunes:title>
  <itunes:duration>26:10</itunes:duration>
  <itunes:summary><![CDATA[Managing data workflows well can change the game for any company. In this episode, we talk about how Airflow makes this possible. Larry Komenda, Chief Technology Officer at Campbell, shares how Airflow supports their operations and improves efficiency.

Larry discusses his role at Campbell, their switch to Airflow, and its impact. We look at their strategies for testing and maintaining reliable workflows and how these help their business.

Key Takeaways:

(02:26) Strong technology and data systems are crucial for Campbell’s investment process.
(05:03) Airflow manages data pipelines efficiently in the market data team.
(07:39) Airflow supports various departments, including trading and operations.
(09:22) Machine learning models run on dedicated Airflow instances.
(11:12) Reliable workflows are ensured through thorough testing and development.
(13:45) Business tasks are organized separately from Airflow for easier testing.
(15:30) Non-technical teams have access to Airflow for better efficiency.
(17:20) Thorough testing before deploying to Airflow is essential.
(19:10) Non-technical users can interact with Airflow DAGs to solve their issues.
(21:55) Airflow improves efficiency and reliability in trading and operations.
(24:40) Enhancing the Airflow UI for non-technical users is important for accessibility.


Resources Mentioned:

Larry Komenda -
https://www.linkedin.com/in/larrykomenda/
Campbell -
https://www.linkedin.com/company/campbell-and-company/
30% off Airflow Summit Ticket -
https://ti.to/airflowsummit/2024/discount/30DISC_ASTRONOMER
Apache Airflow -
https://airflow.apache.org/
NumPy -
https://numpy.org/
Python -
https://www.python.org/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Managing data workflows well can change the game for any company. In this episode, we talk about how Airflow makes this possible. Larry Komenda, Chief Technology Officer at Campbell, shares how Airflow supports their operations and improves efficiency.

Larry discusses his role at Campbell, their switch to Airflow, and its impact. We look at their strategies for testing and maintaining reliable workflows and how these help their business.

Key Takeaways:

(02:26) Strong technology and data systems are crucial for Campbell’s investment process.
(05:03) Airflow manages data pipelines efficiently in the market data team.
(07:39) Airflow supports various departments, including trading and operations.
(09:22) Machine learning models run on dedicated Airflow instances.
(11:12) Reliable workflows are ensured through thorough testing and development.
(13:45) Business tasks are organized separately from Airflow for easier testing.
(15:30) Non-technical teams have access to Airflow for better efficiency.
(17:20) Thorough testing before deploying to Airflow is essential.
(19:10) Non-technical users can interact with Airflow DAGs to solve their issues.
(21:55) Airflow improves efficiency and reliability in trading and operations.
(24:40) Enhancing the Airflow UI for non-technical users is important for accessibility.


Resources Mentioned:

Larry Komenda -
https://www.linkedin.com/in/larrykomenda/
Campbell -
https://www.linkedin.com/company/campbell-and-company/
30% off Airflow Summit Ticket -
https://ti.to/airflowsummit/2024/discount/30DISC_ASTRONOMER
Apache Airflow -
https://airflow.apache.org/
NumPy -
https://numpy.org/
Python -
https://www.python.org/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Managing data workflows well can change the game for any company. In this episode, we talk about how Airflow makes this possible. Larry Komenda, Chief Technology Officer at Campbell, shares how Airflow supports their operations and improves efficie...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1874450295]]></guid>
  <title><![CDATA[How Laurel Uses Airflow To Enhance Machine Learning Pipelines with Vincent La and Jim Howard]]></title>
  <description><![CDATA[The world of timekeeping for knowledge workers is transforming through the use of AI and machine learning. Understanding how to leverage these technologies is crucial for improving efficiency and productivity.

In this episode, we’re joined by Vincent La, Principal Data Scientist at Laurel, and Jim Howard, Principal Machine Learning Engineer at Laurel, to explore the implementation of AI in automating timekeeping and its impact on legal and accounting firms.

Key Takeaways:

(01:54) Laurel's mission in time automation. 
(03:39) Solving clustering, prediction and summarization with AI. 
(06:30) Daily batch jobs for user time generation. 
(08:22) Knowledge workers touch 300 items daily. 
(09:01) Mapping 300 activities to seven billable items. 
(11:38) Retraining models for better performance. 
(14:00) Using Airflow for retraining and backfills. 
(17:06) RAG-based summarization for user-specific tone. 
(18:58) Testing Airflow DAGs for cost-effective summarization. 
(22:00) Enhancing Airflow for long-running DAGs.

Resources Mentioned:

Vincent La -
https://www.linkedin.com/in/vincentla/
Jim Howard -
https://www.linkedin.com/in/jameswhowardml/
Laurel -
https://www.linkedin.com/company/laurel-ai/
Apache Airflow -
https://airflow.apache.org/
Ernst & Young -
https://www.ey.com/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/fc2e5bc0-12d6-40ab-a218-82da8c4b92ef/cover-art/original_2088dee35b0c3a1a6a074776078992c6.jpg" />
  <pubDate>Thu, 18 Jul 2024 07:20:44 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="23024200" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/fc2e5bc0-12d6-40ab-a218-82da8c4b92ef/episode.mp3" />
  <itunes:title><![CDATA[How Laurel Uses Airflow To Enhance Machine Learning Pipelines with Vincent La and Jim Howard]]></itunes:title>
  <itunes:duration>23:58</itunes:duration>
  <itunes:summary><![CDATA[The world of timekeeping for knowledge workers is transforming through the use of AI and machine learning. Understanding how to leverage these technologies is crucial for improving efficiency and productivity.

In this episode, we’re joined by Vincent La, Principal Data Scientist at Laurel, and Jim Howard, Principal Machine Learning Engineer at Laurel, to explore the implementation of AI in automating timekeeping and its impact on legal and accounting firms.

Key Takeaways:

(01:54) Laurel's mission in time automation. 
(03:39) Solving clustering, prediction and summarization with AI. 
(06:30) Daily batch jobs for user time generation. 
(08:22) Knowledge workers touch 300 items daily. 
(09:01) Mapping 300 activities to seven billable items. 
(11:38) Retraining models for better performance. 
(14:00) Using Airflow for retraining and backfills. 
(17:06) RAG-based summarization for user-specific tone. 
(18:58) Testing Airflow DAGs for cost-effective summarization. 
(22:00) Enhancing Airflow for long-running DAGs.

Resources Mentioned:

Vincent La -
https://www.linkedin.com/in/vincentla/
Jim Howard -
https://www.linkedin.com/in/jameswhowardml/
Laurel -
https://www.linkedin.com/company/laurel-ai/
Apache Airflow -
https://airflow.apache.org/
Ernst & Young -
https://www.ey.com/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[The world of timekeeping for knowledge workers is transforming through the use of AI and machine learning. Understanding how to leverage these technologies is crucial for improving efficiency and productivity.

In this episode, we’re joined by Vincent La, Principal Data Scientist at Laurel, and Jim Howard, Principal Machine Learning Engineer at Laurel, to explore the implementation of AI in automating timekeeping and its impact on legal and accounting firms.

Key Takeaways:

(01:54) Laurel's mission in time automation. 
(03:39) Solving clustering, prediction and summarization with AI. 
(06:30) Daily batch jobs for user time generation. 
(08:22) Knowledge workers touch 300 items daily. 
(09:01) Mapping 300 activities to seven billable items. 
(11:38) Retraining models for better performance. 
(14:00) Using Airflow for retraining and backfills. 
(17:06) RAG-based summarization for user-specific tone. 
(18:58) Testing Airflow DAGs for cost-effective summarization. 
(22:00) Enhancing Airflow for long-running DAGs.

Resources Mentioned:

Vincent La -
https://www.linkedin.com/in/vincentla/
Jim Howard -
https://www.linkedin.com/in/jameswhowardml/
Laurel -
https://www.linkedin.com/company/laurel-ai/
Apache Airflow -
https://airflow.apache.org/
Ernst & Young -
https://www.ey.com/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[The world of timekeeping for knowledge workers is transforming through the use of AI and machine learning. Understanding how to leverage these technologies is crucial for improving efficiency and productivity.

In this episode, we’re joined by Vinc...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1857855858]]></guid>
  <title><![CDATA[How Vibrant Planet's Self-Healing Pipelines Revolutionize Data Processing]]></title>
  <description><![CDATA[Discover the cutting-edge methods Vibrant Planet uses to revolutionize geospatial data processing and resource management.

In this episode, we delve into the intricacies of scaling geospatial data processing and resource allocation with experts from Vibrant Planet. Joining us are Cyrus Dukart, Engineering Lead, and David Sacerdote, Staff Software Engineer, who share their innovative approaches to handling large datasets and optimizing resource use in Airflow.

Key Takeaways:

(00:00) Inefficiencies in resource allocation. 
(03:00) Scientific validity of sharded results. 
(05:53) Tech-based solutions for resource management. 
(06:11) Retry callback process for resource allocation.
(08:00) Running database queries for resource needs. 
(10:05) Importance of remembering resource usage. 
(13:51) Generating resource predictions. 
(14:44) Custom task decorator for resource management. 
(20:28) Massive resource usage gap in sharded data. 
(21:14) Fail-fast model for long-running tasks.

Resources Mentioned:

Cyrus Dukart -
https://www.linkedin.com/in/cyrus-dukart-6561482/
David Sacerdote -
https://www.linkedin.com/in/davidsacerdote/
Vibrant Planet -
https://www.linkedin.com/company/vibrant-planet/
Apache Airflow -
https://airflow.apache.org/
Kubernetes -
https://kubernetes.io/
Vibrant Planet -
https://vibrantplanet.net/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ae392564-5b86-444f-a6e9-e68a9c4185f4/cover-art/original_441d838d24bf653c367571bef3427627.jpg" />
  <pubDate>Thu, 27 Jun 2024 15:29:16 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22911771" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ae392564-5b86-444f-a6e9-e68a9c4185f4/episode.mp3" />
  <itunes:title><![CDATA[How Vibrant Planet's Self-Healing Pipelines Revolutionize Data Processing]]></itunes:title>
  <itunes:duration>23:51</itunes:duration>
  <itunes:summary><![CDATA[Discover the cutting-edge methods Vibrant Planet uses to revolutionize geospatial data processing and resource management.

In this episode, we delve into the intricacies of scaling geospatial data processing and resource allocation with experts from Vibrant Planet. Joining us are Cyrus Dukart, Engineering Lead, and David Sacerdote, Staff Software Engineer, who share their innovative approaches to handling large datasets and optimizing resource use in Airflow.

Key Takeaways:

(00:00) Inefficiencies in resource allocation. 
(03:00) Scientific validity of sharded results. 
(05:53) Tech-based solutions for resource management. 
(06:11) Retry callback process for resource allocation.
(08:00) Running database queries for resource needs. 
(10:05) Importance of remembering resource usage. 
(13:51) Generating resource predictions. 
(14:44) Custom task decorator for resource management. 
(20:28) Massive resource usage gap in sharded data. 
(21:14) Fail-fast model for long-running tasks.

Resources Mentioned:

Cyrus Dukart -
https://www.linkedin.com/in/cyrus-dukart-6561482/
David Sacerdote -
https://www.linkedin.com/in/davidsacerdote/
Vibrant Planet -
https://www.linkedin.com/company/vibrant-planet/
Apache Airflow -
https://airflow.apache.org/
Kubernetes -
https://kubernetes.io/
Vibrant Planet -
https://vibrantplanet.net/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></itunes:summary>
  <content:encoded><![CDATA[Discover the cutting-edge methods Vibrant Planet uses to revolutionize geospatial data processing and resource management.

In this episode, we delve into the intricacies of scaling geospatial data processing and resource allocation with experts from Vibrant Planet. Joining us are Cyrus Dukart, Engineering Lead, and David Sacerdote, Staff Software Engineer, who share their innovative approaches to handling large datasets and optimizing resource use in Airflow.

Key Takeaways:

(00:00) Inefficiencies in resource allocation. 
(03:00) Scientific validity of sharded results. 
(05:53) Tech-based solutions for resource management. 
(06:11) Retry callback process for resource allocation.
(08:00) Running database queries for resource needs. 
(10:05) Importance of remembering resource usage. 
(13:51) Generating resource predictions. 
(14:44) Custom task decorator for resource management. 
(20:28) Massive resource usage gap in sharded data. 
(21:14) Fail-fast model for long-running tasks.

Resources Mentioned:

Cyrus Dukart -
https://www.linkedin.com/in/cyrus-dukart-6561482/
David Sacerdote -
https://www.linkedin.com/in/davidsacerdote/
Vibrant Planet -
https://www.linkedin.com/company/vibrant-planet/
Apache Airflow -
https://airflow.apache.org/
Kubernetes -
https://kubernetes.io/
Vibrant Planet -
https://vibrantplanet.net/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#AI #Automation #Airflow #MachineLearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Discover the cutting-edge methods Vibrant Planet uses to revolutionize geospatial data processing and resource management.

In this episode, we delve into the intricacies of scaling geospatial data processing and resource allocation with experts fr...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1833134607]]></guid>
  <title><![CDATA[The Future of AI in Data Engineering With Astronomer’s Julian LaNeve and David Xue]]></title>
  <description><![CDATA[The world of data orchestration and machine learning is rapidly evolving, and tools like Apache Airflow are at the forefront of these changes. Understanding how to effectively utilize these tools can significantly enhance data processing and AI model deployment.
This episode features Julian LaNeve, CTO at Astronomer, and David Xue, Machine Learning Engineer at Astronomer. They delve into the intricacies of data orchestration, generative AI and the practical applications of these technologies in modern data workflows.
Key Takeaways:
(01:51) The pressure to engage in the generative AI space.
(02:02) Generative AI can elevate data utilization to the next level.
(02:43) The transparency issues with commercial AI models.
(04:27) High-quality data in model performance is crucial.
(06:40) Running new models on smaller devices, like phones.
(12:19) Fine-tuning LLMs to handle millions of task failures.
(16:54) Teaching AI to understand specific logs, not general passages, is a goal.
(21:56) Using Airflow as a general-purpose orchestration tool.
(22:00) Airflow is adaptable for various use cases, including ETL and ML systems.


Resources Mentioned:

Julian LaNeve - https://www.linkedin.com/in/julianlaneve/
Atronomer - https://www.linkedin.com/company/astronomer/
David Xue - https://www.linkedin.com/in/david-xue-uva/
Apache Airflow - https://airflow.apache.org/
Meta’s Open Source Llama 3 model: https://ai.meta.com/blog/meta-llama-3/https://ai.meta.com/blog/meta-llama-3/
Microsoft’s Phi-3 model: https://www.microsoft.com/en-us/research/publication/phi-3-technical-report-a-highly-capable-language-model-locally-on-your-phone/
GPT-4 - https://www.openai.com/research/gpt-4




Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ec1a04d1-8c0f-4054-93d0-da35b3607ba4/cover-art/original_578ee38b643fb997225949c4348e454f.jpg" />
  <pubDate>Wed, 29 May 2024 19:49:32 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22660157" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ec1a04d1-8c0f-4054-93d0-da35b3607ba4/episode.mp3" />
  <itunes:title><![CDATA[The Future of AI in Data Engineering With Astronomer’s Julian LaNeve and David Xue]]></itunes:title>
  <itunes:duration>23:36</itunes:duration>
  <itunes:summary><![CDATA[The world of data orchestration and machine learning is rapidly evolving, and tools like Apache Airflow are at the forefront of these changes. Understanding how to effectively utilize these tools can significantly enhance data processing and AI model deployment.
This episode features Julian LaNeve, CTO at Astronomer, and David Xue, Machine Learning Engineer at Astronomer. They delve into the intricacies of data orchestration, generative AI and the practical applications of these technologies in modern data workflows.
Key Takeaways:
(01:51) The pressure to engage in the generative AI space.
(02:02) Generative AI can elevate data utilization to the next level.
(02:43) The transparency issues with commercial AI models.
(04:27) High-quality data in model performance is crucial.
(06:40) Running new models on smaller devices, like phones.
(12:19) Fine-tuning LLMs to handle millions of task failures.
(16:54) Teaching AI to understand specific logs, not general passages, is a goal.
(21:56) Using Airflow as a general-purpose orchestration tool.
(22:00) Airflow is adaptable for various use cases, including ETL and ML systems.


Resources Mentioned:

Julian LaNeve - https://www.linkedin.com/in/julianlaneve/
Atronomer - https://www.linkedin.com/company/astronomer/
David Xue - https://www.linkedin.com/in/david-xue-uva/
Apache Airflow - https://airflow.apache.org/
Meta’s Open Source Llama 3 model: https://ai.meta.com/blog/meta-llama-3/https://ai.meta.com/blog/meta-llama-3/
Microsoft’s Phi-3 model: https://www.microsoft.com/en-us/research/publication/phi-3-technical-report-a-highly-capable-language-model-locally-on-your-phone/
GPT-4 - https://www.openai.com/research/gpt-4




Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></itunes:summary>
  <content:encoded><![CDATA[The world of data orchestration and machine learning is rapidly evolving, and tools like Apache Airflow are at the forefront of these changes. Understanding how to effectively utilize these tools can significantly enhance data processing and AI model deployment.
This episode features Julian LaNeve, CTO at Astronomer, and David Xue, Machine Learning Engineer at Astronomer. They delve into the intricacies of data orchestration, generative AI and the practical applications of these technologies in modern data workflows.
Key Takeaways:
(01:51) The pressure to engage in the generative AI space.
(02:02) Generative AI can elevate data utilization to the next level.
(02:43) The transparency issues with commercial AI models.
(04:27) High-quality data in model performance is crucial.
(06:40) Running new models on smaller devices, like phones.
(12:19) Fine-tuning LLMs to handle millions of task failures.
(16:54) Teaching AI to understand specific logs, not general passages, is a goal.
(21:56) Using Airflow as a general-purpose orchestration tool.
(22:00) Airflow is adaptable for various use cases, including ETL and ML systems.


Resources Mentioned:

Julian LaNeve - https://www.linkedin.com/in/julianlaneve/
Atronomer - https://www.linkedin.com/company/astronomer/
David Xue - https://www.linkedin.com/in/david-xue-uva/
Apache Airflow - https://airflow.apache.org/
Meta’s Open Source Llama 3 model: https://ai.meta.com/blog/meta-llama-3/https://ai.meta.com/blog/meta-llama-3/
Microsoft’s Phi-3 model: https://www.microsoft.com/en-us/research/publication/phi-3-technical-report-a-highly-capable-language-model-locally-on-your-phone/
GPT-4 - https://www.openai.com/research/gpt-4




Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></content:encoded>
  <itunes:subtitle><![CDATA[The world of data orchestration and machine learning is rapidly evolving, and tools like Apache Airflow are at the forefront of these changes. Understanding how to effectively utilize these tools can significantly enhance data processing and AI mod...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1832448933]]></guid>
  <title><![CDATA[The Power of Airflow in Modern Data Environments at Wynn Las Vegas with Siva Krishna Yetukuri]]></title>
  <description><![CDATA[Understanding the critical role of data integration and management is essential for driving business success, particularly in a dynamic environment like a luxury casino resort.

In this episode, we sit down with Siva Krishna Yetukuri, Cloud Data Architect at Wynn Las Vegas, to explore how Airflow and other tools are transforming data workflows and customer experiences at Wynn Las Vegas. 

Key Takeaways:

(02:00) Siva designs and builds cutting-edge data pipelines and architectures.
(02:54) Wynn is building a data platform to drive surveys and marketing strategies.
(05:00) Airflow is the backbone of data ingestion, curation and integration.
(07:00) Custom operators in Airflow enhance monitoring and reporting.
(09:00) Excitement surrounds the use of Airflow 2.9 and its new features.
(08:32) A metadata database drives Airflow workflows and captures metrics.
(12:31) Understanding Airflow fundamentals in layman’s terms simplifies complexity.
(16:33) Transitioning from Control-M to Airflow eases building complex workflows.
(24:06) ML models for volume and freshness anomalies improve data quality.
(20:15) DAGs are often auto-generated, simplifying the process for engineers.


Resources Mentioned:

Apache Airflow -
https://airflow.apache.org/
Snowflake -
https://www.snowflake.com/
Databricks -
https://databricks.com/
Great Expectations -
https://greatexpectations.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/65a8a7ab-2ab7-4760-85b5-039de87cf773/cover-art/original_c526e8daad79beb9760326ef43c9b554.jpg" />
  <pubDate>Wed, 29 May 2024 18:30:43 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="47083187" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/65a8a7ab-2ab7-4760-85b5-039de87cf773/episode.mp3" />
  <itunes:title><![CDATA[The Power of Airflow in Modern Data Environments at Wynn Las Vegas with Siva Krishna Yetukuri]]></itunes:title>
  <itunes:duration>24:31</itunes:duration>
  <itunes:summary><![CDATA[Understanding the critical role of data integration and management is essential for driving business success, particularly in a dynamic environment like a luxury casino resort.

In this episode, we sit down with Siva Krishna Yetukuri, Cloud Data Architect at Wynn Las Vegas, to explore how Airflow and other tools are transforming data workflows and customer experiences at Wynn Las Vegas. 

Key Takeaways:

(02:00) Siva designs and builds cutting-edge data pipelines and architectures.
(02:54) Wynn is building a data platform to drive surveys and marketing strategies.
(05:00) Airflow is the backbone of data ingestion, curation and integration.
(07:00) Custom operators in Airflow enhance monitoring and reporting.
(09:00) Excitement surrounds the use of Airflow 2.9 and its new features.
(08:32) A metadata database drives Airflow workflows and captures metrics.
(12:31) Understanding Airflow fundamentals in layman’s terms simplifies complexity.
(16:33) Transitioning from Control-M to Airflow eases building complex workflows.
(24:06) ML models for volume and freshness anomalies improve data quality.
(20:15) DAGs are often auto-generated, simplifying the process for engineers.


Resources Mentioned:

Apache Airflow -
https://airflow.apache.org/
Snowflake -
https://www.snowflake.com/
Databricks -
https://databricks.com/
Great Expectations -
https://greatexpectations.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></itunes:summary>
  <content:encoded><![CDATA[Understanding the critical role of data integration and management is essential for driving business success, particularly in a dynamic environment like a luxury casino resort.

In this episode, we sit down with Siva Krishna Yetukuri, Cloud Data Architect at Wynn Las Vegas, to explore how Airflow and other tools are transforming data workflows and customer experiences at Wynn Las Vegas. 

Key Takeaways:

(02:00) Siva designs and builds cutting-edge data pipelines and architectures.
(02:54) Wynn is building a data platform to drive surveys and marketing strategies.
(05:00) Airflow is the backbone of data ingestion, curation and integration.
(07:00) Custom operators in Airflow enhance monitoring and reporting.
(09:00) Excitement surrounds the use of Airflow 2.9 and its new features.
(08:32) A metadata database drives Airflow workflows and captures metrics.
(12:31) Understanding Airflow fundamentals in layman’s terms simplifies complexity.
(16:33) Transitioning from Control-M to Airflow eases building complex workflows.
(24:06) ML models for volume and freshness anomalies improve data quality.
(20:15) DAGs are often auto-generated, simplifying the process for engineers.


Resources Mentioned:

Apache Airflow -
https://airflow.apache.org/
Snowflake -
https://www.snowflake.com/
Databricks -
https://databricks.com/
Great Expectations -
https://greatexpectations.io/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></content:encoded>
  <itunes:subtitle><![CDATA[Understanding the critical role of data integration and management is essential for driving business success, particularly in a dynamic environment like a luxury casino resort.

In this episode, we sit down with Siva Krishna Yetukuri, Cloud Data Ar...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/1832444358]]></guid>
  <title><![CDATA[Powering the Texas Rangers World Series Win With AI on Airflow with Alexander Booth]]></title>
  <description><![CDATA[The integration of data and AI in sports is transforming how teams strategize and perform. Understanding how to harness this technology is key to staying competitive in the rapidly evolving landscape of baseball.

In this episode, we sit down with Alexander Booth, Assistant Director of Research and Development at Texas Rangers Baseball Club, to explore the intersection of big data, AI and baseball strategy.

Key Takeaways:

(03:00) Alexander Booth's role and responsibilities at the Texas Rangers. 
(03:33) The implementation of multiple cameras and pose tracking in stadiums. 
(06:16) The importance of Airflow in organizing data orchestrations. 
(06:22) The demand for faster data among modern baseball players. 
(11:01) The necessity of scalable solutions for handling large data sets. 
(15:00) How weather data influences game strategy. 
(15:46) The impact of advanced technology on decision-making in baseball. 
(18:00) The role of AI and machine learning in player and game analysis. 
(22:26) The use of dynamic tasks in Airflow for better data management.


Resources Mentioned:

Apache Airflow -
https://airflow.apache.org/
Statcast -
https://www.mlb.com/statcast
Google BigQuery -
https://cloud.google.com/bigquery/
Databricks -
https://databricks.com/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/0d06e156-03f9-4cde-aa98-f67e27614c56/cover-art/original_7e1a0a6290e7865bdf5260f0a024916f.jpg" />
  <pubDate>Wed, 29 May 2024 18:28:37 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="22688581" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/0d06e156-03f9-4cde-aa98-f67e27614c56/episode.mp3" />
  <itunes:title><![CDATA[Powering the Texas Rangers World Series Win With AI on Airflow with Alexander Booth]]></itunes:title>
  <itunes:duration>23:38</itunes:duration>
  <itunes:summary><![CDATA[The integration of data and AI in sports is transforming how teams strategize and perform. Understanding how to harness this technology is key to staying competitive in the rapidly evolving landscape of baseball.

In this episode, we sit down with Alexander Booth, Assistant Director of Research and Development at Texas Rangers Baseball Club, to explore the intersection of big data, AI and baseball strategy.

Key Takeaways:

(03:00) Alexander Booth's role and responsibilities at the Texas Rangers. 
(03:33) The implementation of multiple cameras and pose tracking in stadiums. 
(06:16) The importance of Airflow in organizing data orchestrations. 
(06:22) The demand for faster data among modern baseball players. 
(11:01) The necessity of scalable solutions for handling large data sets. 
(15:00) How weather data influences game strategy. 
(15:46) The impact of advanced technology on decision-making in baseball. 
(18:00) The role of AI and machine learning in player and game analysis. 
(22:26) The use of dynamic tasks in Airflow for better data management.


Resources Mentioned:

Apache Airflow -
https://airflow.apache.org/
Statcast -
https://www.mlb.com/statcast
Google BigQuery -
https://cloud.google.com/bigquery/
Databricks -
https://databricks.com/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></itunes:summary>
  <content:encoded><![CDATA[The integration of data and AI in sports is transforming how teams strategize and perform. Understanding how to harness this technology is key to staying competitive in the rapidly evolving landscape of baseball.

In this episode, we sit down with Alexander Booth, Assistant Director of Research and Development at Texas Rangers Baseball Club, to explore the intersection of big data, AI and baseball strategy.

Key Takeaways:

(03:00) Alexander Booth's role and responsibilities at the Texas Rangers. 
(03:33) The implementation of multiple cameras and pose tracking in stadiums. 
(06:16) The importance of Airflow in organizing data orchestrations. 
(06:22) The demand for faster data among modern baseball players. 
(11:01) The necessity of scalable solutions for handling large data sets. 
(15:00) How weather data influences game strategy. 
(15:46) The impact of advanced technology on decision-making in baseball. 
(18:00) The role of AI and machine learning in player and game analysis. 
(22:26) The use of dynamic tasks in Airflow for better data management.


Resources Mentioned:

Apache Airflow -
https://airflow.apache.org/
Statcast -
https://www.mlb.com/statcast
Google BigQuery -
https://cloud.google.com/bigquery/
Databricks -
https://databricks.com/


Thanks for listening to The Data Flowcast: Mastering Airflow for Data Engineering & AI. If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.


#ai #automation #airflow #machinelearning]]></content:encoded>
  <itunes:subtitle><![CDATA[The integration of data and AI in sports is transforming how teams strategize and perform. Understanding how to harness this technology is key to staying competitive in the rapidly evolving landscape of baseball.

In this episode, we sit down with ...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/979278949]]></guid>
  <title><![CDATA[Expanding the Data Engineering Toolkit at Reddit]]></title>
  <description><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Ben Wisegarver,  a staff data scientist at Reddit who runs their data warehousing and data engineering functions.

Reddit users generate petabytes of data every day that needs to be processed, stored, and analyzed by a wide breadth of backend services. Our conversation with Ben touches on everything from Airflow as a tool for career mobility across the data stack to scaling out a self-service data architecture across many teams.

For folks interested, our team at Astronomer is growing rapidly and we're on the hunt for new folks to join in a variety of different roles. If you're passionate about Airflow and interested in building the future of data engineering, please get in touch. You can check our current job postings at careers.astronomer.io, but we're constantly updating our listings to accommodate new hiring needs. Please feel free to email me directly at pete@astronomer.io if you're passionate about what we're doing and think you'd be a good addition to the team.

Mentioned Resources:

Careers: https://careers.astronomer.io

Guest Profile:

Ben Wisegarver: https://www.linkedin.com/in/ben-wisegarver-54566576]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/7faf4a20-d2bd-4e9c-b64c-dc1c642e9b78/cover-art/original_ba2596fa087ea1b0b6b968af27f9a9a4.jpg" />
  <pubDate>Thu, 04 Feb 2021 21:48:17 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="43975620" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/7faf4a20-d2bd-4e9c-b64c-dc1c642e9b78/episode.mp3" />
  <itunes:title><![CDATA[Expanding the Data Engineering Toolkit at Reddit]]></itunes:title>
  <itunes:duration>45:48</itunes:duration>
  <itunes:summary><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Ben Wisegarver,  a staff data scientist at Reddit who runs their data warehousing and data engineering functions.

Reddit users generate petabytes of data every day that needs to be processed, stored, and analyzed by a wide breadth of backend services. Our conversation with Ben touches on everything from Airflow as a tool for career mobility across the data stack to scaling out a self-service data architecture across many teams.

For folks interested, our team at Astronomer is growing rapidly and we're on the hunt for new folks to join in a variety of different roles. If you're passionate about Airflow and interested in building the future of data engineering, please get in touch. You can check our current job postings at careers.astronomer.io, but we're constantly updating our listings to accommodate new hiring needs. Please feel free to email me directly at pete@astronomer.io if you're passionate about what we're doing and think you'd be a good addition to the team.

Mentioned Resources:

Careers: https://careers.astronomer.io

Guest Profile:

Ben Wisegarver: https://www.linkedin.com/in/ben-wisegarver-54566576]]></itunes:summary>
  <content:encoded><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Ben Wisegarver,  a staff data scientist at Reddit who runs their data warehousing and data engineering functions.

Reddit users generate petabytes of data every day that needs to be processed, stored, and analyzed by a wide breadth of backend services. Our conversation with Ben touches on everything from Airflow as a tool for career mobility across the data stack to scaling out a self-service data architecture across many teams.

For folks interested, our team at Astronomer is growing rapidly and we're on the hunt for new folks to join in a variety of different roles. If you're passionate about Airflow and interested in building the future of data engineering, please get in touch. You can check our current job postings at careers.astronomer.io, but we're constantly updating our listings to accommodate new hiring needs. Please feel free to email me directly at pete@astronomer.io if you're passionate about what we're doing and think you'd be a good addition to the team.

Mentioned Resources:

Careers: https://careers.astronomer.io

Guest Profile:

Ben Wisegarver: https://www.linkedin.com/in/ben-wisegarver-54566576]]></content:encoded>
  <itunes:subtitle><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Ben Wisegarver,  a staff data scientist at Reddit who runs their data warehousing and data engineering functions.

Reddit users generate petabytes of data every day that needs to be pr...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/931274455]]></guid>
  <title><![CDATA[GDPR, Self-Service Data, and Infrastructure Automation with Typeform]]></title>
  <description><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Albert Franzi and Carlos Escura from Typeform. Typeform is a tool that allows you to build beautiful interactive forms that you can use for a wide variety of use cases, including customer surveys, employee engagement, product feedback, and market research to name a few.  In our conversation, we discussed Airflow as a tool for GDPR compliance, the concept of self-service data and how it allows your data operations team to function as a data platform team, and some of the more specialized infrastructure tooling that the Typeform team has built out to support their internal teams.

For folks interested, our team at Astronomer is growing rapidly and we're on the hunt for new folks to join in a variety of different roles. If you're passionate about Airflow and interested in building the future of data engineering, please get in touch. You can check our current job postings at careers.astronomer.io, but we're constantly updating our listings to accommodate new hiring needs. Please feel free to email me directly at pete@astronomer.io if you're passionate about what we're doing and think you'd be a good addition to the team.

Mentioned Resources:
Dag Factory: https://github.com/ajbosco/dag-factory
Astronomer Careers: https://careers.astronomer.io

Guest Profiles:
Albert Franzi: https://www.linkedin.com/in/albertfranzi/?originalSubdomain=es
Carlos Escura: https://www.linkedin.com/in/carlosescura/en-us/]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/ee7d3ff8-3466-4f4b-90df-fcf6c36db32c/cover-art/original_5384435cf48689006abaffad90bdd835.jpg" />
  <pubDate>Wed, 18 Nov 2020 02:57:42 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="30186285" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/ee7d3ff8-3466-4f4b-90df-fcf6c36db32c/episode.mp3" />
  <itunes:title><![CDATA[GDPR, Self-Service Data, and Infrastructure Automation with Typeform]]></itunes:title>
  <itunes:duration>31:26</itunes:duration>
  <itunes:summary><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Albert Franzi and Carlos Escura from Typeform. Typeform is a tool that allows you to build beautiful interactive forms that you can use for a wide variety of use cases, including customer surveys, employee engagement, product feedback, and market research to name a few.  In our conversation, we discussed Airflow as a tool for GDPR compliance, the concept of self-service data and how it allows your data operations team to function as a data platform team, and some of the more specialized infrastructure tooling that the Typeform team has built out to support their internal teams.

For folks interested, our team at Astronomer is growing rapidly and we're on the hunt for new folks to join in a variety of different roles. If you're passionate about Airflow and interested in building the future of data engineering, please get in touch. You can check our current job postings at careers.astronomer.io, but we're constantly updating our listings to accommodate new hiring needs. Please feel free to email me directly at pete@astronomer.io if you're passionate about what we're doing and think you'd be a good addition to the team.

Mentioned Resources:
Dag Factory: https://github.com/ajbosco/dag-factory
Astronomer Careers: https://careers.astronomer.io

Guest Profiles:
Albert Franzi: https://www.linkedin.com/in/albertfranzi/?originalSubdomain=es
Carlos Escura: https://www.linkedin.com/in/carlosescura/en-us/]]></itunes:summary>
  <content:encoded><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Albert Franzi and Carlos Escura from Typeform. Typeform is a tool that allows you to build beautiful interactive forms that you can use for a wide variety of use cases, including customer surveys, employee engagement, product feedback, and market research to name a few.  In our conversation, we discussed Airflow as a tool for GDPR compliance, the concept of self-service data and how it allows your data operations team to function as a data platform team, and some of the more specialized infrastructure tooling that the Typeform team has built out to support their internal teams.

For folks interested, our team at Astronomer is growing rapidly and we're on the hunt for new folks to join in a variety of different roles. If you're passionate about Airflow and interested in building the future of data engineering, please get in touch. You can check our current job postings at careers.astronomer.io, but we're constantly updating our listings to accommodate new hiring needs. Please feel free to email me directly at pete@astronomer.io if you're passionate about what we're doing and think you'd be a good addition to the team.

Mentioned Resources:
Dag Factory: https://github.com/ajbosco/dag-factory
Astronomer Careers: https://careers.astronomer.io

Guest Profiles:
Albert Franzi: https://www.linkedin.com/in/albertfranzi/?originalSubdomain=es
Carlos Escura: https://www.linkedin.com/in/carlosescura/en-us/]]></content:encoded>
  <itunes:subtitle><![CDATA[Welcome back to the Airflow Podcast.

This week, we met up with Albert Franzi and Carlos Escura from Typeform. Typeform is a tool that allows you to build beautiful interactive forms that you can use for a wide variety of use cases, including custo...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/917635843]]></guid>
  <title><![CDATA[Adopting Airflow at Netlify]]></title>
  <description><![CDATA[After a bit of a break, we're back with the third official episode bundle of The Airflow Podcast. In this batch, we'll get a little bit deeper with current Airflow users and maintainers on core fundamental concepts in data engineering, architectures for operating modern data platforms at scale, and the process of maintaining and operating Airflow, specifically as we go through the release process of Airflow 2.0.

This week, we met up with Brian de la Motte and Florian Hines at Netlify. Netlify provides an extremely popular toolset for building and deploying JAMstack sites. They provide hosting services, CI, DNS, authentication, and managed backend tools that help users run and operate static sites at scale.  The team over there recently adopted Airflow to help decouple orchestration logic from a complex collection Spark jobs and are currently in the process of expanding their Airflow footprint to accommodate a broader group of interesting use-cases.

Disclaimer: we get a bit of a surprise about halfway through the episode when Brian tells us that they had recently signed up for Astronomer- we promise that it wasn't a planted ad :).

Please contact pete@astronomer.io if you'd like to get in touch regarding future episodes. Hope you enjoy!

Guest Profiles:
Brian de la Motte: https://www.linkedin.com/in/brian-de-la-motte/
Florian Hines: https://www.linkedin.com/in/florianhines/]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/9a3b401a-4de3-45f5-8660-e4518a3ff7dc/cover-art/original_57375f70613bfaed7f879b8c60def23e.jpg" />
  <pubDate>Mon, 26 Oct 2020 03:42:01 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="27449096" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/9a3b401a-4de3-45f5-8660-e4518a3ff7dc/episode.mp3" />
  <itunes:title><![CDATA[Adopting Airflow at Netlify]]></itunes:title>
  <itunes:duration>28:35</itunes:duration>
  <itunes:summary><![CDATA[After a bit of a break, we're back with the third official episode bundle of The Airflow Podcast. In this batch, we'll get a little bit deeper with current Airflow users and maintainers on core fundamental concepts in data engineering, architectures for operating modern data platforms at scale, and the process of maintaining and operating Airflow, specifically as we go through the release process of Airflow 2.0.

This week, we met up with Brian de la Motte and Florian Hines at Netlify. Netlify provides an extremely popular toolset for building and deploying JAMstack sites. They provide hosting services, CI, DNS, authentication, and managed backend tools that help users run and operate static sites at scale.  The team over there recently adopted Airflow to help decouple orchestration logic from a complex collection Spark jobs and are currently in the process of expanding their Airflow footprint to accommodate a broader group of interesting use-cases.

Disclaimer: we get a bit of a surprise about halfway through the episode when Brian tells us that they had recently signed up for Astronomer- we promise that it wasn't a planted ad :).

Please contact pete@astronomer.io if you'd like to get in touch regarding future episodes. Hope you enjoy!

Guest Profiles:
Brian de la Motte: https://www.linkedin.com/in/brian-de-la-motte/
Florian Hines: https://www.linkedin.com/in/florianhines/]]></itunes:summary>
  <content:encoded><![CDATA[After a bit of a break, we're back with the third official episode bundle of The Airflow Podcast. In this batch, we'll get a little bit deeper with current Airflow users and maintainers on core fundamental concepts in data engineering, architectures for operating modern data platforms at scale, and the process of maintaining and operating Airflow, specifically as we go through the release process of Airflow 2.0.

This week, we met up with Brian de la Motte and Florian Hines at Netlify. Netlify provides an extremely popular toolset for building and deploying JAMstack sites. They provide hosting services, CI, DNS, authentication, and managed backend tools that help users run and operate static sites at scale.  The team over there recently adopted Airflow to help decouple orchestration logic from a complex collection Spark jobs and are currently in the process of expanding their Airflow footprint to accommodate a broader group of interesting use-cases.

Disclaimer: we get a bit of a surprise about halfway through the episode when Brian tells us that they had recently signed up for Astronomer- we promise that it wasn't a planted ad :).

Please contact pete@astronomer.io if you'd like to get in touch regarding future episodes. Hope you enjoy!

Guest Profiles:
Brian de la Motte: https://www.linkedin.com/in/brian-de-la-motte/
Florian Hines: https://www.linkedin.com/in/florianhines/]]></content:encoded>
  <itunes:subtitle><![CDATA[After a bit of a break, we're back with the third official episode bundle of The Airflow Podcast. In this batch, we'll get a little bit deeper with current Airflow users and maintainers on core fundamental concepts in data engineering, architecture...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/716910052]]></guid>
  <title><![CDATA[The Road to Airflow 2.0]]></title>
  <description><![CDATA[This week, we linked up with Airflow release manager, core committer, and Astronomer platform engineer Ash Berlin-Taylor to discuss the Airflow 2.0 roadmap [1]. There is some great stuff in the works around performance, autoscaling, and usability that we're excited about. In this episode, Ash lends his thoughts on the design, implementation, and value-add around all of the upcoming features, including:
 - The Knative Executor
 - A modern and real-time UI
 - A production-grade API
 - Improved scheduler and webserver performance
 - An official production Docker image for Airflow

We hope you enjoy! Please email pete@astronomer.io if you have thoughts on topics you'd like to see covered in future episodes.

Separately, some good folks from the Airflow community are running a user survey that will help collect some useful information around the Airflow UX. If you have five minutes to spare, filling out the following form will help the core Airflow committers to shape the project roadmap: https://forms.gle/XAzR1pQBZiftvPQM7 

[1] https://cwiki.apache.org/confluence/display/AIRFLOW/Airflow+2.0]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/fb4b2489-4643-40bc-90d8-8d1f7ba851a3/cover-art/original_f9a9661b1fdc1e84d13ac6f3b63392c0.jpg" />
  <pubDate>Fri, 22 Nov 2019 15:17:19 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="53956283" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/fb4b2489-4643-40bc-90d8-8d1f7ba851a3/episode.m4a" />
  <itunes:title><![CDATA[The Road to Airflow 2.0]]></itunes:title>
  <itunes:duration>32:28</itunes:duration>
  <itunes:summary><![CDATA[This week, we linked up with Airflow release manager, core committer, and Astronomer platform engineer Ash Berlin-Taylor to discuss the Airflow 2.0 roadmap [1]. There is some great stuff in the works around performance, autoscaling, and usability that we're excited about. In this episode, Ash lends his thoughts on the design, implementation, and value-add around all of the upcoming features, including:
 - The Knative Executor
 - A modern and real-time UI
 - A production-grade API
 - Improved scheduler and webserver performance
 - An official production Docker image for Airflow

We hope you enjoy! Please email pete@astronomer.io if you have thoughts on topics you'd like to see covered in future episodes.

Separately, some good folks from the Airflow community are running a user survey that will help collect some useful information around the Airflow UX. If you have five minutes to spare, filling out the following form will help the core Airflow committers to shape the project roadmap: https://forms.gle/XAzR1pQBZiftvPQM7 

[1] https://cwiki.apache.org/confluence/display/AIRFLOW/Airflow+2.0]]></itunes:summary>
  <content:encoded><![CDATA[This week, we linked up with Airflow release manager, core committer, and Astronomer platform engineer Ash Berlin-Taylor to discuss the Airflow 2.0 roadmap [1]. There is some great stuff in the works around performance, autoscaling, and usability that we're excited about. In this episode, Ash lends his thoughts on the design, implementation, and value-add around all of the upcoming features, including:
 - The Knative Executor
 - A modern and real-time UI
 - A production-grade API
 - Improved scheduler and webserver performance
 - An official production Docker image for Airflow

We hope you enjoy! Please email pete@astronomer.io if you have thoughts on topics you'd like to see covered in future episodes.

Separately, some good folks from the Airflow community are running a user survey that will help collect some useful information around the Airflow UX. If you have five minutes to spare, filling out the following form will help the core Airflow committers to shape the project roadmap: https://forms.gle/XAzR1pQBZiftvPQM7 

[1] https://cwiki.apache.org/confluence/display/AIRFLOW/Airflow+2.0]]></content:encoded>
  <itunes:subtitle><![CDATA[This week, we linked up with Airflow release manager, core committer, and Astronomer platform engineer Ash Berlin-Taylor to discuss the Airflow 2.0 roadmap [1]. There is some great stuff in the works around performance, autoscaling, and usability t...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/697700581]]></guid>
  <title><![CDATA[Airflow Breeze]]></title>
  <description><![CDATA[This week, we had the pleasure of meeting up with Jarek Potiuk, Principal Software Engineer at Polidea and Apache Airflow committer, to discuss his most recent contribution to the community, Airflow Breeze. Jarek deeply values developer productivity and realized while building a team of Airflow committers that, in order to open a PR on the project, passing unit tests and waiting for the CI build was a cumbersome process that could take up to a few hours. Breeze seeks to improve that experience for Airflow committers and lower the barrier-to-entry of contribution for folks that are new to the open-source community.

You can read more about Airflow Breeze here: https://www.polidea.com/blog/its-a-breeze-to-develop-apache-airflow/#the-apache-airflow-projects-setup]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/88889364-59f9-4883-908a-de3b66cc83f5/cover-art/original_695c0cf5307bbb0a3e6e5fff6c5ce448.jpg" />
  <pubDate>Thu, 17 Oct 2019 13:04:56 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="84090099" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/88889364-59f9-4883-908a-de3b66cc83f5/episode.m4a" />
  <itunes:title><![CDATA[Airflow Breeze]]></itunes:title>
  <itunes:duration>46:57</itunes:duration>
  <itunes:summary><![CDATA[This week, we had the pleasure of meeting up with Jarek Potiuk, Principal Software Engineer at Polidea and Apache Airflow committer, to discuss his most recent contribution to the community, Airflow Breeze. Jarek deeply values developer productivity and realized while building a team of Airflow committers that, in order to open a PR on the project, passing unit tests and waiting for the CI build was a cumbersome process that could take up to a few hours. Breeze seeks to improve that experience for Airflow committers and lower the barrier-to-entry of contribution for folks that are new to the open-source community.

You can read more about Airflow Breeze here: https://www.polidea.com/blog/its-a-breeze-to-develop-apache-airflow/#the-apache-airflow-projects-setup]]></itunes:summary>
  <content:encoded><![CDATA[This week, we had the pleasure of meeting up with Jarek Potiuk, Principal Software Engineer at Polidea and Apache Airflow committer, to discuss his most recent contribution to the community, Airflow Breeze. Jarek deeply values developer productivity and realized while building a team of Airflow committers that, in order to open a PR on the project, passing unit tests and waiting for the CI build was a cumbersome process that could take up to a few hours. Breeze seeks to improve that experience for Airflow committers and lower the barrier-to-entry of contribution for folks that are new to the open-source community.

You can read more about Airflow Breeze here: https://www.polidea.com/blog/its-a-breeze-to-develop-apache-airflow/#the-apache-airflow-projects-setup]]></content:encoded>
  <itunes:subtitle><![CDATA[This week, we had the pleasure of meeting up with Jarek Potiuk, Principal Software Engineer at Polidea and Apache Airflow committer, to discuss his most recent contribution to the community, Airflow Breeze. Jarek deeply values developer productivit...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/666160796]]></guid>
  <title><![CDATA[Open Source and Airflow at Google]]></title>
  <description><![CDATA[This episode kicks off season 2 of The Airflow Podcast. In this next season, we'll focus on the future of Airflow and chat with leading members of the community to paint a picture of what's to come. We're pumped to be diving back into this project and look forward to the great conversations we have lined up.

This week, we chatted with James Malone, Product Manager of Google's Cloud Composer. James had some interesting things to say about open source at Google and where his team plans on contributing most to the project going forward.

As always, thanks for listening and please email pete@astronomer.io if you have any feedback or would like to be considered as a guest.]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/159974ee-5cf9-47c8-8703-cfbeb78e4d4a/cover-art/original_55270132e5aa1d78ef3450c436abb34c.jpg" />
  <pubDate>Thu, 15 Aug 2019 14:22:53 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="70292509" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/159974ee-5cf9-47c8-8703-cfbeb78e4d4a/episode.m4a" />
  <itunes:title><![CDATA[Open Source and Airflow at Google]]></itunes:title>
  <itunes:duration>39:03</itunes:duration>
  <itunes:summary><![CDATA[This episode kicks off season 2 of The Airflow Podcast. In this next season, we'll focus on the future of Airflow and chat with leading members of the community to paint a picture of what's to come. We're pumped to be diving back into this project and look forward to the great conversations we have lined up.

This week, we chatted with James Malone, Product Manager of Google's Cloud Composer. James had some interesting things to say about open source at Google and where his team plans on contributing most to the project going forward.

As always, thanks for listening and please email pete@astronomer.io if you have any feedback or would like to be considered as a guest.]]></itunes:summary>
  <content:encoded><![CDATA[This episode kicks off season 2 of The Airflow Podcast. In this next season, we'll focus on the future of Airflow and chat with leading members of the community to paint a picture of what's to come. We're pumped to be diving back into this project and look forward to the great conversations we have lined up.

This week, we chatted with James Malone, Product Manager of Google's Cloud Composer. James had some interesting things to say about open source at Google and where his team plans on contributing most to the project going forward.

As always, thanks for listening and please email pete@astronomer.io if you have any feedback or would like to be considered as a guest.]]></content:encoded>
  <itunes:subtitle><![CDATA[This episode kicks off season 2 of The Airflow Podcast. In this next season, we'll focus on the future of Airflow and chat with leading members of the community to paint a picture of what's to come. We're pumped to be diving back into this project ...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/546216096]]></guid>
  <title><![CDATA[Airflow 1.10 Release]]></title>
  <description><![CDATA[This week, we met up with Ash Berlin-Taylor to discuss the recent 1.10 release, what it's like to be a release manager for an open source project, Airflow's bid to graduate from incubating status, and the next phase of Airflow project development.

As mentioned in our podcast intro, we at Astronomer are hiring Data Engineers who are passionate about contributing to open source and making Airflow great. Please shoot us an email at humans@astronomer.io if you're interested in hearing more about the fully-remote opportunity.

Check us out at www.astronomer.io if you're interested in seeing a demo of our platform.]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/8dd7729a-46de-43b9-8a1e-1bc490132c47/cover-art/original_caaebf062a946e9d75cc9059fbc2033d.jpg" />
  <pubDate>Mon, 17 Dec 2018 14:46:16 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="56606547" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/8dd7729a-46de-43b9-8a1e-1bc490132c47/episode.m4a" />
  <itunes:title><![CDATA[Airflow 1.10 Release]]></itunes:title>
  <itunes:duration>29:03</itunes:duration>
  <itunes:summary><![CDATA[This week, we met up with Ash Berlin-Taylor to discuss the recent 1.10 release, what it's like to be a release manager for an open source project, Airflow's bid to graduate from incubating status, and the next phase of Airflow project development.

As mentioned in our podcast intro, we at Astronomer are hiring Data Engineers who are passionate about contributing to open source and making Airflow great. Please shoot us an email at humans@astronomer.io if you're interested in hearing more about the fully-remote opportunity.

Check us out at www.astronomer.io if you're interested in seeing a demo of our platform.]]></itunes:summary>
  <content:encoded><![CDATA[This week, we met up with Ash Berlin-Taylor to discuss the recent 1.10 release, what it's like to be a release manager for an open source project, Airflow's bid to graduate from incubating status, and the next phase of Airflow project development.

As mentioned in our podcast intro, we at Astronomer are hiring Data Engineers who are passionate about contributing to open source and making Airflow great. Please shoot us an email at humans@astronomer.io if you're interested in hearing more about the fully-remote opportunity.

Check us out at www.astronomer.io if you're interested in seeing a demo of our platform.]]></content:encoded>
  <itunes:subtitle><![CDATA[This week, we met up with Ash Berlin-Taylor to discuss the recent 1.10 release, what it's like to be a release manager for an open source project, Airflow's bid to graduate from incubating status, and the next phase of Airflow project development.
...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/485906751]]></guid>
  <title><![CDATA[Role-Based Access Control (RBAC)]]></title>
  <description><![CDATA[This time, we met up with WePay's Joy Gao to talk through her work on the RBAC components in the recent Airflow 1.10 release. We dove deep into what inspired her work and took some time to discuss what it's like to be a woman contributing to a predominately male open-source community. Hope you enjoy!

If you'd like to get started using Airflow in your org, check out our recently-launched Spacecamp program here: https://www.astronomer.io/spacecamp

Feel free to email me at pete@astronomer.io with any feedback or if you'd like to be considered as a guest!]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/0b1b1480-0e87-48e2-aa61-76d85eaeab1d/cover-art/original_717f2d54abc8cae905b7e5e72441f331.jpg" />
  <pubDate>Wed, 15 Aug 2018 15:06:32 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="67227865" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/0b1b1480-0e87-48e2-aa61-76d85eaeab1d/episode.m4a" />
  <itunes:title><![CDATA[Role-Based Access Control (RBAC)]]></itunes:title>
  <itunes:duration>37:28</itunes:duration>
  <itunes:summary><![CDATA[This time, we met up with WePay's Joy Gao to talk through her work on the RBAC components in the recent Airflow 1.10 release. We dove deep into what inspired her work and took some time to discuss what it's like to be a woman contributing to a predominately male open-source community. Hope you enjoy!

If you'd like to get started using Airflow in your org, check out our recently-launched Spacecamp program here: https://www.astronomer.io/spacecamp

Feel free to email me at pete@astronomer.io with any feedback or if you'd like to be considered as a guest!]]></itunes:summary>
  <content:encoded><![CDATA[This time, we met up with WePay's Joy Gao to talk through her work on the RBAC components in the recent Airflow 1.10 release. We dove deep into what inspired her work and took some time to discuss what it's like to be a woman contributing to a predominately male open-source community. Hope you enjoy!

If you'd like to get started using Airflow in your org, check out our recently-launched Spacecamp program here: https://www.astronomer.io/spacecamp

Feel free to email me at pete@astronomer.io with any feedback or if you'd like to be considered as a guest!]]></content:encoded>
  <itunes:subtitle><![CDATA[This time, we met up with WePay's Joy Gao to talk through her work on the RBAC components in the recent Airflow 1.10 release. We dove deep into what inspired her work and took some time to discuss what it's like to be a woman contributing to a pred...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/465172242]]></guid>
  <title><![CDATA[Airflow on Kubernetes]]></title>
  <description><![CDATA[In this episode, we dove into the relationship between Airflow and Kuberenetes and interviewed Daniel Imberman, Senior Software Engineer at Bloomberg (1:30), and Greg Neiheisel, CTO here at Astronomer (37:31). Daniel has done most of the work on the Kubernetes executor for Airflow and Greg plans to take on a chunk of the development going forward, so it was really interesting to hear both of their perspectives on the project. Enjoy!]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/6bfd96e9-1d0d-4bc0-92e5-96beee7df84b/cover-art/original_e80bf1eab1e31fffbf4dce847831eaf3.jpg" />
  <pubDate>Fri, 29 Jun 2018 17:00:24 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="99924134" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/6bfd96e9-1d0d-4bc0-92e5-96beee7df84b/episode.m4a" />
  <itunes:title><![CDATA[Airflow on Kubernetes]]></itunes:title>
  <itunes:duration>55:11</itunes:duration>
  <itunes:summary><![CDATA[In this episode, we dove into the relationship between Airflow and Kuberenetes and interviewed Daniel Imberman, Senior Software Engineer at Bloomberg (1:30), and Greg Neiheisel, CTO here at Astronomer (37:31). Daniel has done most of the work on the Kubernetes executor for Airflow and Greg plans to take on a chunk of the development going forward, so it was really interesting to hear both of their perspectives on the project. Enjoy!]]></itunes:summary>
  <content:encoded><![CDATA[In this episode, we dove into the relationship between Airflow and Kuberenetes and interviewed Daniel Imberman, Senior Software Engineer at Bloomberg (1:30), and Greg Neiheisel, CTO here at Astronomer (37:31). Daniel has done most of the work on the Kubernetes executor for Airflow and Greg plans to take on a chunk of the development going forward, so it was really interesting to hear both of their perspectives on the project. Enjoy!]]></content:encoded>
  <itunes:subtitle><![CDATA[In this episode, we dove into the relationship between Airflow and Kuberenetes and interviewed Daniel Imberman, Senior Software Engineer at Bloomberg (1:30), and Greg Neiheisel, CTO here at Astronomer (37:31). Daniel has done most of the work on th...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/451774209]]></guid>
  <title><![CDATA[Pain Points]]></title>
  <description><![CDATA[This week, we’ll examine conversations with both old guests and new to paint a comprehensive picture of Airflow’s pain points. While we still undoubtedly believe that Airflow is the future of ETL, it’s important to acknowledge that any incubating project will have issues, and bringing those issues to the forefront of the community’s attention will help shape the future of the project.

We’ll talk with Thomas La Piana (1:36), Data Engineer at OrderMyGear, Frank Hsu (14:20), Data Engineer at mines.io, and Alan Cruickshank (27:41), business insights and data manager at tails.com. 

Check out our open-source library of Airflow plugins at github.com/airflow-plugins, and feel free to contribute anything that you've been working on!

If you're interested in being on the podcast or have any feedback on how you think we could make it better, shoot me an email at pete@astronomer.io]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/86cbae72-5d07-494c-be97-d55aea3f3911/cover-art/original_82c49857bea4202a4a1752015e17134b.jpg" />
  <pubDate>Thu, 31 May 2018 15:45:13 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="68883771" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/86cbae72-5d07-494c-be97-d55aea3f3911/episode.m4a" />
  <itunes:title><![CDATA[Pain Points]]></itunes:title>
  <itunes:duration>38:57</itunes:duration>
  <itunes:summary><![CDATA[This week, we’ll examine conversations with both old guests and new to paint a comprehensive picture of Airflow’s pain points. While we still undoubtedly believe that Airflow is the future of ETL, it’s important to acknowledge that any incubating project will have issues, and bringing those issues to the forefront of the community’s attention will help shape the future of the project.

We’ll talk with Thomas La Piana (1:36), Data Engineer at OrderMyGear, Frank Hsu (14:20), Data Engineer at mines.io, and Alan Cruickshank (27:41), business insights and data manager at tails.com. 

Check out our open-source library of Airflow plugins at github.com/airflow-plugins, and feel free to contribute anything that you've been working on!

If you're interested in being on the podcast or have any feedback on how you think we could make it better, shoot me an email at pete@astronomer.io]]></itunes:summary>
  <content:encoded><![CDATA[This week, we’ll examine conversations with both old guests and new to paint a comprehensive picture of Airflow’s pain points. While we still undoubtedly believe that Airflow is the future of ETL, it’s important to acknowledge that any incubating project will have issues, and bringing those issues to the forefront of the community’s attention will help shape the future of the project.

We’ll talk with Thomas La Piana (1:36), Data Engineer at OrderMyGear, Frank Hsu (14:20), Data Engineer at mines.io, and Alan Cruickshank (27:41), business insights and data manager at tails.com. 

Check out our open-source library of Airflow plugins at github.com/airflow-plugins, and feel free to contribute anything that you've been working on!

If you're interested in being on the podcast or have any feedback on how you think we could make it better, shoot me an email at pete@astronomer.io]]></content:encoded>
  <itunes:subtitle><![CDATA[This week, we’ll examine conversations with both old guests and new to paint a comprehensive picture of Airflow’s pain points. While we still undoubtedly believe that Airflow is the future of ETL, it’s important to acknowledge that any incubating p...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/429177501]]></guid>
  <title><![CDATA[Competitors: Luigi]]></title>
  <description><![CDATA[On this episode, we linked up with Erik Bernhardsson (@erikbern), creator of Luigi and CTO of Better Mortgage. We chatted about everything from the motivations behind Luigi's creation and his current thoughts on Airflow- we hope you enjoy!

Check out:
- Erik's blog at erikbern.com
- Our open-source library of Airflow plugins at github.com/airflow-plugins

All podcast feedback is hugely appreciated- feel free to email me at pete@astronomer.io if you have any thoughts.]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/df5e858a-9106-44ed-b779-df30c5ddedd8/cover-art/original_a450bbd315a07fb3cb577b7df7e490b3.jpg" />
  <pubDate>Fri, 13 Apr 2018 14:41:01 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="48061883" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/df5e858a-9106-44ed-b779-df30c5ddedd8/episode.m4a" />
  <itunes:title><![CDATA[Competitors: Luigi]]></itunes:title>
  <itunes:duration>27:39</itunes:duration>
  <itunes:summary><![CDATA[On this episode, we linked up with Erik Bernhardsson (@erikbern), creator of Luigi and CTO of Better Mortgage. We chatted about everything from the motivations behind Luigi's creation and his current thoughts on Airflow- we hope you enjoy!

Check out:
- Erik's blog at erikbern.com
- Our open-source library of Airflow plugins at github.com/airflow-plugins

All podcast feedback is hugely appreciated- feel free to email me at pete@astronomer.io if you have any thoughts.]]></itunes:summary>
  <content:encoded><![CDATA[On this episode, we linked up with Erik Bernhardsson (@erikbern), creator of Luigi and CTO of Better Mortgage. We chatted about everything from the motivations behind Luigi's creation and his current thoughts on Airflow- we hope you enjoy!

Check out:
- Erik's blog at erikbern.com
- Our open-source library of Airflow plugins at github.com/airflow-plugins

All podcast feedback is hugely appreciated- feel free to email me at pete@astronomer.io if you have any thoughts.]]></content:encoded>
  <itunes:subtitle><![CDATA[On this episode, we linked up with Erik Bernhardsson (@erikbern), creator of Luigi and CTO of Better Mortgage. We chatted about everything from the motivations behind Luigi's creation and his current thoughts on Airflow- we hope you enjoy!

Check o...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/417039171]]></guid>
  <title><![CDATA[Best Practices]]></title>
  <description><![CDATA[In this episode, we dive into Airflow Best Practices and include longer portions of interviews with Alan Cruickshank (1:30), Business Insights and Data Manager at Tails.com, Chris Riccomini (7:27), Principal Software Engineer at WePay, and Bolke de Bruin(31:45), Head of Advanced Analytics Technology at ING. Hope you enjoy!

We're still working to get better at podcasting, so please send over any feedback to pete@astronomer.io.  We really appreciate hearing what the community has to say, and your feedback is hugely helpful in making us better.

If you're interested in Astronomer Spacecamp, a guided Airflow development course, you can find more info on that here:
https://www.astronomer.io/blog/announcing-astronomer-spacecamp/

We also launched our Managed Airflow on Product Hunt last week- you can check that out here:
https://www.producthunt.com/posts/apache-airflow-on-astronomer

Thanks so much for listening!]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/b3658851-ac24-4f0e-92c9-86498d967222/cover-art/original_c1240e6d5142678822c65027e4989bdb.jpg" />
  <pubDate>Wed, 21 Mar 2018 13:33:17 -0400</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="102729409" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/b3658851-ac24-4f0e-92c9-86498d967222/episode.m4a" />
  <itunes:title><![CDATA[Best Practices]]></itunes:title>
  <itunes:duration>58:31</itunes:duration>
  <itunes:summary><![CDATA[In this episode, we dive into Airflow Best Practices and include longer portions of interviews with Alan Cruickshank (1:30), Business Insights and Data Manager at Tails.com, Chris Riccomini (7:27), Principal Software Engineer at WePay, and Bolke de Bruin(31:45), Head of Advanced Analytics Technology at ING. Hope you enjoy!

We're still working to get better at podcasting, so please send over any feedback to pete@astronomer.io.  We really appreciate hearing what the community has to say, and your feedback is hugely helpful in making us better.

If you're interested in Astronomer Spacecamp, a guided Airflow development course, you can find more info on that here:
https://www.astronomer.io/blog/announcing-astronomer-spacecamp/

We also launched our Managed Airflow on Product Hunt last week- you can check that out here:
https://www.producthunt.com/posts/apache-airflow-on-astronomer

Thanks so much for listening!]]></itunes:summary>
  <content:encoded><![CDATA[In this episode, we dive into Airflow Best Practices and include longer portions of interviews with Alan Cruickshank (1:30), Business Insights and Data Manager at Tails.com, Chris Riccomini (7:27), Principal Software Engineer at WePay, and Bolke de Bruin(31:45), Head of Advanced Analytics Technology at ING. Hope you enjoy!

We're still working to get better at podcasting, so please send over any feedback to pete@astronomer.io.  We really appreciate hearing what the community has to say, and your feedback is hugely helpful in making us better.

If you're interested in Astronomer Spacecamp, a guided Airflow development course, you can find more info on that here:
https://www.astronomer.io/blog/announcing-astronomer-spacecamp/

We also launched our Managed Airflow on Product Hunt last week- you can check that out here:
https://www.producthunt.com/posts/apache-airflow-on-astronomer

Thanks so much for listening!]]></content:encoded>
  <itunes:subtitle><![CDATA[In this episode, we dive into Airflow Best Practices and include longer portions of interviews with Alan Cruickshank (1:30), Business Insights and Data Manager at Tails.com, Chris Riccomini (7:27), Principal Software Engineer at WePay, and Bolke de...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/403744467]]></guid>
  <title><![CDATA[Use Cases]]></title>
  <description><![CDATA[Episode 2 of The Airflow Podcast is here to discuss six specific use cases that we’ve seen for Apache Airflow. Here’s the lineup:

Patrick Atwater (@patwater), Water Data Projects Manager at ARGO Labs: 2:03-5:35
Maksime Pecherskiy (@mrmaksimize), CDO of San Diego: 5:35-23:06
Scott Halgrim (@shalgrim), Data Engineer at Zapier: 23:06-27:27
Bolke de Bruin (@bolke2028), Head of Advanced Analytics at ING: 27:27-39:46
Chris Riccomini (@criccomini), Principal Software Engineer at WePay: 39:46-54:20
Ben Gregory (@benbeingbin), Data Engineer (and noted craft soda enthusiast) at Astronomer: 54:20-1:14:38

Contribute to our open-source library of Airflow plugins at github.com/airflow-plugins
Contact us at www.astronomer.io if you’re interested in Spacecamp: A guided development program to get your team up and running on Airflow.]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/af2cc87a-3872-4c5c-a022-7a9b3b44778a/cover-art/original_0f1589f41c90691deed4b348347a37ef.jpg" />
  <pubDate>Thu, 22 Feb 2018 22:41:25 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="127294941" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/af2cc87a-3872-4c5c-a022-7a9b3b44778a/episode.m4a" />
  <itunes:title><![CDATA[Use Cases]]></itunes:title>
  <itunes:duration>1:15:47</itunes:duration>
  <itunes:summary><![CDATA[Episode 2 of The Airflow Podcast is here to discuss six specific use cases that we’ve seen for Apache Airflow. Here’s the lineup:

Patrick Atwater (@patwater), Water Data Projects Manager at ARGO Labs: 2:03-5:35
Maksime Pecherskiy (@mrmaksimize), CDO of San Diego: 5:35-23:06
Scott Halgrim (@shalgrim), Data Engineer at Zapier: 23:06-27:27
Bolke de Bruin (@bolke2028), Head of Advanced Analytics at ING: 27:27-39:46
Chris Riccomini (@criccomini), Principal Software Engineer at WePay: 39:46-54:20
Ben Gregory (@benbeingbin), Data Engineer (and noted craft soda enthusiast) at Astronomer: 54:20-1:14:38

Contribute to our open-source library of Airflow plugins at github.com/airflow-plugins
Contact us at www.astronomer.io if you’re interested in Spacecamp: A guided development program to get your team up and running on Airflow.]]></itunes:summary>
  <content:encoded><![CDATA[Episode 2 of The Airflow Podcast is here to discuss six specific use cases that we’ve seen for Apache Airflow. Here’s the lineup:

Patrick Atwater (@patwater), Water Data Projects Manager at ARGO Labs: 2:03-5:35
Maksime Pecherskiy (@mrmaksimize), CDO of San Diego: 5:35-23:06
Scott Halgrim (@shalgrim), Data Engineer at Zapier: 23:06-27:27
Bolke de Bruin (@bolke2028), Head of Advanced Analytics at ING: 27:27-39:46
Chris Riccomini (@criccomini), Principal Software Engineer at WePay: 39:46-54:20
Ben Gregory (@benbeingbin), Data Engineer (and noted craft soda enthusiast) at Astronomer: 54:20-1:14:38

Contribute to our open-source library of Airflow plugins at github.com/airflow-plugins
Contact us at www.astronomer.io if you’re interested in Spacecamp: A guided development program to get your team up and running on Airflow.]]></content:encoded>
  <itunes:subtitle><![CDATA[Episode 2 of The Airflow Podcast is here to discuss six specific use cases that we’ve seen for Apache Airflow. Here’s the lineup:

Patrick Atwater (@patwater), Water Data Projects Manager at ARGO Labs: 2:03-5:35
Maksime Pecherskiy (@mrmaksimize), C...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/395373555]]></guid>
  <title><![CDATA[The Origins of Airflow]]></title>
  <description><![CDATA[For the first episode of the Airflow Podcast, we met up with Maxime Beauchemin, creator of Airflow, to explore the motivations behind its creation and the problems it was designed to solve. We asked Maxime for his definition of Airflow, the design principles behind hook/operator use, and his vision for the project.

Speaker list:
Pete DeJoy - Product at Astronomer
Viraj Parekh - Data Engineer at Astronomer
Maxime Beauchemin - Software Engineer at Lyft, creator of Airflow

Talk mentioned at the end of the podcast- Advanced Data Engineering Patterns with Apache Airflow: http://www.ustream.tv/recorded/109227704

Maxime's Blog: https://medium.com/@maximebeauchemin]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/f6583e55-5c27-44ec-b6d7-b3c037882f08/cover-art/original_f41c8d0cf7d1a9a231a0d75ae563fe83.jpg" />
  <pubDate>Tue, 06 Feb 2018 16:15:58 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="80556172" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/f6583e55-5c27-44ec-b6d7-b3c037882f08/episode.m4a" />
  <itunes:title><![CDATA[The Origins of Airflow]]></itunes:title>
  <itunes:duration>45:09</itunes:duration>
  <itunes:summary><![CDATA[For the first episode of the Airflow Podcast, we met up with Maxime Beauchemin, creator of Airflow, to explore the motivations behind its creation and the problems it was designed to solve. We asked Maxime for his definition of Airflow, the design principles behind hook/operator use, and his vision for the project.

Speaker list:
Pete DeJoy - Product at Astronomer
Viraj Parekh - Data Engineer at Astronomer
Maxime Beauchemin - Software Engineer at Lyft, creator of Airflow

Talk mentioned at the end of the podcast- Advanced Data Engineering Patterns with Apache Airflow: http://www.ustream.tv/recorded/109227704

Maxime's Blog: https://medium.com/@maximebeauchemin]]></itunes:summary>
  <content:encoded><![CDATA[For the first episode of the Airflow Podcast, we met up with Maxime Beauchemin, creator of Airflow, to explore the motivations behind its creation and the problems it was designed to solve. We asked Maxime for his definition of Airflow, the design principles behind hook/operator use, and his vision for the project.

Speaker list:
Pete DeJoy - Product at Astronomer
Viraj Parekh - Data Engineer at Astronomer
Maxime Beauchemin - Software Engineer at Lyft, creator of Airflow

Talk mentioned at the end of the podcast- Advanced Data Engineering Patterns with Apache Airflow: http://www.ustream.tv/recorded/109227704

Maxime's Blog: https://medium.com/@maximebeauchemin]]></content:encoded>
  <itunes:subtitle><![CDATA[For the first episode of the Airflow Podcast, we met up with Maxime Beauchemin, creator of Airflow, to explore the motivations behind its creation and the problems it was designed to solve. We asked Maxime for his definition of Airflow, the design ...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
<item>
  <guid isPermaLink="false"><![CDATA[tag:soundcloud,2010:tracks/385626638]]></guid>
  <title><![CDATA[Season One Teaser]]></title>
  <description><![CDATA[A sneak peek at our upcoming podcast about Apache Airflow.

Featured in this clip (in order of appearance):
Pete DeJoy - Product Specialist at Astronomer
Patrick Atwater - Water Data Projects Manager at ARGO Labs
Maksime Pecherskiy - Chief Data Officer of the City of San Diego
Bolke de Bruin - Head of Advanced Analytics at ING]]></description>
  <itunes:image href="https://files.cohostpodcasting.com/quill-file-prod/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/shows/50fd8116-be73-4852-be37-94d8bc0082fa/episodes/fcc3d4fb-dbc0-4b38-9c2f-18b0dfcd0579/cover-art/original_d1f3ffb74d2f0f4e64ca93c247ffde2a.jpg" />
  <pubDate>Thu, 18 Jan 2018 20:25:34 -0500</pubDate>
  <link>https://airflow.apache.org/</link>
  <author><![CDATA[support@contentallies.com (Astronomer)]]></author>
  <enclosure length="5889301" type="audio/mpeg" url="https://audio-delivery.cohostpodcasting.com/audio/dbcdfaae-58e3-4bce-9cf7-fd6dbc27a8f5/episodes/fcc3d4fb-dbc0-4b38-9c2f-18b0dfcd0579/episode.m4a" />
  <itunes:title><![CDATA[Season One Teaser]]></itunes:title>
  <itunes:duration>3:02</itunes:duration>
  <itunes:summary><![CDATA[A sneak peek at our upcoming podcast about Apache Airflow.

Featured in this clip (in order of appearance):
Pete DeJoy - Product Specialist at Astronomer
Patrick Atwater - Water Data Projects Manager at ARGO Labs
Maksime Pecherskiy - Chief Data Officer of the City of San Diego
Bolke de Bruin - Head of Advanced Analytics at ING]]></itunes:summary>
  <content:encoded><![CDATA[A sneak peek at our upcoming podcast about Apache Airflow.

Featured in this clip (in order of appearance):
Pete DeJoy - Product Specialist at Astronomer
Patrick Atwater - Water Data Projects Manager at ARGO Labs
Maksime Pecherskiy - Chief Data Officer of the City of San Diego
Bolke de Bruin - Head of Advanced Analytics at ING]]></content:encoded>
  <itunes:subtitle><![CDATA[A sneak peek at our upcoming podcast about Apache Airflow.

Featured in this clip (in order of appearance):
Pete DeJoy - Product Specialist at Astronomer
Patrick Atwater - Water Data Projects Manager at ARGO Labs
Maksime Pecherskiy - Chief Data Off...]]></itunes:subtitle>
  <itunes:explicit>false</itunes:explicit>
</item>
</channel>
</rss>