Launch, Monitor, and Share Data Pipelines In a Matter of Minutes
Listen now
Description
In this episode, we speak with Blake Burch, co-founder of Shipyard, a data orchestrator tool that allows you to create powerful workflows in a matter of minutes. Top 3 Value Bombs:  Data tests are often for the assumptions we already know. There's a lot of unknowns that can crop up and cause issues that tests are not catching. Start analyzing job metadata to alert on potential anomalies.Store your raw data to allow the most flexibility when it comes to re-transforming the data.Don’t settle for scatter shot troubleshooting. Have a clear lineage of how your data is being used from the source to the various consumers. 
More Episodes
In this episode we speak with Justin Borgman, Chairman & CEO at Starburst, which is based on open source Trino (formerly PrestoSQL) and was recently valued at $3.35 billion after securing their series D funding.  In this episode we discuss convergence of DW’s / DL's, why data lakes fail and...
Published 03/15/22
In this episode we speak with Paul Singman Developer Advocate at Treeverse / LakeFS. LakeFS is an open source project  that allows you to transform your object storage into a Git-like repository.  Top 3 takeaways LakeFS enables use cases like debugging to quickly view historical versions of your...
Published 03/01/22