Episodes
I'm sitting in the Amsterdam Airport (Schipol) and wrote some of my book on the flight over to Europe. In this episode, I'll talk briefly about my book writing process, and how it differs today from when I wrote Fundamentals of Data Engineering.
Published 05/03/24
Jarod Santo and Adam Stacoviak from The Changelog join me for 1.5 hours of free-flowing chats about planned obscelescene, old school vs new school consumer tech, the XZ Backdoor incident, the job market doldrums (plus tips for finding work and starting a biz), and being unemployable. Jarod and Adam are two of my favorite people to talk with, since we can literally chat about anything for hours. Enjoy! Changelog: https://changelog.com/
Published 05/02/24
Published 05/02/24
In today's Practical Data Modeling group discussion, we chatted about how to get buy-in for data modeling. The question was intentionally vague, because context is key. I give some thoughts on this topic, and how you can generalize this to most situations where you need to get buy-in. Practical Data Modeling: https://practicaldatamodeling.substack.com/
Published 04/26/24
Vishnu Vasanth (e6Data) and I chat about what's next for analytical query engines, shifting left, the Indian tech scene, and much more. Vishnu is very wise and has a very deep technical vision for where the industry needs to go. I very much agree with his vision. Enjoy! e6Data: https://www.e6data.com/ LinkedIn: https://www.linkedin.com/in/vishnu-vasanth-5329233/
Published 04/23/24
There's the interview you think you're going to have, then there's the interview you get. This is one of those, in the best way possible. I expected to chat about his time at Snowflake. We didn't even get past his early days building data warehouses because it was so fascinating. Did you know Kent is arguably one of the very first practitioners (probably an accidental inventor) of DataOps? This is sort of a "prequel" episode. Kent Graziano and I chat about his early days as a data...
Published 04/16/24
Sometimes I feel like the data world is stuck in a world of tabular data (rows and columns). This has been the data world for decades. Let's think bigger. We've moved beyond data fitting into lakes. With the capability of AI to unlock the power of unstructured data (audio, images, video), it's time to start thinking about data oceans...
Published 04/12/24
Keith Belanger is an OG data modeling practitioner, having been in the game for decades. We chat about a wide range of data modeling topics. What's changed and what's stayed the same? How to model data to fit the business's needs. Agile data modeling. When it works, when it doesn't. Data modeling for data mesh and decentralization. The art of data modeling How to teach conceptual data modeling to new practitioners Keith brings a wealth of experience and a practical, no-nonsense...
Published 04/10/24
This morning, the Practical Data Modeling Community held its first group discussion (to be posted very soon). People from all sorts of organizations (biggest companies in the world, universities, small companies) discussed how the approach analytical data modeling. My major takeaway - your mileage will vary. There's the ideal way of data modeling we're taught, and there's reality. Everyone's situation is different and there's no one-size-fits-all approach that will work for everyone. The...
Published 04/05/24
Kishore Aradhya and I both teach, and we agree this is a very difficult landscape to determine what and how to teach. Against the backdrop of generative AI, we discuss the role of universities in teaching tech and data, the role of a teacher, how to teach data, and much more. DSPY - https://github.com/stanfordnlp/dspy
Published 04/04/24
Toby Mao started his data tooling company, SQLMesh, in 2022, when investing in data tools was unfashionable. Yet, he's managed to get traction with SQLMesh and is on a mission to simplify data transformations and make data easier to work with. We also chat about experimentation best practices, which he learned at some of the biggest tech companies in the world. This is definitely a great episode if you're interested in startups, data tools, experiments, driving cars, and much more.
Published 04/03/24
Matt Housley hangs out at my house, and we have a random chat about all sorts of stuff - fads in data, data and ML engineering, tech hubs, and more. If you want a glimpse into the sorts of chats that Matt and I have all the time, here you go.
Published 04/02/24
There's an inverse relationship between the value you add and how much you need to tell people about it. If you're adding value, you'll know - you don't need to talk about it. You're doing it. Also, the same goes with "data." If you're putting "data" as the center of the conversation, you just lost the game.
Published 03/29/24
Angel Narciso and I hung out at LEAP Riyadh, alongside 215K attendees (wtf?). We chat about all sorts of stuff in the data world, including some blunt convos on the modern data stack and AI, among other things.
Published 03/28/24
I often get questions about how I write and advice on how one might go about becoming a "writer." In this episode, I talk a bit about my writing process and why you (yes you) should also write. This will be the first in a few episodes and blog posts where I talk about the writing and content creation process, as I get a ton of questions about this. Thanks for your questions and support!
Published 03/22/24
Jess Haberman and I chat about how to negotiate a book deal. She's been in publishing for ages and knows her stuff! Also, I wish I had this episode handy while I was shopping around Fundamentals of Data Engineering, because Jess agreed to publish my book while she was at O'Reilly ;) We also talk about how AI will change publishing.
Published 03/20/24
Had a great chat with Keith Belanger yesterday (podcast dropping soon) about how conceptual data modeling fell by the wayside. All too often, people seem focused on physical data modeling. This is a shame, because conceptual is the art and lifeblood of data modeling. As an industry, we need to learn to see (again).
Published 03/14/24
Sadie St. Lawrence chat about all sorts of stuff - mind and machines, community building, optimizing time, focus, and social capital, and much more. Women in Data: https://www.womenindata.org/ LinkedIn: https://www.linkedin.com/in/sadiestlawrence/
Published 03/12/24
Is it better to make a perfect or good enough data model? It depends…
Published 03/08/24
Christian Bourdeau and I chat about all things data careers - getting hired, getting fired, and finding your gig. We also chat about 75 Hard, lifting weights (we're bros), hackathons, teaching, and much more. LinkedIn: https://www.linkedin.com/in/christianbourdeau/
Published 03/07/24
Zach Zeus and I chat about trust architecture and how it can work to improve ESG impacts in supply chain. This is an incredibly important topic with massive global impact, cuz climate change. LinkedIn: https://www.linkedin.com/in/zachary-zeus/ Recommendation 49: https://unece.org/circular-economy/news/unece-support-scaling-transparency-sustainable-value-chains
Published 03/05/24
I'm chilling in Verbier, Switzerland at Skiers in Data (SKID). In this episode, I chat about the various types of debt - technical, data, and organizational debt.
Published 03/01/24
Annie Nelson and I chat about her path to data analytics, writing her new book, "How to Become a Data Analyst", bad career advice, rock climbing, and more. LinkedIn: https://www.linkedin.com/in/annie-nelson-analyst/ TikTok: https://www.tiktok.com/discover/annie-nelson-data-analytics Book: https://www.amazon.com/How-Become-Data-Analyst-Low-Cost/dp/1394202237
Published 02/29/24
Christophe Blefari and I chat about why teaching data engineering is so damn hard, how generative AI will change technology and data education, and more. Site: https://www.blef.fr/ LinkedIn: https://www.linkedin.com/in/christopheblefari
Published 02/27/24
Imagine you're dropped into the middle of a failed data project - no data team, no documentation, and other horrific things - and have to figure out a way to make it work. What would you do? Gordon Wong and I chat about various aspects of how we'd handle this scenario.
Published 02/23/24