Description
Join us for this next episode of the broadcast, where we bring back Ryan Blue, the creator of Iceberg, to discuss some of the latest happenings in the Iceberg community. We also discuss and demo a bunch of new features that have come out in the Trino Iceberg connector. We also have a new guest, Tabular Developer Advocate Sam Redai, shedding light on this incredible community as well!
Since the first episodes, Iceberg has finalized the v2 spec and added a lot of new features along the way. Likewise, we've improved Trino's writing capabilities around Iceberg. So much so that you can use Trino as the sole query engine atop Iceberg to support your data lake. We'll talk about all of this and more so don't miss it!
- Intro Music: 0:00
- Intro: 0:32
- Releases: 6:27
- Concept of the episode: What is Iceberg?: 11:27
- Concept of the episode: Why Iceberg over other formats?: 16:50
- Concept of the episode: Metadata catalogs: 35:40
- Concept of the episode: Branching, tagging, and auditing, oh my!: 43:54
- Concept of the episode: The Puffin format: 50:53
- Concept of the episode: Trino Iceberg connector updates : 1:01:38
- Pull request of the episode: PR 13111: Scale table writers per task based on throughput: 1:11:37
- Demo of the episode: DML operations on Iceberg using Trino: 1:15:31
Show Notes: https://trino.io/episodes/40.html
Show Page: https://trino.io/broadcast/