003. Hadoop is Dead and Kylin is Not
Listen now
Description
This episode was delayed due to ongiong situation in Ukraine. Thank for understanding. Hot updates Pulsar is updatedApache Kylin "Extreme OLAP Engine for Big Data"Three main versions: 2.4, 3.0 and most recent is 4.0.1 v4 released in the autumn 2021Brings OLAP back to dataBeen around since 2015, brought to you by eBAYNot a friend to HBase, but likes parquetWeb Interface for all data stepsOfficial python client with pandas supportAmbari is killed (put in the attic)Apache Hop 1.1https://www.leanwithdata.com/blog/2022/02/hop-1.1.0/At January, 18th graduated from IncubatorApache Hop Sucks!Dolphin Scheduler Lightning news Apache Arrow for RustApache Iceber 0.13.0Hudi 0.10.1Apache HBase 2.4.9Apache Seatunnel easy-to-use ultra-high-performance distributed data integration platform that supports real-time synchronization of massive dataApache ORC 1.6.13Apache Beam 2.35.0Apache Airflow 2.2.3 Discussion: DataSecOps OWASPData debiasingData anonymization Dr. Igor MosyaginData Engineer @ KlarnaIgor identifies himself as a pragmatic engineer with strong academic background. A theoretical physicist by training, he eventually assumed he had enough PhDs and left Academia to work with Data-* related things. As of 2022, Igor works as a Data Platform Engineer at Klarna. On top of that, he’s a huge fan of cephalopods, math rock, and quantum mechanics. He also hates baked carrots so much he decided to mention it in this bioVisit Website (opens in a new tab)Visit Twitter account (opens in a new tab)Visit LinkedIn account (opens in a new tab)Email Pasha FinkelshteynDeveloper advocate @ JetBrainsHaving 14 years of experience in IT, Pasha went through a fire in water, from technical support to developer, team lead, and data engineer. Now Pasha works as a developer advocate for Data Engineering at JetBrains. He helps develop the Big Data Tools plugin, gives talks on Kotlin and various aspects of data engineering, and work with data. Also, he is the author and maintainer of Kotlin API for Apache Spark.Visit Website (opens in a new tab)Visit Twitter account (opens in a new tab)Visit Facebook account (opens in a new tab)Visit Instagram account (opens in a new tab)Visit LinkedIn account (opens in a new tab)Visit GitHub account (opens in a new tab)Email
More Episodes
Published 04/15/22
Hot updates dbt 1.0.0 releaseddbt is gaining popularityGreat instrument which solves really existing problemRedisJSON is out for public preview https://redis.com/blog/redisjson-public-preview-performance-benchmarking/Need to have Redis 6.x or laterprobably a good point to talk once again...
Published 01/31/22
A few hot updates Apache Geode 1.12.5enterprise edition is known as gemfiregeodistributed storagehas native clients in Java, C#, and C++ (!)JTA compliant transaction supportPinot released 0.9.0Added Segment Merge and RollupRollup is a technique for tree-like groupbyexample: city, streets,...
Published 12/25/21