Episodes
Developers hate monitoring, but we need it. We need it in many points of the software development lifecycle: before deprecating an API, before launching a new feature, after launching the feature, and more. In fact, monitoring needs can vary much more than the classic Ops monitoring. In this episode I’ll host Liran Haimovitch to discuss how to determine what developers should be monitoring, the difference between observability for Dev and for Ops, and how observability fits into our current...
Published 05/26/22
OpenSearch is a community-driven, open-source search and analytics suite derived from Apache 2.0 licensed Elasticsearch 7.10.2 & Kibana 7.10.2. The OpenSearch project started just over a year ago and is now the open-source alternative to ELK, which is no longer open source. The team has spent much of the last year getting the project going, but there was innovation as well. We will cover and discuss what OpenSearch has accomplished, but more importantly what’s coming next, including a big...
Published 04/28/22
Google’s SRE Book popularized the concept of Service Level Objective (SLO) and the SLO-driven approach. But what does it really mean to make SLO driven decisions? How can we generate observability and synchronize teams around joint SLOs? And how can we automate SLOs and integrate them into the software release pipeline? In this episode I’ll host Andreas Grabner. We’ll discuss the SRE practices, and how to automate SLO from dev all the way to prod. We’ll talk about the open source efforts to...
Published 03/30/22
What does it take to build observability in a web-scale company such as Slack, Pinterest and Twitter? On this episode of OpenObsevability Talks I'll host Suman Karumuri to hear how he built these systems from the ground up on these #BigTech co's, about his recent research papers and more. Suman Karumuri is a Sr. Staff Software Engineer and the tech lead for Observability at Slack. Suman Karumuri is an expert in distributed tracing and was a tech lead of Zipkin and a co-author of OpenTracing...
Published 02/27/22
SaaS (software as a service) is a popular model for many businesses today. SaaS businesses need agility to move fast and remain competitive. This means agility in the software IT stack, but also agility in the business models and product-led growth (PLG). Observability plays a key role in enabling SaaS organizations to move fast. Achieving this agility, however, raises specific observability requirements. On this episode of OpenObservability Talks we’ll host Aviad Mizrachi, the CTO and...
Published 01/31/22
We’ve grown to rely on “the three pillars” for observability - logs, metrics and traces. Popular frameworks such as Prometheus have helped popularize these practices. But now people are starting to realize that it’s not enough. On this episode Dotan Horovits will host Frederic Branczyk for a discussion about the unspoken pitfalls of Prometheus and the challenges of current observability coverage. We will also discuss the rise of Continuous Profiling as a new observability signal, what it’s...
Published 12/21/21
OpenObservability Talks S2E06: Hosting Steve McCanne   We hear a lot about BPF in the industry today, applying this flexible technology to solve so many problems from routing, proxying, and of course observability. Correlating events and data from the operating system level across distributed systems is a key problem for the industry and community to solve. I am thrilled to announce Steve McCanne joining us for this episode. I have been lucky enough to spend time with Steve in my career and...
Published 11/23/21
Have you ever wondered how services are operated at Google’s scale? Here’s your opportunity to find out. Ramón will share how his SRE team runs Google’s identity services, and the elaborate end-to-end observability they use to achieve it with strict SLA. We’ll also get a glimpse at the birthplace of Kubernetes, OpenCensus, Dapper, Monarch and other cornerstones of today’s cloud-native DevOps and observability. Ramón Medrano Llamas (@rmedranollamas) is a staff site reliability engineer at...
Published 10/27/21
Observability is becoming a common practice for DevOps teams monitoring and troubleshooting IT systems. But Observability can offer much more than that. More advanced usage of telemetry, and in particular distributed tracing and its context propagation mechanism, can uncover insights into your business performance and can help solve business and FinOps problems. On this episode of OpenObservability Talks we’ll host Yuri Shkuro, a champion of Distributed Tracing, to discuss how tracing and...
Published 09/19/21
In this episode, we’ll talk with industry veteran and product manager Anurag Gupta who has been working in open source observability for over 4 years. We will go into depth on his background, and how he views the ecosystem of open source. Then we will dig into the Fluentd and Fluent Bit projects and discuss some of the amazing innovations coming from this project. Learn what’s next for logging, and how a consolidated data collection plane is being driven by the Fluentd project.
Published 08/26/21
The CNCF has a rich suite to address monitoring Kubernetes and cloud-native workloads. First of which is Prometheus, which is widely adopted, with great out-of-the-box compatibility with Kubernetes. But under the CNCF you can also find OpenMetrics that offers standardization of the metrics format, Thanos and Cortex which offer long-term storage for Prometheus, and other complimentary solutions and integrations.    On this episode of OpenObservability Talks we’ll host “RichiH” Hartmann and...
Published 07/25/21
Current observability practice is largely based on manual instrumentation, which creates a barrier to entry for many wishing to implement observability in their environment. This is especially true in Kubernetes environments and microservices architecture. eBPF (extended Berkeley Packet Filter) is an exciting new technology for Linux kernel level instrumentation, which bears the promise of no-code instrumentation and easier observability into Kubernetes environments (alongside other benefits...
Published 06/30/21
OpenSearch project was born out of the passion for Elasticsearch and Kibana and the desire to keep them open source in the face of Elastic’s decision to close-source them. After a couple of months of hard work led by AWS, the Beta release was announced earlier this month under Apache2 license. On this episode of OpenObservability Talks we hosted Kyle Davis, Senior Developer Advocate for OpenSearch at AWS. We talked about how OpenSearch came to be, what it took to fork Elasticsearch and...
Published 05/27/21
We are thrilled to have Juraci Kröhling a Software Engineer at Red Hat; CNCF, Maintainer for Jaeger, and OpenTelemetry. He will be live and in-person this month on the podcast in a discussion with Jonah Kowall who is the CTO at logz.io and contributor to Jaeger, OpenTelemetry, and OpenSearch.
Published 04/30/21
Join Jonah Kowall and Bartek Plotka for a discussion on the latest happening topics on open source observability. Bartek works on many projects in open source and is Principal Software Engineer at Red Hat; CNCF SIG Observability Tech Lead. He is very active in the community as one of the leaders of Prometheus, Thanos, OpenMetrics, and many other projects.
Published 03/29/21
The ninth of our OpenObservability Talks has Jujhar Singh, Global DevSecOps Practice Lead at The Economist. How much observability is enough? What is the investment required to achieve it? How can we drive observability in the company in a measured and pragmatic way?  This was first streamed at https://www.twitch.tv/openobservability on February 25th 2021 and the video is available at https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg The live recording of the OpenObservability talks...
Published 02/28/21
The eighth of our OpenObservability Talks has Tomer Levy, CEO & Founder of Logz.io. The community is in turmoil around Elastic's announced plan to take Elasticsearch and Kibana off open source. In this episode, both Dotan and Mike have the pleasure of hosting Tomer where we discuss the recent news of Elastic moving Elasticsearch and Kibana to a dual non-OSS license - SSPL and Elastic License - and the implications that have on the open source community around it, including plans to fork...
Published 01/28/21
The seventh of our OpenObservability Talks has Andy Thurai, Senior Analyst at GigaOM. Jonah Kowall, CTO and Andy Thurai will talk about Andy's career and journey as a vendor and an analyst. We will discuss the observability market along with APM and other aspects of monitoring. We will then dive into the open-source ecosystem and how this is changing vendor thinking. Of course, we'll also be discussing OpenTelemetry! Andy Thurai is the Founder & Principal at the FieldCTO providing...
Published 12/31/20
The sixth of our OpenObservability Talks has Avishai Ish-Shalom, Developer Advocate at ScyllaDB. We trust our metrics to show us the status of our system and where it misbehaves. But do our metrics show us what really happened? You'd be surprised how often it's not the case.  On this episode we discussed the math behind metrics, some common misconceptions, what it take to have accurate metrics, and if there even is such a thing. Avishai Ish-Shalom has served as Engineer in Residence in...
Published 11/29/20
The fifth of our OpenObservability Talks has Michael Haberman, CTO & Co-Founder of Aspecto. There's a lot of discussion on how to achieve observability in microservices. Most of the discussion revolves around production workflows such as granular monitoring or debugging of complex environments. A much less discussed aspect is the development stage, before even reaching production. In this talk we will discuss the challenges of microservices environments, and how to leverage tracing for...
Published 10/31/20
The fourth of our OpenObservability Talks has Albert Teoh from the Jaeger project. Distributed tracing has been gaining momentum with the growing popularity of microservices. Jaeger is a popular open source tool originally developed at Uber and now part of the CNCF (Cloud Native Computing Foundation). Albert had been working at Uber for the past 2.5 years, where he got hands-on with Jaeger. On this talk Albert will share with us his experience with distributed tracing, from introducing it...
Published 09/30/20
Episode 3 of OpenObservability Talks, where we'll be hosting Matt Schallert of  Chronosphere. The topic for this episode is Long-Term Metrics with M3 and Prometheus. Prometheus is the de facto solution when getting started with metrics, supporting use cases where a single node can store the entire dataset. When it comes to scaling out workloads or storing long-term data, Prometheus offers the Remote Write and Read interfaces to integrate with other storage solutions.  In this talk, Matt...
Published 08/27/20
Episode 2 of OpenObservability Talks, where we'll be hosting Paul Bruce of  Neotys and Jonah Kowall of Logz.io. Our topic of discussion today is OpenTelemetry and the thriving community around it. This was first streamed at https://www.twitch.tv/openobservability on July 30th and the full video is available at https://www.youtube.com/watch?v=OtxYgT-mopY If you have a talk or a subject you'd like to talk about send it here https://forms.gle/pTVwDMCP1fK32tAF9 Socials: Twitter:...
Published 07/30/20
Welcome to the first-ever OpenObservability Talks! This was first streamed at https://www.twitch.tv/openobservability on June 25th and the full video is available at https://www.youtube.com/watch?v=Otb57MNZfKA You'll get an introduction to the series, a talk by Jack Neely of 42 Lines on Five Tips on Application Monitoring with Prometheus, and a run-through of current events in the world of Open Source Observability and the things that affect it. If you'd like to submit a session for a...
Published 06/29/20