AI Agents and Long Context Windows with Mark Huang - Listen -

AI Agents and Long Context Windows with Mark Huang

Listen now

Description

Today we have Mark Huang on the show. Mark has previously held roles in Data Science and ML at companies like Box and Splunk and is now the co-founder and chief architect of Gradient, an enterprise AI platform to build and deploy autonomous assistants. In our chat, we get into some of the stuff he’s seeing around autonomous AI agents and why people are so excited about that space. Mark and his team has also recently been working on a project to extend the Llama-3 context window. They were able to extend the model from 8K tokens all the way to 1 million through a technique called theta-scaling. He walks us through the details of this project and how longer context windows will impact the types of use cases we can serve with LLMs. Follow Mark: https://x.com/markatgradient Follow Sean: https://x.com/seanfalconer

More Episodes

See all »

Building + Evolving Sentry's Architecture and Funding Open Source with David Cramer

Today, we have David Cramer on the show. David is one of the co-founders of Sentry, an application monitoring tool that's one of the most widely-adopted tools for developers. Sentry does over 300,000 events per second on average, and there's a lot of fancy work to process these application...

Published 11/12/24

Software Huddle

Published 11/12/24

Deep Dive into Inference Optimization for LLMs with Philip Kiely

Today we have Philip Kiely from Baseten on the show. Baseten is a Series B startup focused on providing infrastructure for AI workloads. We go deep on Inference Optimization. We cover choosing a model, discuss the hype around Compound AI, choosing an Inference Engine, Optimization Techniques...

Published 11/05/24