Description
This week we’re talking to Lin Qiao, former PyTorch lead at Meta and current CEO of Fireworks AI. We discuss the evolution of AI frameworks, the challenges of optimizing inference for generative AI, the future of AI hardware, and open-source models. Lin shares insights on PyTorch design philosophy, how to achieve low latency, and the potential for AI to become as ubiquitous as electricity in our daily lives.
Chapters: 00:00 - Introduction and PyTorch Background04:28 - PyTorch's Success and Design Philosophy08:20 - Lessons from PyTorch and Transition to Fireworks AI14:52 - Challenges in Gen AI Application Development22:03 - Fireworks AI's Approach24:24 - Technical Deep Dive: How to Achieve Low Latency29:32 - Hardware Competition and Future Outlook31:21 - Open Source vs. Proprietary Models37:54 - Future of AI and Conclusion
I hope you enjoy the conversation and if you do, please subscribe!
--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com
In this episode, we explore how Replicate is breaking down barriers in AI development through its open-source platform. CEO Ben Firshman shares how Replicate enables developers without machine learning expertise to run AI models in the cloud.
00:00 Introduction 00:29 Overview of Replicate 03:13...
Published 11/13/24
How do you build AI tools that actually meet users’ needs? In this episode of High Agency, Raza speaks with Lorilyn McCue, the driving force behind Superhuman’s AI-powered features. Lorilyn lays out the principles that guide her team’s work, from continuous learning to prioritizing user feedback....
Published 11/07/24