Training Data Locality and Chain-of-Thought Reasoning in LLMs with

The TWIML AI Podcast (formerly This Week in...

Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski

Listen now

Description

Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive science and machine learning. Our conversation centers on Ben’s recent paper, “Why think step by step? Reasoning emerges from the locality of experience,” which he recently presented at NeurIPS 2023. In this conversation, we start out exploring basic questions about LLM reasoning, including whether it exists, how we can define it, and how techniques like chain-of-thought reasoning appear to strengthen it. We then dig into the details of Ben’s paper, which aims to understand why thinking step-by-step is effective and demonstrates that local structure is the key property of LLM training data that enables it. The complete show notes for this episode can be found at twimlai.com/go/673.

More Episodes

See all »

Chronos: Learning the Language of Time Series with Abdul Fatir Ansari

Today we're joined by Abdul Fatir Ansari, a machine learning scientist at AWS AI Labs in Berlin, to discuss his paper, "Chronos: Learning the Language of Time Series." Fatir explains the challenges of leveraging pre-trained language models for time series forecasting. We explore the advantages of...

Published 05/20/24

Powering AI with the World's Largest Computer Chip with Joel Hestness

Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learning team at Cerebras. We discuss Cerebras’ custom silicon for machine learning, Wafer Scale Engine 3, and how the latest version of the company’s single-chip platform for ML has evolved to support...

Published 05/13/24

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Published 05/13/24