AI on your phone? Tim Dettmers on quantization of neural networks

AI on your phone? Tim Dettmers on quantization of neural networks — #41

Listen now

Description

Tim Dettmers develops computationally efficient methods for deep learning. He is a leader in quantization: coarse graining of large neural networks to increase speed and reduce hardware requirements. Tim developed 4-and 8-bit quantizations enabling training and inference with large language models on affordable GPUs and CPUs - i.e., as commonly found in home gaming rigs. Tim and Steve discuss: Tim's background and current research program, large language models, quantization and performance, democratization of AI technology, the open source Cambrian explosion in AI, and the future of AI. 0:00 Introduction and Tim’s background 18:02 Tim's interest in the efficiency and accessibility of large language models 38:05 Inference, speed, and the potential for using consumer GPUs for running large language models 45:55 Model training and the benefits of quantization with QLoRA 57:14 The future of AI and large language models in the next 3-5 years and beyond Tim's site: https://timdettmers.com/ Tim on GitHub: https://github.com/TimDettmers Music used with permission from Blade Runner Blues Livestream improvisation by State Azure. -- Steve Hsu is Professor of Theoretical Physics and of Computational Mathematics, Science, and Engineering at Michigan State University. Previously, he was Senior Vice President for Research and Innovation at MSU and Director of the Institute of Theoretical Science at the University of Oregon. Hsu is a startup founder (Superfocus.ai, SafeWeb, Genomic Prediction) and advisor to venture capital and other investment firms. He was educated at Caltech and Berkeley, was a Harvard Junior Fellow, and has held faculty positions at Yale, the University of Oregon, and MSU. Please send any questions or suggestions to [email protected] or Steve on Twitter @hsu_steve.

More Episodes

See all »

Letter from Shanghai: Reflections on China in 2024 — #73

(00:00) - Overview: 3 weeks in China (02:33) - The China knowledge problem: Grappling with Reality (06:54) - Physics seminars in Shanghai and Beijing (15:54) - Chinese academia, challenges in scientific culture (22:43) - Yu Min: Two Bombs, One Satellite (27:02) - He Jiankui and gene editing, plus...

Published 11/21/24

Manifold

Published 11/21/24

Letter from Beijing, with Han Feizi — #72

Han Feizi is the pseudonym of a columnist for Asia Times, who covers the Chinese economy, technology, and US-China competition. The author lives in Beijing, and has an extensive background in finance and investment banking. Han Feizi's articles for Asia Times:...

Published 11/07/24