Life with AI - Listen - Life with AI

Life with AI

Listen now

More Episodes

See all »

#81- Llama 3.

Extra episode about Llama 3.

Published 04/19/24

#80- Layer pruning and Mixture of Depths.

Hey guys, continuing the series of episodes about PEFT, in this episode I talk about inference optimization techniques for LLMs. I talk about layer pruning, where we prune consecutive layers of the LLM without almost not losing model performance. I also talk about Mixture of Depths, a...

Published 04/18/24

#79- LoRA and QLoRA.

Hey guys, this is the first episode in a series of episodes about PEFT, Parameter Efficient Fine Tuning. In this episode I talk about LoRA and QLoRA, two widely used methods that allowed us to fine tune LLMs way faster and in a single GPU without losing performance. Video sobre...

Published 04/11/24