Episodes
Extra episode about Llama 3.
Published 04/19/24
Hey guys, continuing the series of episodes about PEFT, in this episode I talk about inference optimization techniques for LLMs. I talk about layer pruning, where we prune consecutive layers of the LLM without almost not losing model performance. I also talk about Mixture of Depths, a similar technique to Mixture of Experts, where we have a router that choses which tokens will be processed in which layer of the LLM. Paper MoD: ⁠https://arxiv.org/pdf/2404.02258.pdf⁠ Paper layer...
Published 04/18/24
Published 04/18/24
Hey guys, this is the first episode in a series of episodes about PEFT, Parameter Efficient Fine Tuning. In this episode I talk about LoRA and QLoRA, two widely used methods that allowed us to fine tune LLMs way faster and in a single GPU without losing performance. Video sobre QLoRA: ⁠https://www.youtube.com/watch?v=6l8GZDPbFn8⁠ LoRA paper: ⁠https://arxiv.org/pdf/2106.09685.pdf⁠ QLoRA paper: ⁠https://arxiv.org/pdf/2305.14314.pdf⁠ Instagram do podcast:...
Published 04/11/24
Hello, in this episode I talk a Retrieval Aware Fine Tuning (RAFT), a paper that proposes a new technique to use both domain specific fine-tuning and RAG to improve the retrieval capabilities of LLMs. In the episode I also talk about another paper that is called RAFT, but this time Reward rAnking Fine Tuning, which proposes a new technique to perform RLHF without the convergence problems of Reinforcement Learning. Retrieval Aware Fine Tuning: https://arxiv.org/abs/2403.10131v1 Reward...
Published 03/21/24
Hello guys, in this episode I explain how we can scale the context window of an LLM to more than 1M tokens using Ring Attention. In the episode, I also discuss if RAG is dead or not based on these advancements in the context window. Paper Lost in the Middle: https://arxiv.org/pdf/2307.03172.pdf Gemini technical report: https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf Paper Ring Attention: https://arxiv.org/pdf/2310.01889.pdf Instagram of the podcast:...
Published 03/07/24
Hey guys, in the Brazilian version of the Podcast I interviewed Andre, he is an AI expert on IBM and we talked a lot about how to solve problems using AI. Brains website: https://brains.dev/ Andre's Linkedin: https://www.linkedin.com/in/andrefelipelopes/ Brains' Linkedin: https://www.linkedin.com/company/brains-brazilian-ai-networks/ Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Published 02/22/24
Hey guys, in this episode I talk about Mixture Of Experts, more specifically about Mixtral, which is today the best open-source LLM available, and also better than ChatGPT 3.5 and Gemini Pro. Mixtral paper: ⁠https://arxiv.org/pdf/2401.04088.pdf⁠ Mixtral model: ⁠https://huggingface.co/mistralai/Mixtral-8x7B-v0.1⁠ Mixtral YouTube: ⁠https://www.youtube.com/watch?v=mwO6v4BlgZQ⁠ Instagram: https://www.instagram.com/podcast.lifewithai Linkedin: https://www.linkedin.com/company/life-with-ai
Published 01/18/24
Hey guys, in this episode I have Sergei as guest and we talked a lot about NLP and Named Entity Recognition (NER). Sergei and his colleagues at NuMind have the current state or the art model for NER and we discussed a lot about it during the episode. English model: ⁠https://huggingface.co/numind/generic-entity_recognition_NER-v1⁠ Multi-language model: ⁠https://huggingface.co/numind/generic-entity_recognition_NER-multilingual-v1⁠ Sergei's...
Published 12/07/23
Hey guys, in this episode I explain most of what I know about Transformers. I talk about the architecture, the attention formula, encoder, decoder, self-supervised learning, positional encoding, tokenization, inductive bias, Vision-Transformers, receptive fields... It was the most technical episode I've recorded so far, and I hope you like it! By the way, it worth listening to this episode with the Transformers paper. Paper Transformers: ⁠https://arxiv.org/pdf/1706.03762.pdf⁠ Link of...
Published 11/30/23
Hey guys, in this episode I explain RAG (Retrieval Augmented Generation) and the concept of agents executing different tasks. Hope you like it! Instagram: https://www.instagram.com/podcast.lifewithai/   Linkedin: https://www.linkedin.com/company/life-with-ai
Published 11/16/23
Hey guys, in the brazilian version of the podcast I interviewed Daniel, CTO of WeClever, a company that uses AI to improve chat bot experience. In the episode we talked about fine tuning ChatGPT, LoRA, RAG and more! WeClever: ⁠https://www.linkedin.com/company/wecleverco/⁠ Daniel Linkedin: ⁠https://www.linkedin.com/in/dmerlimorais/⁠ Instagram: https://www.instagram.com/podcast.lifewithai/  Linkedin: https://www.linkedin.com/company/life-with-ai
Published 11/02/23
Hey guys, this episode was really great! I have tips for both technical and behavioral interviews. In the technical interview part, I talk about what is always present in interviews and is always good to know! I also talk about the biggest technical mistakes that people make in interviews and explain them. In the behavioral part, I talk a little about the interviewer's vision and what is important to know! Instagram: https://www.instagram.com/podcast.lifewithai/  Linkedin:...
Published 08/31/23
Hey guys, in this episode I talk about how AI algorithms are trained using supervised and self-supervised learning, how text tokenization works, how ChatGPT was trained and I also talked about document intelligence. This was a heavy technical content episode and I hope you enjoy it! Instagram: https://www.instagram.com/podcast.lifewithai/  Linkedin: https://www.linkedin.com/company/life-with-ai
Published 08/24/23
Hey guys, in this episode I talk about how to better use ChatGPT. In the episode I talk about Chain of Thought, zero shot, few shot and more! Instagram: https://www.instagram.com/podcast.lifewithai/  Linkedin: https://www.linkedin.com/company/life-with-ai
Published 08/04/23
Hello guys, in this episode I go through the leaked document about GPT4 explaining the different points of the architecture, training, inference and dataset. It's a very nice, curious and a bit technical episode! Instagram: https://www.instagram.com/podcast.lifewithai/  Linkedin: https://www.linkedin.com/company/life-with-ai
Published 07/27/23
Hello everyone, in the brazilian version of podcast I interviewed Juliemar from Amicci and we discussed about private label products, marketplace and how to leverage OpenAI API to build their own products. They a lot of very nice applications of AI at Amicci and Juliemar talked more about them during the episode. Juliemar's Linkedin: https://www.linkedin.com/in/juliemarberri/ Amicci's website (only in Portuguese): https://amicci.com.br/ Podcast LinkedIn:...
Published 06/22/23
Hi guys, in this episode in the Brazilian podcast I have as guest Vinicius, CEO of MadeinWeb. In the episode we discussed about the use of AI in agriculture and the power of cloud for digital transformation. Vinicius LinkedIn: https://www.linkedin.com/in/vgallafrio/ MadeinWeb LinkedIn: https://www.linkedin.com/company/madeinweb-mobile/ Podcast LinkedIn: https://www.linkedin.com/company/life-with-ai Podcast Instagram: https://www.instagram.com/podcast.lifewithai/
Published 04/20/23
Hi everyone, in the portuguese version of the podcast I received as guest Rafael Lanna, CRO of Ewally, a Brazilian fintech for B2B and B2C. In the episode we discussed a lot about how to provide credit score for people that don't have historical transactions. Rafael's LinkedIn: https://www.linkedin.com/in/rafaellanna/ Ewally's website: https://www.ewally.com.br/ Podcast LinkedIn: https://www.linkedin.com/company/life-with-ai Podcast Instagram: https://www.instagram.com/podcast.lifewithai/
Published 03/30/23
In this episode I have Krish Ramineni, Co-Founder and CEO of Fireflies a meeting recording tool that transcribes your meeting and has search engine with AI superpowers to enable you to keep track on what was discussed. Their engine allows you to perform different tasks like summarisation, keyword search, topic search... Fireflies is launching soon Fred, their chatGPT like algorithm that will be able to answer everything you want from your meetings. During the episode we discussed a lot...
Published 02/09/23
Hey guys, in this episode I have Eric Olson, Co-Founder and CEO of Consensus, an evidence-based search engine. In the episode of discussed a lot the technical aspects of building a search engine, going through the different steps of it, like the keyword matching, vector similarity search and also Large Language Models for Q&A. We also discussed about his entrepreneur life, going from a technical position as a data scientist to a CEO position, his new challenges and also his day-to-day...
Published 01/19/23
Hello everyone, in this episode I explain the famous algorithm ChatGPT. ChatGPT is a chatbot developed by OpenAI that is able the answer almost every question. It can be free questions, scientific question or even coding questions. ChatGPT uses GPT3 as backbone and also supervised training along with reinforcement learning using PPO algorithm.  ChatGPT: https://chat.openai.com  Instagram: https://www.instagram.com/podcast.lifewithai/  Linkedin: https://www.linkedin.com/company/life-with-ai
Published 12/15/22
Hello everyone, in this episode I explain how tokenizers work. They are basically what enables us to input the text into a NLP algorithm like BERT or GPT. In the episode I explain 3 types of tokenizers, word based, character based and sub-word based representation. Instagram: https://www.instagram.com/podcast.lifewithai/ Linkedin: https://www.linkedin.com/company/life-with-ai Huuging Face blog about tokenizers: https://huggingface.co/docs/transformers/tokenizer_summary
Published 12/01/22
Hey guys, in this episode of the podcast I received Guillaume, COO of Waalaxy. Waalaxy is a CRM focused on prospecting any type of clients on LinkdIn, in the episode, besides explaining the product, we discussed possible use cases of AI in the product e how did they manage to grow from zero to more the 6 millions in Anual Recurrent Revenue and hire and maintain talents specially in the tech field. Link to get 2 months for free on Waalaxy: https://waal.ink/Zgh3CR Linkedin of Guillaume:...
Published 10/20/22
Hey guys, in this episode I talk about the document intelligence algorithms that we have at Qantev, so basically what I do in my job! We have two main document intelligence algorithm, which are information extraction, where we want to retrieve some specific information from a document, and also table extraction, where we want to extract a table in a document into a CSV format.  Instagram of the podcast: https://www.instagram.com/podcast.lifewithai/   Linkedin of the podcast:...
Published 10/07/22