All episodes of Data Science at Home

Episodes

AI’s Impact on Software Engineering: Killing Old Principles? [RB] (Ep. 229)

In this episode, we dive into the ways in which AI and machine learning are disrupting traditional software engineering principles. With the advent of automation and intelligent systems, developers are increasingly relying on algorithms to create efficient and effective code. However, this reliance on AI can come at a cost to the tried-and-true methods of software engineering. Join us as we explore the pros and cons of this paradigm shift and discuss what it means for the future of software...

Published 05/25/23

Warning! Mathematical Mayhem Ahead: Demystifying Liquid Time-Constant Networks (Ep. 228)

Hold on to your calculators and buckle up for a wild mathematical ride in this episode! Brace yourself as we dive into the fascinating realm of Liquid Time-Constant Networks (LTCs), where mathematical content reaches new heights of excitement. In this mind-bending adventure, we demystify the intricacies of LTCs, from complex equations to mind-boggling mathematical concepts, we break them down into digestible...

Published 05/16/23

Efficiently Retraining Language Models: How to Level Up Without Breaking the Bank (Ep. 227)

Get ready for an eye-opening episode! 🎙️ In our latest podcast episode, we dive deep into the world of LoRa (Low-Rank Adaptation) for large language models (LLMs). This groundbreaking technique is revolutionizing the way we approach language model training by leveraging low-rank approximations. Join us as we unravel the mysteries of LoRa and discover how it enables us to retrain LLMs with minimal expenditure of money and resources. We'll explore the ingenious strategies and practical methods...

Published 05/11/23

Revolutionize Your AI Game: How Running Large Language Models Locally Gives You an Unfair Advantage Over Big Tech Giants (Ep. 226)

This is the first episode about the latest trend in artificial intelligence that's shaking up the industry - running large language models locally on your machine. This new approach allows you to bypass the limitations and constraints of cloud-based models controlled by big tech companies, and take control of your own AI journey. We'll delve into the benefits of running models locally, such as increased speed, improved privacy and security, and greater customization and flexibility. We'll...

Published 05/03/23

Rust: A Journey to High-Performance and Confidence in Code at Amethix Technologies (Ep. 225)

The journey of porting our projects to Rust was intense, but it was a decision we made to improve the quality of our software. The migration was not an easy task, as it required a considerable amount of time and resources. However, it was worth the effort as we have seen significant improvements in code reusability, code cleanliness, and performance.In this episode I will tell you why you should consider taking that journey too.

Published 04/26/23

The Power of Graph Neural Networks: Understanding the Future of AI - Part 2/2 (Ep.224)

In this episode of our podcast, we dive deep into the fascinating world of Graph Neural Networks. First, we explore Hierarchical Networks, which allow for the efficient representation and analysis of complex graph structures by breaking them down into smaller, more manageable components. Next, we turn our attention to Generative Graph Models, which enable the creation of new graph structures that are similar to those in a given dataset. We discuss the inner workings of these models and their...

Published 04/18/23

The Power of Graph Neural Networks: Understanding the Future of AI - Part 1/2 (Ep.223)

In this episode, I explore the cutting-edge technology of graph neural networks (GNNs) and how they are revolutionizing the field of artificial intelligence. I break down the complex concepts behind GNNs and explain how they work by modeling the relationships between data points in a graph structure. I also delve into the various real-world applications of GNNs, from drug discovery to recommendation systems, and how they are outperforming traditional machine learning models. Join me and...

Published 04/11/23

Leveling Up AI: Reinforcement Learning with Human Feedback (Ep. 222)

In this episode, we dive into the not-so-secret sauce of ChatGPT, and what makes it a different model than its predecessors in the field of NLP and Large Language Models. We explore how human feedback can be used to speed up the learning process in reinforcement learning, making it more efficient and effective. Whether you're a machine learning practitioner, researcher, or simply curious about how machines learn, this episode will give you a fascinating glimpse into the world of reinforcement...

Published 04/04/23

The promise and pitfalls of GPT-4 (Ep. 221)

In this episode, we explore the potential of the highly anticipated GPT-4 language model and the challenges that come with its development. From its ability to generate highly coherent and creative text to concerns about ethical considerations and the potential misuse of such technology, we delve into the promise and pitfalls of GPT-4. Join us as we speak with experts in the field to gain insights into the latest developments and the impact that GPT-4 could have on the future of natural...

Published 03/30/23

AI’s Impact on Software Engineering: Killing Old Principles? (Ep. 220)

Published 03/14/23

Edge AI applications for military and space [RB] (Ep. 219)

Published 03/09/23

Prove It Without Revealing It: Exploring the Power of Zero-Knowledge Proofs in Data Science (Ep. 218)

In this episode, we dive into the fascinating world of zero-knowledge proofs and their impact on data science. Zero-knowledge proofs allow one party to prove to another that they know a secret without revealing the secret itself. This powerful concept has numerous applications in data science, from ensuring data privacy and security, to facilitating secure transactions and identity verification. We explore the mechanics of zero-knowledge proofs, its real-world applications, and how it is...

Published 02/27/23

Deep learning vs tabular models (Ep. 217)

Deep learning methods are not as effective with tabular data. Here is why, and what to do about it. Sponsors If you're ready to take your WiFi game to the next level, head over to asus.click/ZenWiFi_XD5 or check out the show notes for this episode. Trust me, with ASUS ZenWiFi XD5, you'll get the best WiFi experience ever! References https://paperswithcode.com/methods/category/deep-tabular-learning https://m-clark.github.io/posts/2022-04-01-more-dl-for-tabular/

Published 02/21/23

[RB] Online learning is better than batch, right? Wrong! (Ep. 216)

In this episode I speak about online learning systems and why blindly choosing such a paradigm can lead to very unpredictable and expensive outcomes.Also in this episode, I have to deal with an intruder :) Links Birman, K.; Joseph, T. (1987). "Exploiting virtual synchrony in distributed systems". Proceedings of the Eleventh ACM Symposium on Operating Systems Principles - SOSP '87. pp. 123–138. doi:10.1145/41457.37515. ISBN 089791242X. S2CID 7739589.

Published 02/15/23

Chatting with ChatGPT: Pros and Cons of Advanced Language AI (Ep. 215)

In this episode, I'll be discussing the capabilities and limitations of ChatGPT, an advanced language AI model. I'll go over its power to understand and respond to natural language, and its applications in tasks such as language translation and text summarization. However, I'll also touch on the challenges that still need to be overcome such as bias and data privacy concerns. Tune in for a comprehensive look at the current state of advanced language...

Published 01/26/23

Accelerating Perception Development with Synthetic Data (Ep. 214)

In this episode I am with Kevin McNamara, founder and CEO of Parallel Domain. We speak about a very effective method to generate synthetic data that is currently in production at Parallel Domain. Enjoy the show! References Parallel Domain Synthetic Data Improves Cyclist Detection (blog post): https://paralleldomain.com/parallel-domain-synthetic-data-improves-cyclist-detection/ Beating the State of the Art in Object Tracking with Synthetic...

Published 01/14/23

Edge AI applications for military and space [RB] (Ep. 213)

Our Sponsors NordPass Business has developed a password manager, that will save you a lot of time and energy whenever you need access to business accounts, work across devices, even with the other members of your team, or whenever you need to share sensitive data with your colleagues, or make payments efficiently. All this with the highest standard of cyber secure technology. See NordPass Business in action now with a 3-month free trial herehttps://nordpass.com/DATASCIENCE with code...

Published 12/13/22

From image to 3D model (Ep. 212)

Is it possible to reconstruct a 3D model from a simple image? Under certain constraints, it is! In this episode I tell you how. Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide.Check it out at https://arcticwolf.com/datascience Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they...

Published 12/08/22

Machine learning is physics (Ep. 211)

What if we borrowed from physics some theories that would interpret deep learning and machine learning in general?Here is a list of plausible ways to interpret our beloved ML models and understand why they works, or they don't.Enjoy the show! Our Sponsors NordPass Business has developed a password manager, that will save you a lot of time and energy whenever you need access to business accounts, work across devices, even with the other members of your team, or whenever you need to share...

Published 12/02/22

Autonomous cars cannot drive. Here is why. (Ep. 210)

If you think that the problem of self-driving cars has been solved, think twice.As a matter of fact, the problem of self-driving cars cannot be solved with the technical solutions that companies are currently considering. Don't get fooled by marketing and PR on social media. Whoever is telling you they solved the problem of driving a vehicle fully autonomously, they are lying.Here is why. Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple...

Published 11/21/22

Evolution of data platforms (Ep. 209)

Let's look at the history of data platforms. How did they evolve? Why? Shall I switch to the latest architecture? Enjoy the show! Our Sponsors Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide.Check it out at https://arcticwolf.com/datascience Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone...

Published 11/08/22

[RB] Is studying AI in academia a waste of time? (Ep. 208)

Companies and other business entities are actively involved in defining data products and applied research every year. Academia has always played a role in creating new methods and solutions/algorithms in the fields of machine learning and artificial intelligence. However, there is doubt about how powerful and effective such research efforts are. Is studying AI in academia a waste of time? Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers...

Published 11/02/22

Private machine learning done right (Ep. 207)

There are many solutions to private machine learning. I am pretty confident when I say that the one we are speaking in this episode is probably one of the most feasible and reliable. I am with Daniel Huynh, CEO of Mithril Security, a graduate from Ecole Polytechnique with a specialisation in AI and data science. He worked at Microsoft on Privacy Enhancing Technologies under the office of the CTO of Microsoft France. He has written articles on Homomorphic Encryptions with the CKKS explained...

Published 10/25/22

Edge AI for applications in military and space (Ep. 206)

Our Sponsors Ready to advance your career in data science? University of Cincinnati Online offers nationally recognized educational programs in business analytics and information systems. Predictive Analytics Today named UC as the No.1 MS Data Science school in the country and is nationally recognized with a proven track record of placing students at high-profile companies such as Google, Amazon and P&G. Discover more about the University of Cincinnati’s 100% online master’s degree...

Published 10/15/22

[RB] What are generalist agents and why they can change the AI game (Ep. 205)

That deep learning alone is not sufficient to solve artificial general intelligence, is more and more accepted statement. Generalist agents have great properties that can overcome some of the limitations of single-task deep learning models. Be aware, we are still far from AGI, though. So what are generalist agents? References https://arxiv.org/pdf/2205.06175

Published 10/05/22