Episode 31: Rethinking Data Science, Machine Learning, and AI
Listen now
Description
Hugo speaks with Vincent Warmerdam, a senior data professional and machine learning engineer at :probabl, the exclusive brand operator of scikit-learn. Vincent is known for challenging common assumptions and exploring innovative approaches in data science and machine learning. In this episode, they dive deep into rethinking established methods in data science, machine learning, and AI. We explore Vincent's principled approach to the field, including: The critical importance of exposing yourself to real-world problems before applying ML solutions Framing problems correctly and understanding the data generating process The power of visualization and human intuition in data analysis Questioning whether algorithms truly meet the actual problem at hand The value of simple, interpretable models and when to consider more complex approaches The importance of UI and user experience in data science tools Strategies for preventing algorithmic failures by rethinking evaluation metrics and data quality The potential and limitations of LLMs in the current data science landscape The benefits of open-source collaboration and knowledge sharing in the community Throughout the conversation, Vincent illustrates these principles with vivid, real-world examples from his extensive experience in the field. They also discuss Vincent's thoughts on the future of data science and his call to action for more knowledge sharing in the community through blogging and open dialogue. LINKS The livestream on YouTube Vincent's blog CalmCode scikit-lego Vincent's book Data Science Fiction (WIP) The Deon Checklist, an ethics checklist for data scientists Of oaths and checklists, by DJ Patil, Hilary Mason and Mike Loukides Vincent's Getting Started with NLP and spaCy Course course on Talk Python Vincent on twitter :probabl. on twitter Vincent's PyData Amsterdam Keynote "Natural Intelligence is All You Need [tm]" Vincent's PyData Amsterdam 2019 talk: The profession of solving (the wrong problem) Vanishing Gradients on Twitter Hugo on Twitter Check out and subcribe to our lu.ma calendar for upcoming livestreams!
More Episodes
Hugo speaks with Jason Liu, an independent AI consultant with experience at Meta and Stitch Fix. At Stitch Fix, Jason developed impactful AI systems, like a $50 million product similarity search and the widely adopted Flight recommendation framework. Now, he helps startups and enterprises design...
Published 11/04/24
Published 11/04/24
Hugo speaks with three leading figures from the world of AI research: Sander Schulhoff, a recent University of Maryland graduate and lead contributor to the Learn Prompting initiative; Philip Resnik, professor at the University of Maryland, known for his pioneering work in computational...
Published 10/08/24