Description
Hugo speaks with Vincent Warmerdam, a senior data professional and machine learning engineer at :probabl, the exclusive brand operator of scikit-learn. Vincent is known for challenging common assumptions and exploring innovative approaches in data science and machine learning.
In this episode, they dive deep into rethinking established methods in data science, machine learning, and AI. We explore Vincent's principled approach to the field, including:
The critical importance of exposing yourself to real-world problems before applying ML solutions
Framing problems correctly and understanding the data generating process
The power of visualization and human intuition in data analysis
Questioning whether algorithms truly meet the actual problem at hand
The value of simple, interpretable models and when to consider more complex approaches
The importance of UI and user experience in data science tools
Strategies for preventing algorithmic failures by rethinking evaluation metrics and data quality
The potential and limitations of LLMs in the current data science landscape
The benefits of open-source collaboration and knowledge sharing in the community
Throughout the conversation, Vincent illustrates these principles with vivid, real-world examples from his extensive experience in the field. They also discuss Vincent's thoughts on the future of data science and his call to action for more knowledge sharing in the community through blogging and open dialogue.
LINKS
The livestream on YouTube
Vincent's blog
CalmCode
scikit-lego
Vincent's book Data Science Fiction (WIP)
The Deon Checklist, an ethics checklist for data scientists
Of oaths and checklists, by DJ Patil, Hilary Mason and Mike Loukides
Vincent's Getting Started with NLP and spaCy Course course on Talk Python
Vincent on twitter
:probabl. on twitter
Vincent's PyData Amsterdam Keynote "Natural Intelligence is All You Need [tm]"
Vincent's PyData Amsterdam 2019 talk: The profession of solving (the wrong problem)
Vanishing Gradients on Twitter
Hugo on Twitter
Check out and subcribe to our lu.ma calendar for upcoming livestreams!
Hugo speaks with Jason Liu, an independent AI consultant with experience at Meta and Stitch Fix. At Stitch Fix, Jason developed impactful AI systems, like a $50 million product similarity search and the widely adopted Flight recommendation framework. Now, he helps startups and enterprises design...
Published 11/04/24
Hugo speaks with three leading figures from the world of AI research: Sander Schulhoff, a recent University of Maryland graduate and lead contributor to the Learn Prompting initiative; Philip Resnik, professor at the University of Maryland, known for his pioneering work in computational...
Published 10/08/24