Dear Analyst #123: Telling data stories about rugby and the NBA with Ben Wylie
Listen now
Description
When you think of data journalism, you might think of The New York Times' nifty data visualizations and the Times' embrace of data literacy for all their journalists. Outside of The New York Times, I haven't met anyone who does data journalism and data storytelling full-time until I spoke with Ben Wylie. Ben is the lead financial journalist at a financial publication in London. Like many data analysts, he cut his teeth in Excel, got his equivalent of a CPA in the UK, and received his master's degree in journalism. In this episode, we discuss how his side passion (sports analytics) led him to pursue a career in data journalism and how he approaches building sports data visualizations. Playing with rugby data on lunch breaks When Ben worked for an accounting firm, he would pull rugby data during his lunch breaks and just analyze it for fun. One might say this started Ben's passion in data storytelling because he started a blog called The Chase Rubgy to share his findings. The blog was a labor of love, and at the end of 2019 he had only focused on rugby. After building an audience, he realized data journalism could be a promising career path so he did some freelance sports journalism at the end of his master's course. At the end of 2022, he started Plot the Ball (still a side project) where the tagline is "Using data to tell better stories about sport." Learning new data skills from writing a newsletter Ben spoke about how writing Plot the Ball forced him to learn new tools and techniques for cleaning and visualizing data. All the visualizations on the blog are done in R. A specific R package Ben uses to scrape data from websites is rvest. Through the blog, Ben learned how to scrape, import, and clean data before he even started doing any data visualizations. Sports data all came from Wikipedia. I've spoken before about how the best way to show an employer you want a job in analytics is to create a portfolio of your data explorations. Nothing is better than starting a blog where you can just showcase stuff you're interested in. How the NBA became a global sport One of my favorite posts from Plot the Ball is this post entitled Wide net. It's a short post but the visualization tells a captivating story on how the NBA became global over the last 30 years. Here's the main visualization from the post: Source: Plot the Ball Ben first published a post about NBA phenom Victor Wembanyama in June 2023 (see the post for another great visualization). Ben talks about this post being a good data exercise because there is no good NBA data in tabular form. This "waffle" chart was Ben's preferred visualization since it allows you to better see the change in the subgroups. A stacked bar chart would've been fine as well, but since each "row" of data represents a roster of 15 players, the individual squares abstracts the team composition each year. Home Nations closing the gap with Tri Nations in rugby
More Episodes
When you think of your data warehouse, the "semantic layer" may not be the first thing that pops in your mind. Prior to reading Frances O'Rafferty's blog post on this topic, I didn't even know this was a concept that mattered in the data stack. To be honest, the concept is still a bit confusing...
Published 09/10/24
Published 09/10/24
If you could only learn one programming language for the rest of your career, what would be it be? You could Google the most popular programming languages and just pick the one of the top 3 and off you go (FYI they are Python, C++, and C). Or, you could pick measly #10 and build a thriving career...
Published 08/05/24