Programming
Learn about data exploration tools in the commercial space.
Most of the work we have done in this class has been centered around manually building data analyses in Jupyter notebooks.
This is a wonderful way to do work when you are given freedom and the analysis may not follow a specific target output – when your focus is on getting very preliminary exploration underway.
As we learned in this course data cleaning is a significant task and requires care to get right.
There are a multitude of commercial tools that provide more structure and flexibility to pursue data cleaning and analyses which may be more appropriate, especially when the data is stored in Excel spreadsheets, the grandfather of tabular data formats .
You will be listening to a podcast about a product that makes cleaning and exploring data stroed in Excel sheets more efficient, especially for non-programmers, but also can be placed in a pipeline leading up to more complex analyses which might happen in the Jupyter (or other) space.”
This podcast is an episode from the IBMs Make Data Simple Podcast:
Aug. 3, 2022. Make Data Simple Podcast: “Of course you’re tired of spreadsheets. Data exploration is better. Ryan
Buick, Co-Founder of Canvas.”
You can listen to the podcast from these links:
Spotify
Player.FM
direct from player.fm download of MP3
Apple Podcast (DRM required)
Task: Listen to the podcast and answer the following 4 questions:
List 3 things you learned from this podcast and relate them to things you have experienced in this course.
What motivated the core Canvas company business, and what product space did it emerge from? Your answer should include mentioning data and management team concerns.
What problem does Canvas solve? include details from what Ryan mentions in his interview.
What or who are the target market for the Canvas project?
In two or three sentences relate what you learned about data analysis among non-data scientists with what you learned in the podcast. Be complete, but brief .