Michael Chow

We want GREAT tables! | Richard Iannone & Michael Chow | Data Science Hangout

To join future data science hangouts, add it to your calendar here: https://pos.it/dsh - All are welcome! We’d love to see you!

We were recently joined by Rich Iannone and Michael Chow, software engineers at Posit, to chat about their experiences building GT and Great Tables, how they drive community engagement around their packages, and their career advice for package developers.

GT and Great Tables are R and Python packages for creating static tables in R and Python. They were created to fill a need for a good, maintained solution for generating tables with different output types from data frames. They’ve gained loads of popularity, and have an active community of users!

Resources mentioned in the video and zoom chat: Great Tables Blog → https://posit-dev.github.io/great-tables/blog/ 2024 Table Contest Winners → https://posit.co/blog/2024-table-contest-winners/ Contributing to Public Transit Data Analysis and Tooling → https://posit-dev.github.io/great-tables/blog/open-transit-tools/ Tables as Powerful Representational Tools → https://www.researchgate.net/publication/363345970_Tables_as_Powerful_Representational_Tools The MockUp - 10+ Guidelines for Better Tables in R → https://themockup.blog/posts/2021-01-13-10-guidelines-for-better-tables-in-r/ Show Me the Numbers by Stephen Few → https://analyticspress.com/smtn.php Excel spreadsheets to gt package called {forgts} → https://github.com/luisDVA/forgts What They Forgot to Teach You About R → https://rstats.wtf/ David Robinson Talk called “The unreasonable effectiveness of public work” → https://www.youtube.com/watch?v=th79W4rv67g Publication quality tables in 2024 → https://posit.co/blog/what-we-did-with-publication-quality-tables-in-2024/ Great Tables Design Philosophy → https://posit-dev.github.io/great-tables/blog/design-philosophy/ gtsummary-to-excel → https://www.pipinghotdata.com/posts/2024-07-26-gtsummary-to-excel/ happy git with R → https://happygitwithr.com/ freeCodeCamp - how to contribute to open source → https://github.com/freeCodeCamp/how-to-contribute-to-open-source/

If you didn’t join live, one great discussion you missed from the zoom chat was about how people manage their personal and work GitHub accounts, and whether to have one account for all their work or separate accounts for each employer. Let us know YOUR thoughts on this topic below!

► Subscribe to Our Channel Here: https://bit.ly/2TzgcOu

Follow Us Here: Website: https://www.posit.co Hangout: https://pos.it/dsh LinkedIn: https://www.linkedin.com/company/posit-software Bluesky: https://bsky.app/profile/posit.co

Thanks for hanging out with us!

#pythoncontent

Michael Chow, Rich Iannone

Build Captivating Display Tables in Python With Great Tables | Real Python Podcast #214

Do you need help making data tables in Python look interesting and attractive? How can you create beautiful display-ready tables as easily as charts and graphs in Python? This week on the show, we speak with Richard Iannone and Michael Chow from Posit about the Great Tables Python library.

Links from the show: https://realpython.com/podcasts/rpp/214/

Michael and Richard discuss the design philosophy and history behind creating display tables. We dig into the grammar of tables, the background of the project, and an ingenious way to build a collection of examples for a library.

We briefly cover how Richard and Michael started contributing to open source. We also discuss practicing data skills with challenges and resources like Tidy Tuesday.

This episode is sponsored by Mailtrap.

Topics:

00:00:00 – Introduction
00:02:00 – Michael’s background in open source
00:04:07 – Rich’s background in open source
00:05:27 – Advice for someone starting out
00:08:55 – What do you mean by the term “display” table
00:11:32 – What components were missing from other tables?
00:13:31 – Using examples to explain features
00:16:09 – Why was there an absence of this functionality in Python?
00:19:35 – A progressive approach and the grammar of tables
00:21:26 – Sponsor: Mailtrap
00:22:01 – The design philosophy of great tables
00:25:31 – Nanoplots, spark lines, and column spanners
00:27:06 – Building a gallery of examples
00:28:56 – Heat mapping cells and automatically adjusting text color
00:32:54 – Output formats for the tables
00:34:46 – Building in accessibility
00:36:55 – Dependencies
00:37:42 – What is the common workflow?
00:41:39 – Video Course Spotlight
00:43:15 – Adding graphics
00:46:41 – Using a table contest to get examples
00:49:47 – quartodoc and documenting the project
00:55:00 – Tidy Tuesday and data science community
01:00:29 – What are you excited about in the world of Python?
01:03:46 – What do you want to learn next?
01:08:05 – How can people follow the work you do online?
01:09:57 – Thanks and goodbye

Links from the show: https://realpython.com/podcasts/rpp/214/

Michael Chow

Wrangling data for a Shiny app in Python || Michael Chow || Posit

Shiny makes it easy to build interactive web applications with the power of Python’s data and scientific stack.

Learn more about Shiny for Python: https://shiny.rstudio.com/py/ Check out our interactive Shiny for Python examples: https://shinylive.io/py/examples/

Content: Michael Chow (@chowthedog) Producer: Jesse Mostipak (@kierisi) Editing and Motion Design: Tony Pelleriti (@TonyPelleriti)

Michael Chow

rstudio Shiny for Python Shiny shinylive Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Dplyr Forcats Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Rstats Open Source OSS Reticulate Pyshiny Shiny for Python Jupyter Michael Chow Tony Pelleriti Posit

Data Science Hangout | Michael Chow, Posit | Exploring Team Structure w/ Data Scientists & Engineers

We were joined by Michael Chow, Data Scientist and Software Engineer at RStudio. Michael also previously led a team at the California Integrated Travel Project.

On this week’s hangout there were a lot of thoughts shared on structuring a data science team from both Michael and the broader group:

⬢ Jacqueline Nolis also shared thoughts on this on a data science hangout that there were virtues to different ones, but ended up sold on the decentralized model where data scientists are embedded in teams: https://youtu.be/CcPE29bYGVo?t=325

⬢ Michael agreed that data scientists and analysts should be sitting with the teams that they’re pushing out reports for. Otherwise, I would be trying to send people into those teams to figure out their priorities.

⬢ A data scientist should work with a Project Manager or whoever’s leading the team to push up metrics but also help change the roadmap.

⬢ It leaves a tricky question of where data engineers should be and how they should interact with the team. Today data engineers are often doing more tooling empowerment, so it can be okay to have them a bit more centralized and connect to the data scientists to enforce best practices or enable new pieces for them.

⬢ I think a nice model is for data scientists/analysts to live in the teams and data engineers to be like spokes of a wheel where then the data scientists connect with them and work closely to enforce better best practice and enable new important things.

⬢ Tatsu shared that in thinking of the structure, it’s also important to find your translators and to use the power of feedback. Reach out to those people to start to put that feedback into action.

⬢ George shared that insurance companies have come from a really traditional landscape where they have lots of actuaries working on lots of excel spreadsheets and there can be a lack of knowledge sharing and tool sharing. This is where the data science element comes in. To me, within the organization, you need to have this team which is a mini-spoke if you will, because they are central to the actuarial team. If they are too far removed and they’re back with the IT team, you end up with the old problems because they may not get the business concept communicated back. It’s all about getting enough skills, so they can get stuff done, especially proof of concepts. Maybe after that you can take a step back and then start to look at the centralized model again.

⬢ A central team can help converge to what they see as best practice, but if you’re pushing out something new, exploring a new line of work or area it can be important to set the data engineer there to actually do whatever they need to. Make sure that the converging doesn’t stifle creativity or prevent a team from doing the right thing.

⬢ Manny jumped in to share the perspective from data science being with IT as well, data science is a new field for their company (in real estate) and there’s an identity of where does data science fall. The IT team is fantastic and they’re very structured. Data science is so fluid and creative and non structured at the moment, so you kind of have to look at where it actually should fall.

please note that some of the points above are summarized and not 100% actual quotes.

Resources shared:

⬢ Tatsu shared in the chat, a few projects that Michael is working on: vetiver: https://vetiver.tidymodels.org/articles/vetiver.html , siuba: https://github.com/machow/siuba ⬢ Libby shared a helpful tip on creating a 2 minutes YouTube video with a cover letter, to get the attention of a hiring manager ⬢ Javier shared an example Shiny app used in an interview: https://javierorraca.shinyapps.io/Bloomreach_Shiny_App/ ⬢ Michael mentioned David Robinson’s screencasts: https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ ⬢ Michael mentioned an article on “What data scientists really do according to 35 data scientists”: https://hbr.org/2018/08/what-data-scientists-really-do-according-to-35-data-scientists ⬢ Rachael shared a blog post link where Jacqueline Nolis talked about team structure as well: https://www.rstudio.com/blog/building-effective-data-science-team-answering-your-questions/#Structure

► Subscribe to Our Channel Here: https://bit.ly/2TzgcOu ► Add the Data Science Hangout to your calendar: rstd.io/datasciencehangout ► View the Data Science Hangout site here: rstudio.com/data-science-hangout

Follow Us Here: Website: https://www.rstudio.com LinkedIn:https://www.linkedin.com/company/rstudio-pbc Twitter: https://twitter.com/rstudio

Michael Chow

Michael Chow | Bringing the Tidyverse to Python with Siuba | RStudio

Last January I left my job to spend a year developing siuba, a python port of dplyr. At its core, this decision was driven by a decade of watching python and R users produce similar analyses, but in very different ways.

In this talk, I’ll discuss 3 ways siuba enables R users to transfer their hard-earned programming knowledge to python: (1) leveraging the power of dplyr syntax, (2) options to generate SQL code, and (3) working with the plotnine plotting library.

Looking back, I’ll consider two critical pieces that have helped me develop siuba: using it to livecode TidyTuesday analyses, and building an interactive tutorial for absolute beginners.

About Michael: Michael Chow is a data scientist and learning researcher. He serves as a co-director at Code for Philly. In past lives, he worked on adaptive assessment tools in ed tech, and received a PhD in cognitive psychology from Princeton University

Michael Chow

dplyr plotnine rstudio tidyverse Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Forcats Rstats Open Source OSS Reticulate Siuba Michael Chow SQL

Michael Chow

Software by Michael Chow#

pointblank

Great Tables

plotnine

Shiny for Python

pins-python

py-shinyswatch

pins-r

vetiver-python

Events attended by Michael Chow#

R/Pharma 2025

SciPy 2025

Posts and resources by Michael Chow#

Recreating Septa Transit Timetables in Python

Sleeping Rats and Sociopathic Agents — with Phillip Cloud

Polars: The Blazing Fast Python Framework for Modern Clinical Trial Data Exploration

What even is dbt? An Analytics engineer explains | Laurie Merrell & Michael Chow | Data Science Lab

Michael Chow: From psychology and Python to constrained creativity

The Curse of Documentation (Michael Chow, Posit) | posit::conf(2025)

Michael Chow - User guides: engaging new users, delighting old ones | SciPy 2025

Visualizing Gas Prices | PydyTuesday Uncut #2

Adding Plots to Great Tables

Exploring Web APIs | PydyTuesday Uncut #1

The Test Set: A Posit Podcast Trailer

Overhauling Pointblank’s User Guide

Great Tables: Becoming the Polars .style Property

Great Tables 3: Data Color and Polishing

Great Tables 2: Introducing Units Notation

Great Tables 1: Structure, Format, and Style

Tables in Python with Great Tables

We want GREAT tables! | Richard Iannone & Michael Chow | Data Science Hangout

Contributing to Public Transit Data Analysis and Tooling

Great Tables v0.13.0: Applying styles to all table locations

So You Think You Can ANALYZE? (Data Content Creator Hackathon)

Talks - Michael Chow, Richard Iannone: Making Beautiful, Publication Quality Tables in Python…

Great Tables: Make beautiful, publication quality tables in Python | Rich Iannone & Michael Chow

Build Captivating Display Tables in Python With Great Tables | Real Python Podcast #214

PyCon 2024: Making Beautiful, Publication Quality Tables is Possible in 2024

Great Tables is now BYODF (Bring Your Own DataFrame)

The Design Philosophy of Great Tables

Using Polars to Win at Super Bowl Squares

Great Tables: The Polars DataFrame Styler of Your Dreams

Siuba and duckdb: Analyzing Everything Everywhere All at Once - posit::conf(2023)

Presented at Posit Conference, between Sept 19-20 2023, Learn more at posit.co/conference.#

The accidental analytics engineer

Wrangling data for a Shiny app in Python || Michael Chow || Posit

Hey Shiny Team, what are some of your biggest learnings from 2022? || Shiny Developers || RStudio

Data Science Hangout | Michael Chow, Posit | Exploring Team Structure w/ Data Scientists & Engineers

Michael Chow | Bringing the Tidyverse to Python with Siuba | RStudio

Great Tables: Becoming the Polars `.style` Property

Great Tables `v0.13.0`: Applying styles to all table locations