Jon Krohn
Resources tagged Jon Krohn#
885: Python Polars: The Definitive Guide — with Jeroen Janssens and Thijs Nieuwdorp
#Python #Polars #Pandas
Jeroen Janssens and Thijs Nieuwdorp are data frame library Polars’ greatest advocates in this episode with @JonKrohnLearns where they discuss their book, Python Polars: The Definitive Guide, best practice for using Polars, why Pandas users are switching to Polars for data frame operations in Python, and how the library reduces memory usage and compute time up to 10x more than Pandas. Listen to the episode to be a part of an O’Reilly giveaway!
This episode is brought to you by: • Trainium2, the latest AI chip from AWS: https://aws.amazon.com/ai/machine-learning/trainium/ • Adverity, the conversational analytics platform: https://eu1.hubs.ly/H0jxK210 • Dell AI Factory with NVIDIA: https://www.dell.com/superdatascience
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
• (00:00:00) Introduction
• (00:04:46) Why Jeroen and Thijs wrote Python Polars: The Definitive Guide
• (00:18:18) Best practices in Polars
• (00:25:08) Why Polars has so many users
• (00:32:37) The benefits of the Great Tables package
• (00:50:05) Jeroen and Thijs’ partnership with NVIDIA and Dell for Python Polars: The Definitive Guide
Additional materials: https://www.superdatascience.com/885

Positron: An IDE Specialized For Data Science
Dr. Julia Silge, Engineering Manager at Posit, joins @JonKrohnLearns to introduce Positron, a fresh open-source IDE that’s perfect for exploratory data analysis and visualization. She also lays out her top picks for LLMs that boost coding efficiency and discusses when traditional NLP methods might be the smarter choice over LLMs. Plus, Julia highlights some must-know open-source libraries that make managing MLOps easier than ever. Tune in for insights that every data scientist, ML engineer, and developer will find useful.
Watch the full interview “817: The Positron IDE, Tidy NLP and MLOps — with Dr. Julia Silge” here: https://www.superdatascience.com/817

817: The Positron IDE, Tidy NLP and MLOps — with Dr. @JuliaSilge
#PositronIDE #Tidyverse #MLOps
Dr. Julia Silge, Engineering Manager at Posit, joins @JonKrohnLearns to introduce the brand-new Positron IDE, perfect for exploratory data analysis and visualization. She also lays out her top picks for LLMs that boost coding efficiency and discusses when traditional NLP methods might be the smarter choice over LLMs. Plus, Julia highlights some must-know open-source libraries that make managing MLOps easier than ever. Tune in for insights that every data scientist, ML engineer, and developer will find useful.
This episode is brought to you by Gurobi (https://www.gurobi.com/personas/optimization-for-data-scientists/) , the Decision Intelligence Leader, and by ODSC (https://odsc.com/california) , the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn: • [00:00:00] Introduction • [00:03:23] Overview of Posit and Positron IDE • [00:08:33] How the needs of a data scientist differ from those of a software developer • [00:17:56] How to contribute to the open-source Positron • [00:34:52] MLOps and Vetiver: Tools for deploying and maintaining ML models • [00:48:34] Natural Language Processing (NLP) and the Tidyverse approach • [01:22:18] The role of AI and LLMs in data science education
Additional materials: https://www.superdatascience.com/817

Hadley Wickham on R vs Python
Learn about tidyverse, ggplot2, and the secret to a tech company’s longevity as Hadley Wickham joins @JonKrohnLearns in this episode. He talks about Posit’s rebrand, why tidyverse needs to be in every data scientist’s toolkit, and why getting your hands dirty with open-source projects can be so lucrative for your career.
Watch the full interview “779: The Tidyverse of Essential R Libraries and their Python Analogues — with Dr. Hadley Wickham” here: https://www.superdatascience.com/779

779: The Tidyverse of Essential R Libraries and their Python Analogues — with Dr. Hadley Wickham
#Tidyverse #RProgramming #RLibraries
Tidyverse, ggplot2, and the secret to a tech company’s longevity: Hadley Wickham talks to @JonKrohnLearns about Posit’s rebrand, Tidyverse and why it needs to be in every data scientist’s toolkit, and why getting your hands dirty with open-source projects can be so lucrative for your career.
This episode is brought to you by Intel and HPE Ezmeral Software (https://bit.ly/hpeintel) . Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information.
In this episode you will learn: • [00:00:00] Introduction • [00:02:55] All about the Tidyverse • [00:15:19] Hadley’s favorite R libraries • [00:28:39] The goal of Posit • [00:34:12] On bringing multiple programming languages together • [00:50:19] The principles for a long-lasting tech company • [00:53:34] How Hadley developed ggplot2 • [01:03:52] How to contribute to the open-source community
Additional materials: https://www.superdatascience.com/779
