tidymodels

Data Science in the Energy Industry | Frank Hull | Data Science Hangout

To join future data science hangouts, add it to your calendar here: https://pos.it/dsh - All are welcome! We’d love to see you!

We were recently joined by Frank Hull, Director of Data Science and Analytics at ACES, to chat about forecasting energy demand and prices, managing over a thousand data models, full-stack data science, and advanced machine learning techniques for time series analysis.

In this Hangout, we explore the necessity of managing a vast number of data models in the energy industry. Frank’s team at ACES oversees over a thousand models (nearly 2,000, actually!), a staggering number explained by the complexity and fragmentation of the wholesale energy market. The United States is divided into various Independent System Operators (ISOs), each possessing unique regulations and diverse resource mixes. Each of ACES’s 40+ portfolios can operate in different geographical areas within an ISO, presenting distinct challenges that necessitate individual modeling. These models are used to simulate a wide range of time horizons, from the next hour or day-ahead market to long-term financial planning and infrastructure decisions spanning 25 years. This intricate modeling helps in understanding hourly price shapes, demand patterns, supply mixes, and evaluating the effectiveness of new energy generators or hedging strategies, all with the goal of lowering variable costs for cooperatives and mitigating critical risks like blackouts during peak demand.

Resources mentioned in the video and zoom chat: Tidymodels → https://www.tidymodels.org/ Orbital Project → https://orbital.tidymodels.org/ U.S. Energy Information Administration (EIA) Open Data → https://www.eia.gov/opendata/ Kuzco R Package → https://posit.co/blog/kuzco-computer-vision-with-llms-in-r/

If you didn’t join live, one great discussion you missed from the zoom chat was about handling imbalanced binary classification models. Participants discussed why techniques like SMOTE might not perform well in production with real-world data, shared experiences with alternative methods such as standard up/downsampling, and highlighted challenges in maintaining prediction accuracy in deployment despite strong training results. Let us know below if you’d like to hear more about this topic!

► Subscribe to Our Channel Here: https://bit.ly/2TzgcOu

Follow Us Here: Website: https://www.posit.co Hangout: https://pos.it/dsh LinkedIn: https://www.linkedin.com/company/posit-software Bluesky: https://bsky.app/profile/posit.co

Thanks for hanging out with us!

Timestamps: 00:00 Introduction 04:05 “What’s ISO?” 08:20 “What are your go to models for analysis in the energy field?” 10:48 “Do you tend to use traditional stochastic models for time series analysis or more of the recent ML methods?” 13:30 “What is a full stack data scientist? What’s the overlap between a full stack data scientist and something like an ML engineer or a data engineer?” 18:38 “Is there a specific data science skill set that’s needed to get into energy analysis?” 19:59 “What is the portfolio model?” 23:36 “How have you found convincing regulators and other stats oriented stakeholders to trust and believe your AI fancy machine learning models that they can’t really dive in and and prove to themselves that that’s being statistically valid? Or have you found some good ways to demonstrate that?” 26:50 “Are there any good examples of open data in energy?” 27:54 “How are you keeping on top of the documentation for all of these models? Over a thousand models is a lot. Is there any learning you could share from that experience to help other people keep on top of their documentation?” 30:33 “How would you suggest handling missing data in time series forecasting?” 33:10 “Do you see long term electricity prices decreasing in the next twenty five years due to the abundance of renewables like wind and solar in lower population areas?” 35:14 “Do you have any career advice?” 36:50 “How do you see data science evolving within the energy industry?” 38:39 “How do you keep up to date on new packages?”

Simon Couch: Fair machine learning

Simon Couch Fair machine learning Cascadia R Conf 2024 Regular talk, 10:25-10:40

In recent years, high-profile analyses have called attention to many contexts where the use of machine learning deepened inequities in our communities. A machine learning model resulted in wealthy homeowners being taxed at a significantly lower rate than poorer homeowners; a model used in criminal sentencing disproportionately predicted black defendants would commit a crime in the future compared to white defendants; a recruiting and hiring model penalized feminine-coded words—like the names of historically women’s colleges—when evaluating résumés. In late 2022, a group of Posit employees across teams, roles, and technical backgrounds formed a reading group to engage with literature on machine learning fairness, a research field that aims to define what it means for a statistical model to act unfairly and take measures to address that unfairness. We then designed functionality and resources to help data scientists measure and critique the ways in which the machine learning models they’ve built might disparately impact people affected by that model. This talk will introduce the research field of machine learning fairness and demonstrate a fairness-oriented analysis of a model with tidymodels, a framework for machine learning in R.

Pronouns: he/him Chicago, IL Simon Couch is a software engineer at Posit PBC (formerly RStudio) where he works on open source statistical software. With an academic background in statistics and sociology, Simon believes that principled tooling has a profound impact on our ability to think rigorously about data. He authors and maintains a number of R packages and blogs about the process at simonpcouch.com

Simon Couch

How to train, evaluate, and deploy a machine learning workflow with tidymodels & Posit Team

Helpful resources: Github: https://github.com/simonpcouch/mutagen Follow-up Q&A Session: https://youtube.com/live/vwBVOBQfc_U If you want to book a call with our team to chat more about Posit products: pos.it/chat-with-us Don’t want to meet, but curious who else on your team is using Posit? pos.it/connect-us Blog post on tidymodels + Posit Connect: https://posit.co/blog/pharmaceutical-machine-learning-with-tidymodels-and-posit-connect/ Tidy Modeling with R book: https://www.tmwr.org/

Timestamps: 1:44 - Three steps for developing a machine learning model 3:35 - What is a machine learning model? 7:02 - Overview of machine learning with Posit Team 7:36: Step 1: Understand and clean data 11:05 - Step 2: Train and evaluate models (why you might be interested using tidymodels) 23:02 - Step 3: Deploying a machine learning model from Posit Workbench to Posit Connect 30:14 - Summary 31:21 - Helpful resources

Machine learning models are all around us, from Netflix movie recommendations to Zillow property value estimates to email spam filters.

As these models play an increasingly large role in our personal and professional lives, understanding and embracing them has never been more important; machine learning helps us make better, data-driven decisions.

The tidymodels framework is a powerful set of tools for building—and getting value out of—machine learning models with R.

Data scientists use tidymodels to:

Gain access to a wide variety of machine learning methods
Guard against common mistakes
Easily deploy models through tidymodels’ integration with vetiver

Join Simon Couch from the tidyverse team on Wednesday, October 25th at 11am ET as he walks through an end-to-end machine learning workflow with Posit Team.

No registration is required to attend - simply add it to your calendar using this link: pos.it/team-demo

Simon Couch

posit::conf(2023) Workshop: Advanced tidymodels

Register now: http://pos.it/conf Instructor: Max Kuhn, Software Engineer, Posit Workshop Duration: 1-Day Workshop

This workshop is for you if you: • have used tidymodels packages like recipes, rsample, and parsnip • are comfortable with tidyverse syntax (e.g. piping, mutates, pivoting) • have some experience with resampling and modeling (e.g., linear regression, random forests, etc.), but we don’t expect you to be an expert in these

In this workshop, you will learn more about model optimization using the tune and finetune packages, including racing and iterative methods. You’ll be able to do more sophisticated feature engineering with recipes. Time permitting, model ensembles via stacking will be introduced. This course is focused on the analysis of tabular data and does not include deep learning methods.

Participants who have completed the “Introduction to tidymodels” workshop will be well-prepared for this course. Participants who are new to tidymodels will benefit from taking the Introduction to tidymodels workshop before joining this one

Max Kuhn

finetune parsnip rsample tidymodels tidyverse Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Forcats Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Rstats Open Source OSS Reticulate

posit::conf(2023) Workshop: Introduction to tidymodels

Register now: http://pos.it/conf Instructors: Hannah Frick, Simon Couch, Emil Hvitfeldt Workshop Duration: 1-Day Workshop

This workshop is for you if you: • have intermediate R knowledge, experience with tidyverse packages, and either of the R pipes • can read data into R, transform and reshape data, and make a wide variety of graphs • have had some exposure to basic statistical concepts such as linear models, random forests, etc.

Intermediate or expert familiarity with modeling or machine learning is not required.

This workshop will teach you core tidymodels packages and their uses: data splitting/resampling with rsample, model fitting with parsnip, measuring model performance with yardstick, and basic pre-processing with recipes. Time permitting, you’ll be introduced to model optimization using the tune package. You’ll learn tidymodels syntax as well as the process of predictive modeling for tabular data

Emil Hvitfeldt, Hannah Frick, Simon Couch

parsnip rsample tidymodels tidyverse yardstick Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Forcats Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Rstats Open Source OSS Reticulate

Data Science Hangout | Michael Chow, Posit | Exploring Team Structure w/ Data Scientists & Engineers

We were joined by Michael Chow, Data Scientist and Software Engineer at RStudio. Michael also previously led a team at the California Integrated Travel Project.

On this week’s hangout there were a lot of thoughts shared on structuring a data science team from both Michael and the broader group:

⬢ Jacqueline Nolis also shared thoughts on this on a data science hangout that there were virtues to different ones, but ended up sold on the decentralized model where data scientists are embedded in teams: https://youtu.be/CcPE29bYGVo?t=325

⬢ Michael agreed that data scientists and analysts should be sitting with the teams that they’re pushing out reports for. Otherwise, I would be trying to send people into those teams to figure out their priorities.

⬢ A data scientist should work with a Project Manager or whoever’s leading the team to push up metrics but also help change the roadmap.

⬢ It leaves a tricky question of where data engineers should be and how they should interact with the team. Today data engineers are often doing more tooling empowerment, so it can be okay to have them a bit more centralized and connect to the data scientists to enforce best practices or enable new pieces for them.

⬢ I think a nice model is for data scientists/analysts to live in the teams and data engineers to be like spokes of a wheel where then the data scientists connect with them and work closely to enforce better best practice and enable new important things.

⬢ Tatsu shared that in thinking of the structure, it’s also important to find your translators and to use the power of feedback. Reach out to those people to start to put that feedback into action.

⬢ George shared that insurance companies have come from a really traditional landscape where they have lots of actuaries working on lots of excel spreadsheets and there can be a lack of knowledge sharing and tool sharing. This is where the data science element comes in. To me, within the organization, you need to have this team which is a mini-spoke if you will, because they are central to the actuarial team. If they are too far removed and they’re back with the IT team, you end up with the old problems because they may not get the business concept communicated back. It’s all about getting enough skills, so they can get stuff done, especially proof of concepts. Maybe after that you can take a step back and then start to look at the centralized model again.

⬢ A central team can help converge to what they see as best practice, but if you’re pushing out something new, exploring a new line of work or area it can be important to set the data engineer there to actually do whatever they need to. Make sure that the converging doesn’t stifle creativity or prevent a team from doing the right thing.

⬢ Manny jumped in to share the perspective from data science being with IT as well, data science is a new field for their company (in real estate) and there’s an identity of where does data science fall. The IT team is fantastic and they’re very structured. Data science is so fluid and creative and non structured at the moment, so you kind of have to look at where it actually should fall.

please note that some of the points above are summarized and not 100% actual quotes.

Resources shared:

⬢ Tatsu shared in the chat, a few projects that Michael is working on: vetiver: https://vetiver.tidymodels.org/articles/vetiver.html , siuba: https://github.com/machow/siuba ⬢ Libby shared a helpful tip on creating a 2 minutes YouTube video with a cover letter, to get the attention of a hiring manager ⬢ Javier shared an example Shiny app used in an interview: https://javierorraca.shinyapps.io/Bloomreach_Shiny_App/ ⬢ Michael mentioned David Robinson’s screencasts: https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ ⬢ Michael mentioned an article on “What data scientists really do according to 35 data scientists”: https://hbr.org/2018/08/what-data-scientists-really-do-according-to-35-data-scientists ⬢ Rachael shared a blog post link where Jacqueline Nolis talked about team structure as well: https://www.rstudio.com/blog/building-effective-data-science-team-answering-your-questions/#Structure

► Subscribe to Our Channel Here: https://bit.ly/2TzgcOu ► Add the Data Science Hangout to your calendar: rstd.io/datasciencehangout ► View the Data Science Hangout site here: rstudio.com/data-science-hangout

Follow Us Here: Website: https://www.rstudio.com LinkedIn:https://www.linkedin.com/company/rstudio-pbc Twitter: https://twitter.com/rstudio

Michael Chow

Simon Couch | tidymodels/stacks: A Grammar for Stacked Ensemble Modeling | RStudio

Full title: tidymodels/stacks, Or, In Preparation for Pesto: A Grammar for Stacked Ensemble Modeling

Through a community survey conducted over the summer, the RStudio tidymodels team learned that users felt the #1 priority for future development in the tidymodels package ecosystem should be ensembling, a statistical modeling technique involving the synthesis of multiple learning algorithms to improve predictive performance. This December, we were delighted to announce the initial release of stacks, a package for tidymodels-aligned ensembling. A particularly statistically-involved pesto recipe will help us get a sense for how the package works and how it advances the tidymodels package ecosystem as a whole.

About Simon: Simon Couch is an R developer and statistics student at Reed College, where he is entering the final semester of his undergraduate degree. He co-authors and maintains R packages including broom, infer, and stacks, leads trainings and workshops as an RStudio-certified tidyverse trainer, and researches in algorithmic data privacy. He interned on the RStudio tidymodels team in summer 2020, and is currently applying to doctoral programs in statistics

Simon Couch

infer rstudio tidymodels tidyverse Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Forcats Rstats Open Source OSS Reticulate Simon Couch Stacks

Max Kuhn | What’s new in tidymodels? | RStudio

tidymodels is a collection of packages for modeling using a tidy interface. In the last year there have been numerous improvements and extensions. This talk gives an overview of additional tuning methods, new extension packages for models and recipes, and other features.

About Max: Max Kuhn is a software engineer at RStudio. He is currently working on improving R’s modeling capabilities. He was a Director of Nonclinical Statistics at Pfizer Global R&D in Connecticut. He was applying models in the pharmaceutical and diagnostic industries for over 18 years. Max has a Ph.D. in Biostatistics. Max is the author of numerous R packages for techniques in machine learning and reproducible research and is an Associate Editor for the Journal of Statistical Software. He, and Kjell Johnson, wrote the book Applied Predictive Modeling, which won the Ziegel award from the American Statistical Association, which recognizes the best book reviewed in Technometrics in 2015. Their latest book, Feature Engineering and Selection, was published in 2019

Max Kuhn

rstudio tidymodels Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Forcats Rstats Open Source OSS Reticulate Max Kuhn

Andrew Tran | The Opioid Files: Turning big pharmacy data over to the public | RStudio (2021)

Through a community survey conducted over the summer, the RStudio tidymodels team learned that users felt the #1 priority for future development in the tidymodels package ecosystem should be ensembling, a statistical modeling technique involving the synthesis of multiple learning algorithms to improve predictive performance. This December, we were delighted to announce the initial release of stacks, a package for tidymodels-aligned ensembling. A particularly statistically-involved pesto recipe will help us get a sense for how the package works and how it advances the tidymodels package ecosystem as a whole.

About Andrew: Andrew is a data reporter on the rapid-response investigative team at The Washington Post who has analyzed how covid-19 has disproportionately impacted certain communities, the spread of opioids across the country, and the rise of right-wing violence. He shared in winning the Pulitzer Prize for Investigative Reporting in 2018. He’s an advocate for open data and reproducibility in journalism

rstudio tidymodels Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Forcats Rstats Open Source OSS Reticulate Andrew Tran Wapo

Grant Fleming | Fairness and Data Science: Failures, Factors, and Futures | RStudio

In recent years, numerous highly publicized failures in data science have made evident that biases or issues of fairness in training data can sneak into, and be magnified by, our models, leading to harmful, incorrect predictions being made once the models are deployed into the real world. But what actually constitutes an unfiar or biased model, and how can we diagnose and address these issues within our own work? In this talk, I will present a framework for better understanding how issues of fairness overlap with data science as well as how we can improve our modeling pipelines to make them more interpretable, reproducible, and fair to the groups that they are intended to serve. We will explore this new framework together through an analysis of ProPublica’s COMPAS recidivism dataset using the tidymodels, drake, and iml packages.

About Grant: Grant Fleming is a Data Scientist at Elder Research, co-author of the Wiley book Responsible Data Science (2021), and contributor to the O’Reilly book 97 Things About Ethics Everyone in Data Science Should Know. His professional focus is on machine learning for social science applications, model explainability, and building tools for reproducible data science. Previously, Grant was a research contractor for USAID

rstudio tidymodels Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Forcats Rstats Open Source OSS Reticulate Ethics Propublica Tech

Max Kuhn | Total Tidy Tuning Techniques | RStudio (2020)

Many models have structural parameters that cannot be directly estimated from the data. These tuning parameters can have a significant effect on model performance and require some mechanism for finding reasonable values. The tune and workflow packages enable tidymodels users to optimize these parameters using a variety of efficient grid search methods as well as with iterative search techniques (such as Bayesian optimization)

Max Kuhn

rstudio tidymodels Rstudio::conf(2020) Max Kuhn Rstudio Data Science Machine Learning Python Stats Tidyverse Data Visualization Data Viz Ggplot Technology Coding Connect Server Pro Shiny RMarkdown Package Manager CRAN Interoperability Serious Data Science Dplyr Forcats Ggplot2 Tibble Readr Stringr Tidyr Purrr Github Data Wrangling Tidy Data Odbc Rayshader Plumber Blogdown Gt Lazy Evaluation Tidymodels Statistics Debugging Programming Education Rstats Open Source OSS Reticulate

tidymodels

Contributors#

Max Kuhn

Julia Silge

Emil Hvitfeldt

Hannah Frick

Jeroen Ooms

Gábor Csárdi

Simon Couch

Resources featuring tidymodels#

Strategic Budget Optimization through Marketing Mix Modeling (MMM)

Simon Couch - Practical AI for data science

Sparsity support in tidymodels, faster and less memory hungry models - Emil Hvitfeldt

Precision Medicine for All: Using Tidymodels to Validate PRS in Brazil (Flávia Rius) | posit::conf

The Power of Snowflake and Posit Workbench (Jonathan Regenstein, Snowflake) | posit::conf(2025)

Data Science in the Energy Industry | Frank Hull | Data Science Hangout

Deploying Scikit-learn models for in-database scoring with Snowflake and Posit Team

Easier data and asset sharing across projects and teams with {pins} and Databricks

Standardizing a safety model with tidymodels, Posit Team & Databricks at Suffolk Construction

The Power of Snowflake and Posit Workbench: Macroeconomic Data Exploration in the Cloud

Wes McKinney & Hadley Wickham (on cross-language collaboration, Positron, career beginnings, & more)

Tidymodel prediction workflows inside databases with orbital and Snowflake

Simon Couch - From hours to minutes: Accelerating your tidymodels code

Brendan Graham - A Machine Learning Approach to Protect Patients from Blood Tube Mix-Ups

Emil Hvitfeldt - Tidypredict with recipes, turn workflow to SQL, spark, duckdb and beyond

Hannah Frick - tidymodels for time-to-event data

Max Kuhn - Evaluating Time-to-Event Models is Hard

Simon Couch - Fair machine learning

Tidymodels: Now Also for Time-to-Event Data! - Hannah Frick

Simon Couch: Fair machine learning

Making Better Error Messages with Rlang and Cli - Emil Hvitfeldt

Live Q&A following Workflow Demo - June 26th

Predicting Lending Rates with Databricks, tidymodels, and Posit Team

Conformal Inference with Tidymodels - posit::conf(2023)

Presented at Posit Conference, between Sept 19-20 2023, Learn more at posit.co/conference.#

Making a (Python) Web App is easy! - posit::conf(2023)

Presented at Posit Conference, between Sept 19-20 2023, Learn more at posit.co/conference.#

Open Source Property Assessment: Tidymodels to Allocate $16B in Property Taxes - posit::conf(2023)

Presented at Posit Conference, between Sept 19-20 2023, Learn more at posit.co/conference.#

tidymodels: Adventures in Rewriting a Modeling Pipeline - posit::conf(2023)

Presented at Posit Conference, between Sept 19-20 2023, Learn more at posit.co/conference.#

webR 0.2: R Packages and Shiny for WebAssembly | George Stagg | Posit

Bite-sized tricks for machine learning with tidymodels | Posit

How to train, evaluate, and deploy a machine learning workflow with tidymodels & Posit Team

Workflow Demo Q&A - Oct 25th

Charla Plenaria: Max Kuhn

Emil Hvitfeldt - Slidecraft: The Art of Creating Pretty Presentations

3 Reasons to Use Tidymodels with Julia Silge

posit::conf(2023) Workshop: Advanced tidymodels

posit::conf(2023) Workshop: Introduction to tidymodels

Hannah Frick - “Censored: A tidymodels package for survival models”

Aaron R. Williams | The tidysynthesis R package | RStudio (2022)

Emil Hvitfeldt | tidyclust - expanding tidymodels to clustering | RStudio (2022)

Hannah Frick | Censored - Survival Analysis in Tidymodels | Posit (2022)

Isabel Zimmerman | Demystifying MLOps | Posit (2022)

Julia Silge & Max Kuhn | Good Practices for Applied Machine Learning | Posit (2022)

Kelly Bodwin | Translating from {tidymodels} and scikit-learn: Lessons from a ‘bilingual’ course

Data Science Hangout | Michael Chow, Posit | Exploring Team Structure w/ Data Scientists & Engineers

Julia Silge | Monitoring Model Performance | RStudio

Simon Couch | tidymodels/stacks: A Grammar for Stacked Ensemble Modeling | RStudio

Max Kuhn | What’s new in tidymodels? | RStudio

Andrew Tran | The Opioid Files: Turning big pharmacy data over to the public | RStudio (2021)

Grant Fleming | Fairness and Data Science: Failures, Factors, and Futures | RStudio

Max Kuhn | Total Tidy Tuning Techniques | RStudio (2020)

Max Kuhn | parsnip A tidy model interface | RStudio (2019)

Posts about tidymodels#

orbital 0.4.0

tidymodels & xgboost

tidypredict 1.0.0

Two New tidymodels Packages

Q3 2025 tidymodels digest

tune version 2.0.0

recipes 1.3.0

rsample 1.3.0

Improved sparsity support in tidymodels

Q1 2025 tidymodels digest

orbital 0.3.0

tidymodels Internship for 2025