Loading…
This event has ended. Visit the official site or create your own event on Sched.

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Monday, July 25
 

7:30am EDT

Breakfast
See you in Maryland for breakfast.

Buffet & boxed lunch options. Outdoor seating available.

Workshops start at 9:00 and are a five-to-ten minute walk away.

Monday July 25, 2022 7:30am - 9:00am EDT
5. Maryland

9:00am EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
National Harbor 2

9:00am EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Chesapeake F

9:00am EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
National Harbor 3

9:00am EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Chesapeake D

9:00am EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Maryland 3

9:00am EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
National Harbor 12+13

9:00am EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Maryland 2

9:00am EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Monday July 25, 2022 9:00am - 10:30am EDT
National Harbor 6+7

9:00am EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
National Harbor 4+5

9:00am EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Maryland 6

9:00am EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Monday July 25, 2022 9:00am - 10:30am EDT
Maryland 4+5

9:00am EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
4. National Harbor 10+11

9:00am EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Maryland 1

9:00am EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Chesapeake G+H+I

9:00am EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Monday July 25, 2022 9:00am - 10:30am EDT
Chesapeake E

9:00am EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Monday July 25, 2022 9:00am - 10:30am EDT
Chesapeake 4+5+6

11:00am EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
National Harbor 2

11:00am EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Chesapeake F

11:00am EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
National Harbor 3

11:00am EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Chesapeake D

11:00am EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Maryland 3

11:00am EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
National Harbor 12+13

11:00am EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Maryland 2

11:00am EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Monday July 25, 2022 11:00am - 12:30pm EDT
National Harbor 6+7

11:00am EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
National Harbor 4+5

11:00am EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Maryland 6

11:00am EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Monday July 25, 2022 11:00am - 12:30pm EDT
Maryland 4+5

11:00am EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
4. National Harbor 10+11

11:00am EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Maryland 1

11:00am EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Chesapeake G+H+I

11:00am EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Monday July 25, 2022 11:00am - 12:30pm EDT
Chesapeake E

11:00am EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Monday July 25, 2022 11:00am - 12:30pm EDT
Chesapeake 4+5+6

12:30pm EDT

Lunch
See you in Maryland for lunch.

Buffet & boxed lunch options. Outdoor seating available.

Monday July 25, 2022 12:30pm - 1:30pm EDT
5. Maryland

1:30pm EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
National Harbor 2

1:30pm EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Chesapeake F

1:30pm EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
National Harbor 3

1:30pm EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Chesapeake D

1:30pm EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Maryland 3

1:30pm EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
National Harbor 12+13

1:30pm EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Maryland 2

1:30pm EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Monday July 25, 2022 1:30pm - 3:00pm EDT
National Harbor 6+7

1:30pm EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
National Harbor 4+5

1:30pm EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Maryland 6

1:30pm EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Monday July 25, 2022 1:30pm - 3:00pm EDT
Maryland 4+5

1:30pm EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
4. National Harbor 10+11

1:30pm EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Maryland 1

1:30pm EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Chesapeake G+H+I

1:30pm EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Monday July 25, 2022 1:30pm - 3:00pm EDT
Chesapeake E

1:30pm EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Monday July 25, 2022 1:30pm - 3:00pm EDT
Chesapeake 4+5+6

3:30pm EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
National Harbor 2

3:30pm EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Chesapeake F

3:30pm EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
National Harbor 3

3:30pm EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Chesapeake D

3:30pm EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Maryland 3

3:30pm EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
National Harbor 12+13

3:30pm EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Maryland 2

3:30pm EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Monday July 25, 2022 3:30pm - 5:00pm EDT
National Harbor 6+7

3:30pm EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
National Harbor 4+5

3:30pm EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Maryland 6

3:30pm EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Monday July 25, 2022 3:30pm - 5:00pm EDT
Maryland 4+5

3:30pm EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
4. National Harbor 10+11

3:30pm EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Maryland 1

3:30pm EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Chesapeake G+H+I

3:30pm EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Monday July 25, 2022 3:30pm - 5:00pm EDT
Chesapeake E

3:30pm EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Monday July 25, 2022 3:30pm - 5:00pm EDT
Chesapeake 4+5+6

5:30pm EDT

Book Signing Reception
The incredible bibliography of our workshop instructors will be on display this evening. Pick up your free book and get it signed by the author.

Already own one of these titles? bring your collection and have it signed.

Authors signing books include Mine Çetinkaya-Rundel, Desirée De Leon, Christophe Dervieux, Max Kuhn, Edgar Ruiz, David Robinson, Julia Silge, Hadley Wickham, & more.

Light refreshments will be served.

Monday July 25, 2022 5:30pm - 7:00pm EDT
3. Cherry Blossom
 
Tuesday, July 26
 

7:30am EDT

Breakfast
See you in the Maryland ballroom for breakfast.

Buffet & boxed lunch options. Outdoor seating available.

Workshops start at 9:00 and are a five-to-ten minute walk away.

Tuesday July 26, 2022 7:30am - 9:00am EDT
5. Maryland

9:00am EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
National Harbor 2

9:00am EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Chesapeake F

9:00am EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
National Harbor 3

9:00am EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Chesapeake D

9:00am EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Maryland 3

9:00am EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
National Harbor 12+13

9:00am EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Maryland 2

9:00am EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Tuesday July 26, 2022 9:00am - 10:30am EDT
National Harbor 6+7

9:00am EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
National Harbor 4+5

9:00am EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Maryland 6

9:00am EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Tuesday July 26, 2022 9:00am - 10:30am EDT
Maryland 4+5

9:00am EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
4. National Harbor 10+11

9:00am EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Maryland 1

9:00am EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Chesapeake G+H+I

9:00am EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Tuesday July 26, 2022 9:00am - 10:30am EDT
Chesapeake E

9:00am EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Tuesday July 26, 2022 9:00am - 10:30am EDT
Chesapeake 4+5+6

11:00am EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
National Harbor 2

11:00am EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Chesapeake F

11:00am EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
National Harbor 3

11:00am EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Chesapeake D

11:00am EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Maryland 3

11:00am EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
National Harbor 12+13

11:00am EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Maryland 2

11:00am EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Tuesday July 26, 2022 11:00am - 12:30pm EDT
National Harbor 6+7

11:00am EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
National Harbor 4+5

11:00am EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Maryland 6

11:00am EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Maryland 4+5

11:00am EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
4. National Harbor 10+11

11:00am EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Maryland 1

11:00am EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Chesapeake G+H+I

11:00am EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Chesapeake E

11:00am EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Tuesday July 26, 2022 11:00am - 12:30pm EDT
Chesapeake 4+5+6

12:30pm EDT

Lunch
See you in the Maryland ballroom for lunch.

Buffet & boxed lunch options. Outdoor seating available.

Tuesday July 26, 2022 12:30pm - 1:30pm EDT
5. Maryland

1:30pm EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
National Harbor 2

1:30pm EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Chesapeake F

1:30pm EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
National Harbor 3

1:30pm EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Chesapeake D

1:30pm EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Maryland 3

1:30pm EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
National Harbor 12+13

1:30pm EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Maryland 2

1:30pm EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
National Harbor 6+7

1:30pm EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
National Harbor 4+5

1:30pm EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Maryland 6

1:30pm EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Maryland 4+5

1:30pm EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
4. National Harbor 10+11

1:30pm EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Maryland 1

1:30pm EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Chesapeake G+H+I

1:30pm EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Chesapeake E

1:30pm EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Tuesday July 26, 2022 1:30pm - 3:00pm EDT
Chesapeake 4+5+6

3:30pm EDT

Building Production-Quality Shiny Applications
This workshop is for the Shiny developer who has entered this stage of their application development journey. View full workshop details.

Speakers
avatar for Eric Nantz

Eric Nantz

Eric Nantz is a director within the statistical innovation center at Eli Lilly and Company, creating analytical pipelines and capabilities of advanced statistical methodologies for clinical design used in multiple phases of development. Outside of his day job, Eric is passionate about... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
National Harbor 2

3:30pm EDT

Building Tidy Tools
This is a two-day, hands-on workshop for those who have embraced the tidyverse and want to build their own packages. View full workshop details.

Speakers
avatar for Ian Lyttle

Ian Lyttle

Ian Lyttle works as a Data Scientist at Schneider Electric. His technical interests include visualization, interactivity, and functional programming. He is a community contributor to tidyverse and r-lib, and maintains CRAN packages including vegawidget and boxr. He has delivered tutorials... Read More →
avatar for Emma Rand

Emma Rand

Associate Prof, University of York
Howdy! I am a Senior Lecturer (Associate Prof) in the Department of Biology at the University of York where I teach data science and reproducibility, particularly to those who do not see themselves as programmers. I lead a UKRI funded project called Cloud-SPAN which trains researchers... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Chesapeake F

3:30pm EDT

Causal Inference in R
In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting. View full workshop details.

Speakers
avatar for Malcolm Barrett

Malcolm Barrett

Malcolm Barrett is a data scientist and an epidemiologist. During his Ph.D., he studied vision loss, focusing on epidemiologic methods. He's since worked in the private sector, including Teladoc Health and Apple. Malcolm is also the author of several causal inference-focused R packages... Read More →
avatar for Lucy D'Agostino McGowan

Lucy D'Agostino McGowan

Lucy D’Agostino McGowan is an assistant professor in the Mathematics and Statistics Department at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
National Harbor 3

3:30pm EDT

Clinical Reporting in R
In this workshop you will be working with the Tidyverse and the first generation of Pharmaverse tools in order to ‘deliver’ typical clinical trial displays. Specifically, we will walk through an end-to-end focussing on a sample of demographic, safety and efficacy summaries. View full workshop details.

Speakers
avatar for Christina Fillmore

Christina Fillmore

Christina Fillmore formally trained as a statistician, over her 5 years at GSK has transitioned into a data scientist. Her focus is on open source package development in a regulated environment. She also has a keen interesting in helping other develop R skills and become confident... Read More →
avatar for Andy Nicholls

Andy Nicholls

Head of Data Science, GlaxoSmithKline, United Kingdom
Andy Nicholls is Head of Statistical Data Sciences at GSK. Andy is responsible for building Data Science capability within GSK Biostatistics and delivering Data Science solutions across R&D. Within the wider pharmaceutical industry, Andy is the lead for the R Validation Hub, a collaboration... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Chesapeake D

3:30pm EDT

Designing the Data Science Classroom
The goal of this workshop is to equip educators with concrete information on content, workflows, and infrastructure for painlessly introducing modern computation with R and RStudio within a data science curriculum. View full workshop details.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Maria Tackett

Maria Tackett

Maria Tackett is an Assistant Professor of the Practice in the Department of Statistical Science at Duke University. Prior to joining the faculty at Duke, Maria earned a Ph.D. in Statistics from the University of Virginia and worked as a statistician at Capital One. Her current work... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Maryland 3

3:30pm EDT

From R Markdown to Quarto
This workshop is designed for those who want to take their R Markdown skills and expertise and apply them in Quarto, the next generation of R Markdown. View full workshop details.

Speakers
avatar for Andrew Bray

Andrew Bray

Andrew Bray is an Associate Teaching Professor in the Department of Statistics at UC Berkeley where he develops and teaches courses in statistics and data science. His research interests include statistical computing, data privacy, and applications of statistical models to environmental... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
National Harbor 12+13

3:30pm EDT

Getting Started with Quarto
This workshop is designed for those who have no or little prior experience with R Markdown and who want to learn Quarto. View full workshop details.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Maryland 2

3:30pm EDT

Getting Started with Shiny
Shiny is an R package that makes it easy to build interactive web apps straight from R. This workshop will start at the beginning. View full workshop details.

Speakers
avatar for Colin Rundel

Colin Rundel

Associate Professor, Duke University
Colin Rundel is an Associate Professor of the Practice at Duke University where he has been teaching since 2012. His work focuses on teaching statistical computing to both undergraduate and graduate students in both R and Python. He has been teaching and using Shiny since 2015.


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
National Harbor 6+7

3:30pm EDT

Graphic Design with ggplot2: How to Create Engaging and Complex Visualizations in R
The workshop covers the most important steps and helpful tips to create visually appealing, engaging and complex graphics with ggplot2. View full workshop details.

Speakers
avatar for Cédric Scherer

Cédric Scherer

Data Visualization Professional, Independent
Cédric Scherer is a graduated computational ecologist with a passion for design. After his PhD, he combined his expertise in analyzing and visualizing large data sets in R with his passion to become an independent data visualization designer and specialist. Cédric has created v... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
National Harbor 4+5

3:30pm EDT

How Data Science with R Works for Systems Administrators
In this workshop, you'll learn to use the capabilities of RStudio Team to enable your organization's R and Python users, including topics like package and environment management, performance and scaling, external data connections, and integrating RStudio Team with CI/CD pipelines. View full workshop details.

Speakers
avatar for Alex Gold

Alex Gold

Solutions Engineering, RStudio
Alex is a longtime data nerd who worked on economic policy research, electoral politics, and healthcare at various times. He enjoys cooking, practicing martial arts, and landscaping in his spare time. Alex leads RStudio's Solutions Engineering team -- come talk to him if you're interested... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Maryland 6

3:30pm EDT

Introduction to the tidyverse
A unique 6-week data science apprenticeship where you’ll develop the skills necessary to do data science with the R language. View full workshop details.

Speakers
avatar for Garrett  Grolemund

Garrett Grolemund

Educator, RStudio
avatar for Desirée De Leon

Desirée De Leon

Data Science Educator, RStudio Academy, RStudio


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Maryland 4+5

3:30pm EDT

Machine Learning with tidymodels
This workshop provides an introduction to machine learning with R. View full workshop details.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
4. National Harbor 10+11

3:30pm EDT

Making Art from Code: How the magic works, and the practical skills it teaches
This workshop provides a hands-on introduction to generative art in R. You’ll learn artistic techniques that generative artists use regularly in their work including flow fields, iterative function systems, tilings, and more. View full workshop details.

Speakers
avatar for Danielle Navarro

Danielle Navarro

Danielle Navarro is a generative artist, data scientist, professional educator, mathematical psychologist, academic professor in recovery, open source R developer, and coauthor of “ggplot2: Elegant Graphics for Data Analysis” (3rd edition). She is currently a developer advocate... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Maryland 1

3:30pm EDT

Package Development Masterclass
In this two day masterclass, you’ll have the opportunity to dig deep in to advanced package development topics, like writing testable code and producing error messages that spark joy. View full workshop details.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Chesapeake G+H+I

3:30pm EDT

R for People Analytics
The course will cover some of the most commonly used methods of analysis and inference when working with data related to people, such as survey data and organizational network data. View full workshop details.

Speakers
avatar for Alex LoPilato

Alex LoPilato

Alex LoPilato is an organizational psychologist with extensive experience in the statistical models, methods, and theories developed to draw inferences from survey data and other psychometric data.
avatar for Keith McNulty

Keith McNulty

McKinsey & Company
Keith McNulty is an applied mathematician/statistician, psychometrician and data scientist based in the UK. He started his career as a Pure Mathematician with a focus on Matrix Algebra and Group Representation Theory. He then transitioned into the private sector where he developed... Read More →
avatar for Liz Romero

Liz Romero

Liz Romero is an applied statistician with extensive experience in the engineering and modeling of people-related data.


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Chesapeake E

3:30pm EDT

What They Forgot to Teach You About R
This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. View full workshop details.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.
avatar for Jenny Bryan

Jenny Bryan

software engineer, RStudio
avatar for Shannon McClintock Pileggi

Shannon McClintock Pileggi

Data Scientist, The Prostate Cancers Clinical Trials Consortium
I spend my days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center. I also enjoy blogging and I am a... Read More →


Tuesday July 26, 2022 3:30pm - 5:00pm EDT
Chesapeake 4+5+6

5:30pm EDT

Welcome Reception
Join other conference attendees for a meet and greet.

Refreshments and light snacks will be served.

Tuesday July 26, 2022 5:30pm - 7:30pm EDT
6. Riverview Ballroom
 
Wednesday, July 27
 

7:30am EDT

Breakfast
See you in the Maryland ballroom for breakfast.

Buffet & boxed lunch options. Outdoor seating available.

The morning keynote starts at 9:00 AM. It may take 15 minutes to walk to and find a seat there.

Wednesday July 27, 2022 7:30am - 9:00am EDT
5. Maryland

7:30am EDT

Academic / Research - Birds of a Feather
Join other data scientist and RStudio users working in academia for breakfast before heading over the keynote.

🕘 The morning keynote starts at 9:00 AM. It may take 15 minutes to walk to and find a seat there.

🤞We've striven to provide sufficient space for everyone interested, but please be aware seatting is limited.

Wednesday July 27, 2022 7:30am - 9:00am EDT
Maryland 1, 2, & 3

7:30am EDT

Birds of a Feather - Variegated Groups
Kick off rstudio::conf(2022) with breakfast and your first Birds of a Feather group.

See table signage for group locations in Maryland, and join the group you're most keen to mingle in.

  • R-Ladies Buddy Up. Stop by the R-Ladies section of the main dinning room before joining a Birds of a Feather and/or heading over to the Keynote.
  • Kaggle Data Science Community. Whether you're a competitor and active in the community, or keen to participate and to learn, join us at 7:30 AM.
  • Cyber Security & Data Privacy. Join other data scientists and analysts focused on security & data privacy concerns. Meet at 8:00 AM.
  • French Speakers. Meet at 8:00 AM.
  • Bayesian methods.

The morning keynote starts at 9:00 AM. It may take 15 minutes to walk to and find a seat there.

Wednesday July 27, 2022 7:30am - 9:00am EDT
5. Maryland

9:00am EDT

Good practices for applied machine learning - from model development to model deployment
The tidymodels framework is a collection of R packages for modeling and machine learning using tidyverse principles. Whether you are just starting out today or have years of experience with ML, tidymodels offers a consistent, flexible framework for your work. In this talk, learn how tidymodels has been designed to promote ergonomic, effective, and safe modeling practice. We will discuss how to think about the steps of building a model from beginning to end, how to fluently use different modeling and feature engineering approaches, how to avoid common pitfalls of modeling like overfitting and data leakage, and how to version and deploy reliable models trained in R.

Speakers
avatar for Max Kuhn

Max Kuhn

RStudio, PBC
Max Kuhn is a software engineer at RStudio. He is currently working on improving R’s modeling capabilities. He was a Director of Nonclinical Statistics at Pfizer Global R&D in Connecticut. He was applying models in the pharmaceutical and diagnostic industries for over 18 years... Read More →
avatar for Julia Silge

Julia Silge

RStudio, PBC
Julia Silge is a data scientist and software engineer at RStudio PBC where she works on open source tools for machine learning and MLOps. She holds a PhD in astrophysics and has worked as a data scientist in tech and the nonprofit sector, as well as a technical advisory committee... Read More →


Wednesday July 27, 2022 9:00am - 10:30am EDT
0. Potomac A+B

11:00am EDT

R Package Assessment: Lessons from Pharma
As R users, we make choices daily about what packages to use in our work. After discovering a package that may suit our needs, we consider its qualities and make a judgement about whether to invest in its use. Here at GSK, we are currently forging a path to using R in regulatory settings, which involves careful curation and assessment of R packages for use on clinical trials data. While this exercise is formalized for those of us in pharma, attendees will learn about how related principles can better inform decisions they make their own workflows, regardless of industry or experience. They will take away specific qualities that indicate trustworthiness, as well as the importance of considering the ecosystem as a whole.

Speakers
avatar for Becca Krouse

Becca Krouse

GSK
Becca Krouse is a Data Science Leader in the Statistics and Data Science Innovation Hub at GSK. She is a biostatistician by training with nearly 12 years of experience working on clinical research.


Wednesday July 27, 2022 11:00am - 11:20am EDT
3. Cherry Blossom

11:00am EDT

Translating from {tidymodels} and scikit-learn: Lessons from a 'bilingual' course.
The friendly competition between R and python has gifted us with two stellar packages for workflow-style predictive modeling: tidymodels in R, and scikit- learn in python. When I had to choose between them for a Machine Learning Course, I said: ¿Porque no los dos? (Why not both?)

In this talk, I will share how the differences in structure and syntax between tidymodels and scikit-learn impacted student understanding. Can a helper function hide an important decision about tuning parameters? Can a slight change in argument input influence the way we describe a model? The answer is a resounding, "¡Sí!"

Don't despair, though, because I will also provide advice for avoiding pitfalls when switching between languages or implementations. Together, let's think about the power that programming choices has to shape the mental model of the user, and the ways that we can responsibly document our modeling decisions to increase cross-language reproducibility.

Talk materials are available at https://www.kelly-bodwin.com/talks/rsconf22/.

Speakers
avatar for Kelly Bodwin

Kelly Bodwin

California Polytechnic State University
Kelly Bodwin is an Assistant Professor of Statistics and Data Science and an RStudio Certified Instructor. She is passionate about developing R tools (especially choosing names for packages) and about creating innovative ways to teach computing to data scientists. Although her love... Read More →


Wednesday July 27, 2022 11:00am - 11:20am EDT
4. National Harbor 10+11

11:00am EDT

The R integration in Power BI: Why R developers should be Interested
Power BI is a multi-faceted business intelligence tool with built-in data ingestion, transformation, and visualization features. The learning curve for Power BI can be steep for many, but fortunately, R users can shorten the learning curve by leveraging the Tidyverse. In this talk, attendees will get introduced to how the Tidyverse can shorten the Power BI learning curve and enable them to perform tasks in Power BI that are not possible using its native functionality.

Speakers
avatar for Ryan E Wade

Ryan E Wade

Ryan is a data and analytics solutions architect. He can present complex data intuitively using proper analytical and visualization methods. He has an advanced understanding of R, Python, DAX, SQL, VBA, and M. He is the author of the book, Advanced Analytics in Power BI with R and... Read More →


Wednesday July 27, 2022 11:00am - 11:20am EDT
1. Potomac C

11:00am EDT

censored - Survival Analysis in tidymodels
tidymodels is extending support for survival analysis and censored is a new parsnip extension package for survival models. It offers various types of models: parametric models, semi-parametric models like the Cox model, and tree- based models like decision trees, boosted trees, and random forests. They all come with the consistent parsnip interface so that you can focus on the modelling instead of details of the syntax. Happy modelling!

Talk materials are available at https://hfrick.github.io/rstudio-conf-2022.

Speakers
avatar for Hannah Frick

Hannah Frick

RStudio
Hannah Frick is a software engineer on the tidymodels team at RStudio. She holds a PhD in statistics and has worked in data science consultancy as well as interdisciplinary research at University College London in cooperation with Team GB Hockey.


Wednesday July 27, 2022 11:00am - 11:20am EDT
2. Potomac D

11:20am EDT

Dive Deep into Metadata with Tplyr
Back in 2020, Atorus first released our package Tplyr. The aim of Tplyr was to build a reusable framework that makes all the data preparation for clinical reporting simple – and takes the hassle out of tedious things like decimal and parentheses alignment. But from the beginning, Tplyr was always meant to be more. Tplyr captures metadata about the table being built, and in Tplyr 1.0.0 we can leverage that metadata to its full advantage. Tplyr now let’s you dive in, and for any result provides the traceability of how that result was derived, and the data used to do-so. In our talk, we will explore this new functionality built into Tplyr, the mechanics that make it work, and how these new features are useful in the clinical world and beyond.

Speakers
avatar for Mike Stackhouse

Mike Stackhouse

Atorus Research
Mike Stackhouse is the Chief Innovation Officer of Atorus Research and leads Atorus’ Analytics Engineering department. Mike has a long history of experience in statistical programming with a heavy focus on process improvement, automation, and streamlining. This ultimately drove... Read More →


Wednesday July 27, 2022 11:20am - 11:40am EDT
3. Cherry Blossom

11:20am EDT

Mobile, Low-Bandwidth and Low-Tech. The Story of Chi-Square Mobile.
This talk discusses how we used R to solve some of the challenges we faced when all classes were emergency onlined as a result of COVID. Instructors improvised typically defaulting to zoom, and survey data showed that more than half of our student population did not have access to a laptop or computer, but all did have a smartphone. We set out to mitigate this by creating a mobile app which facilitates a feedback-loop between students and instructors and provides analytics of this interaction.Starting as a shiny-app with {shinymobile}, we applied linguistic and social network analysis to provide the instructor with analyses. As the pandemic dragged on the system evolved and became more integrated with google cloud and the Andoid ecosystem.

Speakers
avatar for Aleksander Dietrichson

Aleksander Dietrichson

Universidad de San Martin & Chi Square Laboratories
Aleksander (Sasha) Dietrichson, Phd is a professor of Data Science and AI, and the founder of Chi Square Laboratories, a company dedicated to the research and development of learning analytics solutions. Sasha has been working in learning analytics and educational AI for over a decade... Read More →


Wednesday July 27, 2022 11:20am - 11:40am EDT
4. National Harbor 10+11

11:20am EDT

R, Python, and Tableau: A Love Triangle
Users often want access to more advanced analytic capabilities in their Tableau dashboards. Together we will walk through an example that highlights how new open-source packages enable R, Python, Tableau, and RStudio Connect to work together to enhance business intelligence and deliver real-time insight.

Speakers
avatar for James Blair

James Blair

RStudio
James is a Product Manager for Cloud Integrations at RStudio, where he focuses on helping RStudio commercial products seemlessly integrate into cloud environments. He has a background in statistics and data science and finds any excuse he can to write R code and ride his bike, although... Read More →


Wednesday July 27, 2022 11:20am - 11:40am EDT
1. Potomac C

11:20am EDT

tidyclust - expanding tidymodels to clustering
This talk marks the grand introduction of tidyclust, a new package that provides a tidy unified interface to clustering model within the tidymodels framework.

While tidymodels has been a leap forward in making machine learning methods accessible to a general audience in R, it is currently limited to the realm of supervised learning. tidyclust, by Emil Hvitfeldt and Kelly Bodwin, builds upon the interfaces familiar to tidymodels users to make unsupervised clustering models equally approachable.

Speakers
avatar for Emil Hvitfeldt

Emil Hvitfeldt

RStudio
Emil Hvitfeldt is a software engineer at RStudio. Part of the tidymodels team's effort to improve R's modeling capabilities. He maintains several packages within the realms of modeling, text analysis and, color palettes. Taught statistical machine learning as an adjunct professor... Read More →


Wednesday July 27, 2022 11:20am - 11:40am EDT
2. Potomac D

11:40am EDT

Packages and Process
Data Scientists have a unique position to drive change and efficiency within organizations workflows. By simplifying workflows to its core expectations, we can design with purpose. Process and software are harmonized to reduce risk and streamline the overall system. At GSK, we are developing capabilities and challenging traditional processes. We must do this while still meeting regulatory compliance requirements that can be complicated by historical inertia. I will talk about how my team is using this approach to develop R packages in tandem with process changes that are leading to wins. Attendees of my talk with leave with an understanding of how to do the same.

Speakers
avatar for Ellis Hughes

Ellis Hughes

GSK
Ellis Hughes is a Data Science Leader and has worked in the pharmaceutical industry for 8 years. He focuses on how organizations can use R and take open source approaches to improve processes and solve problems. During his free time he organizes the Seattle UseR group, the Cascadia... Read More →


Wednesday July 27, 2022 11:40am - 12:00pm EDT
3. Cherry Blossom

11:40am EDT

Designing a Socially-Critical Data Science Course
I'll discuss designing a socially-conscious and socially-critical data science course. This talk will be interesting to anyone who designs or delivers educational opportunities for young data scientists. The topic is important because data science is not value neutral. Learners should understand how data and data analyses can encode biases and assumptions. Particularly when we collect data from or about people, we should ask who is privileged or disadvantaged by how we gather and analyze that data? Attendees will learn how I’ve come to design and teach data science courses that emphasize morality and ethics. They’ll also learn how I design for inclusion and work to create reflective learning environments.

Speakers
avatar for Brian Danielak

Brian Danielak

University of Maryland – College Park
Brian's career has spanned teaching, researching, and being a practicing software engineer. His Ph.D. in Computing Education Research focused on how engineering students learn software design. In subsequent post-doctoral research positions, his work expanded to educational games... Read More →


Wednesday July 27, 2022 11:40am - 12:00pm EDT
4. National Harbor 10+11

11:40am EDT

Building a client portal app for a mortgage management group with embedded Power BI reports
With the Dutch Mortgage Portfolio Management Group we built a client portal for their customers wanting insights into their mortgage investment portfolio. This was done with a Shiny app that runs in Azure App Service, with plumber APIs serving portfolio reports, and embedded power bi reports with clear and easy to maintain dashboards.

Last year we saw a lot of great content on how to combine Tableau with Rstudio Connect, in this talk we have answers for the Power BI users. An interesting example of combining self-service BI with serious data science.

When Microsoft itself mentions "we did not know how to do this, you know you have an interesting topic for rstudio::conf.

Speakers
avatar for Joran De Wilde

Joran De Wilde

AXI Full Service RStudio Partner
Joran is a 24 year old man who recently graduated as Master of ICT Engineering. He currently works as a consultant data scientist. In this role he developped multiple shiny applications for a broad scala of different industries, ranging from the public sector to the retail or finance... Read More →
avatar for Thomas Wouters

Thomas Wouters

AXI Full Service RStudio Partner
Thomas Wouters is a young man who is recently graduated in Applied Computer Science. First started as a Business Analyst but quickly moved to a more sales-oriented position. Because of his enormous affinity with data & analytics, this fits perfectly in the search for customers to... Read More →


Wednesday July 27, 2022 11:40am - 12:00pm EDT
1. Potomac C

11:40am EDT

Demystifying MLOps
Data scientists have an intuition of what goes into training a machine learning model, but building an MLOps strategy to deploy that model can sound daunting for data science teams. Model services are not one-size-fits-all, so it is imperative to know a range of tools available. One option, Vetiver, is a framework for R and Python created to make model deployment feel like a natural extension of a data scientist’s skill set.

This talk offers a high-level overview of what MLOps options are available for model operationalization, but also shows a practical example of an end-to-end MLOps deployment of a model-aware REST API using Vetiver.

Speakers
avatar for Isabel Zimmerman

Isabel Zimmerman

RStudio
Isabel Zimmerman is a software engineer on the open source team at RStudio, where she works on building MLOps frameworks. When she's not geeking out over new data science techniques, she can be found hanging out with her dog or watching Marvel movies.


Wednesday July 27, 2022 11:40am - 12:00pm EDT
2. Potomac D

12:00pm EDT

Lunch
See you in the Maryland ballroom for buffet style lunch.

Buffet & boxed lunch options. Outdoor seating available.

Wednesday July 27, 2022 12:00pm - 1:30pm EDT
5. Maryland

12:00pm EDT

Birds of a Feather Lunch - Variegated Groups
Numerous birds of a feather groups are meeting at this time.

Look for the bird-themed signs and table tents.

Groups include,

  • Energy
  • Government/Public Sector
  • Marketing/Digital Analytics
  • Media & Journalism
  • Professional Services/Consulting
  • Telecommunications
  • Transportation

Wednesday July 27, 2022 12:00pm - 1:30pm EDT
5. Maryland

12:00pm EDT

Business Operations - Birds of a Feather
Join fellow Business Operations analysts for lunch.

Learn about helpful tools for workflow optimization in multi-lingual environments with two short presentations by ProCogia and RStudio.

Welcoming remarks start at 12:30 PM

This event is sponsored by ProCogia.

🍽 Buffet & boxed lunch options.

We've striven to provide sufficient space for everyone interested, but please be aware seatting is limited.

Wednesday July 27, 2022 12:00pm - 1:30pm EDT
Maryland 4, 5, & 6

12:00pm EDT

Pharma / Life Sciences - Birds of a Feather
Meet up with fellow pharma data scientists for lunch.

Learn more about industry trends with a special panel discussion and presentation with Janssen, Novartis, and Atorus Research.

Welcoming remarks start at 12:30 PM

⬡ This includes a discussion on how Janssen and Atorus have collaborated on embedding R into Janssen’s statistical programming team and enabling its usage within a highly regulated environment.

⬡ Bo Wang from Novartis will share how they piloted a solution based on Shiny apps to provide Data Monitoring Committees members a more flexible way to review patient safety data.

This event is brought to you by Atorus Research.

🍽 Buffet & boxed lunch options.

👍 We've striven to provide sufficient space for everyone interested, but please be aware seatting is limited.

Wednesday July 27, 2022 12:00pm - 1:30pm EDT
Maryland 1, 2, & 3

1:30pm EDT

How Anchorage Built Alaska’s Vaccine Finder with R
In January 2021, Alaska residents seeking a COVID-19 vaccine appointment faced a convoluted maze of websites. The software was made for providers—not for residents.

The Anchorage Innovation Team built a fast, and mobile vaccine finder website for Alaska using R. What started as a web scraping prototype launched statewide one week later and ultimately connected tens of thousands of Alaskans to a vaccine.

This talk will cover how we used R to build Alaska's vaccine finder. Including:

  • Scraping and http packages (rvest & httr)
  • Using Heroku and S3 to run R jobs 24/7
  • Creating a flexible data service with R

Speakers
avatar for Ben Matheson

Ben Matheson

Municipality of Anchorage Innovation Team
Ben Matheson is the Data Analyst for the Anchorage Innovation Team— a startup inside city government—where he helps people solve problems using data and technology. Ben works extensively with R and JavaScript to analyze data, visualize ideas, and build tools. He has worked as... Read More →


Wednesday July 27, 2022 1:30pm - 1:50pm EDT
3. Cherry Blossom

1:30pm EDT

Making Data Pipelines in R: A Story From A “Self-Taught” Perspective
When people first learn about R’s capabilities to create fully integrated systems, automated visuals, and seamless data pipelines, the reaction can span from disbelief to amazement. R’s expansive capabilities can leave some feeling overwhelmed when tasked with larger projects like data pipelines. This talk invites the participant to hear the perspective of a self-taught R user who used curiosity and patience to create a functional data pipeline in R for a local health department. Specifically, this talk will touch on the following concepts:

  • Surveying Data Landscapes
  • File Structures
  • Saving Yourself with Data Validation
  • Modularizing Code and Connecting R Scripts
  • Thinking about Pipeline Sustainability
  • Remaining Calm in Unfamiliar R Territories

Talk materials are available at https://github.com/Meghansaha/pipelines_in_R.

Speakers
avatar for Meghan S Harris

Meghan S Harris

PCCTC @ Memorial Sloan Kettering
Meghan Harris is a self-taught R user that is currently a Data Scientist at the PCCTC at the Memorial Sloan Kettering Cancer Center. Meghan’s work allows her to work with data, create custom reports, dashboards, and various solutions using the R programming language daily. Meghan... Read More →


Wednesday July 27, 2022 1:30pm - 1:50pm EDT
4. National Harbor 10+11

1:30pm EDT

Do It For Yourself: Creating a data input platform using R
The Belmaker lab fieldwork involves underwater surveys where we document the observed marine species. Historically, Microsoft’s Access was used for data input. But as the number of surveyors increased, a new user-friendly platform was needed to avoid bottlenecks. In this talk, I share my experience of developing such a platform for the lab using tidyverse packages. This experience demonstrates the concept of Minimum Viable Products as the use of Google Sheets evolved from manually creating the sheets to a single line of code. We encourage other researchers to expand their use of R to create a tailor made data processing workflow - from the data input stage.

Speakers
avatar for Hezi Buba

Hezi Buba

Tel Aviv University
Hezi is currently finishing his PhD in marine ecology at Tel Aviv University, where he also taught R to other graduate students. His research focuses on the impacts of invasive species on the Mediterranean Sea. He is in a constant pursuit of honing his data science skills, and always... Read More →


Wednesday July 27, 2022 1:30pm - 1:50pm EDT
1. Potomac C

1:30pm EDT

Remote Content Execution with RStudio Connect and Kubernetes
This summer the RStudio Connect team will announce a feature which has been over two years in the making: “Remote” off-host content execution with launcher in Kubernetes.

We have been quietly beta testing the Launcher feature with select partners and customers for several months while we prepare for the public announcement.

This talk will highlight why someone might want to use this new execution mode with Connect, show just how seamless it is to get everything configured in a fresh environment on EKS, and finally set some critical context for what publishers and administrators should expect by addressing the anticipated FAQs.

Talk materials are available at https://kelly.quarto.pub/rstudioconf-talk-2022/.

Speakers
avatar for Kelly O'Briant

Kelly O'Briant

RStudio
Kelly O'Briant is the Product Manager for RStudio Connect.


Wednesday July 27, 2022 1:30pm - 1:50pm EDT
2. Potomac D

1:50pm EDT

Scaling and automating R workflows with Kubernetes and Airflow
During the pandemic, epidemiologists have been forced to adapt to the unprecedented scale of the data and high cadence of reporting.

At the UK Health Security Agency, we have created a platform for teams to easily deploy R and/or Python tasks onto our High-Performance Computing resources, scheduling their execution, and allowing previously unthinkable workloads to be executed with ease. Thanks to Kubernetes, git, Docker, and Airflow, our epidemiologists can stop worrying about their laptop's memory and bandwidth, and focus on answering the crucial questions of the pandemic. We'd like to tell you how we did it.

Speakers
avatar for Isaac Florence

Isaac Florence

UK Health Security Agency
I am an epidemiologist and public health policy professional working for the UK Health Security Agency (UKHSA). I have been using R and RStudio products since university and both have formed a major part of my career so far. At UKHSA I write in - and teach - R, SQL, Python, and other... Read More →


Wednesday July 27, 2022 1:50pm - 2:10pm EDT
3. Cherry Blossom

1:50pm EDT

Garbage Data, And What To Do About Them
No matter the requirements of the project, data are rarely ready for analysis without some intervention up front, often described as cleaning or tidying up your data. Researchers and data professionals employ many tools to make their data usable for their needs; but, there exist data that are so far beneath the threshold for usefulness that they cannot be used responsibly for analysis or decision-making, i.e. “bad data.” This talk proposes a framework for identifying bad data, with examples from both academic and industry; identifies challenges you might face from stakeholders when you identify bad data; and suggests concrete steps you can take to overcome those challenges now and in the future.

Speakers
avatar for Jim Kloet

Jim Kloet

Jim Kloet (Kloet rhymes with flute) is a data professional in Chicago. He encourages you to stay hydrated.


Wednesday July 27, 2022 1:50pm - 2:10pm EDT
4. National Harbor 10+11

1:50pm EDT

Oddly Satisfying - Find delight in the mundane
It happens to us all - a request to "just re-run the code" turns into a project nightmare. The materials left to you are poorly documented and scattered across Word, Excel, ArcGIS, and PDF reports. In this talk, I show you how to turn any project into a point of pride. Using a worked example, I provide guidance on how to complete a project intake, find your opportunity to shine, and how to work efficiently and reproducibly through thoughtful documentation. Finally, I cover how to set up the project for future success. Learn how to take the messy project you dread and make it inexplicably satisfying.

Talk materials are available at https://lizroten.com/oddly.

Speakers
avatar for Liz Roten

Liz Roten

Metropolitan Council
Liz Roten is a data scientist and cartographer working at the intersection of regional government, transportation, and data science. She specializes in interactive and static data visualization, package development, and collaborative workflows. She is passionate about open-source... Read More →


Wednesday July 27, 2022 1:50pm - 2:10pm EDT
1. Potomac C

1:50pm EDT

Data science in your customers hands on a budget and a deadline: Publishing Customer Facing Products with RStudio Connect
Benjy Braun, Chief Architect for 202 Group, shows why he and the 202 Group team decided to use RStudio Connect to build customer facing applications and secure websites while focusing on data science and not having to worry about hiring for—or learning—a bunch of web tools like JavaScript and php. With well organized git repos and Rmarkdown, you can build websites for customers with embedded shiny apps and dynamic visualizations and publish and deploy them using RStudio Connect. Using this method, you won’t confine your analysis to static documents like PowerPoint and pdfs and your customers will always see the latest and most up do date information. The tools are out there to do this and it’s easy to start and iterate with more features overtime.

Speakers
avatar for Benjamin Braun

Benjamin Braun

202 Group
I am a mission-driven problem solver with expertise in bringing data science applications from inception to profitability and acquisition. I build data and analytic tools that help users understand challenges, make informed decisions, and act. My experience across the data science... Read More →


Wednesday July 27, 2022 1:50pm - 2:10pm EDT
2. Potomac D

2:10pm EDT

An Integrated Workflow: Microsoft Azure DevOps, RStudio Workbench, RStudio Connect
The California Department of Public Health’s (CDPH) COVID-19 response has required processing and communicating large amounts of data with quick turnaround times to inform strategic resource deployment in the state. To achieve this goal, the COVID-19 Data Processing and Informatics Section developed an integrated workflow that includes a combination of tools: Microsoft Azure DevOps, RStudio Workbench, and RStudio Connect. Attendees will walk away with an understanding of how these three tools helped our team prioritize, collaborate, decrease turnaround time, and scale our efforts to deploy and automate mission critical procedures and products.

Speakers
avatar for Lawrence Y. Tello

Lawrence Y. Tello

Informatics Specialist, California Department of Public Health
Lawrence is an informatics specialist at the California Department of Public Health within the Data Processing and Informatics Sectionworking on products that contribute to California’s COVID-19 response. He spent some time working on psychology and neuroscience projects prior to... Read More →


Wednesday July 27, 2022 2:10pm - 2:30pm EDT
3. Cherry Blossom

2:10pm EDT

Project Immortality: Using GitHub To Make Your Work Live Forever
If you've invested a lot of time and energy on a data science project, you might be ready to move on to new and exciting things. Don't let your old projects wither away and die! There are some powerful and free resources from GitHub that you can leverage to help pay it forward to the next person looking to use your work.

In this talk, I'll showcase how you can transform ordinary R scripts into self-sufficient, robust projects by converting your code into a package, adding some GitHub Actions, and storing data into GitHub Releases. This will help your projects more useful - now and long after you've stopped working on the project!

Talk materials are available at https://github.com/tanho63/project_immortality_with_github/.

Speakers
avatar for Tan Ho

Tan Ho

@_TanHo
I'm a data enthusiast from Ottawa, Canada who loves R 🚀, Shiny ✨, fantasy football 🏈 and carving pumpkins 🎃I started DynastyProcess.com and ffverse, maintain nflverse data and packages, stream on Twitch, and mentor at R4DS Slack Community.Away from my keyboard, I enjoy rowing 🚣️, skiing ⛷, lifting weights 🏋️, and hanging out with my dog Jasper... Read More →


Wednesday July 27, 2022 2:10pm - 2:30pm EDT
4. National Harbor 10+11

2:10pm EDT

Advocating for Automation: Adapting Current Tools in Environmental Science through R
Increasing quantity and types of available data and the need for more advance analyses are outpacing current tools in environmental science. R allows us to efficiently manipulate and analyze complex data; however, taking the first leap from a current workflow to automation can be challenging. This talk will focus on how to advocate for automation in a team with diverse skillsets. We’ll start with examples of small changes that can be made using {officer} and {openxlsx} to meet your team members where they are by presenting results in a manner familiar to them; and finish by presenting a case study of how we used R to automate software from a regulatory agency.

Talk materials are available at https://github.com/hannahpodzorski/advocating-for-automation.

Speakers
avatar for Hannah Podzorski

Hannah Podzorski

GSI Environmental
Hannah Podzorski is a hydrologist currently working as an environmental consultant. She specializes in quantitative analysis and visualization of complex environmental data for a variety of projects and clients. She develops both static and interactive visuals, including {Shiny} apps... Read More →


Wednesday July 27, 2022 2:10pm - 2:30pm EDT
1. Potomac C

2:10pm EDT

R Shiny - From Conception to the Cloud
I will share how we published an R Shiny application to AWS, the decisions we made, and what we learned in the process.

One challenge we faced was figuring out how we could develop collaboratively. We needed to define our development workflow, including version control, dependency management, and quality assurance.

Then, we needed to define the deployment method. R Studio is great for development, but it may hide many of the aspects that break the application. We used CI/CD workflows as much as possible to make sure our code was robust before pushing the changes to production.

Lastly, our infrastructure team designed a framework that is replicable, so we are ready to deploy new R Shiny applications quickly and focus on data analysis.

Speakers
avatar for Ivonne Carrillo Dominguez

Ivonne Carrillo Dominguez

Data Engineering Manager, Bixal
Ivonne is a Data Engineering Manager on the Data team at Bixal where she has worked for 6 years now. She works on data visualization, data processing, and data analysis. She received her B.S. in computer system engineering in Mexico. Before joining Bixal, she worked as a Software... Read More →


Wednesday July 27, 2022 2:10pm - 2:30pm EDT
2. Potomac D

2:30pm EDT

Optimal allocation of COVID-19 vaccines in west Africa - A Shiny success story
In 2022, African countries need to vaccinate most of their population against COVID-19. With an influx of millions of doses, countries need to plan in near real-time, considering constraints at every step of the supply chain. Our Shiny app provides a low-cost decision-making tool for the Ministry of Health in Côté d’Ivoire. Using lpSolve library, we optimize for factors like eligible population in catchment area of vaccination sites, types and number of vaccine doses, expiration date of vaccine does, availability of trained healthcare staff, and presence of cold storage facility. The app maximizes number of people vaccinated while minimizing vaccine wastage across thousands of vaccination sites. (App development will end on May 31)

Speakers
avatar for Anubhuti Mishra

Anubhuti Mishra

Senior Data Scientist, Palladium International
Anubhuti Mishra is a Senior Data Scientist who works at the intersection of public health and data science. In recent past she has developed indices to estimate subnational vulnerability to COVID-19 in Africa, built models to identify high-risk HIV patients, and created allocation... Read More →


Wednesday July 27, 2022 2:30pm - 2:50pm EDT
3. Cherry Blossom

2:30pm EDT

R Markdown + RStudio Connect + R Shiny: A Recipe for Automated Data Processing, Error Logging, and Process Monitoring
R is more than just a tool for data analysis– it can help streamline and automate processes, including managing and monitoring data pipelines. This presentation highlights how R Markdown, RStudio Connect, and R Shiny can be utilized to automate data processing, error logging, and process monitoring. By the end of the presentation, attendees will better understand: (1) how RStudio Connect paired with R Markdown can be used to automate data processing, (2) that packages such as blastula and loggit can be used within R Markdown documents scheduled on RStudio Connect to email users when an error is encountered during data processing and log those errors, and (3) that the resulting logs can be fed to a Shiny app to enhance process monitoring.

Speakers
avatar for Kolbi Parrish

Kolbi Parrish

California Dept. Public Health + UCSF
Kolbi Parrish is an Informatics Specialist working at the California Department of Public Health (CDPH) through the University of California, San Francisco. She has worked in public health for over 10 years, has a Master’s degree in Industrial and Organizational Psychology, and... Read More →
avatar for Andy Pham

Andy Pham

UCSF + California Dept. Public Health
Andy is a clinical informatics specialist working at the California Department of Public Health Data Processing and Informatics section, where he works on building disease surveillance data pipelines for gathering, processing and reporting. Andy has a Master's in Health Informatics... Read More →


Wednesday July 27, 2022 2:30pm - 2:50pm EDT
4. National Harbor 10+11

2:30pm EDT

Saving 1,000 hours with RStudio: selling R in your workplace
There are many benefits to using R and no lack of packages that help you solve technical difficulties, but you may still get stuck at selling it to decision-makers or implementing it at work. Tiger's recommendation is to start a project that focuses on automating work with R and gets everyone involved. Once the value of R has been established, selecting RStudio Workbench and RStudio Connect for streamlining tasks would not be a difficult choice.

Several years ago, Tiger's organization moved away from SAS in favor of R for modeling projects, but there wasn’t much initiative taken company-wide to move everything to a new tool. To help change that, he started a work automation project using R that has saved 12K+ hours of manual work.

In this talk, he will share the key parts of the project, lessons learned, and a structure you can follow if you would like to do something similar in your organization.

Talk materials are available at https://tigertang.org/rst_conf_2022_talk/.

Speakers
avatar for Tiger Tang

Tiger Tang

CARFAX, Inc.
Tiger(Chongtai) Tang is a Data Science Manager at CARFAX, dedicated to building the Data Science team specializing in NLP and forecasting. With a passion for the Data Science community, he has been creating R training, handbook, and R songs for technical users and business users... Read More →


Wednesday July 27, 2022 2:30pm - 2:50pm EDT
1. Potomac C

2:30pm EDT

Robust R Deployments: Building a Pipeline from RStudio to Production
R is often maligned as a poor fit for production deployment systems. At dv01 we deploy Plumber API that serves machine learning models to Tape Cracker, a client facing web application. With R in production we use the same codebase throughout the data science pipeline, saving time and resources while reducing the chance for bugs in the hand-off. To accomplish this we use industry standard continuous integration and deployment tools to deploy our API to compute clusters in the cloud. Our pipeline progresses stepwise through staging, release and production environments. Automated integration testing at each step enables a robust and reliable deployment. In this talk R users will learn strategies to deploy R code in production environments.

Speakers
avatar for David Maguire

David Maguire

dv01
A physical scientist by training, David leverages the scientific method along with the statistical capabilities of R to solve business issues. He has applied this skillset to a variety of industries, including pharmaceutical manufacturing and, currently, structured finance at dv01... Read More →


Wednesday July 27, 2022 2:30pm - 2:50pm EDT
2. Potomac D

3:20pm EDT

What they forgot to teach you about starting a business with R
Lots of people I meet want to start their own business. "I know how to use R," they figure, "so I should be able to go out on my own, find clients, and work for myself."

The reality for many people is very different. They spend weeks on a business plan, website, and social media strategy. Then they sit down at their desk, waiting for the flood of clients. But the clients never come.

Since starting R for the Rest of Us in 2019, I've learned a lot of lessons along the way about (and how not) to run a business using R. In this talk, I'll share some of these lessons.

Whether you have dreams to start a huge business or want to freelance using R, this talk will help you get started with the next chapter of your career.

Talk materials are available at https://dgkeyes.com/rbusiness.

Speakers
avatar for David Keyes

David Keyes

R for the Rest of Us
David Keyes is the CEO and founder of R for the Rest of Us. Since 2019, R for the Rest of Us has helped hundreds of people and dozens of organizations learn to use R. And in our consulting work, R for the Rest of Us has created visually appealing reports that effectively communicate... Read More →


Wednesday July 27, 2022 3:20pm - 3:40pm EDT
3. Cherry Blossom

3:20pm EDT

Exploring Query Optimization: How a few lines of code can save hours of time
If you find yourself waiting hours for your queries to run, this talk is for you. Learn from my query mistakes and avoid crashing your database.
In this talk, you'll learn about minor code changes that can dramatically improve query run time.

Talk materials are available at ~https://github.com/bhadi26/rstudio-conf-2022-slides/blob/main/rebecca-hadi-r-studio-presentation-2022.pptx.

Speakers
avatar for Rebecca Hadi

Rebecca Hadi

Lyn Health
Rebecca holds a Master's degree in Applied Mathematics from Johns Hopkins University and a Bachelor's degree in Mathematics from the University of Washington. She loves automating and optimizing processes, building Shiny apps, and helping others learn to code.


Wednesday July 27, 2022 3:20pm - 3:40pm EDT
4. National Harbor 10+11
  Track National Harbor 10+11, Databases

3:20pm EDT

Quarto for the Curious
Are you curious about Quarto? Maybe you saw it on Twitter or the RStudio::conf agenda. Perhaps this raised questions like: What exactly is Quarto? What about RMarkdown? (don’t worry it’s not going away!) What features does Quarto add? What should I do with my existing Rmd/ipynb files?

This talk will answer all of those questions and more! I’ll present Quarto as a next-gen version of RMarkdown, compare the similarities, and then discuss the new features in Quarto for publishing documents, presentations, blog posts, lab notebooks and more! Lastly, I’ll cover what this means for our customers using RStudio Team, and the exciting new world for Python users.

Talk materials are available at https://thomasmock.quarto.pub/quarto-curious/.

Speakers
avatar for Tom Mock

Tom Mock

RStudio, PBC
Thomas is the Customer Enablement Lead at RStudio, helping RStudio’s customers be as successful as possible. He is deeply involved global data science community, sharing tips on #RStats Twitter (find him at @thomas_mock), as co-founder of #TidyTuesday, a weekly Data Science learning... Read More →


Wednesday July 27, 2022 3:20pm - 3:40pm EDT
1. Potomac C

3:20pm EDT

Dissecting the quick fix: Analysing tech-solutionist solutions
Machine learning models, applied in the real world, can have unanticipated, harmful side effects. Recommended counter-measures include structured documentation of models ("Model Cards for Model Reporting") and training data used ("Data Sheets for Datasets").

In this talk, I'd like to propose a similar, multi-dimensional approach to analyzing the "solution" as a whole - "solution" as in "tech solutionism", the common term for technical "fixes" that have unintended, harmful consequences.

The idea here is that, by asking WHAT a solution is doing, WHO is providing it and WHY, as well as WHERE and HOW it will be used, we should be able to systematically assess whether we are, in fact, confronted with an instance of tech solutionism.

Speakers
avatar for Sigrid Keydana

Sigrid Keydana

RStudio
Sigrid works at RStudio, where she writes about open-source deep learning, machine learning, and scientific-computation frameworks. Seeing how related technologies are increasingly becoming a part of our everyday lives - often, without us even knowing - she is deeply worried about... Read More →


Wednesday July 27, 2022 3:20pm - 3:40pm EDT
2. Potomac D

3:40pm EDT

What they forgot to teach you about becoming an open source contributor
Getting involved in open source is an amazing learning experience and helps you grow your skills as a developer, but to a new contributor there are so many unknown factors about open source projects. In this talk, I'm going to discuss my journey from occasional open source contributor to full time project maintainer, and answer questions such as: what does it look like from the inside of an open-source project? What's a good way to get involved, and really learn the internals of an R package? How can I navigate the social dynamics of an open source project? How do contributions look entirely different from the point of view of a contributor versus a maintainer?

Speakers
avatar for Nic Crane

Nic Crane

Voltron Data
Nic Crane is an R developer, educator, and general enthusiast. They are a committer on Apache Arrow, and are passionate about developer experience.


Wednesday July 27, 2022 3:40pm - 4:00pm EDT
3. Cherry Blossom

3:40pm EDT

dm: Analyze, build and deploy relational data models
dm bridges the gap in the data pipeline between standalone data frames and relational databases. Implementing a "grammar of joined tables", it provides a consistent set of verbs for consuming, creating, and deploying relational data models. In this talk I present a short overview of how dm can help your data analysis and ETL processes, and highlight recent developments.

Speakers
avatar for Kirill Müller

Kirill Müller

cynkra GmbH
Kirill has been working on the boundary between data and computer science for more than 20 years. He has been awarded five R consortium projects to improve database connectivity and performance optimization in R. Kirill is a core contributor to several tidyverse packages, including... Read More →


Wednesday July 27, 2022 3:40pm - 4:00pm EDT
4. National Harbor 10+11

3:40pm EDT

Sometimes you just need words
This talk will trace the evolution of a report from a mostly text free dashboard into a text heavy R markdown report with dynamic text blocks. The report in question is provided to the largest financial institutions in the U.S., but the audience for the data largely is composed of compliance experts and lawyers.

The interface between data products, and people who make decisions is often the most difficult piece in a project. Frequently, what your audience really needs is words! This talk will help you recognize when you need more narrative and will provide some helpful technical advice to get you there, including how to use existing word templates and how to use whisker:: and glue:: to help you dynamically generate text.

Talk materials are available at https://github.com/lmkirvan/presentation.

Speakers
avatar for Lewis Kirvan

Lewis Kirvan

Consumer Financial Protection Bureau
I'm a researcher at the Consumer Financial Protection Bureau, where I've spent the last (nearly) 10 years recovering from getting a law degree. I mostly work to understand consumers experiences by listening to them at scale.


Wednesday July 27, 2022 3:40pm - 4:00pm EDT
1. Potomac C

3:40pm EDT

Introducing workboots: Generate prediction intervals from tidymodel workflows
Sometimes, we want a model that generates a range of possible outcomes around each prediction. Other times, we just care about point predictions and may opt to use a fancy model like XGBoost. But what if we want the best of both worlds: getting a range of predictions while still using a fancy model? That’s where bootstrapping comes to the rescue! By using bootstrap resampling, we can create many models that produce a prediction distribution – regardless of the model type! In this talk, I’ll give an overview of bootstrap resampling for prediction, the pros/cons of this method, and how to implement it as a part of a tidymodel workflow with the workboots package.

Talk materials are available at https://github.com/markjrieke/rstudio-conf-2022.

Speakers
avatar for Mark Rieke

Mark Rieke

Memorial Hermann Health System
I am a senior consumer experience (CX) analyst at Memorial Hermann Health System where I use R and tidymodels to provide actionable insights from patient satisfaction survey data. I love making beautiful charts, working on home improvement projects, and playing jazzy piano. I live... Read More →


Wednesday July 27, 2022 3:40pm - 4:00pm EDT
2. Potomac D

4:00pm EDT

What they forgot to teach you about industry transitions from academia (WTF AITA)
This talk presents a gathering of resources from the RStudio community for industry job-seekers who are transitioning from academia. Examples include packages for resume-building and cover letter writing, quick starts for personal websites and GitHub profiles, and helpful R networks. A new website named after the talk (wtf-aita.com) will showcase and link to the discussed resources, and is open to further community contributions.

Speakers
avatar for Travis Gerke

Travis Gerke

The Prostate Cancer Clinical Trials Consortium
As academic faculty, I led cancer-focused research teams in the application and development of tools for applied machine learning, causal inference, and biostatistics. In the post-academic setting, I direct data science efforts in the oncology clinical trials space. I am a 13+ year... Read More →


Wednesday July 27, 2022 4:00pm - 4:20pm EDT
3. Cherry Blossom

4:00pm EDT

dbcooper: Turn any database into an R or Python package
The first step of any data analysis is importing data, but for tables in a database this can be a surprisingly challenging step that takes analysts out of their workflow. The dbcooper package turns a database connection into a collection of accessor functions, letting users take advantage of autocomplete as they browse a database in the same ways they would engage with local tables. I'll walk through how dbcooper can be used to build a database-specific package, whether for an organization's internal use or to wrap a public data source. The dbcooper package supports both R (built on top of dbplyr) and Python (built on top of siuba), and in either language makes it easy and intuitive to extract insights from a database.

Speakers
avatar for David Garrett Robinson

David Garrett Robinson

Heap Analytics
David Robinson is Director of Data Science at Heap Analytics, where he's helping to build the next generation of product analytics technology. He's the co-author with Julia Silge of the tidytext package and the O’Reilly book Text Mining with R. He also created the broom, fuzzyjoin... Read More →


Wednesday July 27, 2022 4:00pm - 4:20pm EDT
4. National Harbor 10+11

4:00pm EDT

Highlights of the knitr package from the past two years
In this talk, I will highlight a few selected new features of the knitr package introduced in the past two years including: 1) the new way to write chunk options as "#|" comments in the chunk body; 2) new ways to reuse both chunk body and chunk options from another code chunk; 3) a few useful new engines, namely comment (comment out arbitrary content), verbatim (write arbitrary verbatim content), embed (embed external files and display their content verbatim), and exec (execute an arbitrary command); and 4) a few new chunk options such as file (to include external files as the chunk body) and lang. Hopefully these new features can help both knitr users and developers make better use of knitr.

Speakers
avatar for Yihui  Xie

Yihui Xie

Software Engineer, RStudio, PBC
Yihui Xie is a software engineer at RStudio. He earned his PhD from the Department of Statistics, Iowa State University. He has authored and co-authored several R packages, such as knitr, rmarkdown, bookdown, blogdown, and xaringan. He has published a number of books, including “Dynamic... Read More →


Wednesday July 27, 2022 4:00pm - 4:20pm EDT
1. Potomac C

4:00pm EDT

The tidysynthesis R package
Society benefits when leaders make more evidence-based decisions, but growing privacy concerns hamper researchers’ ability to understand and improve the world. Fully synthetic data, pseudo data generated by models, can protect confidentiality and produce statistically valid analysis. This talk shares how the Urban Institute collaborates with the IRS to create fully synthetic tax data for tax policy research. We built an R package called tidysynthesis to create machine learning models for each variable in the data. tidysynthesis leverages the power of tidymodels and allows users to run a sequences of machine learning models with different recipes, engines, and samplers while adding additional noise and enforcing logical constraints.

Speakers
avatar for Aaron R. Williams

Aaron R. Williams

Urban Institute
Aaron R. Williams is a senior data scientist at the Urban Institute where he works on microsimulation models, data imputation methods, and expanding access to administrative data with formal privacy and synthetic data. Williams leads Urban’s R Users Group and teaches Intro to Data... Read More →


Wednesday July 27, 2022 4:00pm - 4:20pm EDT
2. Potomac D

4:30pm EDT

The Past and Future of Shiny
Shiny is a package that makes it easy to create interactive web apps using R. It's hard to believe, but it's been ten whole years since the launch of Shiny! I'll mark the occasion by sharing some stories about the creation of Shiny, the conditions that have made it a success over the years, and our plans for the near future.

Speakers
avatar for Joe Cheng

Joe Cheng

CTO, RStudio, PBC
Joe Cheng is the Chief Technology Officer at RStudio PBC, where he helped create the RStudio IDE and Shiny web framework.


Wednesday July 27, 2022 4:30pm - 5:30pm EDT
0. Potomac A+B

5:30pm EDT

Birds of a Feather Happy Hour - Healthcare
Mingle with other healthcare practitioners and enjoy light refreshments before joining our evening event.

Wednesday July 27, 2022 5:30pm - 7:00pm EDT
Maryland 4, 5, & 6

5:30pm EDT

Birds of a Feather Happy Hour - Industry Groups
Birds of a Feather evening reception. Mingle with data scientist working the similar fields.

Look for signs to indicate locations.

Groups include,

  • Finance. Located in Chesapeake.
  • Insurance. Located in Chesapeake.
  • Supply Chain / Manufacturing / CPG / Retail. Located in Chesapeake.

Light refreshments will be served.

Dinner and our evening event starts at 6:00 PM in the Orchard Terrace, just downstairs.

Wednesday July 27, 2022 5:30pm - 7:00pm EDT
Chesapeake

5:30pm EDT

Birds of a Feather Happy Hour - Pharma / Life Sciences
Mingle with other life scientists and enjoy light refreshments before joining our evening event.

This group is brought to you by TCB Analytics

Wednesday July 27, 2022 5:30pm - 7:00pm EDT
Maryland 1, 2, & 3

5:30pm EDT

Spanish Speakers / LATAM - Birds of a Feather Happy Hour
The LATAM R Birds of a Feather is a get together of all Spanish-speaking R users and Data Innovators attending the conference.

After a brief overview of the state of data in LATAM by ixpantia, everyone will have the opportunity to introduce themselves.

If you are intrigued by the growth and potential in the region and you want to connect with like-minded people, come join us.

The meeting will be conducted “en Español”, but los hablantes de todos los idiomas son bienvenidos.

Wednesday July 27, 2022 5:30pm - 7:00pm EDT
Chesapeake 9

6:00pm EDT

Evening Dinner & Activities
Join us for food and drink, lawn games, dancing, & other fun activities.

Buffet dinner and drinks will be available. There will be indoor and outdoor setting

Wednesday July 27, 2022 6:00pm - 9:00pm EDT
7. Orchard Terrace
 
Thursday, July 28
 

7:30am EDT

Breakfast
See you in the Maryland ballroom for buffet breakfast.

Buffet & boxed lunch options. Outdoor seating available.

The morning keynote starts at 9:00 AM. It may take 15 minutes to walk to and find a seat there.

Thursday July 28, 2022 7:30am - 9:00am EDT
5. Maryland

7:30am EDT

Birds of a Feather Breakfast - Variegated Groups
Meet & mingle for breakfast Birds of a Feather.

See table signage for group locations in Maryland C+D, and join the group you're most keen to mingle in.

Groups include,

  • Survival analysis
  • Time series
  • Dash / Flask / Streamlit / Bokeh
  • Not for Profit / NGOs
  • Spark
  • Religious and Faith Based Organizations and Communities. Join analysts at faith based organizations and communities — and folks keen to explore related data. At at this table at this table starting at 7:30 AM

Thursday July 28, 2022 7:30am - 9:00am EDT
5. Maryland

7:30am EDT

Education & Teaching - Birds of a Feather
Meet other educators for breakfast.

Join us at 8:30 AM for welcoming remarks.

See table signs for smaller, focused groups. Groups include,

  • Teachers of R and statistics
  • K-12
  • Education Researchers
  • RStudio Cloud Instructors
  • Professional R Instructors
  • and many more

The morning keynote starts at 9:00 AM. It may take 15 minutes to walk to and find a seat there.

🪑 We've striven to provide sufficient space for everyone interested, but please be aware seatting is limited.

Thursday July 28, 2022 7:30am - 9:00am EDT
Maryland 1, 2, & 3

9:00am EDT

Hello Quarto: Share • Collaborate • Teach • Reimagine
What is Quarto? In this talk, in four acts, we will discuss how Quarto unifies and extends the R Markdown ecosystem. Through our work with NASA Openscapes, Duke University, and RStudio, we will share our experiences sharing, teaching, collaborating, and reimagining with Quarto. Full of demos with Quarto in RStudio and Jupyter and weaving together narrative and code, this talk will help you get the most out of Quarto regardless of your background with R Markdown, the computing language(s) you may use, and your tool of choice for authoring.

Speakers
avatar for Mine Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Professor + Developer Educator, RStudio + Duke University
Mine Çetinkaya-Rundel is Professor of the Practice at Duke University and Developer Educator at RStudio. Mine’s work focuses on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education... Read More →
avatar for Julia Stewart Lowndes

Julia Stewart Lowndes

Openscapes
Julia Stewart Lowndes, PhD is founding director of Openscapes. She is a marine ecologist and champion for making science more open, efficient, inclusive, and kind. Working at the intersection of actionable environmental science, data science, and open science, she is a Mozilla Fellow, National Science Foundation Better Scientific Software Fellow... Read More →


Thursday July 28, 2022 9:00am - 10:00am EDT
0. Potomac A+B

10:30am EDT

Exploratory Spatial Data Analysis in the tidyverse
R has come quite a long way to enable spatial analysis over the past few years. Packages such as sf have made spatial analysis and mapping easier for many. However, adoption of R for spatial statistics and econometrics has been limited. Many spatial analysts, researchers, and practitioners lean on Python libraries such as pysal.

In this talk I briefly discuss my journey through spatial analysis and introduce a new package sfdep which provides a tidy interface to spatial statistics and noteably exploratory spatial data analysis. sfdep is an interface to the spdep package as well as implements other common exploratory spatial statistics.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/josiahparry/rstudio__conf(2022L)%20-%20Josiah%20Parry.pdf.

Speakers
avatar for Josiah Parry

Josiah Parry

The NPD Group
Josiah Parry is a Research Analyst in the Research Science division at The NPD Group focusing on modernization and methodology. Formerly he worked at RStudio, PBC on the customer success team enabling public sector adoption of data science tools. Josiah received his master's degree... Read More →


Thursday July 28, 2022 10:30am - 10:35am EDT
2. Potomac D

10:30am EDT

The Benefit of Talking to the "Non-Datas"
Data literacy is a tool to build understanding- of the world and ourselves. Data, AI and tech are sometimes portrayed as scary and unknowable; however, data can be for everyone. Data, and decisions based off data, have enormous implications in our daily lives. We (data practitioners) likely have some baseline understanding of numbers and how to read a chart. But others, whether our friends, family members or coworkers, might not have the same level of understanding.

This talk will address how to talk to these seemingly "non-data" people, the benefits of talking data with them, and (hopefully) encourage more curiosity and wonder at the creativity of data. We will also briefly cover what data literacy is and why we ought to care about it.

Talk materials are available at https://rconf-2022-caro-buck.netlify.app/#/section.

Speakers
avatar for Caro Buck

Caro Buck

Wunderman Thompson
I’m Caro, short for Caroline! I’m a curious and creative human, with a passion for data of all sorts. I love wrangling unconventional data, or using a visualization to build understanding and promote data literacy. I have worked on a wide variety of client projects, from AI- generated... Read More →


Thursday July 28, 2022 10:30am - 10:50am EDT
3. Cherry Blossom

10:30am EDT

I made an entire e-commerce platform on Shiny
E-commerce requires passing data between many components like managing a shopping cart, taking payment, fulfilling orders, and sending emails. I've successfully created a full e-commerce platform entirely in R for a quirky side project. The R package ggirl lets users order ggplot2 plots as postcards and more via R functions. Those R functions pass data to a separate Shiny app, which then passes data other services like Stripe payment APIs and printing APIs. In this talk I will walk through how to use packages like httr, callr, and brochure to have your Shiny apps call external services and do many tasks in parallel. You’ll leave the talk with more ways to use Shiny than dashboards plus the knowledge to monetize your existing dashboards!

Talk materials are available at https://link.jnolis.com/rstudio22-slides.

Speakers
avatar for Jacqueline Nolis

Jacqueline Nolis

Saturn Cloud
Dr. Jacqueline Nolis is a data science leader with 15 years of experience in running data science teams and projects at companies ranging from Airbnb to Boeing. She is the Chief Product Officer at Saturn Cloud where she helps design products for data scientists. Jacqueline R expertise... Read More →


Thursday July 28, 2022 10:30am - 10:50am EDT
4. National Harbor 10+11

10:30am EDT

Developing internal tools for multi-lingual teams
Internal packages are great for boosting productivity and promoting good practice, but what kinds of challenges do we face when designing solutions for multi-lingual teams? Here I will advocate for a design approach we are using at Bumble to build Python and R packages with the same foundations. I will discuss the benefits of this approach for the developer and the wider organisation.

Talk materials are available at https://github.com/jamie-ralph/rstudio-conf-2022.

Speakers
avatar for Jamie Ralph

Jamie Ralph

Bumble
Jamie is a developer at Bumble where he builds internal analytics tools in R and Python. Before this he worked in various data roles in the public sector. He enjoys building tools that make people's lives easier. Outside work he likes going for walks with his dog Margie and baking... Read More →


Thursday July 28, 2022 10:30am - 10:50am EDT
1. Potomac C

10:35am EDT

WebR: R compiled for WebAssembly and running in the browser
In this talk I introduce webR, a port of R to WebAssembly using Emscripten. WebR brings a full R environment to the browser, enabling R code execution, numerical analysis, loading packages and more. No local or cloud-based R servers are required as all computation is performed within the browser. I give a brief overview of our build process for webR, describing the toolchain and some of the issues we encountered. A publicly available web-based R session is demonstrated, with package and plotting support.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/georgestagg/webr%20-%20George%20Stagg.pdf.

Speakers
avatar for George Stagg

George Stagg

RStudio
George Stagg is a Software Engineer with experience in research computing and mathematical digital learning software. He holds a PhD in Applied Mathematics from Newcastle University, researching the physics of quantum fluids through high performance computer simulation. Until recently... Read More →


Thursday July 28, 2022 10:35am - 10:40am EDT
2. Potomac D

10:40am EDT

It's about time
Dealing with date-times is hard. Dealing with date-times without the proper tooling is even harder! clock is an R package that aims to provide comprehensive and safe handling of date-times. It goes beyond the date and date-time types that base R provides, implementing new types for year-month, year-quarter, ISO year-week, and many other date-like formats, all with up to nanosecond precision. In this talk, you'll see how clock emphasizes "safety first" when manipulating date-times, and how these new date-time types can be used in your own work.

Talk materials are available at https://speakerdeck.com/davisvaughan/2022-rstudio-conf-its-about-time.

Speakers
avatar for Davis Vaughan

Davis Vaughan

RStudio
Davis is a software engineer at RStudio working on improving the tidyverse and tidymodels ecosystems. He works on packages such as tidyr, vctrs, furrr, slider, clock, hardhat, and yardstick.


Thursday July 28, 2022 10:40am - 10:45am EDT
2. Potomac D

10:45am EDT

Let your mobile shine - Leveraging CSS concepts to make shiny apps mobile responsive
As shiny developers, we spend a lot of time working on the server side, and less time on the UI/UX part. According to the Engineering Production-Grade Shiny Apps book by the ThinkR team, no matter how complex and innovative your back- end is, your application is bad if your user experience (UX) is bad. People spend more time on the internet from their mobile phones, so it is important to ensure that our shiny apps are mobile responsive. At the beginning of this year, I was tasked with creating a mobile version of an already made shiny app in the shortest time possible. In this talk, as much as this work is tied to an NDA, I will highlight a few css tricks that I used to achieve this.

Talk materials are available at https://docs.google.com/presentation/d/1HKcMBQiFo6oC5Cj3OFf6wlGuSGQxADyxJMG_74cNwaU/edit#slide=id.g116406a1a2a_2_135.

Speakers
avatar for Shelmith Nyagathiri Kariuki

Shelmith Nyagathiri Kariuki

Data Analytics Consultant
Shel Kariuki is a Data Analytics consultant based in Nairobi, Kenya. She currently works as a short term consultant at The World Bank . She is a co-organizer of #NairobiR and R-Ladies Nairobi which are groups in Nairobi whose aim is to bring people together to learn and develop their... Read More →


Thursday July 28, 2022 10:45am - 10:50am EDT
2. Potomac D

10:50am EDT

Accelerating geospatial computing using Apache Arrow
The ‘arrow’ R package and wider Apache Arrow ecosystem provide an end-to- end solution for querying and computing on in-memory and bigger-than-memory data sets using the Apache Arrow C++ library. In this talk we introduce the ‘geoarrow’ package, which extends Arrow to provide efficient columnar storage for spatial types and functions to support spatial queries in the Arrow compute engine. We focus on a workflow where (1) data are stored in multiple files that can be hosted remotely (e.g., on S3-compatible storage), (2) queries are processed batchwise and in parallel allowing for efficient processing of bigger- than-memory geospatial data and (3) results can be passed without copying to Rust, Python, or other R packages for further analysis.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/deweydunnington/Accelerating%20geospatial%20computing%20using%20Apache%20Arrow%20-%20Dewey%20Dunnington.pdf.

Speakers
avatar for Dewey Dunnington

Dewey Dunnington

Voltron Data
Dewey Dunnington (Ph.D., P.Geo.) is an environmental researcher, programmer, and educator based in Nova Scotia, Canada. He recently completed his Ph.D. in lake sediment geochemistry and is currently an R Developer at Voltron Data working on all things Apache Arrow + R... Read More →


Thursday July 28, 2022 10:50am - 10:55am EDT
2. Potomac D

10:50am EDT

How to be a pollinatoR
R users are part of data ecosystems comprising both statistical and non- statistical applications. We may work with SAS or Stata datafiles; non-R users may help run R scripts; or we may need to generate outputs in Word or Excel. Just as pollinators support biodiversity, we believe R users can be constructive members of diverse data ecosystems. Our talk will: (1) outline what it means to be constructive, (2) highlight tools that can help R users contribute to their ecosystems, and (3) describe practices that can improve workflows involving diverse groups of staff and software. We hope our talk will inspire R users to think creatively and empathetically about how R can be a force for good in diverse data ecosystems.

Talk materials are available at https://rsconnect.norc.org/rstudioconf-pollinator.

Speakers
avatar for Kiegan Rice

Kiegan Rice

NORC at the University of Chicago
Kiegan is a Statistician at NORC at the University of Chicago specializing in static and interactive data visualization, reproducible data analysis pipelines, and scientific communication. She applies her skills in multiple research areas at NORC including education, healthcare, and... Read More →
avatar for Weihuang Wong

Weihuang Wong

NORC at the University of Chicago
Weihuang (Wei) Wong is a Senior Research Methodologist at NORC, where he develops tools that help researchers and analysts produce high quality and timely research. At NORC, Wei has worked on projects across the education and health sectors. He earned his PhD in political science... Read More →


Thursday July 28, 2022 10:50am - 11:10am EDT
3. Cherry Blossom

10:50am EDT

The worlds smallest R environment? Running R on a $15 computer
Is it possible to do meaningful work in R on a $35 computer? How about a $15 computer? And what does that mean for education, data science, and computing on small devices? In this talk, Mark introduces the R4Pi.org project, its background and why on earth you'd even want to run R at the opposite end of "scale" from everyone else.

Speakers
avatar for Mark Sellors

Mark Sellors

R4Pi.org
A life-long technologist, Mark has always been fascinated by the intersection of people and technology. He is the founder of the R4Pi project, an open-source project aimed at making it easier to use R on the Raspberry Pi family of computers. Mark is also a Solutions Engineering Manager... Read More →


Thursday July 28, 2022 10:50am - 11:10am EDT
4. National Harbor 10+11

10:50am EDT

Achieving a seamless workflow between R, Python and SAS from within RStudio
Some of my best friends use Python...and all of my coworkers use SAS.

Statistics Canada is the official statistical agency of Canada and employs over 6,000 employees. Statistics Canada has a legal obligation to ensure that personal information collected for statistical purposes is kept strictly confidential. An internal system which prevents the release of confidential information is only implemented in SAS. As such, many Analysts and Data Scientists at Statistics Canada must use the SAS programming language as part of their workflow. It is therefore imperative to find ways to work with open source programming languages and SAS seamlessly. I will present a method for achieving a harmonious workflow between R, Python, and SAS, all within RStudio.

Talk materials are available at https://github.com/melissavanbussel/rstudio-conf-2022.

Speakers
avatar for Melissa Van Bussel

Melissa Van Bussel

Analyst, Statistics Canada
Melissa Van Bussel is an accredited Associate Statistician (A. Stat.) and an Analyst at Statistics Canada, Canada's official statistical agency. She completed her M. Sc. in Statistics at Carleton University, and her B. Sc. H. in Mathematics and Computing Systems with a Specialization in Statistics at Trent University. Melissa is passionate about all things Data Science and Teaching (and... Read More →


Thursday July 28, 2022 10:50am - 11:10am EDT
1. Potomac C

10:55am EDT

Zero-setup R workshops with GitHub Codespaces
If you've ever tried to run a workshop using R, you'll be aware of the challenges of getting everyone's laptop set up to able to run your R scripts, Rmarkdown documents, or Jupyter Notebooks without errors.

What if you could host a workshop using R that required no setup from the participants at all? With GitHub Codespaces, a GitHub repository becomes a cloud-based engine for running R in a container with a single click. Every participant, regardless of the power, configuration or operating system of their laptop will have the same experience, all with NO setup in advance.

In this talk, I'll describe the process and share tips for setting up a GitHub repository for an R-based workshop to take advantage of GitHub Codespaces.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/davidsmith/Zero%20Setup%20Workshops%20RStudioConf%202022%20-%20David%20Smith.pdf.

Speakers
avatar for David Smith

David Smith

Cloud Advocate, Microsoft
Ask me about R at Microsoft, the R Consortium, or the Revolutions blog.


Thursday July 28, 2022 10:55am - 11:00am EDT
2. Potomac D

11:00am EDT

Making awesome automations with GitHub Actions
This talk is an introduction to GitHub Actions (GHA), which is a feature from GitHub that allows us to automate several tasks in R. In this presentation, I aim to answer these questions: "What is GitHub Actions? How can I run R Scripts with it?". I will list supplementary materials that are helpful to learn how to start automating tasks in R projects and packages.

Talk materials are available at https://beamilz.com/talks/en/2022-rstudio-conf/.

Speakers
avatar for Beatriz Milz

Beatriz Milz

Curso-R, R-Ladies São Paulo and University of Sao Paulo
PhD Candidante in Environmental Science in the University of Sao Paulo, teacher at Curso-R, Tidyverse certified instructor by RStudio and co-organizer in local R communities, such as [R-Ladies Sao Paulo](https://twitter.com/RLadiesSaoPaulo), [satRday Sao Paulo](https:// twitter.c... Read More →


Thursday July 28, 2022 11:00am - 11:05am EDT
2. Potomac D

11:05am EDT

{shinyslack}: Connecting Slack Teams to Shiny Apps
Slack is a useful communication tool for communities and businesses. Integrating it with Shiny can make it even more useful. I'll show how my {shinyslack} package allows for connections between Shiny apps and Slack teams. I’ll show how users can wrap their existing Shiny UIs to quickly and easily add Slack authentication to their Shiny apps. I'll also demonstrate how we’ve used {shinyslack} to manage and streamline the R4DS Online Learning Community, and how other R users might make the most of their Slack workspaces.

Speakers
avatar for Jon Harmon

Jon Harmon

Admin/Principal Data Scientist, R4DS Online Learning Community/Macmillan Learning
I created the Content Science team at Macmillan Learning. We're interested in applying modeling techniques (including various NLP techniques and performance analysis such as IRT) to learn about and improve our educational content.I am also the volunteer administrator of the R4DS Online... Read More →



Thursday July 28, 2022 11:05am - 11:10am EDT
2. Potomac D

11:10am EDT

leafdown: Interactive multi-layer maps in Shiny apps
Interactive maps are indispensable tools for exploring spatial datasets because of their real-world context and intuitiveness. For a comprehensive understanding of the data, it is often necessary to switch between several map layers (such as states and counties) and to analyze multiple variables simultaneously - both of which are challenging. In this talk, I will show how we can overcome these challenges using the leafdown package, which allows us to create multi-layer maps embedded in Shiny apps.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/andreashofheinz/leafdown_presentation%20-%20Andreas%20H.pdf.

Speakers
avatar for Andreas Hofheinz

Andreas Hofheinz

d-fine
Andreas is a consultant at d-fine and maintainer of the leafdown R package. He is passionate about interactive data visualizations and interpretable machine learning. He holds a B.Sc. in Economics and an M.Sc. in Statistics from LMU Munich.


Thursday July 28, 2022 11:10am - 11:15am EDT
2. Potomac D

11:10am EDT

Enterprise-Level Data Science Success
Enterprise-Level Data Science Success includes many factors beyond the nuts and bolts of core data science work. It is not just about data, databases, data wrangling and model outputs anymore! The ‘soft’(art) aspect of establishing a successful data science practice in a large organization includes building the right partnerships with various stakeholders within the enterprise, hiring and growing teams and people management, being a tireless advocate for value/ strategy creation with the use of data and analytics, fighting for the right level attention with executive leadership and keeping the focus of the data science teams business-oriented goals. This talk will touch on these above topics in the data science journey at Flex.

Speakers
avatar for Prabha Thanikasalam

Prabha Thanikasalam

Sr. Director, Data Science and Advanced Analytics, Flex
I lead a data science, analytics, and solutions team at Flex.My main role, passion is creating value with data and analytics within the organization.Please talk to me about use cases on supply chain, manufacturing, and finance topics/. I am excited about the upcoming RStudio conference... Read More →



Thursday July 28, 2022 11:10am - 11:30am EDT
3. Cherry Blossom

11:10am EDT

A touch of R in Robotics
Over the years, the R community has experienced an increase in the number, diversity and domain background of users. However, incorporation of R in the field of Robotics is still not fully exploited even though the actuation of manipulators such as robotic arms is based on streams of input data gathered from sensors. With R constantly evolving and redefining how to turn data into actionable insights, any domain that produces data now has the potential to use R. What if this actionable insight means actuating a robotic arm? The aim of this talk is to connect two previously separated worlds: robotic application development and the data manipulation capabilities of R by demonstrating how to actuate a Robotic arm using R and Arduino.

Speakers
avatar for Ian Muchiri

Ian Muchiri

Dedan Kimathi University of Technology
I am a co-founder and co-organizer of DekutR data science club at Dedan Kimathi University of Technology. Also, I am an early career researcher in the field of Internet of Things, Data Science and Electronics (Embedded System Design).
avatar for Eric Wanjau

Eric Wanjau

Leeds Institute for Data Analytics
Eric is an Early Career Researcher at the Leeds Institute for Data Analytics (LIDA) in the University of Leeds. His research interests span domains such as robotics, computer vision, and signal processing. Eric is a co-founder and co-organizer of DekutR data science club at Dedan... Read More →


Thursday July 28, 2022 11:10am - 11:30am EDT
4. National Harbor 10+11

11:10am EDT

Yes, you can use Python with RStudio Team!
Are you a bilingual data scientist who wants to use Python in RStudio products? Or do you have colleagues who use Python for data science? In this talk, we will present an end to end data science use case including automated jobs, model training and monitoring, APIs, and interactive dashboards, entirely built and deployed in Python with RStudio Pro products.

Speakers
avatar for Xu Fei

Xu Fei

RStudio
Xu Fei is a solutions engineer at RStudio where he helps bilingual data science teams work better with RStudio commercial products. He has experience in Python data science and reproducibility.
avatar for Gagandeep Singh

Gagandeep Singh

RStudio
Gagandeep works as a Solutions Engineer with RStudio. He is a former software engineer and data scientist who has worked in a variety of cross-technology teams. Before joining RStudio, he was consulting with fortune 500 companies to build their analytical capacities. Outside of work... Read More →


Thursday July 28, 2022 11:10am - 11:30am EDT
1. Potomac C

11:15am EDT

Say Hello! to Multilingual Shiny Apps
Multilingual shiny apps are not straightforward to build. Translation affects almost every single aspect of an app. Although there are a few packages designed to automate the translation process, they tend to only work for the most widely spoken languages.

Using a bilingual English-Welsh shiny app we developed to present public health data as a case study, this talk will discuss:

  • how we built a multilingual shiny app;
  • how translation affected design decisions;
  • how we overcame the main issues faced;
  • and most importantly, what we'd do differently next time.

By the end of this talk, you will have a better understanding of how to translate your Shiny app to help you to share your app with a much wider audience.

Talk materials are available at https://nrennie.rbind.io/talks/2022-july-rstudio-conf/.

Speakers
avatar for Nicola Rennie

Nicola Rennie

Jumping Rivers
Nicola Rennie is a statistician and data scientist, passionate about using R to solve problems. She currently works as a data scientist at Jumping Rivers having previously completed her PhD in statistics and operational research at Lancaster University. She enjoys sharing her knowledge... Read More →


Thursday July 28, 2022 11:15am - 11:20am EDT
2. Potomac D

11:20am EDT

Let's start at the beginning - bits to character encoding in R
Attendees will recieve a broad overview of the encoding and decoding process in the human-to-computer loop, how bits are used, and the math that gets us to common bit values. A brief history of ASCII, Latin-1, and UTF-8 will be provided as well.

Attendees will also be exposed to how character encoding works in R and in the tidyverse.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/alexfarach/bits_to_character_in_R_RSTUDIO%20-%20Alex%20F.pdf.

Speakers
avatar for Alex Farach

Alex Farach

Accenture Federal Services
Alex Farach is an Analytics Manager and Data Scientist in the Applied Intelligence Discovery Lab at Accenture Federal Services (AFS) where he helps clients build productive relationships with their data. Prior to joining AFS as a data scientist in 2019 Alex worked as an economist... Read More →


Thursday July 28, 2022 11:20am - 11:25am EDT
2. Potomac D

11:25am EDT

Implications of R syntax in intro stats
This talk reports on a head-to-head comparison of the formula and tidyverse syntaxes in a full semester introductory statistics course, providing data to help guide other instructors in their pedagogical decision-making. The formula version of the class used the mosaic package for summary statistics, ggformula for graphics, and base functions such as t.test for inference. The tidyverse section used base functions inside summarize() calls for summary statistics, ggplot2 for graphics, and functions such as infer::t_test for inference. Analysis of materials allows us to determine the number of functions students were exposed to in each section, which functions they actually used, and how much time they spent on their assignments in each class.

Speakers
avatar for Amelia McNamara

Amelia McNamara

University of St Thomas
Amelia McNamara is an assistant professor of statistics at the University of St Thomas, in St Paul, MN. Her research interests include statistics education, statistical computing, and the intersection of the two. You may know her as "the woman with the tidyverse dress."


Thursday July 28, 2022 11:25am - 11:30am EDT
2. Potomac D

11:30am EDT

Comparing package versions with Diffify
Even when we run the simplest of R scripts, we are using dozens of R packages. We use packages for data cleaning, writing reports, graphics and modelling. One of the strengths of R, is the depth of packages.

Unfortunately, packages change and break our code. Not all R packages have NEWS file, and even those that do, it might not be complete.

The diffify service aims to make comparing between package versions easier. For example, is there a new Import? Or perhaps a package has been removed from Suggests? Maybe the arguments of a function have changed? Or a function is no longer exported. Diffify can help.

NB: We have completed the back-end infrastructure, and are currently working on the front-end. Expected launch: ~May 1st

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/colingillespie/2022-07-27_rstudio-conf%20-%20Colin%20Gillespie.pdf.

Speakers
avatar for Colin Gillespie

Colin Gillespie

CTO, Jumping Rivers
Colin has been using R since 1999. He's the author of a number of R packages, the co-founder of Jumping Rivers and has published the book Efficient R Programming with O'Reilly.


Thursday July 28, 2022 11:30am - 11:35am EDT
2. Potomac D

11:30am EDT

Cross-Industry Anomaly Detection Solutions with R and Shiny
This session highlights two anomaly detection use cases in production: identification of problematic life sciences manufacturing units and identification of significant newsworthy events. With both solutions, Shiny is integrated with live data to provide early detection for proactive intervention. Shiny’s intuitive user interface also allows for interaction with the data behind anomalies to uncover potential causes and paths to action or resolution.

The session also briefly highlights a rapid prototyping development approach with Shiny. This technique allows for collaborative refinement of the underlying anomaly detection model in R, quickly incorporating user feedback, where end users may not have in-depth machine learning knowledge.

Talk materials are available at https://docs.google.com/presentation/d/e/2PACX-1vTE7Ee2QIUGDUmfEKmF8l_WTQPVgnGaLJLGuuMquio57bXojeeb5YYSjuzO-xzYxMHxuX2cm_QNC2y-/pub?start=false&loop=false&delayms=60000&slide=id.gbb68c6dbe2_1_44.

Speakers
avatar for Tanya Cashorali

Tanya Cashorali

Founder, TCB Analytics
Tanya Cashorali is the founder of TCB Analytics, a data and analytics consultancy. She leads a world-wide community network of 400 data enthusiasts, has helped universities launch data science programs, and is a frequent speaker at tech conferences. 


Thursday July 28, 2022 11:30am - 11:50am EDT
3. Cherry Blossom

11:30am EDT

An Introduction to the Apple Health Export
After a visit to the ER, I discovered an ocean of personal data: more than 3 million rows of data about one of my favorite subjects: me. My watch averages almost 5,000 measurements a day. That experience led me to explore the Apple Health Export in more detail. It’s the largest dataset I’ve ever worked with. The variety of sources and the quantity data can be overwhelming. I will describe the basic measurements that are available and some of the things one needs to be on the lookout for. A ggplot of a full day of heart rate data provided me (and my doctor) important context on two occasions when I went to the ER. Fortunately there’s no great medical drama involved, but the plots are pretty neat and actually turned out to be helpful.

Talk materials are available at https://www.johngoldin.com/blog/2022-07-notes-apple-health-export/.

Speakers
avatar for John Goldin

John Goldin

Yale University
John Goldin spent his career as a data analyst for the Yale University administration. He was the Director of Institutional Research for over 20 years until retirement in 2011. Since then he has maintained some involvement with his old office, although each year he becomes more thoroughly... Read More →


Thursday July 28, 2022 11:30am - 11:50am EDT
4. National Harbor 10+11

11:30am EDT

Running Shiny without a server
Shiny for Python has the ability to be deployed without a server that runs Python. These applications can be served by any web hosting service. They are easily deployed and can scale to handle large amounts of traffic.

This mode of deployment is called Shinylive. It works by running Python in the user’s web browser, using Pyodide, which is a version of Python that is compiled to WebAssembly. Shinylive opens up new possibilities for how Shiny applications can be used. In this talk I discuss the advantages and the limitations of Shinylive applications.

Speakers
avatar for Winston  Chang

Winston Chang

RStudio
Winston Chang is a software engineer at RStudio and currently works on Shiny and related projects. He has also worked on several other R packages, including devtools, R6, and ggplot2. Winston is the author of the R Graphics Cookbook, published by O'Reilly Media, and has a PhD in psychology... Read More →


Thursday July 28, 2022 11:30am - 11:50am EDT
1. Potomac C

11:35am EDT

Visualizing distributions and uncertainty using ggdist
I propose a talk on visualizing distributions and uncertainty using {ggdist}. I will describe how to think systematically about distributional visualization as mappings of PDFs, CDFs, and quantile functions onto aesthetics, and how support for this enables creative and easy exploration of the space of possible uncertainty visualizations. I will highlight features like true gradient support in R 4.1, support for distribution vector datatypes, and the automatic binwidth- selecting geom_dots(). I expect to leave the audience with: (1) a systemic way to think about visualizing distributions and uncertainty in the grammar of graphics and (2) an understanding of how to actually do it using ggdist.

Talk materials are available at https://www.mjskay.com/presentations/rstudio-conf-2022-talk.pdf.

Speakers
avatar for Matthew Kay

Matthew Kay

Assistant Professor, Northwestern University
Matthew Kay is an Assistant Professor jointly appointed in Computer Science and Communications Studies at Northwestern University. He works in human-computer interaction and information visualization, including uncertainty visualization and the design of human- centered tools for... Read More →


Thursday July 28, 2022 11:35am - 11:40am EDT
2. Potomac D

11:40am EDT

The Future of missing data
If you do data analysis, you encounter missing data. Missing data upsets data analysis workflow because you have to make decisions on how to deal with it - do you impute the values? Remove them? These each have consequences! The data we often encounter does not always arrive with a research question in mind, so how do you understand why you have missing values? When I first encountered missing data I was incredibly frustrated at how hard it was to understand and explore it. This frustration led me to create two R packages to explore missing data, {naniar} and {visdat}. In this talk I will showcase how to use these tools to explore missing data, as well as new features that have not been presented, and planned advances.

Talk materials are available at https://github.com/rstudio/rstudio-conf/blob/master/2022/nicholastierney/The%20Future%20of%20NA%20Data.pdf.

Speakers
avatar for Nicholas Tierney

Nicholas Tierney

Telethon Kids Institute, Perth, Australia
Nick Tierney has an honours degree in Psychology, and a PhD in Statistics and now work as a research software engineer with [Nick Golding](https://www.telethonkids.org.au/contact-us/our- people/g/nick-golding/) at the [Telethon Kids Institute](https:// www.telethonkids.org.au/), in... Read More →


Thursday July 28, 2022 11:40am - 11:45am EDT
2. Potomac D

12:00pm EDT

Lunch
See you in the Maryland ballroom for lunch.

Buffet & boxed lunch options. Outdoor seating available.

Thursday July 28, 2022 12:00pm - 1:30pm EDT
5. Maryland

12:00pm EDT

Birds of a Feather - ML/AI with Amazon SageMaker
Accelerate Your Team with Fully Managed RStudio on Amazon SageMaker In this joint session with Amazon Web Services (AWS) and RStudio PBC, we demonstrate the first fully managed RStudio integrated development environment in the cloud. R developers can dial up compute from within the same interface reducing interruptions to work and improving productivity. We cover the complete ML lifecycle including model training, workflow orchestration, and model hosting.

Brought to you by AWS

🍽 Buffet & boxed lunch options.

👍 We've striven to provide sufficient space for everyone interested, but please be aware seatting is limited.

Thursday July 28, 2022 12:00pm - 1:30pm EDT
Maryland 1, 2, & 3

12:00pm EDT

Birds of a Feather - UseR Group Organizers
Mingle for lunch with fellow UseR, R-Ladies, and Community Group Organizers.

Whether you help organize a group already or a keen to get a community started, join like-minded folks for lunch. Look for the some special packs to take home to your groups, while supplies last.

🍽 Buffet & boxed lunch options.

🪑 We've striven to provide sufficient space for everyone interested, but please be aware seatting is limited.

Thursday July 28, 2022 12:00pm - 1:30pm EDT
Maryland 4, 5, & 6

12:00pm EDT

Birds of a Feather Lunch - Variegated Groups
Numerous birds of a feather groups are meeting at this time.

Look for the bird-themed signs and table tents.

Groups include,

  • Spatial/Maps
  • Sports Analytics
  • R-Administrators / DevOps
  • Graph / Network analysis
  • Natural language processing
  • Environmental

Thursday July 28, 2022 12:00pm - 1:30pm EDT
5. Maryland

1:30pm EDT

A Journey to Data Science: Tools for Equity and Diversity in STEM
Data suggests that less than 3% of data scientists are women of color. My journey and that of many other women who fall at the intersection of being underrepresented in this field is one that is often filled with pitfalls and curves that can result in continued exclusion in these spaces. My journey as a historically excluded double minority in STEM is one of triumph, but not without the community educational tools, online platforms and genuinely inclusive members of the R Studio and open science communities. My work explores how we can build on these tools and further propel the use of open science as a tool for equity and diversity in STEM, now more than ever.

Speakers
avatar for Ileana Fenwick

Ileana Fenwick

Openscapes & UNC Chapel Hill
Ileana Fenwick is an open science advocate and Marine Sciences Ph.D. Student at UNC Chapel Hill . Ileana's research focuses on evaluating how marine communities respond to climate change and human impacts. Her work uses innovative quantitative approaches to improve our ocean management... Read More →


Thursday July 28, 2022 1:30pm - 1:50pm EDT
3. Cherry Blossom

1:30pm EDT

Cultivating Your Own R Ecosystem as a Solo Contributor
It can be daunting to start using R when no one else in your office is! Using a case study from an administrative higher education office, learn how you can begin to build your own R ecosystem, step by step, to increase the efficiency and impact of your work, even as a solo contributor. Start from scratch and get small wins by replacing common Excel tasks with reproducible code, and then continue to develop iteratively, incorporating more of R’s capabilities into your workflow until you’re humming along with internal packages and parameterized reporting. We’ll discuss how R can ease the burden of documentation and how to handle common challenges like when you can’t control how you get your data or in which tool it is ultimately presented.

Talk materials are available at https://meghan.rbind.io/talk/rstudioconf/.

Speakers
avatar for Meghan Hall

Meghan Hall

Brown University
Meghan Hall has spent over seven years as a data professional in higher education. She also has experience in sports analytics, as an ongoing contributor to the public community and previously as a data scientist at Zelus Analytics. She is dedicated to R education, particularly for... Read More →


Thursday July 28, 2022 1:30pm - 1:50pm EDT
4. National Harbor 10+11

1:30pm EDT

Model Migration: from Excel to R
Many companies within the insurance and financial sector maintain a large number of legacy models in the platform Microsoft Excel. With the advent of data analytics, many of these same companies are now looking to migrate their existing models from an Excel environment into an R environment. This process, however, is not foolproof. I should know as I have had to do it several times! Having made many mistakes myself, I would like to provide some tips to help future analysts and companies outside of the technology industry overcome some common pitfalls that they may encounter during the model migration process.

Speakers
avatar for Johnny Breen

Johnny Breen

Tokio Marine Kiln
Currently working as Portfolio Optimisation Lead at Tokio Marine Kiln, I am passionate about enhancing the computational aspects of actuarial work in the insurance industry and the London market. My interest in R began after completing a data science project during a master's programme... Read More →


Thursday July 28, 2022 1:30pm - 1:50pm EDT
1. Potomac C

1:30pm EDT

A new way to build your Shiny app's UI
Shiny helps data scientists create web applications without requiring web development experience. However, there's still a steep learning curve for writing the Ui portion of an app. Unlike an app's server code, which typically only uses a few different Shiny-specific functions, the app's UI code is almost entirely made up of functions with strange syntaxes and often surprising interactions with each other. This paradigm requires a very different mindset than your typical R Script. This talk introduces a new tool to help both new and experienced Shiny developers spend less time writing their app's UI and more time doing data science.

Speakers
avatar for Nick Strayer

Nick Strayer

Software Engineer, RStudio
Nick is an engineer on the Shiny team working to make web applications with R easy to build, beautiful, and powerful. Nick has a Ph.D. in Biostatistics from Vanderbilt University and previously worked as a data journalist at the New York Times.


Thursday July 28, 2022 1:30pm - 1:50pm EDT
2. Potomac D

1:50pm EDT

Building Accessible Lessons with R and Friends
The Carpentries is a global community of volunteers who collaboratively develop and deliver lessons to build capacity in data and coding skills to researchers worldwide. In the recent redesign of our lesson infrastructure (serving >100 lessons, used daily by >5K learners), we replaced embedded Jekyll templates with a workbench of modular and accessible packages using R and Pandoc. By leveraging renv and knitr for R-based lessons, we provide a seamless and collaborative lesson development experience that maximizes reproducibility and minimizes frustration so authors can focus on the contents, not the tooling. We demonstrate how anyone can use our infrastructure to build customised and accessible sites for their own lessons or tutorials.

Talk materials are available at https://github.com/zkamvar/rstudio-conf-2022.

Speakers
avatar for Zhian N. Kamvar

Zhian N. Kamvar

Lesson Infrastructure Technology Developer, The Carpentries
Zhian is the Lesson Infrastructure Technology Developer for The Carpentries, brining data science skills to researchers worldwide. He has been publishing R packages since 2013 and is a recovering bioinformatician located in Beaverton, Oregon, USA. Outside of R, he enjoys bicycle... Read More →


Thursday July 28, 2022 1:50pm - 2:10pm EDT
3. Cherry Blossom

1:50pm EDT

Demystifying the art of creating custom libraries for your organization
Demystifying the Art of creating custom Libraries for your organization. Imagine a world where a company has its own R library, this stores the most common functions an analytical team uses. From connecting to the Database to reading in SQL from a SQL file to creating custom graphs that only take in a few inputs. In today's world, we find that code is siloed on many Analyst machines. Even though the same code is being used over and over again by the same person or multiple people carry on with this architecture. We scramble to figure out where we save that nice code chunk so that we can use it on another package. Instead, we can all leverage creating our own libraries that store and document the cool features we've created.

Speakers
avatar for Dan Caley

Dan Caley

Custom Ink
Dan Caley is a Senior Data Analyst at Custom Ink covering descriptive, inferential, and predictive analytics. He has over 10 years experience in analytics, finding value in data for traditional finance companies like Freddie Mac and in the crypto space for Ethos and Huddl. Once at... Read More →


Thursday July 28, 2022 1:50pm - 2:10pm EDT
4. National Harbor 10+11

1:50pm EDT

Tidy Transit: Real Life Data Modeling for Public Transportation
California Integrated Travel Project’s mission is to make transit across California simpler and more affordable. As part of this, we created an open source data warehouse to allow easy analysis of the travel data people often interact with every day.

In this talk we’ll discuss two big challenges we faced:

Creating tidy representations of daily schedules and payments data across 200 transit agencies. Enabling people with a range of backgrounds (R, SQL, and python) and experience to quickly analyze the data.

Tidy data allowed us to turn equal focus on agencies running a single bus, and those serving entire metropolitan areas.

Speakers
avatar for Hunter Owens

Hunter Owens

California Department of Transportation
Hunter Owens does data science in the public interest. Current at Caltrans, he has worked at the City of Los Angeles, the Center for Data Science and Public Policy, and Obama for America. I spend a lot of time thinking about cities, data pipelines and tacos.


Thursday July 28, 2022 1:50pm - 2:10pm EDT
1. Potomac C

1:50pm EDT

Designing for people is hard
Shiny users can prototype an app in minutes, but... What if it also looked better almost immediately? In this talk, we'll walk through a new look for Shiny's UI, as well as some new functionality.

Speakers
avatar for Greg Swinehart

Greg Swinehart

RStudio, PBC
Greg has contributed to RStudio's brand, print and web designs since 2012. His credits include rstudio.com, many hex stickers, and countless shirts and trade show booths. While he often claims to have ghostwritten Hadley Wickham's books, that is probably not true. Greg is currently... Read More →


Thursday July 28, 2022 1:50pm - 2:10pm EDT
2. Potomac D

2:10pm EDT

Everything I learned about community building, I learned from growing up in a restaurant
In community conversations at the Data Science Hangout, we’ve talked about misalignment between what recruiters are looking for and who is actually a great fit for a role. At one point, people started sharing their own degrees in the chat: sociology, political science, finance, psychology, no degree, etc.

What makes us good at our jobs is so much more than the degree that we have.

I’m not a data scientist. When I think about my own role though, much of my intuition comes from growing up in a restaurant and watching my mom create a community among customers.

This talk will share a few community building tips to get you started and perhaps a little inspiration to consider the other skills that qualify you for what you want to do.

Speakers
avatar for Rachael Dempsey

Rachael Dempsey

Customer Success, RStudio
Rachael is a community leader at RStudio with a passion for customer experience and making people happy. She enjoys connecting people, spending time with family and friends, being outdoors, and any opportunity to find live music.


Thursday July 28, 2022 2:10pm - 2:30pm EDT
3. Cherry Blossom

2:10pm EDT

You should be using renv
Dependencies don't have to be hell. In this talk we'll discuss how renv makes it easier to diagnose problems, move projects between environments, and collaborate with other R users.

Speakers
avatar for E. David Aja

E. David Aja

RStudio
E. David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.


Thursday July 28, 2022 2:10pm - 2:30pm EDT
4. National Harbor 10+11

2:10pm EDT

Digging a Pit of Success for Your Organization: Embracing a R-based ecosystem in the US federal government
How does a US federal agency analyze tens of millions of records across 30,000 sites in over 50 countries analyze these data efficiently and effectively? Five years ago, our team ventured beyond the confines of (largely) Excel and towards leveraging R to be more efficient in our analysis and workflows since data is instrumental to the mission of ending the global HIV epidemic. We have created our own 'pit of success', providing analysts the infrastructure and support needed to ease the learning of and working with R in our specific context. We will be sharing our experiences in digging this 'pit of success' that organizations would benefit from.

Talk materials are available at https://speakerdeck.com/achafetz/digging-a-pit-of-success-for-your-organization.

Speakers
avatar for Aaron Chafetz

Aaron Chafetz

Senior Economist, US Agency for International Development
Aaron Chafetz is a Senior Economist at the U.S. Agency for International Development (USAID) in the Office of HIV/AIDS, where he specializes in quantitative analysis of global health data. Most of his work is concentrated in analyzing large quantities of data from health facilities... Read More →


Thursday July 28, 2022 2:10pm - 2:30pm EDT
1. Potomac C

2:10pm EDT

Creating a Design System for Shiny and RMarkdown
A design system is a set of standards to manage design at scale by reducing redundancy while creating a shared language and visual consistency across different pages and channels. This talk will go into detail about the history of design systems, what components to include when creating a design system, and lastly various examples of implementation. By the end of the talk, you should have the tools to create your own design system within Shiny/RMarkdown to be used by yourself, your brand, or your company.

Speakers
avatar for Maya Gans

Maya Gans

Atorus Research
Maya Gans is an experienced R and JavaScript developer, who recently started applying those skills to the clinical trial space. After graduating with her Masters in Botany from University of Wyoming, Maya interned at RStudio where she created a blocks based programming language called... Read More →


Thursday July 28, 2022 2:10pm - 2:30pm EDT
2. Potomac D

2:30pm EDT

We R KaggleRs - At the Intersection of Data Science Communities
In this presentation, I will share my experiences at the intersection of the R and Kaggle communities. As Kaggle's first Notebooks Grandmaster, I will talk about how R users can contribute and thrive in an increasingly Python dominated community. A strong focus will be on the art of building insightful and engaging Kaggle notebooks and exploratory data analyses in general. I will share my lessons from running a weekly series aimed at promoting underrated Kaggle notebooks and their authors. And I will talk about organising a Kaggle insights competition aimed at exploring this dataset of 100 episodes, and what it taught me about community dynamics, different approaches to exploratory challenges, and the creativity of the participants.

Speakers
avatar for Martin Henze

Martin Henze

Data Scientist, YipitData
Data Scientist @ YipitData | Kaggle Grandmaster | PhD Astrophysicist


Thursday July 28, 2022 2:30pm - 2:50pm EDT
3. Cherry Blossom

2:30pm EDT

Small Team Large Organization: Building Impactful Shiny Dashboards at NIH
It can be difficult for small teams to make an impact in large organizations. In this talk I will discuss how my small team, at the National Institute of Allergy and Infectious Diseases (NIAID) has used Shiny to increase efficiency, enhance collaboration, and track biomedical research funding. In light of the recent pandemic, and the ongoing threat of emerging infectious diseases, our ability to understand a complex research portfolio and adapt quickly to changing research priorities has become increasingly important. I will highlight how automated workflows and Shiny dashboards have allowed my team to become more efficient and enabled us to have an outsized impact on our organization.

Speakers
avatar for Jon Nye

Jon Nye

National Institutes of Health
Jon Nye is a Lead Health Science Policy Analyst at the National Institute of Allergy and Infectious Diseases (NIAID) at the National Institutes of Health. He is currently part of the Data Analytics and Research Branch which focuses on data integration, visualization, and building... Read More →


Thursday July 28, 2022 2:30pm - 2:50pm EDT
4. National Harbor 10+11

2:30pm EDT

Save an ocean of time: streamline data wrangling with R
My organization currently has over 250 oceanographic sensors deployed around the coast of Nova Scotia, Canada. Together, these generate around 4 million rows of data every year. I was shocked when I discovered my colleagues manually compiled, formatted, and analyzed these data using hundreds of Excel spreadsheets. This was highly time consuming, error prone, and lacked traceability. To improve this workflow, I developed an R package that reduced processing time by 95%. The package has since become integral to our data pipeline, including quality control, analysis, visualization, and report generation in RMarkdown. The resulting datasets have already proven invaluable to industry leaders looking to invest in Nova Scotia’s coastal resources.

Talk materials are available at https://github.com/dempsey-CMAR/2022_rstudio_conf.

Speakers
avatar for Danielle Dempsey

Danielle Dempsey

Centre for Marine Applied Research
Danielle Dempsey is a Research Scientist and resident “R nerd” at the Centre for Marine Applied Research in Nova Scotia, Canada. Danielle enjoys developing code to automate repetitive tasks and improve workflows. She has written several R packages to facilitate ocean data wrangling... Read More →


Thursday July 28, 2022 2:30pm - 2:50pm EDT
1. Potomac C

2:30pm EDT

Dashboard-Builder: Building Shiny Apps without writing any code
I would like to create (more) Shiny Dashboards, but...

  • I don't know how
  • I can't write R code
  • it's too complex
  • I don't have enough time (even though I know how to build them)

If this sounds familiar, this talk is for you. We present our latest project, the dashboard-builder that allows users to create full Shiny dashboards without writing a single line of code. You can find a demonstration video here

This drag&drop dashboard-builder allows you to interactively create native Shiny dashboards. Lowering the entry barrier for new users starting their data-science journey. They can begin to visualize their datasets without prior knowledge of R. More experienced users can use the dashboard-builder to quickly sketch out their ideas and export them to act as a foundation for more complex dashboards.

Speakers
avatar for Peter Gandenberger

Peter Gandenberger

Peter is about to finish his Masters in Data Engineering & Analytics from the TU-Munich. Besides his studies, he has a passion for R- development, co-authoring the leafdown package. His latest project is the no-code/low-code dashboard-builder which allows users to create shiny dashboards... Read More →


Thursday July 28, 2022 2:30pm - 2:50pm EDT
2. Potomac D

3:20pm EDT

Becoming Creative: How I Designed a Quilt with R
When someone asks about essential skills for data careers, I often hear responses like R, Python, and machine learning. However, I argue that creativity is an underrated skill that you can and should practice. In this talk, I want to tell you a story about a project I did to stretch my creative brain and use my favorite tool, R. I designed a quilt in R using generative art ideas. Then I created individual blocks that make up the larger design. I used foundation paper piecing, a method that allows for intricate designs but has geometrical constraints. I hope my talk will entertain and inspire folks to exercise their creative muscles to improve their performance and enjoyment of their day jobs.

Talk materials are available at https://github.com/awalsh17/quiltR.

Speakers
avatar for Alice Walsh

Alice Walsh

Pathos, R-Ladies Philly
Alice is a computational biologist who applies data science tools to solve drug development challenges. She is currently VP of Translational Research at Pathos. Alice's prior professional experience includes roles at Janssen and Bristol Myers Squibb, including leading teams that worked... Read More →


Thursday July 28, 2022 3:20pm - 3:40pm EDT
3. Cherry Blossom

3:20pm EDT

Cracking open ggplot internals with {ggtrace}
The inner workings of {ggplot2} are difficult to grasp even for experienced users because its internal object-oriented (ggproto) system is hidden from user- facing functions, by design. This is exacerbated by the foreignness of ggproto itself, which remains the largest hurdle in the user-to-developer transition. However this needs not to be the case: ggplot internals have clear parallels to data wrangling, where data is passed between methods that take inputs and return outputs. Capitalizing on this connection, package {ggtrace} exposes the familiar functional programming logic of ggplot with functions that inspect, capture, or modify steps in a ggplot object's execution pipeline, enabling users to learn the internals through trial-and-error.

Talk materials are available at https://github.com/yjunechoe/ggtrace-rstudioconf2022.

Speakers
avatar for June Choe

June Choe

PhD Student, University of Pennsylvania
June is a second year Ph.D. student in linguistics at the University of Pennsylvania, studying various topics in language acquisition and human sentence processing. He uses R extensively for experimental and corpus research, and spends too much time making the perfect figures for... Read More →


Thursday July 28, 2022 3:20pm - 3:40pm EDT
4. National Harbor 10+11

3:20pm EDT

These are a few of my favorite things (about Quarto presentations)
Quarto is the next generation of RMarkdown, and comes with a new presentation format, revealjs. In this talk, I’ll show a few of my favorite things about making interactive HTML presentations with Quarto. Along the way you’ll learn about the visual editor, multiple columns, delivering an effective talk with speaker notes and mode, making your presentations pop with transitions, effective ways to incorporate code into your presentation, beautiful themes (and how to match your corporate style guide), and sharing it as pdf and HTML. I’ll finish off with a brief look at generating a powerpoint presentation from a template.

Speakers
avatar for Tracy Teal

Tracy Teal

RStudio
Tracy Teal has been working with open source communities as a developer, instructor and project leader throughout her career. As a PhD student at Caltech and then as an Assistant Professor in bioinformatics at Michigan State University, she saw that the bottleneck to discovery was... Read More →


Thursday July 28, 2022 3:20pm - 3:40pm EDT
1. Potomac C

3:20pm EDT

Introducing Rhino: Shiny application framework for enterprise
R and Shiny are ready for production use in Enterprise. As Appsilon, we have worked with tens of top global companies on their Shiny projects. Emphasis on UI, monitoring users’ behavior, and solid engineering principles made the difference. These are some of the practices that made projects impactful. We want to help ourselves and the community to leverage best practices that have worked for us so far. That’s why we built Rhino - the new framework for Shiny in Enterprise. Kamil Żyła is a senior software engineer at Appsilon and Rhino’s creator. He will share our experiences in ensuring the success of Shiny projects. You will see why you may want to try Rhino in your next (or current) project!

Speakers
avatar for Kamil Zyla

Kamil Zyla

Appsilon
Kamil is a Full Stack Engineer at Appsilon and a core developer of Rhino. He learned to code at the age of 15 and used to participate in programming competitions. He earned degrees in computer science and mathematics. His interests include programming language theory, software development... Read More →


Thursday July 28, 2022 3:20pm - 3:40pm EDT
2. Potomac D

3:40pm EDT

Building a ggplot2 rollercoaster: Creating amazing 3D data visualizations in R
Have you ever had so much fun building a data visualization that it felt like a thrill ride? What if your dataviz actually WAS a thrill ride—a 3D virtual reality rollercoaster through your dataset? In this talk, I’ll show you how to create incredibly engaging and informative 3D visualizations in R, and demonstrate what makes a good 3D dataviz. Using the rayrender package, you’ll see how you can interactively fly through 3D scenes in real-time, without ever leaving RStudio. Learn how you can use code to create 3D structures, plot data, and even make generative art: visualizing everything from the smallest molecule to the vast reaches of space, in only a few lines of code.

Speakers
avatar for Tyler Morgan-Wall

Tyler Morgan-Wall

Institute for Defense Analyses
Dr. Tyler Morgan-Wall is the developer of the rayverse: a collection of packages that includes the mapping and data visualization package rayshader, the high-quality 3D rendering package rayrender, and several other packages. He is also the developer of the experimental design and... Read More →


Thursday July 28, 2022 3:40pm - 4:00pm EDT
3. Cherry Blossom

3:40pm EDT

From summarizing projects to setting tags, uses of parsing R files
I'll walk through a few potential uses of parsing out the functions and packages in projects.

  • Creating a reference table: With so many #rstats learning materials out there, it's often helpful to parse-out the functions from a project and create a lookup table that complements your notes.
  • Analyzing files: A network visualization of the packages may provide insights as to which files or projects are most related to one another as well as which packages are most central to a body of work.
  • Setting tags: Picking good consistent tags for your blogdown website is hard. It's easier to just parse out the packages in each post and use those to organize your website.

Examples will use helpers from the new {funspotr} package: https://github.com/brshallo/funspotr/

Talk materials are available at https://github.com/brshallo/funspotr-rstudioconf2022.

Speakers
avatar for Bryan Shalloway

Bryan Shalloway

NetApp
Bryan is a Data Scientist at NetApp where he has worked on a wide range of internal operations problems in Customer Support, Forecasting, and Pricing Optimization. He also leads internal study groups on books like Feature Engineering and Selection and Interpretable Machine Learning... Read More →


Thursday July 28, 2022 3:40pm - 4:00pm EDT
4. National Harbor 10+11

3:40pm EDT

Literate Programming With Jupyter Notebooks and Quarto
Jupyter Notebooks play a critical role in in the workflow of many users. Notebooks are used to document existing code, to quickly prototype and iterate on ideas, and as a medium of technical communication. However, package developers typically use an entirely separate set of more traditional development tools, and the context switching between these tools and notebooks can be frustrating. Not only do you lose the ability to iterate fast, but you lose the ability to document and test your code in-situ, requiring you to create documentation and tests separately from source code.

Nbdev is a literate programming framework that allows you to develop Python libraries within Jupyter Notebooks. In this talk, Hamel will describe the integration between Nbdev amd Quarto, which enables library developers to author their documentation right alongside their code, and automatically produce a Quarto website for their package. The result is a seamless workflow for developing, documenting, and testing software packages all within Jupyter Notebooks, with no context-switching required.

Talk materials are available at https://github.com/fastai/nbdev-demo.

Speakers
avatar for Hamel Husain

Hamel Husain

fastai
Hamel Husain is head of Data Science and ML at Outerbounds (the developers of Metaflow). Previous to Outerbounds, Hamel has built ML infrastructure and deployed data products at Airbnb, GitHub, and DataRobot. Hamel is also very active in open source communities and is currently a... Read More →


Thursday July 28, 2022 3:40pm - 4:00pm EDT
1. Potomac C

3:40pm EDT

A Robust Framework for Automated Shiny App Testing
For production-grade Shiny applications, regression testing ensures that the application maintains its core functionality as new features are added to the app. With the help of various R and Python tools that programmatically interact with the UI and examine UI outputs, regression test logic can be represented programmatically and can run as often as needed. This gives the development team an opportunity to catch and fix bugs before they are pushed to production.

In this talk, I will introduce a framework for automated testing of Shiny applications both (1) during the development phase and (2) after the app is deployed. I will share a demo Shiny app along with relevant shinytest2 and Selenium code.

Speakers
avatar for Sydeaka Watson

Sydeaka Watson

Eli Lilly and Co
Dr. Sydeaka Watson earned a Ph.D. in Statistics from Baylor University and has several years of teaching experience. As a Research Assistant Professor in The University of Chicago Biostatistics Laboratory, she consulted with over 110 biomedical research teams, specializing in statistical... Read More →


Thursday July 28, 2022 3:40pm - 4:00pm EDT
2. Potomac D

4:00pm EDT

The Polygons of Another World - realtime interactive rendering in R
In this talk I want to explore R's capabilities for fast, interactive graphical applications. This exploration is driven by my ongoing port of the 1991 action adventure game "Another World", but these capabilities also open up possibilities for new visualisations and applications in R.

The porting of this game is a 'moonshot' project as I try to discover the techniques and tools needed for fast (>20fps) interactive (keyboard + mouse) rendering to R graphics devices. A further constraint is that I want all this to be done in plain R - avoiding any C or javascript as much as possible.

I will discuss three of the key challenges faced: graphics device speed, fast double-buffered rendering and event-driven programming for interactivity.

I will showcase the capability of R to render 5000 moving sprites using the nara package, an interactive drum machine with the eventloop package, and my progress with the 'Another World' game engine with animation, keyboard control and synchronised sound.

Talk materials are available at https://github.com/coolbutuseless/RStudioConf-2022.

Speakers
avatar for Mike Cheng

Mike Cheng

TBD. Will write this when I get the chance


Thursday July 28, 2022 4:00pm - 4:20pm EDT
3. Cherry Blossom

4:00pm EDT

An introduction to R7
The R7 package is a new OOP system designed to be a successor to S3 and S4. It has been designed and implemented collaboratively by the RConsortium Object- Oriented Programming Working Group, which includes representatives from R-Core, BioConductor, RStudio/tidyverse, and the wider R community.

In this talk, I'll introduce R7 to the wider world. Attendees will learn why we created R7 and how they can use it to create new classes and packages. I hope to inspire folks to download, try it out, and give us the feedback we need to make it better.

Speakers
avatar for Hadley Wickham

Hadley Wickham

Chief Scientist, RStudio
Hadley is Chief Scientist at RStudio, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes... Read More →


Thursday July 28, 2022 4:00pm - 4:20pm EDT
4. National Harbor 10+11

4:00pm EDT

Websites & Books & Blogs, oh my! Creating Rich Content with Quarto
A number of packages have emerged in the R ecosystem to help create websites (hugodown), books (bookdown), and blogs (blogdown). In this talk, I'll show you how Quarto handles creating these types of content and more. Whether you're a grad student considering your first blog post about R, or a python expert about to write a technical book, I'll show you how Quarto enables you to focus on content while it takes care of the rest. In addition, I'll show the RStudio team is making it as easy as possible to publish and share content within your organization and to the whole community.

Speakers
avatar for Devin Pastoor

Devin Pastoor

RStudio
Devin is a solutions engineer at RStudio. He has a PhD in Pharmaceutical Sciences and has focused his career on helping companies leverage technology to enable faster, better decisions.


Thursday July 28, 2022 4:00pm - 4:20pm EDT
1. Potomac C

4:00pm EDT

{shinytest2}: Regression testing for Shiny applications
Manually testing Shiny applications is often laborious, inconsistent, and doesn’t scale well. Whether you are developing new features, fixing bug(s), or simply upgrading dependencies, it is critical to know when regressions are introduced. The new {shinytest2} R package provides a toolkit for unit testing Shiny apps and seamlessly integrates with {testthat}. Under the hood, it uses the new {chromote} R package to render apps in a headless Chrome browser with features such as live preview and built in debugging tools. In this talk, you’ll learn how to test Shiny apps by simply recording your actions as code and extending it to test more particular aspects of your app, resulting in fewer bugs and more confidence in future development.

Talk materials are available at https://bit.ly/shinytest2-conf22.

Speakers
avatar for Barret Schloerke

Barret Schloerke

RStudio
hello()! Dr. Barret Schloerke is a Shiny Software Engineer at RStudio. He currently maintains and creates many R packages surrounding the Shiny ecosystem, including {shiny}, {shinytest2}, {reactlog}, and {plumber}.


Thursday July 28, 2022 4:00pm - 4:20pm EDT
2. Potomac D

4:30pm EDT

Data science training in communities with limited technology resources and opportunities
I'm so excited to be back with the data science community at rstudio::conf! Data science is a field that touches nearly every part of our modern lives - from the shows we watch, to our jobs, to the economy, to sports and entertainment. Moreover, modern technologies like cloud computing and access to increasing volumes of data have opened the door to incredible new opportunities for those with data skill. RLadies, Bioconductor, Data Carpentry, and RStudio have also done an incredible job of creating a supportive community for these new data scientists to flourish. But not everyone gets equal access to the training that can pull them into this field.

In this talk I will describe a collaborative effort to build DataTrail - a data science program created in partnership with local non-profits in Baltimore that seeks to spread awareness, knowledge, and opportunities into historically under-resourced and under-served communities. I will highlight the incredible work of the partners who made this training program possible, show examples of the achievements of DataTrail participants, discuss successes and challenges with the program, and talk about the rewards of being a part of community-based data science education. Finally, I will highlight opportunities to be a part of our DataTrail community or build a new DataTrail in yours.

Speakers
avatar for Jeff Leek

Jeff Leek

Fred Hutchinson Cancer Center
Jeff is Chief Data Officer, Vice President, and J Orin Edson Foundation Chair of Biostatistics at the Fred Hutchinson Cancer Center. Previously, he was a professor of Biostatistics and Oncology at the Johns Hopkins Bloomberg School of Public Health and co-director of the Johns Ho... Read More →


Thursday July 28, 2022 4:30pm - 5:30pm EDT
0. Potomac A+B

5:30pm EDT

R-Ladies Reception
Come mix and mingle with R-Ladies from DC and all over the world! There will be light refreshments. (And hex stickers!)

This event is open to R-Ladies who did not register for rstudio::conf.

Thursday July 28, 2022 5:30pm - 7:00pm EDT
Maryland 1, 2, & 3
 


Filter sessions
Apply filters to sessions.