Coding and Cookies

Automating data cleaning and analysis using R.

Welcome to Coding and Cookies

Facts: 

  1. Researchers are producing more data than ever before.
  2. It's literally impossible to analyze all of these data by hand.
  3. Research is really repetitive
  4. Automating data processing and analysis will streamline your research

The Coding and Cookies series will teach you the basics of how to use R programming and version control using git to make your research more efficient and reproducible. 

Coding & Cookies is offered in collaboration with the Department of Statistics. After adapting our approach to support online learning in 2020, we will continue to use a flipped classroom format in Spring 2022. To get the most out of these workshops, attendees are encouraged to watch a recorded video and follow along with the exercises presented before attending the live workshops, which will be hybrid sessions offered both online via Zoom and in person in Morgan Library. At the live sessions, we will review key concepts and work through additional examples and questions, with individual help available. Learning materials will continue to be made publicly available on this guide (see links to the left). If sessions are full, interested students are encouraged to review these materials and get in touch with the instructors for follow-up questions. 

Sessions will be led by experienced statistics graduate students and facilitated by Mara Sedlins, PhD, Data Management Specialist at the CSU Libraries, and Julia Sharp, Associate Professor of Statistics and Director of the Graybill Statistics and Data Science Laboratory.  

Spring 2022 Workshops

New to R or RStudio? We encourage you to attend the first session, R Basics. A basic working knowledge of R and RStudio is helpful to get the most out of the rest of the sessions.

R basics

Learning how to code involves an investment of time and effort up front, but will save you time and effort in the long run. In the R basics Coding and Cookies session, the basics of using tabular data in RStudio will be discussed. By the end of this session, you will be able to load data into R, calculate summary statistics, and create exploratory graphs using R’s basic graphics package. This session is geared toward beginners, so if you have experience using R, this may not be the class for you.

February 1st, 10:00-11:30am
Registration link (in person): https://colostate.libcal.com/calendar/12856/r-basics-2022-02-01-in-person
Registration link (online): https://colostate.libcal.com/calendar/12856/r-basics-2022-02-01-online

February 22nd, 10:00-11:30am -- This workshop will be online only.
Registration link (online): https://colostate.libcal.com/calendar/12856/r-basics-2022-02-22-online

Tidy Data in R

The process of generating data can be messy, and what you can do with your data depends strongly on how it is formatted. This month's coding and cookies will cover the definition of “tidy data”, a standardized way of formatting your data that makes it easier to work with. You will learn how to clean and reformat your data using a collection of R packages called the tidyverse. A basic working knowledge of R and R studio would be helpful for you to get the most out of this session.

March 22nd, 10:00-11:30am
Registration link (in person): https://colostate.libcal.com/calendar/12856/tidy-data-r-2023-03-22-in-person
Registration link (online): https://colostate.libcal.com/calendar/12856/tidy-data-r-2022-03-22-online

Data Visualization using ggplot2

So you’re familiar with R, but want to do more with your plots than the base graphics package.  In this month’s Coding and Cookies, the ggplot2 package in R will be discussed. After this session, you will be able to create a variety of plot types, alter their aesthetics, and create custom themes. A working knowledge of R and R studio and dplyr would be helpful for you to get the most out of this session.

April 5th, 10:00-11:30am
Registration link (in person): https://colostate.libcal.com/calendar/12856/data-viz-2022-04-05-in-person
Registration link (online): https://colostate.libcal.com/calendar/12856/data-viz-2022-04-05-online

Version Control using Git

We’ve all intuitively used some type of version control in our work such as saving multiple versions of a document. While easy, it can cause file bloat and ultimately become more complicated. Luckily, formal version control systems have been developed to streamline this process. In this month’s Coding and Cookies session, we will be covering version control using git. After this session, you’ll be able to create a git repository, make and add changes to the repository, and use GitHub to remotely store your repository.

April 19th, 10:00-11:30am
Registration link (in person): https://colostate.libcal.com/calendar/12856/version-control-2022-04-19-in-person
Registration link (online): https://colostate.libcal.com/calendar/12856/version-control-2022-04-19-online

 

URL: https://libguides.colostate.edu/coding-cookies | Print Page