31 General Resources
This is a long list of helpful general resources related to EDAV. If you have come across a good resource you don’t see here, consider adding it with a pull request (see the contribute page for more info).
A lot of these are available for students through Columbia Libraries, in both physical and e-book formats.
Graphical Data Analysis with R: This book systematically goes through the different types of data, including categorical variables, continuous variables, and time series. The author shows different examples of plotting techniques using ggplot and promoting the “grammar of graphics” model. Code snippets included and available at the book’s website.
R for Data Science: The classic. Everything from data types, programming, modeling, communicating, and those keyboard shortcuts you keep forgetting. To quote the book, “this book will teach you how to do data science with R.” Nuff said.
Cheatsheet of cheatsheets: Paul van der Laken has put together a large collection of R resource links, including cheat sheets, style guides, package info, blogs, and other helpful resources.
RStudio Cheatsheet Collection: Collection of downloadable cheatsheets from RStudio. Includes ones on R Markdown, Data Transformation (
dplyr), and Data Visualization (
ggplot2). They also have a R Markdown Reference Guide, which is great for remembering that one chunk option that’s on the tip of your tongue.
R Base Graphics Cheatsheet: Oddly enough, despite the length of time it’s been around, it’s hard to find a base graphics cheatsheet. Joyce put this one together to help you out if you’re using base graphics.
Ten Simple Rules for Better Figures: A helpful article discussing how to make the best figures possible by following ten basic rules such as “Avoid ‘chartjunk’” and “Know Your Audience”. Good to keep these rules in mind.
The Simpsons by the Data: Nice example of telling a story with data (histograms, scatterplots, etc.). Also, it’s subject is everybody’s favorite TV family.
- New York Open Statistical Programming Meetup: Meetups hosted by Jared Lander and Wes McKinney on a variety of topics in statistical programming, but with a focus on the R language. Past speakers have included J.J. Allaire (founder of RStudio) and Hadley Wickham (core tidyverse developer). Other attendees are generally eager to welcome newcomers and all of their talks are available on the Lander Analytics Youtube channel.
R likes Twitter. Here are some cool people doing work with #rstats:
Members of the United States Congress (1789-Present) with lots of biographical information https://github.com/unitedstates/congress-legislators