8 Bookmarks with Thought

8.1 R Reference

Keyboard Shortcuts – RStudio Support

Cookbook for R

http–www.rpad.org-Rpad-R-refcard.pdf

Jonathan Baron's R help page

  • lots of links for r functions, blogs, and other references

R commands

  • List of the most useful R commands

Package ‘fuzzyjoin’

Fast csv writing for R

  • Check out fwrite for csv in and out How to Perform a Logistic Regression in R | DataScience+

Generate Dynamic R Markdown Blocks

  • Top Answer: Try knit_expand() (Dickson, 2014)

Broom: Let’s Tidy Up a Bit

8.6 R Workflow / Version Control

A perfect RStudio layout (Ilya Kashnitsky, 2018)

  • My current preferred layout
  • Adjust in “Tools” –> “Global options” –> “Pane layout”:
    • Top Left: Source
    • Bottom Left: History (then collapse)
    • Top Right: Console
    • Bottom Right: Everything else

Project-Oriented Workflow (Bryan 2017)

  • Organize each logical project into a folder on your computer.
    • Make sure the top-level folder advertises itself as such (with a .here, .git, or .Rproj)
  • Directories:
    • Create all paths relative to the top-level directory.
    • Whenever you work on this project, launch the R process from the project’s top-level directory. If you launch R from the shell, cd to the correct folder first.
  • Write every script assuming it will be run in a fresh R process:
    • User-level setup: Do not save .RData when you quit R and don’t load .RData when you fire up R.
    • Don’t do things in your .Rprofile that affect how R code runs, such as loading a package like dplyr or ggplot or setting an option such as stringsAsFactors = FALSE.
    • Daily work habit: Restart R very often and re-run your under-development script from the top.

Git and GitHub · R packages

GitHub - MaximeWack-tidyflow Tidyflow a workflow that fits the tidyverse

  • This is the workflow process that seems to work best for me right now

Good enough practices in scientific computing

ProjectTemplate

ProjectTemplate package

  • Recommended by the Not so standard deviations podcast
  • Doesn’t look like a format that I would find useful. Too informal once the project is set up.

R workflow fun

  • An article about various R workflow tools that the Author found interesting

Tools for Reproducible Research Spring (Broman 2016)

Professional data validation for the R environment

  • Validate package to include data frame of testing from not so standard deviations episode 18 asser from r open Sci

Reproducible Development With Rmarkdown and Github

Structuring R Projects: a Pragmatic Perspective

Beyond Basic R – Introduction and Best Practices

A Framework for Reproducible and Collaborative Data Science • workflowr

8.10 R links/blog posts to read

Tidy text package used for Frump text analysis

Check out 538 R analyses on GitHub

Check out show notes from not so standard deviations episode 23 Walt Hickey

8.11 R links/blog posts to read without annotation

“Errors in Statistics for Medical Research”. Any Suggestions?

[Instructional] How to Make Awesome Sankey Diagrams in RStudio in Minutes!

[OC] US Population Projections by Age Through 2060

“Are My Scores Good?” Well, Compared to What?

“Demystifying Data Science” Remote Notes

2.5 Other Considerations

20 Free Online Courses to Learn Data Science

4 Ways to Be More Efficient Using RStudio’s Code Snippets, With 11 Ready to Use Examples

5 Alternatives to the Default R Outputs for GLMs and Linear Models

A Comparative Review of the Deducer GUI for R

A Completely Subjective Ranking of Data Science Podcasts

A Lazy Function

A List of Over 1,000 Datasets Available in R Packages

A little function to help generate ICCs in simple clustered data

A Primer on Web Scraping in R

A Question and an Answer About Recoding Several Factors Simultaneously in R

A Recipe for Recipes

A Small Logical Change With Big Impact

A very basic tutorial for performing linear mixed effects analyses (Tutorial 2)

AI Data Sets to Use For Your Next Creative Project

AI, Machine Learning and Data Science Roundup: June 2018

An Overview of R With a Curated Learning Path

An R Tutorial: Visual Representation of Complex Multivariate Relationships Using the R Qgraph Package, Part Two Repost

Analyzing NetHack Data, Part 1: What Kills the Players

Announcing CGPfunctions 0.3 – April 20, 2018 R-bloggers

Anomaly Detection for Business Metrics with R

April 2018: “Top 40” New Packages

As machine learning evolves, we need to update the definition of ‘data scientist’ | VentureBeat

Automated Email Reports With R

Automated Feature Selection Using bounceR

Automl Package: Part 2/2 First Steps How To

Avoid these 4 common data science pitfalls

Awesome R

Awesome Twitter Word Clouds in R

Basics of EDA Using Iris Dataset

Beautiful and Powerful Correlation Tables in R

Beautiful and Powerful Correlation Tables in R

Beautiful Visualizations in R

Becoming a Data Scientist – Curriculum via Metromap

Bias in Pre-Post Designs – an Example From the Turnbaugh Et Al (2006) Mouse Fecal Transplant Study

Big Data-2: Move Into the Big League:Graduate From R to SparkR

Big News: Vtreat 1.2.0 Is Available on CRAN, and It Is Now Big Data Capable

Blog Archives

Book suggestions for a student pursuing a degree in statistics?

Breaking Into Data Science in Seattle

Can I get some advice on learning to use R?

Can Someone Help Me Get a Better Understanding of the ROC Curve and Its Applications?

CausalImpact

Celebrate Halloween With Creepy Computer Games in R

Celebrate Halloween With the R Language - Translation of a 1982 Game From the Book Creepy Computer Games

CHAID and Caret – a Good Combo – June 6, 2018

CHAID and R – When You Need Explanation – May 15, 2018

Cleaning Up Tables

cocor comparing correlations

Collateral

Communicating Results With R Markdown

Computer Science and Programming

Confessions of a Data Scientist: Why I Quit Social Media and Still Cut My Own Grass

Continuous and Automatic Key Driver Analysis of Customer Feedback

Continuous Deployment of Package Documentation With Pkgdown and Travis CI

Converting a R Matrix to LaTeX

Conway’s Game of Life in R: or on the Importance of Vectorizing Your R Code

Create a Glossary in R Markdown

Create Outstanding Dashboards With the New semantic.dashboard Package

Create Waterfall Charts Using R & Ggplot2

Create Your Machine Learning Library From Scratch With R ! (2/5) – PCA

Create Your Machine Learning Library From Scratch With R ! (3/5) – KNN

Creating Slopegraphs with R

Customizing Styler – the Quick Way

Data + Art STEAM Project: Final Results

Data + Art STEAM Project: Initial Results

Data Analyst to Data Scientist

Data fallacies to avoid

Data Lunch 2Feb: the Use of Bookdown to Write Documents and Reports

Data on 1,340 Coffee Bean Reviews (aroma, acidity, flavor, altitude, + more)

Data Rectangling

Data Science for Business Tutorial: Using Machine Learning With LIME to Understand Employee Churn

Data Science for Business: 3 Reasons You Need to Learn the Expected Value Framework

Data Science for Business: Course Launch in 5 Days!!!

Data Science for Startups: Business Intelligence

Data Science With R Course Series – Week 6

Data Scientist Resume Projects

Data Summary in One Go

Data Visualisation COMPETITION

Dealing With Heteroskedasticity; Regression With Robust Standard Errors Using R

Deep Learning From First Principles in Python, R and Octave – Part 8

Detecting Unconscious Bias in Models, With R

Detecting Unconscious Bias in Models, With R

Digging Deeper: Online Resources for Intermediate to Advanced R Users

Do Clustering by “Dimensional Collapse”

Don’t Worry: Google Only Checks Your Location Every 10 Minutes

drake

Draw nicer Classification and Regression Trees with the rpart.plot package

Drawing Beautiful Maps Programmatically With R, Sf and Ggplot2 — Part 1: Basics

Easy APA Formatted Bayesian Correlation

Enterprise Dashboards With R Markdown

Episode 7: Best Practices for Workflow Management

eRum 2018 - May 15 - Edwin Thoen - YouTube

Estimating Control Chart Constants With R

Estimating Pi

Every. Damn. Time.

Exploratory Factor Analysis in R

Exploring College Major and Income: a Live Data Analysis in R

Extract FRED Data for OLS Regression Analysis: a Complete R Tutorial

Extracting Tables From PDFs in R Using the Tabulizer Package

Extreme Makeover: R Graphics Edition

Fancy Plot (With Posterior Samples) for Bayesian Regressions

Feature Engineering and Selection: a Practical Approach for Predictive Models

Finalfit, Knitr and R Markdown for Quick Results

First20Hours/Google-10000-English

Flow Charts in R

Forecasting My Weight With R

Formatted Correlation With Effect Size

From Data to Viz | Find the Graphic You Need

From Webscraping Data to Releasing It as an R Package to Share With the World: a Full Tutorial With Data From NetHack

Fun with lists (in R) • Apoorv Anand

Get Your Tracks From the Strava API and Plot Them on Leaflet Maps

Getting Data From Pdfs Using the Pdftools Package

Getting Hired as a Data Scientist

Ggplotgui

GitHub - andbe/keysLocksSimulations: Simulate keys/lock question scenarios

GitHub - jldbc/coffee-quality-database: Building the Coffee Quality Institute Database

github.com/topepo/recipes

Global Migration, Animated With R

Greybox Package for R

Guide to Becoming an Average Programmer and Infocybersecurity

Hotfix for Microsoft R Open 3.5.0 on Linux

How Do I Use Latent Growth Curve Analysis in R?

How Do You Combine “Revision Control” With “Workflow” for R?

How Not to Sort by Average Rating

How the Odds Ratio Confounds: a Brief Study in a Few Colorful Figures

How to Access Data From Twitter API Using R and Python (Let Me Know What You Think)

How to Build a Data Science Portfolio

How to Build Your Own Neural Network From Scratch in R

How to Create a Flexdashboard: Exercises

How to learn Bayesian statistics à la DataCamp or swirl()?

How to Plot With Dygraphs: Exercises

How to Quickly Make Reusable World Maps Based on Excel Worksheet Data Using Ggplot and RStudio!

How to Run R From the Task Scheduler

How to setup RStudio Server Open Source Edition like the Pro version

How to Update Your Wordpress Website From R

https://dm13450.github.io/2018/06/12/Isle-of-Man-TT.html

I created a tool to automatically extract the most important sentences from an article of text; it also has a physics-based network visualization of the underlying algorithm [OC]

I Fell Out With Tapply and in Love With Dplyr

Icon: Web Icons for Rmarkdown

Importance of Conveying Facts Through Storytelling in Data Science - for Data Scientists

In Case You Missed It: June 2018 Roundup

in Honor of a Two Week Break, I Have Graphed the Dice Rolls From the Start of Season 2

In RStudio, How Do You Decide Which Code Goes in Your RMarkdown Document and Which Code Goes in Your RScript?

Interpreting Machine Learning Models With the Lime Package for R

Interview Challenge with One of the Biggest Tech Companies

Intro to FFTree Exercise

Intro to Time Series Analysis -Part 1

Intro to Time Series Analysis Part 2 :Exercises

Introducing Datazar Paper

Introduction to Compositional Data Analysis in R - Preferably With a Geoscience Background

Introduction to DataExplorer

Is There Any Serious Computer Science Course Available for Free Online?

jldbc/coffee-quality-database: Building the Coffee Quality Institute Database

Kaplan-Meier Survival Plot – With at Risk Table

Leaderboard

Learn About Pi-Hole Ad Blocking for Your Home Network

Life (Expectancy), Animated

Life Data and Design Decisions

Lobe

Lobe’s Ridiculously Simple Machine Learning Platform Aims to Empower Non-Technical Creators

LongCatEDA: an R Package to Visualize Longitudinal Change in Categorical Variables

Machine Learning and Its Applications

Machine Learning Basics – Random Forest

Machine Learning in R With H2O and LIME: a Free Workshop!

Machine Learning Logistic Regression for Credit Modelling in R

Machine learning may be most useful in tiny, embedded, offline processors / Boing Boing

Machine Learning Results in R: One Plot to Rule Them All! (Part 2 – Regression Models)

Making Art in R

Marketing Analytics and Data Science

Markov Chain Delinquency Problem in R

Master R Shiny: One Trick to Build Maintainable and Scalable Event Chains

May 2018: “Top 40” New Packages

Melt and cast the shape of your data.frame – Exercises

Michèle Nuijten and John Sakaluk — The Bayes Factor — Overcast

Microsoft R Open 3.5.0 Now Available

Missing Data Imputation and Instrumental Variables Regression: the Tidy Approach

Mlr or Caret

Models Are About What Changes, and What Doesn’t

Modularize Your Shiny Apps: Exercises

Monotonic Binning With Equal-Sized Bads for Scorecard Development

Monte Carlo Part Two

Monte Carlo Simulation of E [OC]

Monte Carlo Techniques to Create Counterfactuals

More Tidy Evaluation With Ggplot2

Most Liked R-Bloggers’ Posts From Last Week (2018-10-07 Till 2018-10-13 – Based on Twitter)

Multilevel Modeling Solves the Multiple Comparison Problem: an Example With R

Multiple Linear Regression R Code With Optimal Adjusted R^2

My “Frite” Package Was Just Published by Cran. It Provides Basic Tools for Code Generation and Diagnostic Functions That Are Useful in Everyday Scenarios.

My Book “Deep Learning From First Principles” Now on Amazon

My First R Package Building Experience: Reflections From Creating Bulletchartr!

Neural Networks Are Essentially Polynomial Regression

New Course: Interactive Maps With Leaflet in R

New Course: Marketing Analytics in R

New Course: Python for R Users

New R Cheatsheet: Data Science Workflow With R

New R Package Xplain: Providing Interactive Interpretations and Explanations of Statistical Results

Non-Linear Model in R Exercises

Non-Standard Evaluations Example

Now “Fread” From data.table Can Read “Gz” and “Bz2” Files Directly

nps_tests.R

Odds Ratio and Its Relevance in Case Control / Cohort Studies

One-arm Bayesian Adaptive Trial Simulation Code

Optimize Your R Code Using Memoization

Ordinal Data

Organization Best Practices- Is It Better to Have Separate Files for Data Importing+Cleaning, Function Creation, Analysis Etc. That Link to Each Other, or to Put Everything in One R File?

Outliers Detection and Intervention Analysis

Package €˜Fuzzyjoin’ March 2, 2018

Parsing Metadata With R – a Package Story

PH525x series - Biomedical Data Science

Predict Customer Churn With Gradient Boosting

Prediction Interval, the wider sister of Confidence Interval

Prettify Your Shiny Tables With DT: Exercises

Pro Git book, written by Scott Chacon and Ben Straub -Fast-Version-Control

Programmatically Creating Text Output in R – Exercises

Programming With Dplyr

Publish R Markdown to DataScience+ via Your Editor (i.e., RStudio) and Promote You Work

Purrr Like a Kitten Till the Lake Pipes RoaR

PYPL Language Rankings: Python Ranks #1, R at #7 in Popularity

Qualitative Data Science: Using RQDA to Analyse Interviews

Questions About Questions

R-Bloggers Weekly – Most Loved R Posts From Last Week (2018-10-21 Till 2018-10-27)

r-podcast

R and Version Control for the Solo Data Analyst

R Data Science Tutorials

R for Psych Students? Beginner

R Functions for Bayesian Model Statistics and Summaries #Rstats #Stan #Brms

R Plotting Systems

R Resources (Free Courses, Books, Tutorials, & Cheat Sheets)

R Studio

R Tip: Use Qc() for Fast Legible Quoting

R Weekly 2018-26 Chat, Highlight

R Weekly 2018-28 Tidyverse, R Markdown, Docker

Ranking the Most Influential Computer Security Papers Ever Published

Rapply, Vapply, and Eapply

Rcommander

RConsortium — Building an R Certification

Reading financial statements (PDF) for data analysis

Recommend the Book to Learn R in 14 Days for Medical Research to a Person That Already Knows Programming and Basic Statistics.

Recommended Courses

Rectangling Onboarding

Reliability Chapter Added to “Empirical Software Engineering Using R”

remindR: in Code Text Reminders to Aid Code Development

Remove Password Protection From Excel Sheets Using R

Replace data frame names in formula en masse

Reproducible Finance, the Book! and a Discount for Our Readers

ReproducibleResearch

Residential Property Investment Visualization and Analysis Shiny App

Resources for Data Imputation?

REST APIs and Plumber

Ricochet Id: Ricochet:2Bax37Pseb6Mewj4

Rmd Files Will No Longer Be Rendered as Markdown on Github - Yihui Xie | 谢益辉

roxygen

RpadTpDataScientist1

RStudio 1.2 Preview: Plumber Integration

Rstudio::Conf 2018 Summary

RStudio:Addins Part 1. – Code Reproducibility Testing

RStudio:Addins Part 2 – Roxygen Documentation Formatting Made Easy

RStudio:Addins Part 3 – View Objects, Files, Functions and More With 1 Keypress

Running the Same Task in Python and R

SatRday Talks Recordings

Seanchrismurphy/a-Psychologists-Guide-to-R

Sentiment Analysis of Slack Reviews Using R

Sentiment Use Across the Course of Pitchfork Music Reviews: a Tidy Text Analysis With R

Seplyr 0.5.8 Now Available on CRAN

Share Your Method of Analysis Without Sharing Your Data - Quantified Self

Shinytest – Automated Testing for Shiny Apps

Should I use an ordered multinomial logit model or just use ANOVA?

Should You Learn R or Python for Data Science?

Simple Guide to Learn How to Analyse Multilevel (Diary) Data?

Skip to Content

Slides From My JAX 2018 Talk: Deep Learning – a Primer

Slopegraphs and R – a Pleasant Diversion – May 26, 2018

Soliciting Reactions to My Portfolio Project

Some Exercises Related to Logistic Regression, Principal Component Analysis, Linear Discriminant Analysis, Support Vector Machines, the Kernel Trick, Word Embedding and Constrained Optimization. Feedback and Stars Appreciated :P

Stagraph | Powered By R

Statistical Sins: Is Your Classification Model Any Good?

Statistics “Sunday”: More Sentiment Analysis Resources

Statistics Sunday: Conducting Meta-Analysis in R

Statistics Sunday: Creating Wordclouds

Statistics Sunday: Tokenizing Text

Statistics Sunday: Two R Packages to Check Out

Statistics Sunday: Welcome to Sentiment Analysis With “Hotel California”

Survey Your Audience and Visualise the Results With R and Google Forms

Switching to Blogdown, Netlify and Travis

taskviewr

Teaching an Advanced R Course

Testing · R packages

testthat

The drake R package

The Financial Times and BBC Use R for Publication Graphics

The Fix Is in: Finding Infix Functions Inside Contributed R Package “Utilities” Files

The Number of Job Applications It Took to Become a Viz Practitioner [OC]

The Open Source Data Science Masters

The Significance of Poisson Distribution in Statistics | Hashtag Statistics

The True Meaning of Confidence Interval

Tidily Evaluated Ggplot2

Tidying Messy Excel Data (Introduction)

Tidyverse Cookbook

Time Series Analysis in R - for All Data Science Lovers

Time Series Analysis With Documentation and Steps I Follow for Analytics Projects.

To Purrr or Not to Purrr

Top 20 R Libraries for Data Science in 2018 [Infographic]

Top R Language Resources to Improve Your Data Skills

Top Takeaways From Studying iPhone X Owners

Tutorials for Learning R

UC Business Analytics R Programming Guide

Understanding PCA Using Stack Overflow Data

Using DataCamp’s Autograder to Teach R

Variance Explained

Visualising cumulative binomial probabilities with Shiny

Visualize the World Cup With R! Part 2: Visualizing Changes in the Group Table During the Final Matchday!

We Use Cookies on Kaggle to Deliver Our Services, Analyze Web Traffic, and Improve Your Experience on the Site. by Using Kaggle, You Agree to Our Use of Cookies.

Welcome to DataCamp

What’s New in Azure for Machine Learning and AI

What Is the R Assignment Operator := For?

What They Forgot to Teach You About R

What You Can Expect From R

Why I Rarely Use Apply

Why You Should Regret Not Going to eRum 2018?

Working With Your Facebook Data in R

Wrapr 1.4.1 Now Up on CRAN

wrapr 1.5.0 available on CRAN

Write Your Own R Package

Writing Better R Functions Part Four – April 17, 2018

Writing Pipe-Friendly Functions

X Is for By

xplain package

Z Is for Z-Scores and Standardizing