8 Bookmarks with Thought
8.1 R Reference
Keyboard Shortcuts – RStudio Support
http–www.rpad.org-Rpad-R-refcard.pdf
- lots of links for r functions, blogs, and other references
- List of the most useful R commands
Package ‘fuzzyjoin’
- Check out fwrite for csv in and out How to Perform a Logistic Regression in R | DataScience+
Generate Dynamic R Markdown Blocks
- Top Answer: Try knit_expand() (Dickson, 2014)
8.2 R Graphs
Writing functions for dplyr and ggplot2 – April 2, 2018 R-bloggers
ggplot2 - Essentials - Easy Guides - Wiki - STHDA
8.3 R NLP - Natural Language Processing
8.4 R Stats
Evaluating Logistic Regression Models R-bloggers
PCA in a tidy(verse) framework - goonR blog
wehrley.github.io-SOUPTONUTS.md at master · wehrley-wehrley.github.io · GitHub
Use Domain Knowledge to Review Prior Distributions
- interesting thinking about Bayesian priors and how to set them
15 Types of Regression You Should Know
An Interactive Stats Flowchart / Decision Tree to Help You Choose an Appropriate Statistical Test.
8.5 R Markdown
Branding and Automating with R Markdown Daniel Hadley
Document Code with R Markdown – NEON Data Skills
Writing your thesis with R Markdown (3) – Figures, R code and tables Rosanna's Research
- blogdown: Creating Websites with R Markdown (Yihui Xie, Amber Thomas, Alison Presmanes Hill, 2017)
- lots of other book links
Bookdown_Resources, Phil Bowsher, 2017
- Appears to be an excellent resource
Announcing the R Markdown Book
Pimp My RMD: a Few Tips for R Markdown
R Markdown: How to Format Tables and Figures in .Docx Files
8.6 R Workflow / Version Control
A perfect RStudio layout (Ilya Kashnitsky, 2018)
- My current preferred layout
- Adjust in “Tools” –> “Global options” –> “Pane layout”:
- Top Left: Source
- Bottom Left: History (then collapse)
- Top Right: Console
- Bottom Right: Everything else
Project-Oriented Workflow (Bryan 2017)
- Organize each logical project into a folder on your computer.
- Make sure the top-level folder advertises itself as such (with a .here, .git, or .Rproj)
- Directories:
- Create all paths relative to the top-level directory.
- Whenever you work on this project, launch the R process from the project’s top-level directory. If you launch R from the shell, cd to the correct folder first.
- Write every script assuming it will be run in a fresh R process:
- User-level setup: Do not save .RData when you quit R and don’t load .RData when you fire up R.
- Don’t do things in your .Rprofile that affect how R code runs, such as loading a package like dplyr or ggplot or setting an option such as stringsAsFactors = FALSE.
- Daily work habit: Restart R very often and re-run your under-development script from the top.
GitHub - MaximeWack-tidyflow Tidyflow a workflow that fits the tidyverse
- This is the workflow process that seems to work best for me right now
Good enough practices in scientific computing
ProjectTemplate package
- Recommended by the Not so standard deviations podcast
- Doesn’t look like a format that I would find useful. Too informal once the project is set up.
- An article about various R workflow tools that the Author found interesting
Tools for Reproducible Research Spring (Broman 2016)
- Writing reproducible reports: knitr with R Markdown (Broman 2016)
- Karl Broman (Broman 2016)
- Excellent site with lots of discussion of Reproducable research, knitr, and R overall
- http://kbroman.org/steps2rr/
Professional data validation for the R environment
- Validate package to include data frame of testing from not so standard deviations episode 18 asser from r open Sci
Reproducible Development With Rmarkdown and Github
Structuring R Projects: a Pragmatic Perspective
Beyond Basic R – Introduction and Best Practices
A Framework for Reproducible and Collaborative Data Science • workflowr
8.7 R Shiny
8.8 R Projects
Identifying “Dirty” Twitter Bots with R and Python – paulvanderlaken.com
Mapping earthquakes off the Irish Coast using the leaflet package in R R-bloggers
Simulating 10 000 Tic Tac Toe Games - Analytic Freaks
- Statistics and R
- Working with Your Facebook Data in R
Global, Arctic and Antarctic Sea-Ice Area Barrel - June 2018 [OC]
Analyzing Customer Data From Square
Animating Changes in Football Kits Using R
Antipredator Behavior With R (or Why Wildebeest Should Stay Together)
Any Ideas for Some Projects I Can Do to Show Off My Knowledge of R?
Anybody Interested in Analyzing Last Summer’s /R/Statistics User Survey?
Anyone Tried Some Analysis on Your Own Facebook Data? How to Get HTM Files Into RStudio?
8.9 R Blogs and general reference
Yihui's Blog - R · statistics · life · random thoughts - Yihui Xie
Research Tools Rosanna's Research
The Litigation Articles - George D. Gopen
- Good reference on general business writing - bridging the gap between code and audience
Our Top 10 Blogs for Data Scientists
8.10 R links/blog posts to read
Tidy text package used for Frump text analysis
Check out 538 R analyses on GitHub
Check out show notes from not so standard deviations episode 23 Walt Hickey
8.11 R links/blog posts to read without annotation
“Errors in Statistics for Medical Research”. Any Suggestions?
[Instructional] How to Make Awesome Sankey Diagrams in RStudio in Minutes!
[OC] US Population Projections by Age Through 2060
“Are My Scores Good?” Well, Compared to What?
“Demystifying Data Science” Remote Notes
20 Free Online Courses to Learn Data Science
4 Ways to Be More Efficient Using RStudio’s Code Snippets, With 11 Ready to Use Examples
5 Alternatives to the Default R Outputs for GLMs and Linear Models
A Comparative Review of the Deducer GUI for R
A Completely Subjective Ranking of Data Science Podcasts
A List of Over 1,000 Datasets Available in R Packages
A little function to help generate ICCs in simple clustered data
A Question and an Answer About Recoding Several Factors Simultaneously in R
A Small Logical Change With Big Impact
A very basic tutorial for performing linear mixed effects analyses (Tutorial 2)
AI Data Sets to Use For Your Next Creative Project
AI, Machine Learning and Data Science Roundup: June 2018
An Overview of R With a Curated Learning Path
Analyzing NetHack Data, Part 1: What Kills the Players
Announcing CGPfunctions 0.3 – April 20, 2018 R-bloggers
Anomaly Detection for Business Metrics with R
April 2018: “Top 40” New Packages
As machine learning evolves, we need to update the definition of ‘data scientist’ | VentureBeat
Automated Email Reports With R
Automated Feature Selection Using bounceR
Automl Package: Part 2/2 First Steps How To
Avoid these 4 common data science pitfalls
Awesome Twitter Word Clouds in R
Basics of EDA Using Iris Dataset
Beautiful and Powerful Correlation Tables in R
Beautiful and Powerful Correlation Tables in R
Becoming a Data Scientist – Curriculum via Metromap
Bias in Pre-Post Designs – an Example From the Turnbaugh Et Al (2006) Mouse Fecal Transplant Study
Big Data-2: Move Into the Big League:Graduate From R to SparkR
Big News: Vtreat 1.2.0 Is Available on CRAN, and It Is Now Big Data Capable
Book suggestions for a student pursuing a degree in statistics?
Breaking Into Data Science in Seattle
Can I get some advice on learning to use R?
Can Someone Help Me Get a Better Understanding of the ROC Curve and Its Applications?
Celebrate Halloween With Creepy Computer Games in R
CHAID and Caret – a Good Combo – June 6, 2018
CHAID and R – When You Need Explanation – May 15, 2018
Communicating Results With R Markdown
Computer Science and Programming
Confessions of a Data Scientist: Why I Quit Social Media and Still Cut My Own Grass
Continuous and Automatic Key Driver Analysis of Customer Feedback
Continuous Deployment of Package Documentation With Pkgdown and Travis CI
Converting a R Matrix to LaTeX
Conway’s Game of Life in R: or on the Importance of Vectorizing Your R Code
Create a Glossary in R Markdown
Create Outstanding Dashboards With the New semantic.dashboard Package
Create Waterfall Charts Using R & Ggplot2
Create Your Machine Learning Library From Scratch With R ! (2/5) – PCA
Create Your Machine Learning Library From Scratch With R ! (3/5) – KNN
Customizing Styler – the Quick Way
Data + Art STEAM Project: Final Results
Data + Art STEAM Project: Initial Results
Data Analyst to Data Scientist
Data Lunch 2Feb: the Use of Bookdown to Write Documents and Reports
Data on 1,340 Coffee Bean Reviews (aroma, acidity, flavor, altitude, + more)
Data Science for Business Tutorial: Using Machine Learning With LIME to Understand Employee Churn
Data Science for Business: 3 Reasons You Need to Learn the Expected Value Framework
Data Science for Business: Course Launch in 5 Days!!!
Data Science for Startups: Business Intelligence
Data Science With R Course Series – Week 6
Data Scientist Resume Projects
Data Visualisation COMPETITION
Dealing With Heteroskedasticity; Regression With Robust Standard Errors Using R
Deep Learning From First Principles in Python, R and Octave – Part 8
Detecting Unconscious Bias in Models, With R
Detecting Unconscious Bias in Models, With R
Digging Deeper: Online Resources for Intermediate to Advanced R Users
Do Clustering by “Dimensional Collapse”
Don’t Worry: Google Only Checks Your Location Every 10 Minutes
Draw nicer Classification and Regression Trees with the rpart.plot package
Drawing Beautiful Maps Programmatically With R, Sf and Ggplot2 — Part 1: Basics
Easy APA Formatted Bayesian Correlation
Enterprise Dashboards With R Markdown
Episode 7: Best Practices for Workflow Management
eRum 2018 - May 15 - Edwin Thoen - YouTube
Estimating Control Chart Constants With R
Exploratory Factor Analysis in R
Exploring College Major and Income: a Live Data Analysis in R
Extract FRED Data for OLS Regression Analysis: a Complete R Tutorial
Extracting Tables From PDFs in R Using the Tabulizer Package
Extreme Makeover: R Graphics Edition
Fancy Plot (With Posterior Samples) for Bayesian Regressions
Feature Engineering and Selection: a Practical Approach for Predictive Models
Finalfit, Knitr and R Markdown for Quick Results
First20Hours/Google-10000-English
Formatted Correlation With Effect Size
From Data to Viz | Find the Graphic You Need
Fun with lists (in R) • Apoorv Anand
Get Your Tracks From the Strava API and Plot Them on Leaflet Maps
Getting Data From Pdfs Using the Pdftools Package
Getting Hired as a Data Scientist
GitHub - andbe/keysLocksSimulations: Simulate keys/lock question scenarios
GitHub - jldbc/coffee-quality-database: Building the Coffee Quality Institute Database
Global Migration, Animated With R
Guide to Becoming an Average Programmer and Infocybersecurity
Hotfix for Microsoft R Open 3.5.0 on Linux
How Do I Use Latent Growth Curve Analysis in R?
How Do You Combine “Revision Control” With “Workflow” for R?
How Not to Sort by Average Rating
How the Odds Ratio Confounds: a Brief Study in a Few Colorful Figures
How to Access Data From Twitter API Using R and Python (Let Me Know What You Think)
How to Build a Data Science Portfolio
How to Build Your Own Neural Network From Scratch in R
How to Create a Flexdashboard: Exercises
How to learn Bayesian statistics à la DataCamp or swirl()?
How to Plot With Dygraphs: Exercises
How to Quickly Make Reusable World Maps Based on Excel Worksheet Data Using Ggplot and RStudio!
How to Run R From the Task Scheduler
How to setup RStudio Server Open Source Edition like the Pro version
How to Update Your Wordpress Website From R
https://dm13450.github.io/2018/06/12/Isle-of-Man-TT.html
I Fell Out With Tapply and in Love With Dplyr
Importance of Conveying Facts Through Storytelling in Data Science - for Data Scientists
In Case You Missed It: June 2018 Roundup
in Honor of a Two Week Break, I Have Graphed the Dice Rolls From the Start of Season 2
Interpreting Machine Learning Models With the Lime Package for R
Interview Challenge with One of the Biggest Tech Companies
Intro to Time Series Analysis -Part 1
Intro to Time Series Analysis Part 2 :Exercises
Introduction to Compositional Data Analysis in R - Preferably With a Geoscience Background
Is There Any Serious Computer Science Course Available for Free Online?
jldbc/coffee-quality-database: Building the Coffee Quality Institute Database
Kaplan-Meier Survival Plot – With at Risk Table
Learn About Pi-Hole Ad Blocking for Your Home Network
Life Data and Design Decisions
Lobe’s Ridiculously Simple Machine Learning Platform Aims to Empower Non-Technical Creators
LongCatEDA: an R Package to Visualize Longitudinal Change in Categorical Variables
Machine Learning and Its Applications
Machine Learning Basics – Random Forest
Machine Learning in R With H2O and LIME: a Free Workshop!
Machine Learning Logistic Regression for Credit Modelling in R
Machine learning may be most useful in tiny, embedded, offline processors / Boing Boing
Machine Learning Results in R: One Plot to Rule Them All! (Part 2 – Regression Models)
Marketing Analytics and Data Science
Markov Chain Delinquency Problem in R
Master R Shiny: One Trick to Build Maintainable and Scalable Event Chains
May 2018: “Top 40” New Packages
Melt and cast the shape of your data.frame – Exercises
Michèle Nuijten and John Sakaluk — The Bayes Factor — Overcast
Microsoft R Open 3.5.0 Now Available
Missing Data Imputation and Instrumental Variables Regression: the Tidy Approach
Models Are About What Changes, and What Doesn’t
Modularize Your Shiny Apps: Exercises
Monotonic Binning With Equal-Sized Bads for Scorecard Development
Monte Carlo Simulation of E [OC]
Monte Carlo Techniques to Create Counterfactuals
More Tidy Evaluation With Ggplot2
Most Liked R-Bloggers’ Posts From Last Week (2018-10-07 Till 2018-10-13 – Based on Twitter)
Multilevel Modeling Solves the Multiple Comparison Problem: an Example With R
Multiple Linear Regression R Code With Optimal Adjusted R^2
My Book “Deep Learning From First Principles” Now on Amazon
My First R Package Building Experience: Reflections From Creating Bulletchartr!
Neural Networks Are Essentially Polynomial Regression
New Course: Interactive Maps With Leaflet in R
New Course: Marketing Analytics in R
New Course: Python for R Users
New R Cheatsheet: Data Science Workflow With R
New R Package Xplain: Providing Interactive Interpretations and Explanations of Statistical Results
Non-Linear Model in R Exercises
Non-Standard Evaluations Example
Now “Fread” From data.table Can Read “Gz” and “Bz2” Files Directly
Odds Ratio and Its Relevance in Case Control / Cohort Studies
One-arm Bayesian Adaptive Trial Simulation Code
Optimize Your R Code Using Memoization
Outliers Detection and Intervention Analysis
Package €˜Fuzzyjoin’ March 2, 2018
Parsing Metadata With R – a Package Story
PH525x series - Biomedical Data Science
Predict Customer Churn With Gradient Boosting
Prediction Interval, the wider sister of Confidence Interval
Prettify Your Shiny Tables With DT: Exercises
Pro Git book, written by Scott Chacon and Ben Straub -Fast-Version-Control
Programmatically Creating Text Output in R – Exercises
Publish R Markdown to DataScience+ via Your Editor (i.e., RStudio) and Promote You Work
Purrr Like a Kitten Till the Lake Pipes RoaR
PYPL Language Rankings: Python Ranks #1, R at #7 in Popularity
Qualitative Data Science: Using RQDA to Analyse Interviews
R-Bloggers Weekly – Most Loved R Posts From Last Week (2018-10-21 Till 2018-10-27)
R and Version Control for the Solo Data Analyst
R for Psych Students? Beginner
R Functions for Bayesian Model Statistics and Summaries #Rstats #Stan #Brms
R Resources (Free Courses, Books, Tutorials, & Cheat Sheets)
R Tip: Use Qc() for Fast Legible Quoting
R Weekly 2018-26 Chat, Highlight
R Weekly 2018-28 Tidyverse, R Markdown, Docker
Ranking the Most Influential Computer Security Papers Ever Published
RConsortium — Building an R Certification
Reading financial statements (PDF) for data analysis
Reliability Chapter Added to “Empirical Software Engineering Using R”
remindR: in Code Text Reminders to Aid Code Development
Remove Password Protection From Excel Sheets Using R
Replace data frame names in formula en masse
Reproducible Finance, the Book! and a Discount for Our Readers
Residential Property Investment Visualization and Analysis Shiny App
Resources for Data Imputation?
Ricochet Id: Ricochet:2Bax37Pseb6Mewj4
Rmd Files Will No Longer Be Rendered as Markdown on Github - Yihui Xie | 谢益辉
RStudio 1.2 Preview: Plumber Integration
RStudio:Addins Part 1. – Code Reproducibility Testing
RStudio:Addins Part 2 – Roxygen Documentation Formatting Made Easy
RStudio:Addins Part 3 – View Objects, Files, Functions and More With 1 Keypress
Running the Same Task in Python and R
Seanchrismurphy/a-Psychologists-Guide-to-R
Sentiment Analysis of Slack Reviews Using R
Sentiment Use Across the Course of Pitchfork Music Reviews: a Tidy Text Analysis With R
Seplyr 0.5.8 Now Available on CRAN
Share Your Method of Analysis Without Sharing Your Data - Quantified Self
Shinytest – Automated Testing for Shiny Apps
Should I use an ordered multinomial logit model or just use ANOVA?
Should You Learn R or Python for Data Science?
Simple Guide to Learn How to Analyse Multilevel (Diary) Data?
Slides From My JAX 2018 Talk: Deep Learning – a Primer
Slopegraphs and R – a Pleasant Diversion – May 26, 2018
Soliciting Reactions to My Portfolio Project
Statistical Sins: Is Your Classification Model Any Good?
Statistics “Sunday”: More Sentiment Analysis Resources
Statistics Sunday: Conducting Meta-Analysis in R
Statistics Sunday: Creating Wordclouds
Statistics Sunday: Tokenizing Text
Statistics Sunday: Two R Packages to Check Out
Statistics Sunday: Welcome to Sentiment Analysis With “Hotel California”
Survey Your Audience and Visualise the Results With R and Google Forms
Switching to Blogdown, Netlify and Travis
The Financial Times and BBC Use R for Publication Graphics
The Fix Is in: Finding Infix Functions Inside Contributed R Package “Utilities” Files
The Number of Job Applications It Took to Become a Viz Practitioner [OC]
The Open Source Data Science Masters
The Significance of Poisson Distribution in Statistics | Hashtag Statistics
The True Meaning of Confidence Interval
Tidying Messy Excel Data (Introduction)
Time Series Analysis in R - for All Data Science Lovers
Time Series Analysis With Documentation and Steps I Follow for Analytics Projects.
Top 20 R Libraries for Data Science in 2018 [Infographic]
Top R Language Resources to Improve Your Data Skills
Top Takeaways From Studying iPhone X Owners
UC Business Analytics R Programming Guide
Understanding PCA Using Stack Overflow Data
Using DataCamp’s Autograder to Teach R
Visualising cumulative binomial probabilities with Shiny
What’s New in Azure for Machine Learning and AI
What Is the R Assignment Operator := For?
What They Forgot to Teach You About R
Why You Should Regret Not Going to eRum 2018?
Working With Your Facebook Data in R
Writing Better R Functions Part Four – April 17, 2018