R

What I've learnt about making an R package

What I've learnt about making an R package

R
The last few weeks have been all about R package development for me. First I was exploring GitHub actions with the lovely people at the rOpenSci OzUnconf, and then I was off to San Francisco to learn about Building Tidy Tools with the Wickham siblings. I’ve picked up a lot about package development, so I’m documenting some of trickier things that I’ve learnt. A great resource for package development is Hadley’s book.
Model as a package

Model as a package

R
There’s a concept in R of an analysis as a package, in which everything you need for your data analysis is contained within a custom package. When you install the package and build the vignettes, the data analysis is performed and results saved as a pretty HTML or PDF file, generated with R Markdown. I wanted to extend this concept to a machine learning model as a package. The idea here is that, using vignettes, we can make equivalent installing a package with training a model.
My data science job hunt

My data science job hunt

R
If you listen to university advertisements for data science masters degrees, you’d believe that data scientists are so in-demand that they can walk into any company, state their salary, and start work straight away. Not quite. Interviewing for data science positions is tough, and job-seekers face some bad behaviour from recruiters and hiring managers. Many companies understand that they need to do something with data, but they don’t know what. They’ll say they want machine learning when they really want a few dashboards.
Markdown monsters

Markdown monsters

R
Whenever I take an interest in something I think to myself, “How can I combine this with R?” This post is the result of applying that attitude to Dungeons and Dragons. So how would I combine D&D with R? A good start would be to have a nice data set of Dungeons and Dragons monsters, with all of their statistics, abilities and attributes. One of the core D&D rule books is the Monster Manual.
How I'd like to send an email from R

How I'd like to send an email from R

R
When I found myself using R in a corporate environment, my workflow went like this: Connect to databases Do stuff to data Email results Yes, there exist options for presenting results that are a bit more modern than the old-fashioned email—R Markdown, Shiny, or even Slack, for example. But email is embedded in corporate culture and will be around for a long time to come. I want to set down how I think a send_email function should work in R.
useR: The Week in Review

useR: The Week in Review

R
That’s it for #useR2018. After 6 keynotes, 132 parallel sessions, many more lightning talks and posters, and an all-important conference dinner, we’ve reached the end of the week. This was my first proper conference since 2015. I had almost forgotten how it felt to be surrounded by hundreds of people who are just as passionate (if not more) about your tiny area of specialised knowledge than you are. I took notes for the three tutorials I went to, but I wanted to take a moment to review the week as a whole, including the talks that stood out to me.
useR: Missing values imputation

useR: Missing values imputation

F
These are my notes for the third and final tutorial of useR2018, and the tutorial I was looking forward to the most. I struggle with missing value imputation. It’s one of those things which I kind of get the theory of, but fall over when trying to do. So I was keen to hear Julie Joss and Nick Tierney talk about their techniques and associated R packages. Your dataset with missing values after mean imputation.
useR: Getting started with R and Docker

useR: Getting started with R and Docker

R
These are my notes for the super helpful tutorial given by Elizabeth Stark on the first day of the UseR 2018 conference. This was an introduction to Docker for R users who have no prior experience with Docker (which was me!). Elizabeth’s slides Elizabeth’s exercises and examples This tutorial took me through setting up an RStudio Server container. I’m on a Linux machine, but I’m particularly interested by the idea that you could run these traditionally Linux-only servers on a Windows machine through Docker.
useR: Recipes for data processing

useR: Recipes for data processing

R
These are my notes for the tutorial given by Max Kuhn on the afternoon of the first day of the UseR 2018 conference. Full confession here: I was having trouble deciding between this tutorial and another one, and eventually decided on the other one. But then I accidentally came to the wrong room and I took it as a sign that it was time to learn more about preprocessing. Also, the recipes package is adorable.
Scraping wine colours with R

Scraping wine colours with R

R
My knowledge of wine covers three facts: I like red wine. I do not like white wine. I love wine data. I came across a great collection of around 130,000 wine reviews, each a paragraph long, on Kaggle. This is juicy stuff, and I can’t wait to dig into it with some text analysis, or maybe build some sort of markov chain or neural network that generates new wine reviews.