Blog posts and articles
Big data analytics with R
2 February 2023
Big data analytics is a term thrown about a lot but often missing is how you practically go about things when your data get’s big. In this post I breakdown how I think about big data and the tools and packages I reach for in R when I want to read in and manipulate data, from small to large. If you want to get started with tools like vroom, data.table, Arrow, DuckDB, SQLite and Spark, and see how you can use these alongside the tidyverse then read on.
Why use Git and GitHub?
23 January 2023
If you write code for your research or analysis, maybe R or Python, there’s a good chance you will have heard of Git and GitHub. But many haven’t and, in my opinion, using Git and GitHub (or a competitor like Gitlab) are essential for coding.
In this post I’ll explain what exactly Git and GitHub are, and why they are so important for high quality analysis.