Data Manipulation in R
1 Introduction
Data manipulation is the process of cleaning, organising and preparing data in a way that makes it suitable for analysis. Most real-world datasets require some form of manipulation to facilitate the downstream analysis and this process is often repeated a number of times during the data analysis cycle. In this workshop you will learn how to apply a consistent grammar of data manipulation to raw data and prepare it for analysis. The following topics are covered in the workshop:
- Learning to use the grammar of data manipulation
- Merging multiple datasets and creating subsets using filters
- Reshaping data between long and wide formats
- Summarising data with group-wise operation
- Setting up data pipelines for efficient data manipulation
This workshop is designed for individuals who are already familiar with R but wish to learn efficient techniques for data manipulation. It is recommended that you bring your own laptop with the latest version of R and RStudio installed.
Last Updated: Nov 23, 2017 12:36 AM