Data Manipulation with dplyr

dplyr is a a great tool to perform data manipulation. It makes your data analysis process a lot more efficient. Even better, it’s fairly simple to learn and start applying immediately to your work! Oftentimes, with just a few elegant lines of code, your data becomes that much easier to dissect and analyze. For these reasons, it is an essential and foundational skill to master for any aspiring data scientist. Often one may be surprised how some easy-to-learn functions can make the data analysis process that much more efficient. That is certainly the case with dplyr. We will teach you how to use this incredibly useful package to mung data, while demonstrating with a Kaggle dataset on wine ratings. Remember to bring your laptops to follow along if you like.

Get the accompanying material here

Next community talk:
Data Visualization with ggplot2

Previous community talk:
Building a Business Case for Your ML Idea

More Data Science Material:
[Video] Feature Engineering with dplyr
[Video] dplyr Setup and Data Preparation
[Blog] Data Manipulation and Exploration with dplyr

(431)

Arham Akheel
About The Author
- Arham holds a Masters degree in Technology Management from Texas A&M University and has a background of managing information systems.

Avatar

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>