Data Manipulation with dplyr
dplyr is a a great tool to perform data manipulation. It makes your data analysis process a lot more efficient. Even better, it’s fairly simple to learn and start applying immediately to your work! Oftentimes, with just a few elegant lines of code, your data becomes that much easier to dissect and analyze. For these reasons, it is an essential and foundational skill to master for any aspiring data scientist. Often one may be surprised how some easy-to-learn functions can make the data analysis process that much more efficient. That is certainly the case with dplyr. We will teach you how to use this incredibly useful package to mung data, while demonstrating with a Kaggle dataset on wine ratings. Remember to bring your laptops to follow along if you like.
Next community talk:
Data Visualization with ggplot2
Previous community talk:
Building a Business Case for Your ML Idea