Automated Web Scraping in R: Writing your Script

In this video tutorial you will learn how to write standard web scraping commands in R, filter timely data based on time diffs, analyze or summarize key information in the text, and send an email alert of the results of your analysis.

Packages used:
rvest – for downloading website data
lubridate – for cleaning, converting date-time data
stringr – for cleaning text in r
LSAfun – for ranking/summarizing the text

Recommended for medium level R users. See our Introduction to R to get up-to-speed with basic R commands:

Introduction to R – Part 1.1 Getting Started

The R full script for this video tutorial can be accessed here

To see an example of web scraping timely political news events and commentary from Reddit, check out Data Science Dojo’s blog tutorial on KDnuggets:



Category: Canonical Pages
About The Author
- Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.

Start the discussion at