Web Scraping in R: Creating your automated script

R tutorial: In this video tutorial you will learn how to write standard web scraping commands in R, filter timely data based on time diffs, analyze or summarize key information in the text, and send an email alert of the results of your analysis.

Packages used:
rvest – for downloading website data
lubridate – for cleaning, converting date-time data
stringr – for cleaning text in r
LSAfun – for ranking/summarizing the text

Recommended for medium level R users. See our Introduction to R to get up-to-speed with basic R commands:

The R full script for this video tutorial can be accessed here

To see an example of web scraping timely political news events and commentary from Reddit, check out Data Science Dojo’s blog tutorial on KDnuggets: https://www.kdnuggets.com/2018/12/automated-web-scraping-r.html

More Data Science Material:
[Video] Auto Scheduling your Script (Part 2)
[Video] Web Scrape in python using BeautifulSoup
[Blog] Web Scraping in 30-minutes


Rebecca Merrett
About The Author
- Rebecca holds a bachelor’s degree of information and media from the University of Technology Sydney and a post graduate diploma in mathematics and statistics from the University of Southern Queensland. She has a background in technical writing for games dev and has written for tech publications.


You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>