WEB SCRAPING ASSIGNMENTS IN PYTHON
Sean is a Software Engineer based in the UK who specializes in web scrapping and algorithm optimization.
From 2012 to 2015, he was a teaching Assistant within BSc Mathematics course at the University of Aston.
He holds a Bachelors and Masters degree in Mathematics.
Web Scraping is a technique to extract data from web sites through an automated process. It is a form of copying data from the web and store it in a database or spreadsheet.
This project is made up of three different web scraping problems. The assignments consist of extracting data, files or information from different websites.
Each problem involves building a different web scraper for the following sites:
– Wikipedia Easy
– Youtube. Advanced
– Filebin. Easy
WHAT YOU WILL PRACTICE
You will need to import the following packages: BeautifulSoup, requests, zipfile, csv and re.
You will practice some basic Regex for pattern matching.
curl command with Python requests. See here for more information.
You will work with functions, loops, lists and conditionals. You will have to unzip files and write data in a csv file.
DOWNLOAD / CONTENT
You will receive an email with a protected ZIP and a password to access the content. If you are a registered user, the download is always available on your account.
-1 PDF with the problems description along with steps or guidelines to help you finish the exercises in case you get lost.
– 3 py files with the source code including explanations and comments about how the code works.
– Python 3
If you need additional information, do not hesitate to contact us.
Specification: SCRAP POPULAR SITES AND EXTRACT REQUESTED DATA
pdf, text, zip
1 review for SCRAP POPULAR SITES AND EXTRACT REQUESTED DATA
Only logged in customers who have purchased this product may leave a review.