Description
WEB SCRAPING ASSIGNMENTS IN PYTHON
INSTRUCTOR
Sean-Michael Tulloch
Sean is a Software Engineer based in the UK who specializes in web scrapping and algorithm optimization.
From 2012 to 2015, he was a teaching Assistant within BSc Mathematics course at the University of Aston.
He holds a Bachelors and Masters degree in Mathematics.
DESCRIPTION
Web Scraping is a technique to extract data from web sites through an automated process. It is a form of copying data from the web and store it in a database or spreadsheet.
This project is made up of three different web scraping problems. The assignments consist of extracting data, files or information from different websites.
Each problem involves building a different web scraper for the following sites:
– Wikipedia Easy
– Youtube. Advanced
– Filebin. Easy
WHAT YOU WILL PRACTICE
Libraries:
You will need to import the following packages: BeautifulSoup, requests, zipfile, csv and re.
Regex:
You will practice some basic Regex for pattern matching.
cURL
curl command with Python requests. See here for more information.
Python:
You will work with functions, loops, lists and conditionals. You will have to unzip files and write data in a csv file.
DOWNLOAD / CONTENT
You will receive an email with a protected ZIP and a password to access the content. If you are a registered user, the download is always available on your account.
-1 PDF with the problems description along with steps or guidelines to help you finish the exercises in case you get lost.
– 3 py files with the source code including explanations and comments about how the code works.
REQUIREMENTS
– Python 3
CONTACT
If you need additional information, do not hesitate to contact us.
Additional information
Specification: SCRAP POPULAR SITES AND EXTRACT REQUESTED DATA
|
Reviews (1)
1 review for SCRAP POPULAR SITES AND EXTRACT REQUESTED DATA
Only logged in customers who have purchased this product may leave a review.
Benjamin (verified owner) –
I got introduced to webscraping in my Master of data science. Beyond all doubt, I would recommend these assignments to anyone willing to learn webscraping. I think this is a great start for anyone interested in the field.