Scrapy Masterclass: Learn Web Scraping With Scrapy Framework

Master Web Scraping using Scrapy Python Framework and this Step-By-Step & In-Depth Web Scraping Guide For Beginners

4.40 (145 reviews)
Udemy
platform
English
language
Programming Languages
category
instructor
Scrapy Masterclass: Learn Web Scraping With Scrapy Framework
7,991
students
7.5 hours
content
May 2023
last update
$79.99
regular price

What you will learn

Define the Steps Involved in Web Scraping and Creating Web Crawlers

Install and Setup Scrapy in Windows, Mac OS, Ubuntu (Linux) & Anaconda Environments

Send Request to a URL to Scrape Websites Using Scrapy Spider

Get the HTML Response From URL and Parse it for Web Scraping

Select Desired Data From Websites Using Scrapy Selector, CSS Selectors & XPath

Scrapy Crawl Spiders to Get Data From Websites And Extract it to JSON, CSV, XLSX ( Excel ) and XML Files

Use Scrapy Shell Commands to Test & Verify CSS Selectors or XPath

Export and Save Scraped Data to Online Databases Like MonogoDB Using Scrapy Item Pipelines

Define Scrapy Items to Organize Scraped Data And Load Items Using Scrapy Itemloaders with Input & Output Processors

Scrape Data From Multiple Web Pages Using Scrapy Pagination And Extract Data From HTML Tables

Login Into Websites Using Scrapy FormRequest With CSRF Tokens

Scrape Dynamic/JavaScript Rendered Websites Using Scrapy-Playwright And Interact With Web Elements, Take Screenshot of Websites or Save as PDF

Identify API Calls From a Website and Scrape Data From API Using Scrapy Request

Why take this course?

Web scraping is the process of scraping websites and extracting desired data from the same, and in this course, you'll learn and master web scraping using python and scrapy framework with a step-by-step and in-depth guide.


A Step-By-Step Guide

Assuming that you know nothing about web scraping, web crawling, scrapy framework, web scraping, or even web scrapping, we will start from the complete basics. In the first section, you'll learn about the web scraping process step-by-step (with infographics - no code), how to scrape data from websites and how to use scrapy for the same (i.e. scrapy meaning).


After getting the basics clear and having an idea of how web scraping works, we will start web scraping using python & scrapy framework! Again, we'll move step-by-step and perform each step learned in the basics with bite-sized lessons. We'll take it slow so that it's easier for you to understand every step involved in scraping and extracting data from websites.


Web Scraping & Scrapy Essentials

Having built an actual web scraper, you'll get an idea of how web scraping works first-hand. Now it's crucial to cover the essential concepts of web scraping and scrapy, which we will do next.

  • CSS Selectors to select web elements

  • XPath to select web elements.

  • Scrapy Shell to test & verify selectors.

  • Items to organise extracted data

  • Load Items with ItemLoaders with input & output Processors

  • Export data to JSON, CSV, XLSX (Excel) & XML file formats

  • Save extracted data to online databases like MongoDB using ItemPipelines.


Master Web Scraping In-Depth

Learning how to scrape websites and the essentials already makes you a complete web scraper, but we'll take this even further and learn the advanced web scraping techniques to become an expert!

  • Follow links in a webpage to another page.

  • Crawl multiple pages and extract data, i.e. Pagination.

  • Scrape data using Regular Expressions (RegEx)

  • Extract Data From HTML Tables

  • Login Into Websites Using Scrapy FormRequest

  • Bypass CSRF-protected Login forms.

  • Scrape Dynamic or JavaScript Rendered Websites using Scrapy Playwright.

    • Interact with web elements like fill forms, click buttons, etc.

    • Handle Infinite Scroll websites.

    • Wait For Elements when contents/data take time to load

    • Take Screenshots of websites.

    • Save websites as PDFs.

  • Identify API calls from websites and scrape data from APIs

  • Use middleware in a scrapy project.

  • Configure settings in a scrapy project

  • Use and Rotate User-Agents & Proxies

  • Web scraping Best Practices


Real-World Projects

After mastering web scraping and web crawling, we need projects to start! That's why you'll perform three projects as well:

  • Champions League Table [ ESPN ]

  • Product Tracker [ Amazon ]

  • Scraper Application [ GUI ]


Join us in this in-depth course, where you'll learn about web scraping from scratch and master the process of extracting data from websites step-by-step. Check out the preview lessons to learn how web scraping works! See you there~

Screenshots

Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Screenshot_01Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Screenshot_02Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Screenshot_03Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Screenshot_04

Reviews

Avram
September 11, 2023
I am really enjoying this course, but I still have some hesitations about whether I will be able to use them in a web-scraping project or not.
Pratipalsinh
July 22, 2023
I have been scraping websites since 2019 using Python and I always preferred the "Requests" library over the Scrapy framework as it appeared to be more complex. Recently I thought of giving Scrapy a try and bought this course to deep dive and understand the concepts like items, pipelines, and middlewares. Although the instructor explains these concepts using practical examples but the websites he used for this are very basic and generic (quotes to scrape dot com, books to scrape dot com), provided by the developer of the Scrapy framework itself i.e., Zyte. I expected the scraping of some real-world websites with lots of data since Scrapy is mostly used when there's a huge amount of data to be scraped. For that reason, I am leaving 3 stars. So if you already have some experience scraping with Scrapy then this course is not for you but if you are just starting out then I believe this course is best suited for you. Thanks
A.
June 6, 2023
A brilliant introduction to Python Scrapy and how web scraping works. Instructor Rahul is very knowledgeable in his field, clear in teaching and easy to follow along with this tutorial. Thank you for the course and crisp presentation.
Thomas
April 17, 2023
Of course, it's almost impossible to create a course that is comprehensive enough to cover all web scraping topics, but this course will give you a good start. You'll get more than just the basics, you'll dive deeper into web scraping right from the start. Everything is shown with examples used on real websites. Even for beginners, this course is easy to understand because of its step-by-step approach. If you want to learn how to use "scrapy", take this course.
Anthony
January 29, 2023
Easy to follow. Insightful and the animations/lecture style is remarkable. This clearer had a lot of work put into which I am grateful for. I find a lot of courses on Udemy come off as if the instructor turned on a low-end webcam and decided to ramble instructions hoping for a positive result. nopt Rahul, this took work and years of work in the field to deliver. 5/5 | 10/10
Joshua
January 29, 2023
Fantastic so far, very easy to follow. The one recommendation I would add is setting up iPython in Pycharm, as well as using Pycharm's built in Python c=Console. I understand VS Code is popular, bu tso is Pycharm and the Python Console in PyCharm has a built in object explorer which is useful for exploring objects
Norma
November 5, 2022
Good step-by-step instructions. I was able to follow up the exercises. The setup for the environment needs to be updated since Scrapy now requires a newer version of python. Once the environment was setup, I was able to follow up the exercises without a problem.

Charts

Price

Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Price chart

Rating

Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Ratings chart

Enrollment distribution

Scrapy Masterclass: Learn Web Scraping With Scrapy Framework - Distribution chart
4583862
udemy ID
3/7/2022
course created date
4/5/2022
course indexed date
Bot
course submited by