By Andy Brown
Download files and exercises for this tutorial from https://www.wiseowl.co.uk/python/videos/python-programming/scraping-websites/
This tutorial covers all aspects of extracting data from websites using Python: from the ethics and legality of web scraping, via the structure of a web page's document object model through to using the Python requests and BeautifulSoup modules to extract meaning from HTML.
Chapters
00:00 Topic list
02:03 Problems you may encounter
06:52 Our example HTML
08:20 The Document Object Model (DOM)
11:08 HTML tags and attributes
12:52 Element ids and class names (CSS)
18:37 Getting HTML from a website (requests module)
22:26 Getting HTML from a file
23:45 Getting started with BeautifulSoup
27:18 Chaining elements
30:43 Navigable strings
33:54 Navigating the DOM
37:52 Finding elements
46:11 Using CSS selectors
Python playlist https://youtube.com/playlist?list=PLNIs-AWhQzckGrdnwITFlXGqiXTCAhWWi
If you'd like to help fund Wise Owl's conversion of tea and biscuits into quality training videos you can join this channel to get access to perks:
https://www.youtube.com/channel/UCbi5G5PjWBaQUFy7XU_O7yw/join
Or you can click this link https://www.wiseowl.co.uk/donate?t=1 to make a donation. Thanks for watching!
Visit https://www.wiseowl.co.uk for more online training resources in Microsoft Excel, Microsoft Power BI, DAX, VBA, Python, Visual C#, Microsoft SQL Server, SQL Server Reporting Services SSRS, SQL Server Integration Services SSIS, Microsoft Access and more!