Web Scraping

Erstellt von Hannes Burrichter, zuletzt geändert von Karahan Yilmazer am 27. November 2023

In this workshop you will learn even more about how you can extract information from websites! Cool right?

BeautifulSoup is a Python module that helps you parse and navigate HTML or XML documents, making it easier to extract and manipulate data from web pages. (Documentation: https://beautiful-soup-4.readthedocs.io/en/latest/)

The following topics will be covered:

BeautifulSoup:

Scraping data from an example website listing various artists
Storing movie information from the IMDb Top 250 website into Pandas DataFrames

Requirements

Your own laptop
Internet connection
Google account
Google Colab installed in Google Drive

Date and Location

Wed, 05.07.23, 17:00 - 19:00

Materials

Slides: https://docs.google.com/presentation/d/1BkxgoUQyF8vhHHBFS9nMB5hCgMla9DiI/edit?usp=sharing&ouid=116220617472791343301&rtpof=true&sd=true
Notebook: https://colab.research.google.com/drive/1LMoxTpxL5vVdKb52Vj_uHnzjiTVJ3uSv
Feedback form: TBA

Keine Stichwörter