Welcome to Week 3: Mastering Data with Pandas & NumPy
Ready to take your data skills up a notch? This week, we’re diving into Pandas and NumPy, the ultimate dynamic duo for data manipulation! You’ll learn to clean, analyze, and transform data with ease, and discover the power of arrays and dataframes. By the end of the week, you’ll be crunching numbers and making data dance—turning raw data into real insights like a pro!
Session 5: NumPy for Numerical Computing
Date and Time: 12.12.2024, 14:00-16:15
Duration: 2 hours, 15 minutes break
Content:
- Introduction to NumPy:
- Understanding the importance of NumPy in data science
- Installing and importing the NumPy library.
- NumPy Arrays:
- Creating arrays from lists.
- Array indexing and slicing.
- Understanding array shapes and dimensions.
- Array Data Types and Operations:
- Data types in NumPy arrays.
- Performing mathematical operations on arrays.
- Broadcasting rules for array operations.
Session 6: Pandas for Data Analysis
Date and Time: 13.12.2024, 14:00-16:15
Duration: 2 hours, 15 minutes break
Content:
- Introduction to Pandas:
- The role of Pandas in data manipulation.
- Key data structures: Series and DataFrames.
- Data Input and Output: Reading data from CSV and Excel files.
- Writing data to files.
- Data Manipulation:
- Selecting and indexing data.
- Handling missing data.
- Filtering and sorting data.
- Data Aggregation:
- Using groupby for aggregation.
- Merging, joining, and concatenating DataFrames.
Requirements
- Your own laptop
- Internet connection
- Google account
- Google Colab installed in Google Drive
Materials
- Slides: https://docs.google.com/presentation/d/1l-RJPACkYyOmGn6hDPKBMSy6kRgurqZS/edit?usp=sharing&ouid=116220617472791343301&rtpof=true&sd=true
- Notebook: https://drive.google.com/file/d/1l-xoF-ky7r9vpjjBEK7H1Q3Dvl8G_zZn/view?usp=sharing
- Solution: https://colab.research.google.com/drive/1USCoLaHEHMGzuohpzlTh31LAO1JsguhZ?usp=sharing
- Feedback form: TBA