Not logged in. Login

CMPT 733: Big Data Programming II

Objectives

This course is designed for students who have completed CMPT 726 and CMPS 732, and want to further their knowledge and skills in data science and big data. It aims to bridge the gap between theoretical concepts and practical applications of machine learning and data engineering, by exposing them to current trends and challenges in data science and big data.

The course will cover essential topics such as data wrangling, data visualization, data storytelling, and machine learning workflows, and introduce students to cutting-edge techniques and tools for dealing with large-scale and complex data. By the end of this course, students should be able to tackle real-world data problems, ask meaningful questions about data, design effective data-processing pipelines, and communicate their findings.

Topics

  • Introduction to Data Science
  • Data Preparation
  • Visualization
  • Statistics
  • Deep Learning
  • Practical Machine Learning (AutoML, Explainable AI, Feature Engineering)
  • Anomaly Detection
  • Cloud Computing
  • Responsible Data Science
  • Communication

Logistics

Instructor

TAs

  • Gaurav Bhagchandani
  • Jialiang Guo
  • Nidhi Kantekar

Lectures

  • Time: Tue 10:30 PM - 12:20 PM
  • Location: BLU9660

Labs

Lab G101:

  • Time: Wed 11:30 AM to 1:20 PM (Instructor + TA) and Fri 1:30 PM to 3:20 PM (TAs)
  • Location: SECB1010

Lab G102:

  • Time: Wed 1:30 PM to 3:20 PM (Instructor + TA) and Fri 11:30 AM to 1:20 PM (TA)
  • Location: SECB1010

Grading

  1. Assignments: 11 × 4% = 44%
  2. In-lab team exercise: 9%
  3. Final Project: 47% (2% proposal + 15% milestone + 15% final presentation + 15% code & report & video)

Schedule

WeekDateEvent TypeDescriptionCourse Materials
wk 1Tue Jan 9Lecture 1Course Introductionslides
Wednesday January 17 2024A1-1 DueAssignment #1-1 DueA1-1
Wednesday February 07 2024A1-2 DueAssignment #1-2 DueA1-2
wk 2Tue Jan 16Lecture 2Data Preparationslides
Friday January 26 2024A2 DueAssignment #2 DueA2
wk 3Tue Jan 23Lecture 3Statistics (Part I)slides
Monday February 05 2024A3 DueAssignment #3 DueA3
wk 4Tue Jan 30Lecture 4Data Visualization (Part I)slides
Monday February 12 2024A4 DueAssignment #4 DueA4
wk 5Tue Feb 6Lecture 5Practical Machine Learning (Part I)slides
Friday February 16 2024A5 DueAssignment #5 DueA5
wk 6Tue Feb 13Lecture 6Deep Learning (Part I)slides
Tuesday February 20 2024Proposal DueCourse Project Proposal Due
Tue Feb 20Reading BreakNo Classes
wk 7Tue Feb 27Lecture 7Data Visualization (Part II)slides
Monday March 18 2024A7 DueAssignment #7 DueA7
wk 8Tue Mar 5Lecture 8Practical Machine Learning (Part II)slides
Fri Mar 8Milestone Presentation
Tue Mar 12Friday March 22 2024Assignment #8 DueA8-1, A8-2
wk 9Tue Mar 12Lecture 9Statistics (Part II)slides
Friday March 29 2024A9 DueAssignment #9 DueA9-1, A9-2
wk 10Tue Mar 19Lecture 10Deep learning (Part II), Natural Language Processingslides
Wednesday April 10 2024A10 DueAssignment #10 DueA10 lec10-nlp-ds
wk 11Tue Mar 26Lecture 11Responsible Data Scienceslides
Monday April 15 2024A11 DueAssignment #11 DueA11-1, A11-2
wk 12Tue Apr 2NO class - Final project prep
Fri Apr 5
wk 13Tue Apr 9Final Project Presentation & CodeCourse Project Presentation
Fri Apr 12Report & Video DueCourse Project Report Due

Final Project Showcase

References

Updated Tue April 02 2024, 08:43 by sbergner.