Home
  • Blog
  • Class Code
  • ML Notebook
  • Project

DANL 320-01: Big Data Analytics, Spring 2025

  • Instructor: Byeong-Hak Choe   Email | Brightspace

Welcome! ๐Ÿ‘‹

\(-\) Explore, Learn, and Grow with Data Analytics! ๐ŸŒŸ

\(\bullet\,\) Lecture Slides ๐Ÿš€

Title Subtitle Date
Lecture 1 Syllabus, Course Outline, and DANL Career January 22, 2025
Lecture 2 Getting Started with Jupyter Notebook and Quarto January 27, 2025
Lecture 3 Python Basics January 29, 2025
Lecture 4 Big Data February 3, 2025
Lecture 5 Distributed Computing Framework; Apache Hadoop and Spark; PySpark February 5, 2025
Lecture 6 PySpark Basics February 10, 2025
Lecture 7 Linear Regression February 17, 2025
Lecture 8 Logistic Regression March 10, 2025
Lecture 9 K-fold Cross-Validation; Regularized Regression March 26, 2025
Lecture 10 Tree-based Models April 7, 2025
Lecture 11 Unsupervised Learning April 30, 2025
No matching items

\(\bullet\,\) ML Notebooks ๐Ÿ”ข

Title Subtitle Date
Linear Regression Bikeshare in DC February 19, 2025
Linear Regression Orange Juice February 26, 2025
Homework 2 Beer Markets March 5, 2025
Logistic Regression New Born Baby at Risk March 24, 2025
Homework 3 American Housing Survey 2004 March 26, 2025
Quasi-Separation and Regularized Logistic Regression Car Safety Rating March 31, 2025
Lasso Linear Regression Online Shopping April 2, 2025
Lasso Logistic Regression NHL Player Evaluation April 7, 2025
Omitted Variable Bias Orange Juice April 9, 2025
Tree-based Models NBC Shows; Boston Housing Markets April 14, 2025
From Linear Regression to Tree-based Models Claifornia Housing Markets April 16, 2025
Homework 4 - Part 1: Lasso Linear Regerssion - Model 1 Beer Markets with Big Demographic Design April 19, 2025
Homework 4 - Part 1: Lasso Linear Regerssion - Model 2 Beer Markets with Big Demographic Design April 19, 2025
Homework 4 - Part 1: Lasso Linear Regerssion - Model 3 with Discussions Beer Markets with Big Demographic Design April 19, 2025
Homework 4 - Part 2: Tree-based Models MLB Batting April 19, 2025
Exam Supervised Learning April 21, 2025
No matching items

\(\bullet\,\) Classwork โŒจ๏ธ

Title Subtitle Date
Classwork 1 Building a Personal Website using Git, GitHub, and RStudio with Quarto January 22, 2025
Classwork 2 Markdown Basics January 27, 2025
Classwork 3 Quarto Website Basics January 27, 2025
Classwork 4 Python Basics January 29, 2025
Classwork 5 PySpark Basics - Loading, Summarizing, Selecting, Counting, and Sorting Data February 10, 2025
Classwork 6 PySpark Basics - Convering Data Types; Filtering Data; Dealing with Missing Values/Duplicates February 12, 2025
Classwork 7 PySpark Basics - Group Operations February 17, 2025
Classwork 8 Linear Regression I April 14, 2025
Classwork 9 Linear Regression II April 14, 2025
Classwork 10 Logistic Regression April 14, 2025
Classwork 11 Addressing Quasi-Separation in Logistic Regression with Regularization April 14, 2025
Classwork 12 Predicting Housing Price in California April 14, 2025
No matching items

\(\bullet\,\) Homework ๐Ÿ’ป

Title Subtitle Date
Homework 1 Survey, Personal Website, and Python Basics March 3, 2025
Homework 2 Linear Regression; Jupyter Notebook Blogging May 4, 2025
Homework 3 Regression; Jupyter Notebook Blogging May 4, 2025
Homework 4 Lasso Linear Regression; Tree-based Models May 4, 2025
No matching items
Back to top
 

powered with github, quarto, and rstudio
byeong-hak choe, 2025