DANL 320-01: Big Data Analytics, Spring 2025
- Instructor: Byeong-Hak Choe Email | Brightspace
Welcome! 👋
\(-\) Explore, Learn, and Grow with Data Analytics! 🌟
\(\bullet\,\) Lecture Slides 🚀
Title | Subtitle | Date |
---|---|---|
Lecture 1 | Syllabus, Course Outline, and DANL Career | January 22, 2025 |
Lecture 2 | Getting Started with Jupyter Notebook and Quarto | January 27, 2025 |
Lecture 3 | Python Basics | January 29, 2025 |
Lecture 4 | Big Data | February 3, 2025 |
Lecture 5 | Distributed Computing Framework; Apache Hadoop and Spark; PySpark | February 5, 2025 |
Lecture 6 | PySpark Basics | February 10, 2025 |
Lecture 7 | Linear Regression | February 17, 2025 |
Lecture 8 | Logistic Regression | March 10, 2025 |
Lecture 9 | K-fold Cross-Validation; Regularized Regression | March 26, 2025 |
Lecture 10 | Tree-based Models | April 7, 2025 |
No matching items
\(\bullet\,\) ML Notebooks 🔢
Title | Subtitle | Date |
---|---|---|
Linear Regression | Bikeshare in DC | February 19, 2025 |
Linear Regression | Orange Juice | February 26, 2025 |
Homework 2 | Beer Markets | March 5, 2025 |
Logistic Regression | New Born Baby at Risk | March 24, 2025 |
Homework 3 | American Housing Survey 2004 | March 26, 2025 |
Quasi-Separation and Regularized Logistic Regression | Car Safety Rating | March 31, 2025 |
Lasso Linear Regression | Online Shopping | April 2, 2025 |
Lasso Logistic Regression | NHL Player Evaluation | April 7, 2025 |
Omitted Variable Bias | Orange Juice | April 9, 2025 |
Tree-based Models | NBC Shows; Boston Housing Markets | April 14, 2025 |
From Linear Regression to Tree-based Models | Claifornia Housing Markets | April 16, 2025 |
Homework 4 - Part 1: Lasso Linear Regerssion - Model 1 | Beer Markets with Big Demographic Design | April 19, 2025 |
Homework 4 - Part 1: Lasso Linear Regerssion - Model 2 | Beer Markets with Big Demographic Design | April 19, 2025 |
Homework 4 - Part 1: Lasso Linear Regerssion - Model 3 with Discussions | Beer Markets with Big Demographic Design | April 19, 2025 |
Homework 4 - Part 2: Tree-based Models | MLB Batting | April 19, 2025 |
No matching items
\(\bullet\,\) Classwork ⌨️
Title | Subtitle | Date |
---|---|---|
Classwork 1 | Building a Personal Website using Git, GitHub, and RStudio with Quarto | January 22, 2025 |
Classwork 2 | Markdown Basics | January 27, 2025 |
Classwork 3 | Quarto Website Basics | January 27, 2025 |
Classwork 4 | Python Basics | January 29, 2025 |
Classwork 5 | PySpark Basics - Loading, Summarizing, Selecting, Counting, and Sorting Data | February 10, 2025 |
Classwork 6 | PySpark Basics - Convering Data Types; Filtering Data; Dealing with Missing Values/Duplicates | February 12, 2025 |
Classwork 7 | PySpark Basics - Group Operations | February 17, 2025 |
Classwork 8 | Linear Regression I | April 14, 2025 |
Classwork 9 | Linear Regression II | April 14, 2025 |
Classwork 10 | Logistic Regression | April 14, 2025 |
Classwork 11 | Addressing Quasi-Separation in Logistic Regression with Regularization | April 14, 2025 |
Classwork 12 | Predicting Housing Price in California | April 14, 2025 |
No matching items
\(\bullet\,\) Homework 💻
Title | Subtitle | Date |
---|---|---|
Homework 1 | Survey, Personal Website, and Python Basics | March 3, 2025 |
Homework 2 | Linear Regression; Jupyter Notebook Blogging | March 13, 2025 |
Homework 3 | Regression; Jupyter Notebook Blogging | March 25, 2025 |
Homework 4 | Lasso Linear Regression; Tree-based Models | April 16, 2025 |
No matching items