Lecture 1

Syllabus and Course Outline

Byeong-Hak Choe

SUNY Geneseo

August 25, 2025

Instructor

Instructor

Current Appointment & Education

  • Name: Byeong-Hak Choe.

  • Assistant Professor of Data Analytics and Economics, School of Business at SUNY Geneseo.

  • Ph.D. in Economics from University of Wyoming.

  • M.S. in Economics from Arizona State University.

  • M.A. in Economics from SUNY Stony Brook.

  • B.A. in Economics & B.S. in Applied Mathematics from Hanyang University at Ansan, South Korea.

    • Minor in Business Administration.
    • Concentration in Finance.

Instructor

Economics and Data Science

  • Choe, B.H., 2021. “Social Media Campaigns, Lobbying and Legislation: Evidence from #climatechange and Energy Lobbies.

  • Question: To what extent do social media campaigns compete with fossil fuel lobbying on climate change legislation?

  • Data include:

    • 5.0 million tweets with #climatechange/#globalwarming around the globe;
    • 12.0 million retweets/likes to those tweets;
    • 0.8 million Twitter users who wrote those tweets;
    • 1.4 million Twitter users who retweeted or liked those tweets;
    • 0.3 million US Twitter users with their location at a city level;
    • Firm-level lobbying data (expenses, targeted bills, etc.).

Instructor

Economics and Data Science

  • Choe, B.H. and Ore-Monago, T., 2024. “Governance and Climate Finance in the Developing World

  • Climate finance refers to the financial resources allocated for mitigating and adapting to climate change, including support for initiatives that reduce greenhouse gas emissions and enhance resilience to climate impacts.

    • We focus on transnational financing that rich countries provide poor countries with financial resources, in order to help them adapt to climate change and mitigate greenhouse gas (GHG) emissions.
    • Since the GHG emissions in developing countries are rapidly growing, it is crucial to assess the effectiveness of climate finance.
    • Poor governance (e.g., legal system, rule of law, and accountability) can be significant barriers to emissions reductions.

Instructor

Economics and Data Science

  • Choe, B.H. and Newbold, Steve, “Estimating the Value of Statistical Life (VSL) through Big Data

  • VSL is the monetary value associated with reducing the risk of death.

    • How much value would that be? How can we measure it?
    • How do government agencies use the VSL to decide which policies are worth the cost when they reduce the risk of death?

Syllabus

Syllabus

Email, Class & Office Hours

Syllabus

Course Description

  • This course is designed to provide a comprehensive overview of data handling techniques, focusing on practical application through case studies.

  • Key topics include:

    1. data loading, cleaning, transformation, merging, and reshaping;
    2. techniques for slicing, dicing, and summarizing datasets;
    3. data collection via web scraping and APIs.
  • These areas will be explored through detailed, real-world examples to address common data analysis challenges.

  • Throughout the course, students will gain hands-on experience with Python and its data analysis libraries, along with practical applications of git and GitHub.

Syllabus

Reference Materials

Syllabus

Course Requirements

  • Laptop: You should bring your own laptop (Mac or Windows) to the classroom.

    • The minimum specification for your laptop in this course is 2+ core CPU, 4+ GB RAM, and 500+ GB disk storage.
  • Homework: There will be six homework assignments.

  • Project: There will be one project on a personal website.

  • Exams: There will be two Midterm Exams and one Final Exam.

    • The Midterm Exam 2 is comprehensive.
    • The Final Exam is comprehensive.
  • Discussions: You are encouraged to participate in GitHub-based online discussions and class discussion, and office hours.

    • Checkout the netiquette policy in the syllabus.

Syllabus

Personal Website

  • You will create your own website using Quarto, R Studio, and Git.

  • You will publish your homework assignments and a project on your website.

  • Your website will be hosted in GitHub.

  • The basics in Markdown will be discussed.

  • References:

Syllabus

Why Personal Website?

  • Here is the example website:
  • Professional Showcase: Display skills and projects
  • Visibility and Networking: Increase online presence
  • Content Sharing and Engagement: Publish articles, insights
  • Job Opportunities: Attract potential employers and clients
  • Long-term Asset: A growing repository of your career journey

Syllabus

Project

  • The project report should include data collection and exploratory data analysis using summary statistics, visual representations, and data wrangling.

  • The document for the project must be published in each member’s website.

Syllabus

Class Schedule and Exams

  • There will be tentatively 28 class sessions.

  • The Midterm Exam I is scheduled on October 8, 2025, Wednesday, during the class time.

  • The Midterm Exam II is scheduled on November 12, 2025, Wednesday, during the class time.

  • The Final Exam is to be determined.

  • The due for the project is December 16, 2025, Tuesday, 11:59 P.M., Eastern Time

Syllabus

Class Schedule and Exams

  • No class on
    • September 1 (Labor Day)
    • October 13 (Fall Break)
    • November 26 (Thanksgiving Break)

Syllabus

Course Contents

  • The first part of the course covers Python basics and pandas basics.

Syllabus

Course Contents

  • The second part of the course covers data collection.

Syllabus

Course Contents

  • The third part of the course covers advanced pandas.

Syllabus

Grading

\[ \begin{align} (\text{Total Percentage Grade}) =&\quad\;\, 0.05\times(\text{Attendance Score})\notag\\ &\,+\, 0.20\times(\text{Total Homework Score})\notag\\ &\,+\, 0.25\times(\text{Project and Website Score})\notag\\ &\,+\, 0.50\times(\text{Total Exam Score}).\notag \end{align} \]

Syllabus

Grading

  • You are allowed up to 4 absences without penalty.

    • Send me an email if you have standard excused reasons (illness, family emergency, transportation problems, etc.).
  • For each absence beyond the initial five, there will be a deduction of 1% from the Total Percentage Grade.

  • Participation will be evaluated by quantity and quality of GitHub-based online discussions and in-person discussion.

  • The single lowest homework score will be dropped when calculating the total homework score.

Syllabus

Grading

\[ \begin{align} &(\text{Midterm Exam Score}) \\ =\, &\text{max}\,\left\{0.50\times(\text{Midterm Exam I Score}) \,+\, 0.50\times(\text{Midterm Exam II Score})\right.,\notag\\ &\qquad\;\,\left.0.33\times(\text{Midterm Exam I Score}) \,+\, 0.67\times(\text{Midterm Exam II Score})\right\}.\notag \end{align} \]

  • The Midterm Exam Score is the maximum between
    1. the simple average of the Midterm Exam I score and the Midterm Exam II Score and
    2. the weighted average of them with one-third weight on the Midterm Exam I Score and two-third weight on the Midterm Exam II Score.

Syllabus

Grading

\[ \begin{align} &(\text{Total Exam Score}) \\ =\, &\text{max}\,\left\{0.50\times(\text{Midterm Exam Score}) \,+\, 0.50\times(\text{Final Exam Score})\right.,\notag\\ &\qquad\;\,\left.0.25\times(\text{Midterm Exam Score}) \,+\, 0.75\times(\text{Final Exam Score})\right\}.\notag \end{align} \]

  • The Total Exam Score is the maximum between
    1. the simple average of the Midterm Exam Score and the Final Exam Score and
    2. the weighted average of them with one-fourth weight on the Midterm Exam Score and three-third weight on the Final Exam Score.

Syllabus

Make-up Policy

  • Make-up exams will not be given unless you have either a medically verified excuse or an absence excused by the University.

  • If you cannot take exams because of religious obligations, notify me by email at least two weeks in advance so that an alternative exam time may be set.

  • A missed exam without an excused absence earns a grade of zero.

  • Late submissions for homework assignment will be accepted with a penalty.

  • A zero will be recorded for a missed assignment.

Syllabus

Academic Integrity and Plagiarism

  • All homework assignments and exams must be the original work by you.

  • Examples of academic dishonesty include:

    • representing the work, thoughts, and ideas of another person as your own
    • allowing others to represent your work, thoughts, or ideas as theirs, and
    • being complicit in academic dishonesty by suspecting or knowing of it and not taking action.
  • Geneseo’s Library offers frequent workshops to help you understand how to paraphrase, quote, and cite outside sources properly.

Syllabus

Accessibility

  • The Office of Accessibility will coordinate reasonable accommodations for persons with physical, emotional, or cognitive disabilities to ensure equal access to academic programs, activities, and services at Geneseo.

  • Please contact me and the Office of Accessibility Services for questions related to access and accommodations.

Syllabus

Well-being

  • You are strongly encouraged to communicate your needs to faculty and staff and seek support if you are experiencing unmanageable stress or are having difficulties with daily functioning.

  • Liz Felski, the School of Business Student Advocate (felski@geneseo.edu, South Hall 303), or the Dean of Students (585-245-5706) can assist and provide direction to appropriate campus resources.

  • For more information, see https://www.geneseo.edu/dean_students.

Syllabus

Career Design

  • To get information about career development, you can visit the Career Development Events Calendar (https://www.geneseo.edu/career_development/events/calendar).

  • You can stop by South 112 to get assistance in completing your Handshake Profile https://app.joinhandshake.com/login.

    • Handshake is ranked #1 by students as the best place to find full-time jobs.
    • 50% of the 2018-2020 graduates received a job or internship offer on Handshake.
    • Handshake is trusted by all 500 of the Fortune 500.