<- read_csv("https://bcdanl.github.io/data/beer_markets_all.csv") beer_markets
Data Storytelling Team Project - Guideline
What You Should Do for the Team Project
Presentation Schedule and Format
- Each team will deliver a 10-minute presentation, followed by a 1–2 minute Q&A session, during one of the following class sessions:
- December 4, Wednesday: Four teams
- December 6, Friday: Four teams
- December 9, Monday: Two teams
- The order of team presentations will be determined by a random draw during class.
- If multiple teams choose the same topic, I will try to schedule these teams in different sessions to minimize repetition within a single class.
- To ensure fairness and equal participation, each student must contribute evenly to the presentation.
Suggested Topics
- You are welcome to use your own dataset for the project; however, here are some suggested topics with corresponding data for your convenience.
- Dataset Approval: If your team decides to use a dataset that is not suggested by Byeong-Hak, you must obtain approval in advance.
- Dataset Submission:
- Once your dataset is approved, make sure to send the dataset files to Byeong-Hak.
- If your team adds any additional datasets to the suggested topic you have chosen, please also submit those additional dataset files to Byeong-Hak.
- The suggested data frames are quite large.
- Feel free to transform the data as needed to focus on the story your team wants to tell.
1. Beer Market
The beer_markets
data frame contains detailed information about household beer purchases across different brands and markets in the United States. It includes purchase details, product attributes, promotional information, and demographic data of the households.
Variable Description
hh
: an identifier of the household;X_purchase_desc
: details on the purchased item;quantity
: the number of items purchased;brand
: Bud Light, Busch Light, Coors Light, Miller Lite, or Natural Light;dollar_spent
: total dollar value of purchase;beer_floz
: total volume of beer, in fluid ounces;price_floz
: price per fl.oz. (i.e.,dollar_spent
/beer_floz
);container
: the type of container;promo
: Whether the item was promoted (coupon or otherwise);region
: US regionstate
: US statemarket
: Scan-track market (or state if rural);- demographic data, including gender, marital status, household income, class of work, race, education, age, the size of household, and whether or not the household has a microwave or a dishwasher.
2. NYC Housing Market
The nyc_housing_sales
data frame includes property sale transactions in New York City from 2003 to 2024. It provides detailed information about each sale, including property characteristics, sales prices, and building classifications.
<- read_csv("https://bcdanl.github.io/data/nyc_housing_sales_2003-2024.csv") nyc_housing_sales
Variable Description
- For the description of variables in the
nyc_housing_sales
data.frame, please refer to the following webpage:- https://www.nyc.gov/site/finance/property/glossary-property-sales.page
- For the variables of building class code, please refer to the following webpage:
- https://www.nyc.gov/site/finance/property/glossary-property-sales.page