Week 10
Career Session; Big Data and the Modern Data Infrastructure
In Week 10, weβll wrap up the two Alumni Career Sessions. Weβll then continue to explore new topics in big data and the modern data infrastructure.
π« Lecture Slides
- Lecture 7 β Big Data and the Modern Data Infrastructure
View Slides
π₯ Looking for lecture recordings? You can only find those on Brightspace.
βοΈ Classwork
Classwork 9 - Joining Two Related Tables in R
View ClassworkClasswork 10 - Career Sessions
View Classwork
π Reference
Free Sources of Useful (Big) Data
Economics/Finance
| Data Source | Description | URL |
|---|---|---|
| Bureau of Labor Statistics (BLS) | Provides access to data on inflation and prices, wages and benefits, employment, spending and time use, productivity, and workplace injuries | BLS |
| FRED (Federal Reserve Economic Data) | Provides access to a vast collection of U.S. economic data, including interest rates, GDP, inflation, employment, and more | FRED |
| Yahoo Finance | Provides comprehensive financial news, data, and analysis, including stock quotes, market data, and financial reports | Yahoo Finance |
| IMF (International Monetary Fund) | Provides access to a range of economic data and reports on countriesβ economies | IMF Data |
| World Bank Open Data | Free and open access to global development data, including world development indicators | World Bank Open Data |
| OECD Data | Provides access to economic, environmental, and social data and indicators from OECD member countries | OECD Data |
Government/Public Data
| Data Source | Description | URL |
|---|---|---|
| Data.gov | Portal providing access to over 186,000 government data sets, related to topics such as agriculture, education, health, and public safety | Data.gov |
| CIA World Factbook | Portal to information on the economy, government, history, infrastructure, military, and population of 267 countries | CIA World Factbook |
| U.S. Census Bureau | Portal to a huge variety of government statistics and data relating to the U.S. economy and its population | U.S. Census Bureau |
| European Union Open Data Portal | Provides access to public data from EU institutions | EU Open Data Portal |
| New York City Open Data | Provides access to datasets from New York City, covering a wide range of topics such as public safety, transportation, and health | NYC Open Data |
| Los Angeles Open Data | Portal for accessing public data from the City of Los Angeles, including transportation, public safety, and city services | LA Open Data |
| Chicago Data Portal | Offers access to datasets from the City of Chicago, including crime data, transportation, and health statistics | Chicago Data Portal |
General Data Repositories
| Data Source | Description | URL |
|---|---|---|
| Amazon Web Services (AWS) public data sets | Portal to a huge repository of public data, including climate data, the million song dataset, and data from the 1000 Genomes project | AWS Datasets |
| Gapminder | Portal to data from the World Health Organization and World Bank on economic, medical, and social issues | Gapminder |
| Google Dataset Search | Helps find datasets stored across the web | Google Dataset Search |
| Kaggle Datasets | A community-driven platform with datasets from various fields, useful for machine learning and data science projects | Kaggle Datasets |
| UCI Machine Learning Repository | A collection of databases, domain theories, and datasets used for machine learning research | UCI ML Repository |
| United Nations Data | Provides access to global statistical data compiled by the United Nations | UN Data |
| Humanitarian Data Exchange (HDX) | Provides humanitarian data from the United Nations, NGOs, and other organizations | HDX |
| Democratizing Data from data.org | A platform providing access to high-impact datasets, tools, and resources aimed at solving critical global challenges | Democratizing Data |
| Justia Federal District Court Opinions and Orders database | A free searchable database of full-text opinions and orders from civil cases heard in U.S. Federal District Courts | Justia |
π¬ Discussion
Welcome to our Week 10 Discussion Board! π
This space is designed for you to engage with your classmates about the material covered in Week 10.
Whether you are looking to delve deeper into the content, share insights, or have questions about the content, this is the perfect place for you.
If you have any specific questions for Byeong-Hak (@bcdanl) or peer classmate (@GitHub-Username) regarding the Week 10 materials or need clarification on any points, donβt hesitate to ask here.
Letβs collaborate and learn from each other!