Lecture 7

pandas Basics - Getting a Summary of Data; Selecting Variables; Counting Methods; Sorting Methods

Byeong-Hak Choe

SUNY Geneseo

February 12, 2025

`nba` DataFrame

Let’s read the nba.csv file as nba:

# Below is to import the pandas library as pd
import pandas as pd 

# Below is for an interactive display of DataFrame in Colab
from google.colab import data_table  
data_table.enable_dataframe_formatter()

# Below is to read nba.csv as nba DataFrame
nba = pd.read_csv("https://bcdanl.github.io/data/nba.csv",
                  parse_dates = ["Birthday"])

Getting a Summary of Data

1 / 16

Lecture 7 pandas Basics - Getting a Summary of Data; Selecting Variables; Counting Methods; Sorting Methods Byeong-Hak Choe bchoe@geneseo.edu SUNY Geneseo February 12, 2025

Lecture 7

`nba` DataFrame

Getting a Summary of Data

DataFrame Terminologies: Variables, Observations, and Values

DataFrame Terminologies

Dot Operators, Methods, and Attributes

Dot operator

Method

Attribute

Getting a Summary of a `DataFrame` with `.info()`

Getting a Summary of a `DataFrame` with `.describe()`

Selecting Variables

Selecting a Variable by its Name

Selecting Multiple Variables by their Names

Selecting Multiple Variables with `select_dtypes()`

Counting Methods

Counting with `.count()`

Counting with `.value_counts()`

Counting with `.nunique()`

Pandas Basics