Classwork 6
Pandas Basics - Convering Data Types; Filtering Data; Dealing with Missing Values/Duplicates
Direction
The netflix.csv
file (with its pathname https://bcdanl.github.io/data/netflix.csv
) contains a list of 6,000 titles that were available to watch in November 2019 on the video streaming service Netflix. It includes four variables: the video’s title
, director
, the date Netflix added it (date_added
), and its type
(category).
Question 1
Optimize the DataFrame for limited memory use and maximum utility by using the astype()
method.
Answer:
Question 2
Find all observations with a director of “Martin Scorsese”.
Answer:
Question 3
Find all observations with a title of “Limitless” and a type of “Movie”.
Answer:
Question 4
Find all observations with either a date_added
of “2018-06-15
” or a director of “Bong Joon Ho”.
Answer:
Question 5
Find all observations with a director of “Ethan Coen”, “Joel Coen”, and “Quentin Tarantino”.
Answer:
Question 6
Find all observations with a date_added
value between January 1, 2019 and February 1, 2019.
Answer:
Question 7
Drop all observations with a NaN
value in the director
variable.
Answer:
Question 8
Identify the days when Netflix added only one movie to its catalog.
Answer:
Discussion
Welcome to our Classwork 6 Discussion Board! 👋
This space is designed for you to engage with your classmates about the material covered in Classwork 6.
Whether you are looking to delve deeper into the content, share insights, or have questions about the content, this is the perfect place for you.
If you have any specific questions for Byeong-Hak (@bcdanl) regarding the Classwork 6 materials or need clarification on any points, don’t hesitate to ask here.
All comments will be stored here.
Let’s collaborate and learn from each other!