17.1. Data Manipulation¶
The readings for this chapter will be found in the lesson’s Jupyter Notebooks.
The notebooks also contain Check Your Understanding questions. Each question from the notebooks has been copied below so you can practice before taking the Canvas quiz.
17.1.1. Check Your Understanding¶
Question
What is the syntax for multiple aggregation functions across a single column, such as 'age'?
Question
What data type is the syntax using?
Question
What is this line of code doing with the data?
data_group = data.groupby("embark_town")
Question
What is this line of code doing with the data?
data_group = data.groupby("embark_town").agg("mean")
Question
According to pandas documentation, using a for loop is the only way to update values in a column.
True
False
Question
What is the syntax to rename a column?
Question
Converting numeric data to categorical data is an example of?
Question
The pivot() method is the only way to aggregate values in a table.
True
False
Question
Which method do you use to create a Wide-to-Long table?
Question
When creating a small table, you should store it in its own variable to keep your original table safe.
True
False
Question
When appending a new row, if it contains a column that doesn’t previously exist in the original table then an error will be thrown.
True
False
Question
Concatenation can act on which axes?
1 and 0
1, only
0 only
for as many columns as the table contains.
Question
Using our flowers and garden_supply tables, write the syntax to merge a subset of columns,
where flowers is the right table, and garden_supply on the left. This subset should only look
at "Flower" and "Sold_As" only in the garden_supply table, and "Name" in the flowers table.
a. garden_supply[["Flower","Sold_As"]].merge(flowers[["Name"]],left_on="Flower", right_on="Name")
b. flowers[["Flower", "Sold_As"]].merge(garden_supply[["Name"]], left_on="Flower", right_on="Name")
c. garden_supply[["Flower", "Sold_As"]].merge(flowers[["Name"]], left_on="Name", right_on="Flowers")
d. garden_supply[["Name"]].merge(flowers[["Flower", "Sold_As"]], left_on="Flower", right_on="Name")
Question
The default merge in the pandas merge() function is a left merge.
True
False
Question
Which merge() combines ALL of the rows of the merged DataFrames, filling in NaN if values are missing?
Question
The merge() function contains the following parameters: on, left_on, and right_on.
When would you use each of these parameters?
Question
What is the difference between on and left_on in the merge() function?
Question
When working with join, the right table will always be joined based on its index and not a designated column.
True
False
Question
The default join() type is:
