17.1. Data Manipulation¶
The readings for this chapter will be found in the lesson’s Jupyter Notebooks.
The notebooks also contain Check Your Understanding questions. Each question from the notebooks has been copied below so you can practice before taking the Canvas quiz.
17.1.1. Check Your Understanding¶
Question
What is the syntax for multiple aggregation functions across a single column, such as 'age'
?
Question
What data type is the syntax using?
Question
What is this line of code doing with the data?
data_group = data.groupby("embark_town")
Question
What is this line of code doing with the data?
data_group = data.groupby("embark_town").agg("mean")
Question
According to pandas
documentation, using a for loop is the only way to update values in a column.
True
False
Question
What is the syntax to rename a column?
Question
Converting numeric data to categorical data is an example of?
Question
The pivot()
method is the only way to aggregate values in a table.
True
False
Question
Which method do you use to create a Wide-to-Long
table?
Question
When creating a small table, you should store it in its own variable to keep your original table safe.
True
False
Question
When appending a new row, if it contains a column that doesn’t previously exist in the original table then an error will be thrown.
True
False
Question
Concatenation can act on which axes?
1 and 0
1, only
0 only
for as many columns as the table contains.
Question
Using our flowers
and garden_supply
tables, write the syntax to merge a subset of columns,
where flowers
is the right table, and garden_supply
on the left. This subset should only look
at "Flower"
and "Sold_As"
only in the garden_supply
table, and "Name"
in the flowers
table.
a. garden_supply[["Flower","Sold_As"]].merge(flowers[["Name"]],left_on="Flower", right_on="Name")
b. flowers[["Flower", "Sold_As"]].merge(garden_supply[["Name"]], left_on="Flower", right_on="Name")
c. garden_supply[["Flower", "Sold_As"]].merge(flowers[["Name"]], left_on="Name", right_on="Flowers")
d. garden_supply[["Name"]].merge(flowers[["Flower", "Sold_As"]], left_on="Flower", right_on="Name")
Question
The default merge in the pandas
merge()
function is a left merge.
True
False
Question
Which merge()
combines ALL of the rows of the merged DataFrames, filling in NaN
if values are missing?
Question
The merge()
function contains the following parameters: on
, left_on
, and right_on
.
When would you use each of these parameters?
Question
What is the difference between on
and left_on
in the merge()
function?
Question
When working with join, the right table will always be joined based on its index and not a designated column.
True
False
Question
The default join()
type is: