14.1. Exploratory Data Analysis¶
Read the following articles, follow along where instructed.
Note
You do not need to install pandas
, it comes with Anaconda
.
Tip
For Medium articles: if you run out of free articles, open the page in an incognito window.
14.1.1. Exploring Data with Python¶
Code along with this article.
Stop at Step #8 “Detecting Outliers”.
14.1.2. Get to Know Your Data¶
14.1.3. Python pandas
¶
Code along with this article.
Python Pandas Tutorial: A Complete Introduction for Beginners
Stop at “Handling Duplicates”.
14.1.4. Statistics in pandas
¶
14.1.5. What is a DataFrame
?¶
A pandas DataFrame
is similar to a Python dictionary. The column names are like keys and the values are the data for that column.
pandas Series
. Here is how pandas Series
are used to build a dataframe.14.1.6. Check Your Understanding¶
Question
What is the pandas
function used to return the number of rows and columns in a DataFrame
?
Question
Column names cannot be changed in a DataFrame
?
True
False
Question
What can knowing the data types present in a data set tell us about the data being presented?
Question
What is the pandas
method for reading a CSV file type?
Question
Visualized below is the “purchases” DataFrame
. What is the pandas
syntax to select for Robert’s data?
Question
How do we view only the first 13 rows of a DataFrame
?
Question
A DataFrame
column is a Series
?
True
False
Question
Which pandas
function will print the number of records, three quartiles, mean, standard deviation, minimum and maximum values of a DataFrame
?
.describe()
.index()
.statistics()
.head()