Spreadsheet Mastery: Exercises

Your responsibilities as a Data Analyst at Amazon encompass both reporting and analysis across various departments. Recently, your team received a request for a comprehensive analysis of shipping speed and status for orders shipped to India over the past three months. Upon reviewing the request, your department director concluded that more information is necessary before deciding to proceed with a full analysis.

As a result, your director has asked you to compile a concise report summarizing sales orders and associated shipping information from March 31, 2022 to June 29, 2022.

Additionally, your director would like specific information for a random sample of orders, for which the order IDs have been provided.

The initial reporting request specifies the following metrics, KPIs, and sample order information:

  1. Reporting Period Begin Date.
  2. Reporting Period End Date.
  3. Count of days in Reporting Period.
  4. Date of report preparation.
  5. Total Order Count.
  6. Total Unit Count.
  7. Total Sales Dollars.
  8. Average Order Value (AOV).
  9. Average Unit Retail (AUR).
  10. Units Per Transaction/Order (UPT).
  11. Sales Dollars and % of Total Sales Dollars by Shipping Status Category.
  12. Unit Counts and % of Total Units by Cancellation Status.
  13. Date, Status, Ship Service Level, Units, Dollars, City, State, and Zip Code for these 20 Order IDs:
Order IDOrder ID
404-2822073-6488362402-3840492-4873917
407-1425428-0087544408-8226426-6214707
406-9907647-1673953404-8659996-0325913
408-0047275-3547510403-1323278-1035515
408-6217530-3383522406-0203534-2738746
402-5622441-3675529403-9063524-5621118
407-0452640-1414754406-9685702-0163562
403-7617865-0366735407-5692596-2928364
408-5897470-5138724404-1500003-2290719
408-7795853-5203551405-6656663-7334729

Ship Service Level and Zip Code for these 20 Order IDs:

Order IDOrder ID
408-0897859-0872345406-5667488-4807518
404-5192536-7610731404-0385252-9830703
402-1319583-3594710402-4130969-7199506
405-0843296-7457920408-7947698-5720302
402-7607638-8078742402-4065058-6283510
404-3793288-3303541404-0625941-9181127
403-6876910-7017123171-7595817-4329100
405-6575169-1710717408-0894500-2480315
407-8320886-2189149404-4022285-8367511
403-8548938-0817920171-5184553-5321935

Upon reviewing the report requirements, you realize that you do not have an existing report template for these specifications, so you will need to create one. A cleaned dataset has already been generated by another member of your team and is stored in a CSV file on the department network drive.

You know that your director likes to receive one workbook per project rather than multiple files, so you decide to use several worksheets within the same workbook.

You’re ready to get started and have a plan for how you’ll proceed, which you outlined as follows:

  1. Download the dataset from Canvas. You will find it under Files > Spreadsheet Mastery Resources > Order Data.csv.
  2. Open the file, save it as an Excel Workbook (*.xlsx), and name it “Order Summary Report_Your Name”.
  3. Rename the worksheet to Data.
  4. Review the dataset and make note of anything you think will be helpful as you proceed.
  5. Note to self: EDA and cleaning already performed by a colleague.
  6. Add a new worksheet and name it Summary Report.
  7. Add another new worksheet and name it Sample Order Data.
  8. Reorder the worksheets so that the Summary Report is first, and the Data worksheet is last.
  9. Using simple formatting options, create a Summary Report template that looks like this mock-up.

Summary report template mock-up Summary report template mock-up

  1. Create a template for the Sample Order Data that looks like this

Sample order data template Sample order data template

Note

Be sure to save any changes as you work and apply number formatting for each metric as you populate the report. Follow the department formatting conventions using the short date format, rounding to whole numbers for total dollar amounts and percentages, and using two decimal places for all KPIs

Begin with populating the Summary Report

  1. Populate the Reporting Period Begin Date using the MIN function.
  2. Populate the Reporting Period End Date using the MAX function.
  3. Populate the Count of Days in Period by calculating the range between the begin and end dates.
  4. Populate the Report Preparation Date using the TODAY function.
  5. Use the COUNT function to populate the Total Orders cell.
Note

Count either the units or dollars column to get this total since the COUNT function only works on cells populated with numeric data. This is a correct approach because there is one row per order and there aren’t nulls in these columns.

  1. Use the SUM function to populate the Total Units cell.
  2. Use the SUM function to populate the Total Sales $ cell.
  3. Populate the Average Order Value (AOV) KPI using the AGGREGATE function.
Note

AOV = Sales $/Orders

  1. Populate the Average Unit Retail (AUR) KPI using cell references and a division formula.
Note

AUR = Sales $/Units

  1. Use the AVERAGE function to populate the Units Per Transaction (UPT) KPI.
Note

UPT = Units/Orders

  1. Populate the Sales Dollar amount for each Shipping Status Category by using the SUMIF formula.
Note

Start by creating the function for the Cancelled status. Don’t hard-code the shipping status, use cell references. Use a mix of absolute and relative references where appropriate so that you can use the fill option to populate the values for the remaining categories.

  1. Populate the % of Total for each Shipping Status Category using cell references and a division formula.
Note

% of Total = Category Sales $/Total Sales $. Use a mix of absolute and relative references where appropriate so that you can use the fill option to populate the values for the remaining categories.

  1. Populate the Quantity of Units for each Cancellation Status Category by using the SUMIF formula.
Note

You will need to hard-code the conditions for these using logical operators equal to Cancelled (“=Cancelled”) and not equal to Cancelled (“<>Cancelled”)

  1. Populate the % of Total for each Cancellation Status category using the same method as above.
  2. Validate that your basic functions and IF functions result in accurate data. Make corrections where needed. Validate that the percentage of total values for Shipping Status and Cancellation Status sum to 100%. If they do not, add more categories and formulas as appropriate and sort section to retain original sort order. Double check all date and number formats.
Note

Use the UNIQUE formula on the Shipping Status and Cancellation Status columns in the dataset to return a list of all distinct category options.

  1. Next, populate the Sample Order Data Report.
  2. Find the Top Destination Postal Code using the MODE.SINGL function.
  3. For the first set of twenty provided Order IDs, populate the Date, Status, Ship Service Level, Units, Dollars, City, State, and Zip Code using the VLOOKUP function.
  4. For the second set of twenty provided Order IDs, populate the Ship Service Level and Zip Code by combining the INDEX and MATCH functions.
  5. Validate that your VLOOKUP and INDEX/MATCH functions are returning the correct data. Make any necessary corrections to your formulas. Double-check all formats.

Submitting Your Work

Your director has asked that you submit the report for review as soon as you have it prepared and gave you a heads-up that added requirements will be provided as soon as the initial review is completed. You’re ready to submit this initial report, so you review the submission requirements just to be sure that you follow the specifications.

Upload the Microsoft Excel Workbook under Class 4 Exercises: Order Summary Report and click Submit.