MechaCar_Statistical_Analysis

We analyzed data for AutosRUs’ newest prototype, the MechaCar, which is suffering from production issues that are blocking the manufacturing team’s progress. We helped the data analytics team to review the production data for insights that may help the manufacturing team.

Technologies Used:

Overview

Results

Linear Regression to Predict MPG

The MechaCar prototypes were produced using multiple design specifications to identify ideal vehicle performance. Multiple metrics, such as vehicle length, vehicle weight, spoiler angle, drivetrain, and ground clearance, were collected for each vehicle.

Figure 1. Multiple linear regression data for mpg vs. all independent variables

mpg_mult_line_regr.png

Figure 2. Linear regresson plots for mpg vs. each independent variable.

mpg_avplots.png

1. Which variables/coefficients provided a non-random amount of variance to the mpg values in the dataset?
2. Is the slope of the linear model considered to be zero? Why or why not?
3. Does this linear model predict mpg of MechaCar prototypes effectively? Why or why not?

Summary Statistics on Suspension Coils

The manufacturing data for all lots (Figure 1 below) shows that the suspension coil variance is 62.29356, which is below the limit of 100 pounds per square inch.

Figure 3 - Lot Summary of suspension coils

total_summary.png

However, the per-lot breakdown of the data shows Lot3 has a suspension coil variance of 170.2861224, which is above the limit of 100 pounds per square inch. Lot 1 (0.9897433) and Lot 2 (7.4693878) are within the variance limit.

Figure 4 - Total Summary of suspension coils

lot_summary.png


T-Tests on Suspension Coils

Using our knowledge of R, we performed t-tests to determine if all manufacturing lots and each lot individually are statistically different from the population mean of 1,500 pounds per square inch. The results are shown in Figures 5 through 8 below. The only lot with a statistical difference is Lot 3 (probably because of the high variance as shown in Figure 4 above).

Figure 5 - T-test for all lots vs. pop. mean of 1500 PSI - no statistical difference.

t_test_all_v_1500.png

Figure 6 - T-test for lot 1 vs. pop. mean of 1500 PSI - no statistical difference.

t_test_lot1_v_1500.png

Figure 7 - T-test for lot 2 vs. pop. mean of 1500 PSI - no statistical difference.

t_test_lot2_v_1500.png

Figure 8 - T-test for lot 3 vs. pop. mean of 1500 PSI - yes, a statistical difference.

t_test_lot3_v_1500.png


Study Design: MechaCar vs Competition

Using our knowledge of R, we designed a statistical study to compare performance of the MechaCar vehicles against performance of vehicles from other manufacturers.

Description

Summary

As has been the case for every Module in this bootcamp, this Challenge took much longer than the time estimate given in the online material. However, it was a very interesting assignment. I have only been exposed very lightly to statistics in college and/or grad school. Even though it was extremely short, this introduction taught me a lot.

In addition, my first impression of the R Programming Language was not very positive. However, by playing around with RStudio, the R and Rscript user-level commands, R Markdown files, Jupyter Notebook for R; and having seen the other online material about how programs can generate visually appealing documents via the Knit tool, etc.; I’m more impressed. If I have an opportunity to use R professionally in the future, I will look forward to learning more about it!