Statistics for business decision making and analysis robert stine and foster chapter 06

30 175 0
Statistics for business decision making and analysis robert stine and foster chapter 06

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

Thông tin tài liệu

Chapter Association between Quantitative Variables Copyright © 2011 Pearson Education, Inc 6.1 Scatterplots Is household natural gas consumption associated with climate?  Annual household natural gas consumption measured in thousands of cubic feet (MCF)  Climate as measured by the National Weather Service using heating degree days (HDD) of 30 Copyright © 2011 Pearson Education, Inc 6.1 Scatterplots Association between Numerical Variables  A graph displaying pairs of values as points on a two-dimensional grid  The explanatory variable is placed on the xaxis  The response variable is placed on the y-axis of 30 Copyright © 2011 Pearson Education, Inc 6.1 Scatterplots Scatterplot of Natural Gas Consumption (y) versus Heating Degree-Days (x) of 30 Copyright © 2011 Pearson Education, Inc 6.2 Association in Scatterplots Visual Test for Association  Compare the original scatterplot to others that randomly match the coordinates  If you can pick the original out as having a pattern, then there is an association of 30 Copyright © 2011 Pearson Education, Inc 6.2 Association in Scatterplots Describing Association Direction Does it trend up or down? Curvature Is the pattern linear or curved? Variation Are the points tightly clustered around the trend? Outliers Is there something unexpected? of 30 Copyright © 2011 Pearson Education, Inc 6.2 Association in Scatterplots Gas Consumption vs Heating Degree Days Direction: Positive Curvature: Linear Variation: Considerable scatter Outliers: None apparent of 30 Copyright © 2011 Pearson Education, Inc 6.3 Measuring Association Covariance x1 − x ) ( y1 − y ) + ( x2 − x ) ( y − y ) + L ( x n − x ) ( y n − y ) ( cov( x, y ) = n −1   A measure that quantifies the linear association Depends on units of measurement and is therefore difficult to interpret of 30 Copyright © 2011 Pearson Education, Inc 6.3 Measuring Association Correlation (r) cov( x, y ) corr( x, y ) = S x Sy    Standardized measure of the strength of the linear association (has no units) Always between -1 and +1 Easy to interpret 10 of 30 Copyright © 2011 Pearson Education, Inc 6.3 Measuring Association Scatterplot for r = 16 of 30 Copyright © 2011 Pearson Education, Inc 6.3 Measuring Association Correlation Matrix A table showing all of the correlations among a set of numerical variables 17 of 30 Copyright © 2011 Pearson Education, Inc 6.4 Summarizing Association with a Line Expressed using z-scores zˆ = rzx Slope-Intercept Form yˆ = a + bx a = y − bx and b = rsy / s x 18 of 30 Copyright © 2011 Pearson Education, Inc 6.4 Summarizing Association with a Line Line Relating Gas Consumption (y) to Heating Degree Days (x) yˆ = 42.6 + 0.0126 x 19 of 30 Copyright © 2011 Pearson Education, Inc 6.4 Summarizing Association with a Line Lines and Prediction  Use the correlation line to customize an ad for estimated savings from insulation based on climate  For a home in a cold climate (HDD = 8,800), the predicted gas consumption is 154 MCF  At $10 / MCF, the predicted cost is $1,540  Assuming that insulation saves 30% on gas bill, estimated savings is $462 20 of 30 Copyright © 2011 Pearson Education, Inc 6.5 Spurious Correlation Lurking Variables  Scatterplots and correlation reveal association, not causation  Spurious correlations result from underlying lurking variables 21 of 30 Copyright © 2011 Pearson Education, Inc 6.5 Spurious Correlation Checklist: Covariance and Correlation     Numerical variables No obvious lurking variables Linear Outliers 22 of 30 Copyright © 2011 Pearson Education, Inc 4M Example 6.1: LOCATING A NEW STORE Motivation Is it better to locate a new retail outlet far from competing stores? 23 of 30 Copyright © 2011 Pearson Education, Inc 4M Example 6.1: LOCATING A NEW STORE Method Is there an association between sales at the retail outlets and distance to nearest competitor? For 55 stores in the chain, data are gathered for total sales in the prior year and distance in miles from the nearest competitor 24 of 30 Copyright © 2011 Pearson Education, Inc 4M Example 6.1: LOCATING A NEW STORE Mechanics 25 of 30 Copyright © 2011 Pearson Education, Inc 4M Example 6.1: LOCATING A NEW STORE Mechanics Compute the correlation between sales and distance to be r = 0.741 26 of 30 Copyright © 2011 Pearson Education, Inc 4M Example 6.1: LOCATING A NEW STORE Message The data show a strong, positive linear association between distance to the nearest competitor and sales It is better to locate a new store far from its competitors 27 of 30 Copyright © 2011 Pearson Education, Inc Best Practices  To understand the relationship between two numerical variables, start with a scatterplot  Look at the plot, look at the plot, look at the plot  Use clear labels for the scatterplot 28 of 30 Copyright © 2011 Pearson Education, Inc Best Practices (Continued)  Describe a relationship completely  Consider the possibility of lurking variables  Use a correlation to quantify the association between two numerical variables that are linearly related 29 of 30 Copyright © 2011 Pearson Education, Inc Pitfalls  Don’t use the correlation if data are categorical  Don’t treat association and correlation as causation  Don’t assume that a correlation of zero means that the variables are not associated  Don’t assume that a correlation near -1 or +1 means near perfect association 30 of 30 Copyright © 2011 Pearson Education, Inc ... association between sales at the retail outlets and distance to nearest competitor? For 55 stores in the chain, data are gathered for total sales in the prior year and distance in miles from the nearest... Summarizing Association with a Line Lines and Prediction  Use the correlation line to customize an ad for estimated savings from insulation based on climate  For a home in a cold climate (HDD = 8,800),... measurement and is therefore difficult to interpret of 30 Copyright © 2011 Pearson Education, Inc 6.3 Measuring Association Correlation (r) cov( x, y ) corr( x, y ) = S x Sy    Standardized

Ngày đăng: 10/01/2018, 16:00

Từ khóa liên quan

Mục lục

  • PowerPoint Presentation

  • Slide 2

  • 6.1 Scatterplots

  • Slide 4

  • Slide 5

  • 6.2 Association in Scatterplots

  • Slide 7

  • Slide 8

  • 6.3 Measuring Association

  • Slide 10

  • Slide 11

  • Slide 12

  • Slide 13

  • Slide 14

  • Slide 15

  • Slide 16

  • Slide 17

  • 6.4 Summarizing Association with a Line

  • Slide 19

  • Slide 20

Tài liệu cùng người dùng

  • Đang cập nhật ...

Tài liệu liên quan