The histogram is not very effective to display location and spread. The description of the data (summary statistics) in the text or table may be complemented by a graphical representation of the data: either a histogram, cumulative distribution or box-and-whisker plot. If the value is an outlier, you can exclude the value or the entire case from further statistical analysis by clicking Exclude on the Tools menu.įor multiple box-and-whisker plots see Multiple comparison graphs and Multiple variables graphs. If you double-click an observation, the spreadsheet window will open with the corresponding case highlighted. ![]() When you click an individual observation in the graph, the corresponding case is identified in a pop-up window (see also Select variable for case identification command). This enables you to obtain a diagram representing a statistical summary of the data without the disadvantage of concealing the real data. These values are plotted using a different marker in the warning color (see Format graph).Īs an option, you may select to plot all individual data points. A far out value is defined as a value that is smaller than the lower quartile minus 3 times the interquartile range, or larger than the upper quartile plus 3 times the interquartile range ( outer fences).An outside value is defined as a value that is smaller than the lower quartile minus 1.5 times the interquartile range, or larger than the upper quartile plus 1.5 times the interquartile range ( inner fences).The horizontal line extends from the minimum to the maximum value, excluding outside and far out values which are displayed as separate points. In the Box-and-whisker plot, the central box represents the values from the lower to upper quartile (25 to 75 percentile). This is the Box-and-Whisker plot for the variable Weight: You can choose between vertical and horizontal orientation of the Box-and-Whisker plot. If the data require a logarithmic transformation, then select the Logarithmic transformation option. The dialog box for Box-and-Whisker plot is similar to the one for Summary statistics: For n < 5 we recommend showing the individual data points.The Box-and-Whisker plot (Tukey, 1977), or boxplot, displays a statistical summary of a variable: median, quartiles, range and possibly extreme values.įor a detailed description of a Box-and-Whisker plot, see Construction of a Box-and-Whisker plot. Box plot construction requires a sample of at least n = 5 (preferably larger), although some software does not check for this. Outliers beyond the whiskers may be individually plotted. The 1.5 multiplier corresponds to approximately ☒.7σ (where σ is s.d.) and 99.3% coverage of the data for a normal distribution. As with the division of the box by the median, the whiskers are not necessarily symmetrical ( Fig. ![]() The use of quartiles for box plots is a well-established convention: boxes or whiskers should never be used to show the mean, s.d. Whiskers are conventionally extended to the most extreme data point that is no more than 1.5 × IQR from the edge of the box (Tukey style) or all the way to minimum and maximum of the data values (Spear style). The plot may be oriented vertically or horizontally-we use here (with one exception) horizontal boxes to maintain consistent orientation with corresponding sample distributions. A line inside the box shows the median, which is not necessarily central. The core element that gives the box plot its name is a box whose length is the IQR and whose width is arbitrary ( Fig.
0 Comments
Leave a Reply. |