Chapter 37: The GPLOT Procedure


Overview

The GPLOT procedure plots the values of two or more variables on a set of coordinate axes (X and Y). The coordinates of each point on the plot correspond to two variable values in an observation of the input data set. The procedure can also generate a separate plot for each value of a third (classification) variable. It can also generate bubble plots in which circles of varying proportions representing the values of a third variable are drawn at the data points.

The procedure produces a variety of two-dimensional graphs including

  • simple scatter plots

  • overlay plots in which multiple sets of data points display on one set of axes

  • plots against a second vertical axis

  • bubble plots

  • logarithmic plots (controlled by the AXIS statement).

In conjunction with the SYMBOL statement the GPLOT procedure can produce join plots, high-low plots, needle plots, and plots with simple or spline-interpolated lines. The SYMBOL statement can also display regression lines on scatter plots.

The GPLOT procedure is useful for

  • displaying long series of data, showing trends and patterns

  • interpolating between data points

  • extrapolating beyond existing data with the display of regression lines and confidence limits.

About Plots of Two Variables

Plots of two variables display the values of two variables as data points on one horizontal axis (X) and one vertical axis (Y). Each pair of X and Y values forms a data point.

The following figure shows a simple scatter plot that plots the values of the variable HEIGHT on the vertical axis and the variable WEIGHT on the horizontal axis. By default, the PLOT statement scales the axes to include the maximum and minimum data values and displays a plus sign (+) at each data point. It labels each axis with the name of its variable or an associated label and displays the value of each major tick mark.

click to expand
Figure 37.1: Scatter Plot of Two Variables (GPLVRBL1(a))

The program for this plot is in Example 4 on page 1126. For more information on producing scatter plots, see PLOT Statement on page 1101.

You can also overlay two or more plots (multiple sets of data points) on a single set of axes and you can apply a variety of interpolation techniques to these plots. See About Interpolation Methods on page 1085.

About Plots with a Classification Variable

Plots that use a classification variable produce a separate set of data points for each unique value of the classification variable and display all sets of data points on one set of axes.

The following figure shows multiple line plots that compare yearly temperature trends for three cities. The legend explains the values of the classification variable, CITY.

click to expand
Figure 37.2: Plot of Three Variables with Legend (GPLVRBL2(a))

By default, plots with a classification variable generate a legend. In the code that generates the plot for Example 8 on page 1135, a SYMBOL statement connects the data points and specifies the plot symbol that is used for each value of the classification variable (CITY). The program for this plot is in Example 8 on page 1135. For more information on how to produce plots with a classification variable, see PLOT Statement on page 1101.

About Bubble Plots

Bubble plots represent the values of three variables by drawing circles of varying sizes at points that are plotted on the vertical and horizontal axes. Two of the variables determine the location of the data points, while the values of the third variable control the size of the circles.

Figure 37.3 on page 1084 shows a bubble plot in which each bubble represents a category of engineer that is shown on the horizontal axis. The location of each bubble in relation to the vertical axis is determined by the average salary for the category. The size of each bubble represents the number of engineers in the category relative to the total number of engineers in the data.

By default, the BUBBLE statement scales the axes to include the maximum and minimum data values and draws an unlabeled circle at each data point. It labels each axis with the name of its variable or an associated label and displays the value of each major tick mark.

click to expand
Figure 37.3: Bubble Plot (GPLBUBL1)

The program for this plot is in Example 1 on page 1120. For more information on producing bubble plots, see BUBBLE Statement on page 1090.

About Plots with Two Vertical Axes

Plots with two vertical axes have a right vertical axis that can

  • display the same variable values as the left axis

  • display left axis values in a different scale

  • plot a second response (Y) variable, thereby producing one or more overlay plots.

In the following figure, the right axis displays the values of the vertical coordinates in a different scale from the scale that is used for the left axis.

click to expand
Figure 37.4: Plot with a Right Vertical Axis (GPLSCVL1)

The program for this plot is in Example 9 on page 1138. For more information on how to produce plots with a right vertical axis, see PLOT2 Statement on page 1115 and BUBBLE2 Statement on page 1098.

About Interpolation Methods

In addition to these graphs, you can produce other types of plots such as box plots or high-low-close plots by specifying various interpolation methods with the SYMBOL statement. Use the SYMBOL statement to

  • connect the data points with straight lines

  • specify regression analysis to fit a line to the points and, optionally , display lines for confidence limits

  • connect the data points to the zero line on the vertical axis

  • display the minimum and maximum values of Y at each X value and mark the mean value, display standard deviations that connect the data points with lines or bars, generate box plots, or plot high-low-close stock market data

  • specify that a pattern fill the polygon that is defined by data points

  • smooth plot lines with spline interpolation

  • use a step function to connect the data points

SYMBOL Statement on page 183 describes all interpolation methods.




SAS.GRAPH 9.1 Reference, Volumes I and II
SAS.GRAPH 9.1 Reference, Volumes I and II
ISBN: N/A
EAN: N/A
Year: 2004
Pages: 342

flylib.com © 2008-2017.
If you may any questions please contact us: flylib@qtcs.net