Stata Scatter Plot of Means Across Continuous Variable
Introduction
- Stata is a powerful and intuitive data analysis program.
- Learning how to graph in Stata is an important part of learning how to use Stata.
- Yet, the default graphs in Stata can sometimes be less than optimal.
- This document is an introduction to (a) basic graphing ideas in Stata; and (b) some simple ways to make your Stata graphs look more professional.
What are Variables?
- By variables, I simply mean the columns of data that you have.
- For our purposes, you may think of variables as synonymous with questionnaire items, or columns of data.
Variable Types
- categorical variables represent unordered categories like neighborhood, or religious affiliation, or place of residence.
- continuous variables represent a continuous scale like a mental health scale, or a measure of life expectancy.
A Data Visualization Strategy
Once we have discerned the type of variable that have, there are two followup questions we may ask before deciding upon a chart strategy:
- Is our graph about one thing at a time?
- How much of x is there?
- What is the distribution of x?
- Is our graph about two things at a time?
- What is the relationship of x and y?
- How are x and y associated?
Data
We are going to use the famous "iris" data collected by Edgar Anderson in the early 20th Century.
. use "iris.dta", clear . . summarize Variable │ Obs Mean Std. dev. Min Max ─────────────┼───────────────────────────────────────────────────────── Sepal_Length │ 150 5.843333 .8280661 4.3 7.9 Sepal_Width │ 150 3.057333 .4358663 2 4.4 Petal_Length │ 150 3.758 1.765298 1 6.9 Petal_Width │ 150 1.199333 .7622377 .1 2.5 Species │ 150 2 .8192319 1 3
The
iris
data set has 5 variables.
Species of Iris
Iris species images courtesy Wikipedia.
data:image/s3,"s3://crabby-images/fb819/fb8190df07c5541a60229619ecf3ab0fe47eaf0d" alt="Iris Species"
Petals and Sepals
data:image/s3,"s3://crabby-images/4e0f3/4e0f32320a6e2ba5709981e2e6f89ec7e31b7d68" alt="Petals and Sepals"
Basic Graphs
Continuous Variable histogram
. histogram Petal_Length (bin=12, start=1, width=.49166667)
data:image/s3,"s3://crabby-images/bf3fc/bf3fcfecc6f79ce90e4b0c12cb8e1efe53576cd3" alt="Histogram of Petal Width"
Categorical Variable graph bar
. graph bar, over(Species)
data:image/s3,"s3://crabby-images/c7d44/c7d44746bf54ba6de9e88962da6af728ddecde22" alt="Bar Graph of Species"
Continuous by Continuous twoway
. twoway scatter Petal_Length Petal_Width
data:image/s3,"s3://crabby-images/06504/0650479f984a505189e2ac04fc989d86ec89fdea" alt="Scatterplot"
Categorical by Categorical graph bar
. recode Petal_Length /// > (min/3.758 = 0 "below mean") /// > (3.758/max = 1 "above mean"), /// > generate(Petal_Group) // dichotomize Petal_Length (150 differences between Petal_Length and Petal_Group) . . graph bar, over(Species) over(Petal_Group)
data:image/s3,"s3://crabby-images/ac713/ac713a4f40466f5373668f3857757149013ce572" alt="Bar Graph of Species by Category of Petal Length"
Continuous by Categorical graph bar
. graph bar Petal_Length, over(Species)
data:image/s3,"s3://crabby-images/050f6/050f613da3e8acebeaa66935373dcc410a099acf" alt="Bar Graph of Petal Length by Species"
Titles and Labels , title(...) xtitle(...) ytitle(...)
. twoway scatter Petal_Length Petal_Width, scheme(s1rcolor) /// > title("Petal Length by Petal Width") /// > xtitle("Petal Width") ytitle("Petal Width") /// > caption("Iris Data")
data:image/s3,"s3://crabby-images/1d34f/1d34f64753ad99af177eb8bebb40e08a7bb06443" alt="Graph With Titles and Labels"
Better Graphing With Schemes ,scheme(...)
The easiest method to make better Stata graphs is through the use of predefined Stata graphing schemes.
Pre-Defined Schemes
Some schemes, e.g.economist
, sj
, s1color
, and s1rcolor
are pre-installed with Stata.
Economist Scheme
. twoway scatter Petal_Length Petal_Width, scheme(economist)
data:image/s3,"s3://crabby-images/4a258/4a258fd6a4d9e24893d009ddca765639a279f8b8" alt="Scatterplot with Economist Scheme"
Stata Journal Scheme
. twoway scatter Petal_Length Petal_Width, scheme(sj)
data:image/s3,"s3://crabby-images/b9aea/b9aea72b018c3524731cf9cc6be2daacbc889ddf" alt="Scatterplot with Stata Journal Scheme"
s1color
Scheme
. twoway scatter Petal_Length Petal_Width, scheme(s1color)
data:image/s3,"s3://crabby-images/5d1ec/5d1ec512ac573bf190e0b40b6d7fad656bd4ed74" alt="Scatterplot with s1color Scheme"
s1rcolor
Scheme
. twoway scatter Petal_Length Petal_Width, scheme(s1rcolor)
data:image/s3,"s3://crabby-images/c6509/c650939af390193e0ae170af12da1dfdb36ddada" alt="Scatterplot with s1rcolor Scheme"
User Written Schemes
Two of the best user written schemes are plottig
and lean2
.
Use the findit
command e.g.findit lean2
to find these schemes.
lean2
Scheme
. twoway scatter Petal_Length Petal_Width, scheme(lean2)
data:image/s3,"s3://crabby-images/c79af/c79af231e7fbc5684450d61ebd4ebe4286f0a99b" alt="Scatterplot with lean2 Scheme"
Michigan graph scheme
I have written a michigan
graph scheme described here.
. twoway (scatter Petal_Length Petal_Width) /// > (lfit Petal_Length Petal_Width), scheme(michigan)
data:image/s3,"s3://crabby-images/78e59/78e59e25aff92d7386827fb2030d505972a623ff" alt="Scatterplot with michigan Scheme"
Schemes as a Base for Further Tweaking
Schemes can be used as a base that can then be further modified.
. twoway (scatter Petal_Length Petal_Width, msymbol(0) mcolor(red)) /// > (lfit Petal_Length Petal_Width), /// > scheme(lean2) (note: named style 0 not found in class symbol, default attributes used)
data:image/s3,"s3://crabby-images/00cec/00cec9b882f320dbcbf7d5e2d234ee206fd3cf54" alt="Modified Scatterplot with lean2 Scheme as a Base"
Even More Tweaks
Based upon an example at https://blog.stata.com/2018/10/02/scheming-your-way-to-your-favorite-graph-style/
. twoway scatter Sepal_Length Sepal_Width Petal_Width Petal_Length, /// > color(%50 %50 %50) /// transparency > title("Multiple Iris Characteristics") /// title > scheme(s1rcolor) // scheme
data:image/s3,"s3://crabby-images/5f958/5f958ad0c972710a5af3a6d42b6f9b6460a1c911" alt="Modified Scatterplot with s1rcolor Scheme as a Base"
More Information
See also Two Page Stata
Created by agrogan@umich.edu
Source: https://agrogan1.github.io/Stata/stata-graphing/
0 Response to "Stata Scatter Plot of Means Across Continuous Variable"
Post a Comment