Univariate Graphing - Wesleyan University
QAC 201: Introduction to Graphing in R
Prof. Nazzaro
Graphing in R with gpplot2
? The data set used to illustrate the ggplot2 commands is the HELP study (data name is HELPrct), which was a clinical trial for adult inpatients recruited from a detoxification unit. The variables that we use throughout this tutorial include depression (cesd), homelessness status (homeless), primary abuse substance (substance), patient's age (age), and patient's gender (sex).
Univariate Graphing
? Suppose we would like a plot of a single categorical variable.
ggplot(data=HELPrct)+ geom_bar(aes(x=substance))+ ggtitle("Primary abuse substance of subjects")
Primary abuse substance of subjects
150
count
100
50
0 alcohol
cocaine
substance
? Now for a plot of a single quantitative variable
ggplot(data=HELPrct)+ geom_histogram(aes(x=cesd))+ ggtitle("Depression Scores of Subjects")
Depression Scores of Subjects
30
count
20
10
0 0
20
40
cesd
heroin 60
1
QAC 201: Introduction to Graphing in R
ggplot(data=HELPrct)+ geom_density(aes(x=cesd))+ ggtitle("Depression Scores of Subjects")
Depression Scores of Subjects
0.03
density
0.02
0.01
0.00 0
20
40
cesd
Prof. Nazzaro
60
2
QAC 201: Introduction to Graphing in R
Prof. Nazzaro
Bivariate Graphing
C Q
? OPTION 1: Construct a bar plot with mean of response variable on y-axis.
ggplot(data=HELPrct)+ stat_summary(aes(x=substance, y=cesd), fun.y=mean, geom="bar")+ ylab("Depression")+ ggtitle("Mean Depression Scores at each Primary Abuse Substance")
Mean Depression Scores at each Primary Abuse Substance
30
Depression
20
10
0 alcohol
cocaine
substance
heroin
? OPTION 2: Boxplots
ggplot(data=HELPrct)+ geom_boxplot(aes(x=substance, y=cesd, fill=substance))+ ylab("Depression")+ ggtitle("Mean Depression Scores at each Primary Abuse Substance")
Mean Depression Scores at each Primary Abuse Substance
60
Depression
40
substance
alcohol
cocaine
20
heroin
0 alcohol
cocaine
substance
heroin
? OPTION 3: Density Plots
ggplot(data=HELPrct)+ geom_density(aes(x=cesd, color=substance))+ ylab("Depression")+ ggtitle("Mean Depression Scores at each Primary Abuse Substance")
3
QAC 201: Introduction to Graphing in R
Prof. Nazzaro
Mean Depression Scores at each Primary Abuse Substance
Depression
0.03
substance
0.02
alcohol
cocaine
heroin 0.01
0.00 0
20
40
60
cesd
? OPTION 4: Mean of Response with Error Bars
ggplot(data=HELPrct)+ stat_summary(aes(x=substance, y=cesd, color=substance), fun.data="mean_se", geom="errorbar", width=0.2)+ stat_summary(aes(x=substance, y=cesd, color=substance), fun.y="mean", geom="point")+ ylab("Depression")+ ggtitle("Mean Depression Scores at each Primary Abuse Substance with Standard Error")
Mean Depression Scores at each Primary Abuse Substance with Standard Error
36
Depression
34
substance
alcohol
32
cocaine
heroin 30
28 alcohol
cocaine
substance
heroin
4
QAC 201: Introduction to Graphing in R
Prof. Nazzaro
C C
? If you have a binary response variable (that is, a response variable that takes on two possible values) - you can display the proportion of participants at an indicated response level for each level of a categorical variable.
HELPrct$homeless_status[HELPrct$homeless=="homeless"] ................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- data display in r for repeated measurements
- data visualization and graphics in r
- data visualization with ggplot2 cheat sheet
- a ggplot2 primer data action lab
- introduction to ggplot2
- chapter 2 r ggplot2 examples university of wisconsin
- lecture 3 bar graphs
- univariate graphing wesleyan university
- 6 r for graphs 2020
- data visualization stats and r
Related searches
- wesleyan financial group
- univariate and multivariate logistic regression
- univariate vs multivariate logistic regression
- univariate regression model
- univariate bivariate and multivariate data
- univariate and bivariate statistics
- univariate and multivariate
- univariate vs multivariate data
- univariate versus bivariate analysis
- univariate and multivariate statistics
- univariate analysis example
- univariate statistics definition