summarize multiple variables by group in r

In order to summarize multiple variables by group in R, we can use the dplyr package.

Assuming we have a data.frame called my_data with variables group, var1, var2, var3, and var4, we can group the data by the group variable and then summarize the other variables using the summarize function.

main.r
library(dplyr)

my_data %>%
  group_by(group) %>%
  summarize(mean_var1 = mean(var1),
            sd_var1 = sd(var1),
            mean_var2 = mean(var2),
            sd_var2 = sd(var2),
            mean_var3 = mean(var3),
            sd_var3 = sd(var3),
            mean_var4 = mean(var4),
            sd_var4 = sd(var4))
322 chars
13 lines

In this code block, we first load the dplyr package. Then, we use the pipe operator %>% to chain a series of functions together. We first group the data by the group variable using group_by. We then summarize the data using summarize and calculate the mean and standard deviation of each variable for each group.

The resulting output will be a data.frame with one row for each group and one column for each summarized variable.

gistlibby LogSnag