dplyr filter multiple columns for unique value in r

To filter for rows that have unique values in multiple columns using dplyr, we can use the distinct() function with the .keep_all = TRUE argument.

Let's assume we have a data frame df with columns "A", "B", "C", and "D" and we want to filter for unique combinations of values in columns "A" and "B". We can do the following:


main.r
library(dplyr)

df_unique <- df %>% distinct(A, B, .keep_all = TRUE)
69 chars
4 lines

This will create a new data frame df_unique with only the unique combinations of values in columns "A" and "B", along with all the other columns in the original data frame.

If we want to filter for unique combinations of values in more than two columns, we can simply add additional arguments to the distinct() function:


main.r
df_unique <- df %>% distinct(A, B, C, .keep_all = TRUE)
56 chars
2 lines

This will filter for unique combinations of values in columns "A", "B", and "C".

similar r code snippets

find kth most common elements in tidyverse in r

filter a dataframe from specific variable value superior to in r

dplyr rename prefix column names in r

filter a dataframe from a specific value in a column in r

filter nycflights13 by month of july in r

duplicate every row in a table and add a suffix with dplyr in r

dplyr remove prefix column names in r

dplyr rename first character all column names in r

dplyr left join without middle names in r

filter nycflights13 by month in r

related categories