In the Titanic Dataset, engineering a new feature representing number of passengers in each class

anon27567014 · December 26, 2017, 9:41pm

How can I engineer a new feature representing the count of each passenger class and add it to the original dataframe?

toobamukhtar · April 15, 2019, 6:22pm

Rabeez · April 16, 2019, 1:47am

You can obtain the class-wise counts using a groupby+aggregate operation with the dplyr package:

c <- Titanic %>% group_by(Class) %>% summarise(count=n())

You can merge this new dataframe to the Titanic one like this:

merge(x = Titanic, y = c, by = "Class", all.x = TRUE)

This will ensure that the group counts are replicated for all appropriate records in the Titanic dataframe.