Why doesn’t a model matrix necessarily have the same number of rows as the data frame?
mergem = model.matrix(as.formula(paste(response, '~ .')), data=mergef)
dim(mergef)
# [1] 115562 71
dim(mergem)
# [1] 66786 973
I tried looking for hints in the documentation but couldn’t find anything. Thanks in advance.
Well, if a row has NAs in it, that row is (by default) removed:
This behavior is controlled by the option “na.action”: