You are given a data set consisting of variables having more than 30% missing values? Let’s say, out of 50 variables, 8 variables have missing values higher than 30%. How will you deal with them?

Name: AnyTimeChat.in
Brand: AnyTimeChat.in
SKU: AnyTimeChat.in
Rating: 4.68 (233734 reviews)

February 3, 2024September 13, 2020 by priya

Assign a unique category to the missing values, who knows the missing values might uncover some trend.
We can remove them blatantly.
Or, we can sensibly check their distribution with the target variable, and if found any pattern we’ll keep those missing values and assign them a new category while removing others.