The dataset had numerous irrelevant and redundant columns, as well as missing values. Irrelevant columns were removed, and missing values were imputed using the mean, median, or mode for numerical variables and mode imputation for categorical variables