Customer Satisfaction

Santander Bank is asking Kagglers to help them identify dissatisfied customers early in their relationship. Doing so would allow Santander to take proactive steps to improve a customer's happiness before it's too late.

In this competition, you'll work with hundreds of anonymized features to predict if a customer is satisfied or dissatisfied with their banking experience.

Dataset Description

Dataset Source

The dataset was collected from Kaggle.

Dataset Description

The dataset consisted of 76020 rows and 371 unnamed features. The "TARGET" column is the variable to predict. It equals one for unsatisfied customers and 0 for satisfied customers.

Feature Engineering

Features who had less than 2% variance were removed.
Features who were highly correlated(90%) were removed.
Features with minimal correlation(0.1%) with the target were also removed.
59 most important features were selected using ExtraTreeClassifier
The final 16 features were selected using Recursive Feature Elimination, chi-squared test, and model based feature selection like Random Forest, Logistic Regression, Lightgbm

Classifier

Three classifiers (KNN, DT, and RF) were fitted with hyperparameter tuning using GridSearchCV. All of them had an accuracy of 86% approximately.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Santander_Customer_Satisfaction.ipynb		Santander_Customer_Satisfaction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Satisfaction

Dataset Description

Dataset Source

Dataset Description

Feature Engineering

Classifier

About

Releases

Packages

Languages

Manisha-Karim/Customer-Satisfaction

Folders and files

Latest commit

History

Repository files navigation

Customer Satisfaction

Dataset Description

Dataset Source

Dataset Description

Feature Engineering

Classifier

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages