You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

2109 lines
3.2 MiB

2 years ago
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"___\n",
"\n",
"<a href='http://www.pieriandata.com'><img src='../Pierian_Data_Logo.png'/></a>\n",
"___\n",
"<center><em>Copyright by Pierian Data Inc.</em></center>\n",
"<center><em>For more information, visit us at <a href='http://www.pieriandata.com'>www.pieriandata.com</a></em></center>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Supervised Learning Capstone Project - Tree Methods Focus - SOLUTIONS\n",
"\n",
"\n",
"\n",
"## Make sure to review the introduction video to understand the 3 ways of approaching this project exercise!\n",
"\n",
"----\n",
"\n",
"**Ways to approach the project:**\n",
" 1. Open a new notebook, read in the data, and then analyze and visualize whatever you want, then create a predictive model.\n",
" 2. Use this notebook as a general guide, completing the tasks in bold shown below.\n",
" 3. Skip to the solutions notebook and video, and treat project at a more relaxing code along walkthrough lecture series.\n",
"\n",
"------\n",
"------\n",
"\n",
"## GOAL: Create a model to predict whether or not a customer will Churn .\n",
"\n",
"----\n",
"----\n",
"\n",
"\n",
"## Complete the Tasks in Bold Below!\n",
"\n",
"## Part 0: Imports and Read in the Data\n",
"\n",
"**TASK: Run the filled out cells below to import libraries and read in your data. The data file is \"Telco-Customer-Churn.csv\"**"
]
},
{
"cell_type": "code",
"execution_count": 132,
"metadata": {},
"outputs": [],
"source": [
"# RUN THESE CELLS TO START THE PROJECT!\n",
"import numpy as np\n",
"import pandas as pd\n",
"import matplotlib.pyplot as plt\n",
"import seaborn as sns"
]
},
{
"cell_type": "code",
"execution_count": 133,
"metadata": {},
"outputs": [],
"source": [
"df = pd.read_csv('../DATA/Telco-Customer-Churn.csv')"
]
},
{
"cell_type": "code",
"execution_count": 134,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>customerID</th>\n",
" <th>gender</th>\n",
" <th>SeniorCitizen</th>\n",
" <th>Partner</th>\n",
" <th>Dependents</th>\n",
" <th>tenure</th>\n",
" <th>PhoneService</th>\n",
" <th>MultipleLines</th>\n",
" <th>InternetService</th>\n",
" <th>OnlineSecurity</th>\n",
" <th>...</th>\n",
" <th>DeviceProtection</th>\n",
" <th>TechSupport</th>\n",
" <th>StreamingTV</th>\n",
" <th>StreamingMovies</th>\n",
" <th>Contract</th>\n",
" <th>PaperlessBilling</th>\n",
" <th>PaymentMethod</th>\n",
" <th>MonthlyCharges</th>\n",
" <th>TotalCharges</th>\n",
" <th>Churn</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>7590-VHVEG</td>\n",
" <td>Female</td>\n",
" <td>0</td>\n",
" <td>Yes</td>\n",
" <td>No</td>\n",
" <td>1</td>\n",
" <td>No</td>\n",
" <td>No phone service</td>\n",
" <td>DSL</td>\n",
" <td>No</td>\n",
" <td>...</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>Month-to-month</td>\n",
" <td>Yes</td>\n",
" <td>Electronic check</td>\n",
" <td>29.85</td>\n",
" <td>29.85</td>\n",
" <td>No</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>5575-GNVDE</td>\n",
" <td>Male</td>\n",
" <td>0</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>34</td>\n",
" <td>Yes</td>\n",
" <td>No</td>\n",
" <td>DSL</td>\n",
" <td>Yes</td>\n",
" <td>...</td>\n",
" <td>Yes</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>One year</td>\n",
" <td>No</td>\n",
" <td>Mailed check</td>\n",
" <td>56.95</td>\n",
" <td>1889.50</td>\n",
" <td>No</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>3668-QPYBK</td>\n",
" <td>Male</td>\n",
" <td>0</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>2</td>\n",
" <td>Yes</td>\n",
" <td>No</td>\n",
" <td>DSL</td>\n",
" <td>Yes</td>\n",
" <td>...</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>Month-to-month</td>\n",
" <td>Yes</td>\n",
" <td>Mailed check</td>\n",
" <td>53.85</td>\n",
" <td>108.15</td>\n",
" <td>Yes</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>7795-CFOCW</td>\n",
" <td>Male</td>\n",
" <td>0</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>45</td>\n",
" <td>No</td>\n",
" <td>No phone service</td>\n",
" <td>DSL</td>\n",
" <td>Yes</td>\n",
" <td>...</td>\n",
" <td>Yes</td>\n",
" <td>Yes</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>One year</td>\n",
" <td>No</td>\n",
" <td>Bank transfer (automatic)</td>\n",
" <td>42.30</td>\n",
" <td>1840.75</td>\n",
" <td>No</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>9237-HQITU</td>\n",
" <td>Female</td>\n",
" <td>0</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>2</td>\n",
" <td>Yes</td>\n",
" <td>No</td>\n",
" <td>Fiber optic</td>\n",
" <td>No</td>\n",
" <td>...</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>No</td>\n",
" <td>Month-to-month</td>\n",
" <td>Yes</td>\n",
" <td>Electronic check</td>\n",
" <td>70.70</td>\n",
" <td>151.65</td>\n",
" <td>Yes</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>5 rows × 21 columns</p>\n",
"</div>"
],
"text/plain": [
" customerID gender SeniorCitizen Partner Dependents tenure PhoneService \\\n",
"0 7590-VHVEG Female 0 Yes No 1 No \n",
"1 5575-GNVDE Male 0 No No 34 Yes \n",
"2 3668-QPYBK Male 0 No No 2 Yes \n",
"3 7795-CFOCW Male 0 No No 45 No \n",
"4 9237-HQITU Female 0 No No 2 Yes \n",
"\n",
" MultipleLines InternetService OnlineSecurity ... DeviceProtection \\\n",
"0 No phone service DSL No ... No \n",
"1 No DSL Yes ... Yes \n",
"2 No DSL Yes ... No \n",
"3 No phone service DSL Yes ... Yes \n",
"4 No Fiber optic No ... No \n",
"\n",
" TechSupport StreamingTV StreamingMovies Contract PaperlessBilling \\\n",
"0 No No No Month-to-month Yes \n",
"1 No No No One year No \n",
"2 No No No Month-to-month Yes \n",
"3 Yes No No One year No \n",
"4 No No No Month-to-month Yes \n",
"\n",
" PaymentMethod MonthlyCharges TotalCharges Churn \n",
"0 Electronic check 29.85 29.85 No \n",
"1 Mailed check 56.95 1889.50 No \n",
"2 Mailed check 53.85 108.15 Yes \n",
"3 Bank transfer (automatic) 42.30 1840.75 No \n",
"4 Electronic check 70.70 151.65 Yes \n",
"\n",
"[5 rows x 21 columns]"
]
},
"execution_count": 134,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df.head()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Part 1: Quick Data Check\n",
"\n",
"**TASK: Confirm quickly with .info() methods the datatypes and non-null values in your dataframe.**"
]
},
{
"cell_type": "code",
"execution_count": 135,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 136,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"<class 'pandas.core.frame.DataFrame'>\n",
"RangeIndex: 7032 entries, 0 to 7031\n",
"Data columns (total 21 columns):\n",
" # Column Non-Null Count Dtype \n",
"--- ------ -------------- ----- \n",
" 0 customerID 7032 non-null object \n",
" 1 gender 7032 non-null object \n",
" 2 SeniorCitizen 7032 non-null int64 \n",
" 3 Partner 7032 non-null object \n",
" 4 Dependents 7032 non-null object \n",
" 5 tenure 7032 non-null int64 \n",
" 6 PhoneService 7032 non-null object \n",
" 7 MultipleLines 7032 non-null object \n",
" 8 InternetService 7032 non-null object \n",
" 9 OnlineSecurity 7032 non-null object \n",
" 10 OnlineBackup 7032 non-null object \n",
" 11 DeviceProtection 7032 non-null object \n",
" 12 TechSupport 7032 non-null object \n",
" 13 StreamingTV 7032 non-null object \n",
" 14 StreamingMovies 7032 non-null object \n",
" 15 Contract 7032 non-null object \n",
" 16 PaperlessBilling 7032 non-null object \n",
" 17 PaymentMethod 7032 non-null object \n",
" 18 MonthlyCharges 7032 non-null float64\n",
" 19 TotalCharges 7032 non-null float64\n",
" 20 Churn 7032 non-null object \n",
"dtypes: float64(2), int64(2), object(17)\n",
"memory usage: 1.1+ MB\n"
]
}
],
"source": [
"df.info()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Get a quick statistical summary of the numeric columns with .describe() , you should notice that many columns are categorical, meaning you will eventually need to convert them to dummy variables.**"
]
},
{
"cell_type": "code",
"execution_count": 137,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 138,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>SeniorCitizen</th>\n",
" <th>tenure</th>\n",
" <th>MonthlyCharges</th>\n",
" <th>TotalCharges</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>count</th>\n",
" <td>7032.000000</td>\n",
" <td>7032.000000</td>\n",
" <td>7032.000000</td>\n",
" <td>7032.000000</td>\n",
" </tr>\n",
" <tr>\n",
" <th>mean</th>\n",
" <td>0.162400</td>\n",
" <td>32.421786</td>\n",
" <td>64.798208</td>\n",
" <td>2283.300441</td>\n",
" </tr>\n",
" <tr>\n",
" <th>std</th>\n",
" <td>0.368844</td>\n",
" <td>24.545260</td>\n",
" <td>30.085974</td>\n",
" <td>2266.771362</td>\n",
" </tr>\n",
" <tr>\n",
" <th>min</th>\n",
" <td>0.000000</td>\n",
" <td>1.000000</td>\n",
" <td>18.250000</td>\n",
" <td>18.800000</td>\n",
" </tr>\n",
" <tr>\n",
" <th>25%</th>\n",
" <td>0.000000</td>\n",
" <td>9.000000</td>\n",
" <td>35.587500</td>\n",
" <td>401.450000</td>\n",
" </tr>\n",
" <tr>\n",
" <th>50%</th>\n",
" <td>0.000000</td>\n",
" <td>29.000000</td>\n",
" <td>70.350000</td>\n",
" <td>1397.475000</td>\n",
" </tr>\n",
" <tr>\n",
" <th>75%</th>\n",
" <td>0.000000</td>\n",
" <td>55.000000</td>\n",
" <td>89.862500</td>\n",
" <td>3794.737500</td>\n",
" </tr>\n",
" <tr>\n",
" <th>max</th>\n",
" <td>1.000000</td>\n",
" <td>72.000000</td>\n",
" <td>118.750000</td>\n",
" <td>8684.800000</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" SeniorCitizen tenure MonthlyCharges TotalCharges\n",
"count 7032.000000 7032.000000 7032.000000 7032.000000\n",
"mean 0.162400 32.421786 64.798208 2283.300441\n",
"std 0.368844 24.545260 30.085974 2266.771362\n",
"min 0.000000 1.000000 18.250000 18.800000\n",
"25% 0.000000 9.000000 35.587500 401.450000\n",
"50% 0.000000 29.000000 70.350000 1397.475000\n",
"75% 0.000000 55.000000 89.862500 3794.737500\n",
"max 1.000000 72.000000 118.750000 8684.800000"
]
},
"execution_count": 138,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df.describe()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Part 2: Exploratory Data Analysis\n",
"\n",
"## General Feature Exploration\n",
"\n",
"**TASK: Confirm that there are no NaN cells by displaying NaN values per feature column.**"
]
},
{
"cell_type": "code",
"execution_count": 139,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 140,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"customerID 0\n",
"gender 0\n",
"SeniorCitizen 0\n",
"Partner 0\n",
"Dependents 0\n",
"tenure 0\n",
"PhoneService 0\n",
"MultipleLines 0\n",
"InternetService 0\n",
"OnlineSecurity 0\n",
"OnlineBackup 0\n",
"DeviceProtection 0\n",
"TechSupport 0\n",
"StreamingTV 0\n",
"StreamingMovies 0\n",
"Contract 0\n",
"PaperlessBilling 0\n",
"PaymentMethod 0\n",
"MonthlyCharges 0\n",
"TotalCharges 0\n",
"Churn 0\n",
"dtype: int64"
]
},
"execution_count": 140,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df.isna().sum()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK:Display the balance of the class labels (Churn) with a Count Plot.**"
]
},
{
"cell_type": "code",
"execution_count": 141,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 142,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<AxesSubplot:xlabel='Churn', ylabel='count'>"
]
},
"execution_count": 142,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAYsAAAEGCAYAAACUzrmNAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAARQElEQVR4nO3debBedX3H8fdHotK6ESRFTKChmhkntq4p0trOtGBZrDXUhcFxSZFp2hnqaKeLWmfEorR2quLS1g5TaINLkbqB1pFmQNs6rcKNIrKUIeJCMmCCQSou1Oi3fzy/iw8hN78HvOc+N7nv18wzzznfszzfO3OTz/2dc55zUlVIkrQvD5p2A5Kkxc+wkCR1GRaSpC7DQpLUZVhIkrqWTbuBIRx22GG1evXqabchSfuVLVu23F5VK/a27IAMi9WrVzMzMzPtNiRpv5Lka3Mt8zCUJKnLsJAkdRkWkqQuw0KS1GVYSJK6DAtJUpdhIUnqMiwkSV2GhSSp64D8Bvd8ePqfXDjtFrQIbfnrl027BWkqHFlIkroMC0lS16BhkeSrSb6U5OokM612aJLNSW5q78tbPUnemWRrkmuSPG1sPxva+jcl2TBkz5Kk+1qIkcWvV9VTqmpdm38NcHlVrQEub/MAJwNr2msj8G4YhQtwFvAM4BjgrNmAkSQtjGkchloPbGrTm4BTxuoX1shngUOSHAGcCGyuql1VdQewGThpgXuWpCVt6LAo4N+SbEmysdUOr6pb2/RtwOFteiVwy9i221ptrvq9JNmYZCbJzM6dO+fzZ5CkJW/oS2d/paq2J/kZYHOS/xlfWFWVpObjg6rqPOA8gHXr1s3LPiVJI4OOLKpqe3vfAXyE0TmHb7TDS7T3HW317cCRY5uvarW56pKkBTJYWCR5WJJHzE4DJwDXApcCs1c0bQAuadOXAi9rV0UdC9zZDlddBpyQZHk7sX1Cq0mSFsiQh6EOBz6SZPZz3l9Vn0xyFXBxkjOArwGntvU/ATwb2Ap8FzgdoKp2JXkjcFVb7+yq2jVg35KkPQwWFlV1M/DkvdS/CRy/l3oBZ86xrwuAC+a7R0nSZPwGtySpy7CQJHUZFpKkLsNCktRlWEiSugwLSVKXYSFJ6jIsJEldhoUkqcuwkCR1GRaSpC7DQpLUZVhIkroMC0lSl2EhSeoyLCRJXYaFJKnLsJAkdRkWkqQuw0KS1GVYSJK6DAtJUpdhIUnqMiwkSV2GhSSpy7CQJHUZFpKkLsNCktRlWEiSugwLSVKXYSFJ6ho8LJIclOQLST7e5o9O8rkkW5N8IMlDWv2hbX5rW756bB+vbfUbk5w4dM+SpHtbiJHFK4Ebxub/Cji3qh4P3AGc0epnAHe0+rltPZKsBU4DngicBPxdkoMWoG9JUjNoWCRZBfwm8A9tPsBxwAfbKpuAU9r0+jZPW358W389cFFV3V1VXwG2AscM2bck6d6GHlm8HfhT4Edt/tHAt6pqd5vfBqxs0yuBWwDa8jvb+vfU97LNPZJsTDKTZGbnzp3z/GNI0tI2WFgkeQ6wo6q2DPUZ46rqvKpaV1XrVqxYsRAfKUlLxrIB9/1M4LlJng0cDDwSeAdwSJJlbfSwCtje1t8OHAlsS7IMeBTwzbH6rPFtJEkLYLCRRVW9tqpWVdVqRieor6iqFwOfAl7QVtsAXNKmL23ztOVXVFW1+mntaqmjgTXAlUP1LUm6ryFHFnN5NXBRkjcBXwDOb/Xzgfck2QrsYhQwVNV1SS4Grgd2A2dW1Q8Xvm1JWroWJCyq6tPAp9v0zezlaqaq+j7wwjm2Pwc4Z7gOJUn74je4JUldhoUkqcuwkCR1GRaSpC7DQpLUZVhIkroMC0lSl2EhSeoyLCRJXYaFJKnLsJAkdRkWkqQuw0KS1GVYSJK6DAtJUpdhIUnqMiwkSV2GhSSpy7CQJHUZFpKkLsNCktRlWEiSugwLSVKXYSFJ6jIsJEldhoUkqcuwkCR1GRaSpC7DQpLUZVhIkroMC0lS12BhkeTgJFcm+WKS65L8easfneRzSbYm+UCSh7T6Q9v81rZ89di+XtvqNyY5caieJUl7N+TI4m7guKp6MvAU4KQkxwJ/BZxbVY8H7gDOaOufAdzR6ue29UiyFjgNeCJwEvB3SQ4asG9J0h4GC4sauavNPri9CjgO+GCrbwJOadPr2zxt+fFJ0uoXVdXdVfUVYCtwzFB9S5Lua9BzFkkOSnI1sAPYDHwZ+FZV7W6rbANWtumVwC0AbfmdwKPH63vZZvyzNiaZSTKzc+fOAX4aSVq6JgqLJJdPUttTVf2wqp4CrGI0GnjC/W1wUlV1XlWtq6p1K1asGOpjJGlJWravhUkOBn4aOCzJciBt0SPZy1/3c6mqbyX5FPBLwCFJlrXRwypge1ttO3AksC3JMuBRwDfH6rPGt5EkLYDeyOL3gC2MRgRbxl6XAH+zrw2TrEhySJv+KeA3gBuATwEvaKttaPsCuLTN05ZfUVXV6qe1q6WOBtYAV07480mS5sE+RxZV9Q7gHUleUVXvup/7PgLY1K5cehBwcVV9PMn1wEVJ3gR8ATi/rX8+8J4kW4FdjK6AoqquS3IxcD2wGzizqn54P3uRJP0E9hkWs6rqXUl+GVg9vk1VXbiPba4BnrqX+s3s5Wqmqvo+8MI59nUOcM4kvUqS5t9EYZHkPcDjgKuB2b/qC5gzLCRJB46JwgJYB6xt5xAkSUvMpN+zuBZ4zJCNSJIWr0lHFocB1ye5ktFtPACoqucO0pUkaVGZNCzeMGQTkqTFbdKrof596EYkSYvXpFdDfZvR1U8AD2F0U8DvVNUjh2pMkrR4TDqyeMTs9NidYI8dqilJ0uJyv+862249/lHAhxBJ0hIx6WGo543NPojR9y6+P0hHkqRFZ9KroX5rbHo38FVGh6IkSUvApOcsTh+6EUnS4jXpw49WJflIkh3t9aEkq4ZuTpK0OEx6gvsfGT1X4rHt9bFWkyQtAZOGxYqq+seq2t1e/wT47FJJWiImDYtvJnlJkoPa6yWMHnkqSVoCJg2LlwOnArcBtzJ67OnvDNSTJGmRmfTS2bOBDVV1B0CSQ4G3MAoRSdIBbtKRxZNmgwKgqnaxl0emSpIOTJOGxYOSLJ+daSOLSUclkqT93KT/4b8V+O8k/9LmXwicM0xLkqTFZtJvcF+YZAY4rpWeV1XXD9eWJGkxmfhQUgsHA0KSlqD7fYtySdLSY1hIkroMC0lSl2EhSeoyLCRJXYaFJKnLsJAkdRkWkqSuwcIiyZFJPpXk+iTXJXllqx+aZHOSm9r78lZPkncm2ZrkmiRPG9vXhrb+TUk2DNWzJGnvhhxZ7Ab+qKrWAscCZyZZC7wGuLyq1gCXt3mAk4E17bUReDfcc9PCs4BnAMcAZ43f1FCSNLzBwqKqbq2qz7fpbwM3ACuB9cCmttom4JQ2vR64sEY+CxyS5AjgRGBzVe1qt0nfDJw0VN+SpPtakHMWSVYzev7F54DDq+rWtug24PA2vRK4ZWyzba02V12StEAGD4skDwc+BLyqqv53fFlVFVDz9Dkbk8wkmdm5c+d87FKS1AwaFkkezCgo3ldVH27lb7TDS7T3Ha2+HThybPNVrTZX/V6q6ryqWldV61asWDG/P4gkLXFDXg0V4Hzghqp629iiS4HZK5o2AJeM1V/Wroo6FrizHa66DDghyfJ2YvuEVpMkLZAhH436TOClwJeSXN1qfwa8Gbg4yRnA14BT27JPAM8GtgLfBU6H0fO+k7wRuKqtd3Z7BrgkaYEMFhZV9Rkgcyw+fi/rF3DmHPu6ALhg/rqTJN0fQ44sJA3g62f/wrRb0CJ01Ou/NOj+vd2HJKnLsJAkdRkWkqQuw0KS1GVYSJK6DAtJUpdhIUnqMiwkSV2GhSSpy7CQJHUZFpKkLsNCktRlWEiSugwLSVKXYSFJ6jIsJEldhoUkqcuwkCR1GRaSpC7DQpLUZVhIkroMC0lSl2EhSeoyLCRJXYaFJKnLsJAkdRkWkqQuw0KS1GVYSJK6DAtJUpdhIUnqGiwsklyQZEeSa8dqhybZnOSm9r681ZPknUm2JrkmydP
"text/plain": [
"<Figure size 432x288 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"sns.countplot(data=df,x='Churn')"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig1.png' >"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Explore the distrbution of TotalCharges between Churn categories with a Box Plot or Violin Plot.**"
]
},
{
"cell_type": "code",
"execution_count": 143,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 144,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<AxesSubplot:xlabel='Churn', ylabel='TotalCharges'>"
]
},
"execution_count": 144,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAZEAAAEKCAYAAADTgGjXAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAA9BElEQVR4nO3dd3hc9Zn3//c9M+rdttxtbFwgNhgMxrQQWigGEiAQlv0lG8Jm4XnSnt1nk2eB0AIBkhBIgGw2CSRsKCGBDd2AjYMNwRBj3HA3Nm6SLdmyujTS1Pv3x5yRZSzJKnPmqNyv69LF6MyZObcvJH3mW873K6qKMcYY0xs+rwswxhgzcFmIGGOM6TULEWOMMb1mIWKMMabXLESMMcb0moWIMcaYXnMtRETkcRHZLyLr2x0bJiKLRGSr898S57iIyCMisk1E1orISe1ec51z/lYRua7d8ZNFZJ3zmkdERNz6txhjjOmYmy2RPwAXf+rYzcBbqjoNeMv5HmAeMM35uhH4NSRCB7gTOBWYC9yZDB7nnBvave7T1zLGGOMy10JEVf8G1Hzq8OXAE87jJ4Ar2h1/UhOWAcUiMga4CFikqjWqWgssAi52nitU1WWauFvyyXbvZYwxJk0Cab7eKFWtcB5XAqOcx+OAsnbnlTvHujpe3sHxIxoxYoROmjSpx4UbY8xQtXLlygOqWtrRc+kOkTaqqiKSljVXRORGEt1kTJw4kRUrVqTjssYYMyiIyK7Onkv37Kx9TlcUzn/3O8f3ABPanTfeOdbV8fEdHO+Qqj6qqnNUdU5paYdhaowxphfSHSKvAMkZVtcBL7c7/jVnltZpQL3T7bUQuFBESpwB9QuBhc5zDSJymjMr62vt3ssYY0yauNadJSJ/As4BRohIOYlZVj8BnhORbwC7gGuc018HLgG2AUHgegBVrRGRHwEfOufdrarJwfpvkZgBlgO84XwZY4xJIxlqS8HPmTNHbUzEGGO6T0RWquqcjp6zO9aNMcb0moWIMcaYXrMQMcYY02sWIsaYQWmojfd6xULE9IqqctNNN7F48WKvSzHmMM888ww33HCD12UMCZ7dsW4GtnA4zAcffMDKlSs577zzvC7HmEM8+uijXpcwZFhLxPRKNBo95L/GmKHJQsT0SiQS8boEY0w/YCFieiUcDntdgjGmH7AQMb3S2trqdQnGmH7AQsT0SjAY9LoEY0w/YCFieqW5udnrEowx/YCFiOmVxsZGr0swxvQDFiKmV2pqatoe253BxgxdFiKmV9qHSFNTk4eVGNO5eDzudQmDnoWI6ZXKysoOHxvTn1iIuM9CxPTK3ooK1JcBWIiY/stWVHCfhYjpld27y4gWjQegrKzM42qM6VgsFvO6hEHPQsT0WF1dHU2NDcTyS5GsPHbt2uV1ScZ0yJbncZ+FiOmx7du3A+BvrCSSVcy2bZ94XJExHbMQcZ+FiOmxrVu3AuCLtBLLHcbOnTvsl9X0S/Zz6T4LEdNjmzdvBvGjPh/xvBHEYjE++cRaI6b/sYVC3WchYnps/foNxP2J/cxieaUAbNy40cuSjOmQLRTqPgsR0yNVVVVUVe0HfyYAmpmHZOWxYcMGjysz5nAtLS1elzDoWYiYHlm/fj0A6oQIIoRzR7Lmo7UeVmVMxyxE3GchYnpk7dq1iD8D9QXajsUKRlF9oMpuOjT9QvsbDG1JHvdZiJgeWfPRR0TzSkGk7VisYBSQCBhjvNY+OGy1afdZiJhua2xsZOeOHUQLRh9yPJ5TggSyLERMv1BfX9/2uKGhwcNKhgYLEdNt69atQ1WJ5Y869AnxEckbyeo1azypy5j26urqOnxs3OFJiIjI/xWRDSKyXkT+JCLZIjJZRD4QkW0i8qyIZDrnZjnfb3Oen9TufW5xjm8RkYu8+LcMJevXrwfxtU3rbS9WMJI95eX2S2s8V1tb2/a4/ZYFxh1pDxERGQf8H2COqh4H+IFrgZ8Cv1DVqUAt8A3nJd8Aap3jv3DOQ0RmOK+bCVwM/JeI+NP5bxlq1q1bRzxvOPgDhz2XbJ3YVF/jterqagAm5kc5UFXlcTWDn1fdWQEgR0QCQC5QAZwH/MV5/gngCufx5c73OM+fLyLiHP+zqoZUdQewDZibnvKHnmg0yuYtWxKD6h2I5Y0A8bFp06Y0V2bMoaqqqsjwwYT8KAeq9ntdzqCX9hBR1T3AA8BuEuFRD6wE6lQ1OTevHBjnPB4HlDmvjTrnD29/vIPXmBTbsWMHkXC4w64sAHwBNHcYG+zOdeOx/fv3U5INw7PjVNfU2p4iLvOiO6uERCtiMjAWyCPRHeXmNW8UkRUisqLKmre9smXLFoDOQwSI5A5ny5Yttue68dS+ykqGZ0YYnhUnrtrWvWXc4UV31ueBHapapaoR4AXgTKDY6d4CGA/scR7vASYAOM8XAdXtj3fwmkOo6qOqOkdV55SWdv5H0HRu69atSCATzSro9Jx47nCCzc1UVFSksTJjDlVZWcGI7BilOYmtce3n0V1ehMhu4DQRyXXGNs4HNgJLgKudc64DXnYev+J8j/P8Yk181H0FuNaZvTUZmAYsT9O/YcjZum0b0ZySQ24y/LRY7nAAW9HXeCYUClFdU0tpTpwR2YldDW0lBXd5MSbyAYkB8lXAOqeGR4GbgH8XkW0kxjx+77zk98Bw5/i/Azc777MBeI5EAC0Avq2qthemC1SV7dt3EMsp6fK8eE4xcHDTKmPSbd++fQCU5sQYkR1HsJaI2w6fq5kGqnoncOenDm+ng9lVqtoKfLmT97kXuDflBZpDVFVV0doSJD6y6xDBn4FkF7Jz58601GXMpyUDozQ7TsAHw3IsRNxmd6ybI0ruoR7PLj7iuZGsInbs2OluQcZ0oi1EchKdEiOyIlTs3etlSYOehYg5orKyxEzqeE7REc+NZxeyZ88e4vG422UZc5iKigoyfFCUmZghWJodo6Kiw/k2JkUsRMwRlZWVJWZmBXKOeG48u4hIJNzWN21MOlVWVjI8R/E58z9Kc+JU19TZNrkushAxR1ReXk4sq7DLmVlJ8exEa2XPHvv0Z9KvsmIvI7Iibd+PyI6jquzfb3euu8VCxBzR7rJyYl3cH9JePLsQsBAx3ti3bx8jsg92pQ7PirUdN+6wEDFdikajVO3fRzyrsFvna0Yu4g9YiJi0C4fD1NU3MDyrXYg4gWIrVbjHQsR0qbKykng83tbCOCIR4lmFlJeXu1uYMZ9y4MABAErahUjysXVnucdCxHQp2aLQbrZEAKKZ+ZRZiJg06yhEMv2Qnym2r4iLLERMl5Itinh298ZEADS7kIqKCpvma9IqudBicdahP3fFmfG2gDGpZyFiulReXo74M7o1vTcpnlVINBKxfmiTVsnWRnHmoSFSmBGh1loirrEQMV3as2dPYjykG9N7k5LTfJM3KRqTDrW1tfgE8jIO3YqgKFOprraWiFssREyXdu3eTbQH4yFg03yNN+rq6ijMkrYbDZMKM+PU19d7U9QQYCFiOhUOh9m/b19by6K7ktN8rSVi0qmuro78wOELeRdkKC2tIUKhkAdVDX4WIqZTFRUVqGqPQwQR4tlFFiImrRrq6ynoIETyMxJjJA0NDekuaUiwEDGdalt4sYfdWQDRzEJ27tqd6pKM6VRdXW1bYLSX74yRWIi4w0LEdKotRHraEiExLlK1fx+RSOTIJxuTAo2NDYcNqoO1RNxmIWI6VVZWhmTmQiCzx6+NZxcRj8fZa3s5mDRQVZqamskLdBAiAWuJuMlCxHRq166ez8xKsmm+Jp1aW1uJRGPkddCdleu0ThobG9Nd1pBgIWI6tWv37sQS8L1gIWLSKRkQHbVE8gKJYGlqakprTUOFhYjpUENDA02NDb0aDwEgkIlk5lqImLRIBkRHYyLZfvCJtUTcYiFiOnRwzaxehggQzSpg926boWXclwyI3A5aIiKQlykWIi6xEDEdSkWIxLKK2G0tEZMGyUHzjqb4QqKby0LEHRYipkO7d+8GETQrv9fvEc8uoqG+3n55jevaurM6aIkkjkft59AlFiKmQ+Xl5ZBdCD5/r99DbQ0tkyZtA+s
"text/plain": [
"<Figure size 432x288 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"sns.violinplot(data=df,x='Churn',y='TotalCharges')"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig2.png' >"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Create a boxplot showing the distribution of TotalCharges per Contract type, also add in a hue coloring based on the Churn class.**"
]
},
{
"cell_type": "code",
"execution_count": 145,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 146,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<matplotlib.legend.Legend at 0x2d1eb25c100>"
]
},
"execution_count": 146,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAB+MAAALaCAYAAADqX/GDAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AACimElEQVR4nOz9e5zdVX0v/r/WJNlkyOQqJIAiKhdF8NdajLZWUKzfqtXWmIDagkIr9YZy7PF2rOKprdZTtRdBbLV4oSYKSmJQ0WpbraBVjFRboYCAWKJIEm5JBgZ2JrN+f8xOGcZJJjPZs2cm83w+HvuxP5f1Xvs9xBHNa6/1KbXWAAAAAAAAAADt0zXZDQAAAAAAAADA/kYYDwAAAAAAAABtJowHAAAAAAAAgDYTxgMAAAAAAABAmwnjAQAAAAAAAKDNhPEAAAAAAAAA0GbCeAAAAAAAAABoM2E8AAAAAAAAALSZMB4AAAAAAAAA2kwYDwAAAAAAAABtJowHAAAAAAAAgDYTxgMAAAAAAABAmwnjAQAAAAAAAKDNhPEAAAAAAAAA0GbCeAAAAAAAAABoM2E8AAAAAAAAALTZ7MlugP1HKeWAJE9onW5JsnMS2wEAAAAAACbXrCQHt45/WGt9YDKbAeg0YTzt9IQkGya7CQAAAAAAYMpZnuR7k90EQCfZph4AAAAAAAAA2szKeNppy66D7373uzn00EMnsxcAAAAAAGAS/fznP8+Tn/zkXadb9jQWYH8kjKed/ucZ8Yceemge8YhHTGYvAAAAAADA1LFz9CEA+xfb1AMAAAAAAABAmwnjAQAAAAAAAKDNhPEAAAAAAAAA0GbCeAAAAAAAAABoM2E8AAAAAAAAALSZMB4AAAAAAAAA2kwYDwAAAAAAAABtJowHAAAAAAAAgDYTxgMAAAAAAABAmwnjAQAAAAAAAKDNhPEAAAAAAAAA0GbCeAAAAAAAAABoM2E8AAAAAAAAALSZMB4AAAAAAAAA2kwYDwAAAAAAAABtJowHAAAAAAAAgDYTxgMAAAAAAABAmwnjAQAAAAAAAKDNhPEAAAAAAAAA0GbCeAAAAAAAAABoM2E8AAAAAAAAALSZMB4AAAAAAAAA2mz2ZDcAAAAAU9nAwEC2bds22W20xcDAQLZv3/6Qa/Pnz09X1/T/rv6CBQv2i58DAACA/YcwHgAAAPZg27ZtWbFixWS3wSjWr1+fRYsWTXYbMC34ktH04EtGAADTnzAeAAAAAGYQXzKaHnzJCABg+vPVSgAAAAAAAABoM2E8AAAAAAAAALSZbeoBAABgDxYsWJD169dPdhttsXXr1pxxxhkPuXbRRRdl4cKFk9RR+yxYsGCyWwAAAICHEMYDAADAHnR1de3Xz+xduHDhfv3zAb/Il4ymB18yAgCY/oTxAAAAADCD+JIRAAB0hmfGAwAAAAAAAECbCeMBAAAAAAAAoM2E8QAAAAAAAADQZsJ4AAAAAAAAAGgzYTwAAAAAAAAAtJkwHgAAAAAAAADaTBgPAAAAAAAAAG0mjAcAAAAAAACANhPGAwAAAAAAAECbCeMBAAAAAAAAoM2E8QAAAAAAAADQZsJ4AAAAAAAAAGgzYTwAAAAAAAAAtJkwHgAAAAAAAADaTBgPAAAAAAAAAG0mjAcAAAAAAACANps92Q0AAACdMTAwkG3btk12G20xMDCQ7du3P+Ta/Pnz09W1f3zfeMGCBfvNzwIAAAAwUwnjAQBghti2bVtWrFgx2W2wF9avX59FixZNdhsAAAAA7ANLLQAAAAAAAACgzYTxAAAAAAAAANBmwngAAAAAAAAAaDPPjAemrWazmUajMdltAMC0sWDBgqxfv36y22iLrVu35owzznjItYsuuigLFy6cpI7aa8GCBZPdAgAAAAD7SBgPTEt9fX1585vfnPe+973p7u6e7HYAYFro6urKokWLJruNCbNw4cL9+ucDAAAAYHqxTT0wLa1evTo//OEPs2bNmsluBQAAAAAAAH6BMB6YdjZu3JhLLrkkSXLJJZdk48aNk9wRAAAAAAAAPJQwHphWaq0577zz0t/fnyTZsWNHzjvvvNRaJ7kzAAAAAAAAeJAwHphWrrjiimzYsOEh1zZs2JArr7xykjoCAAAAAACAXySMB6aNvr6+XHDBBSPeu+CCC9LX19fhjgAAAAAAAGBkwnhg2li9enU2b9484r1NmzZlzZo1He4IAAAAAAAARjbjw/hSSqOUclYp5SullJ+XUh4opfSWUm4opXy8lPLUvZznuaWUz5VSftqa46et8+eOoZfZpZRXlVKuLKVsKaX0lVJuLqV8uJRy3BjmOaiU8qellP8spWxrvf6zde1hezsPTCW33nprLrnkkj2Oufjii7Nx48YOdQQAAAAAAAC7N6PD+FLKEUn+PcnfJ/nNJIckaSSZl+SYJGcm+VYp5bxSStnNHF2llAuTfCnJiiQPb83x8Nb5l0opf19K2eM/61LKQUn+LcnfJnlakoOSzE3ymCSvSHJ1KeWsvfiZnpLkh0nOTfKEJPNbrye0rv2wlPLk0eaBqaTWmvPPPz/9/f17HNff35/zzjsvtdYOdQYAAAAAAAAjm7FhfCllTpLLk+xacf6fGQzffy2DwfyfJrm3de91Sd6ym6neneTlrePvJ/ndJE9uvX+/df2sJO/aQy+zknwuyfLWpXVJnpvkKUnOSbI5yQFJPrynlfallMOTfCGDXyroT/LeJCe1Xu9tXTs0yRdKKY/Y3Tww1VxxxRXZsGHDXo3dsGFDrrzyygnuCAAAAAAAAPZs9mQ3MIlekAeD+G8nObHWunPI/X8qpXy+dW9OkreUUt5fa/2fpbmllGOSvLF1+r0kJ9Va+1rnG1r130jypCRvKqV8rNZ60wi9nJHB1fBJ8qFa69lD7n23lPLlJFcnWZDkvFLKsUP7GOLdSQ5uHf9erfWzQ+5dWUq5OsklSZZm8MsBZ44wB0wpfX19ueCCC8ZUc8EFF2T58uXp7u6eoK4AAAAAAABgz2bsyvgkQ58F/55hQXySpNZ6dZIvtk4XJTl22JDX58EvNLxuSBC/q/6+DK6qT2vcH+2ml12B/l1J3jRCHzcleU/r9KgkLxw+ppRySJLTWqdfGRbE75rnM0m+0jp9aasGprTVq1dn8+bNY6rZtGlT1qxZM0EdAQAAAAAAwOhmchjfGHL84z2Mu3mkmtYz5F/QOr2+1vqdkYpb129onb5g+LPnW6vrd4X8n2kF+CP5xJDjXwjjk/xOHvzz/Phu5hg6T1erBqasZrOZdevWjat27dq1aTabbe4IAAAAAAAA9s5MDuNvGHL8mD2MO7L1XpPcOOT6o5Mc1jr+xiiftev+w5M8ati9p40w7hfUWm9P8qPW6a+PMGSv5hl2b6R5YMpoNBpZuXLluGpXrVqVRqMx+kAAAAAAAACYADM5jP90km2t47eUUmYNH1BKeWKS57VOP1Vr3Tbk9uOHHF8/ymcNvT98q/vxzHN4KWXebubZ2gruR1Rr/Xke/LmH97JHpZRH7OmVxLb3tN3pp5+epUuXjqlm2bJlOe2000YfCAAAAAAAABNkxobxtdY7krw0yX0ZXCG+oZTyslLKr5ZSnlVK+b8ZXEXeSPLvSd4wbIpHDDn+6Sgft3HI8eFtmKcMqxs6z2hzDJ1neC97U7en14Yxzgej6u7uztlnnz2mmrPPPjvd3d0T1BEAAAAAAACMbsaG8UlSa/18khOSXJjkl5NclOTbSf4pyZ9kMKh/fZITa62bhpXPH3LcO8pH3TvkuGeC5xltjqHzDJ8DpqSTTjopy5cv36uxy5cvz4knnjjBHQEAAAAAAMCezegwvpTSSPKyJC/I4Grz4ZYlOT3Js0a4N3fIcXOUj3pgyPHw5brtnme0OYbOM9alw4eP8tq7tBTGqJSSc845J7Nnz97juNmzZ+ecc85JKSP9OgMAAAAAAEDnzNgwvvXM9X9O8tYkS5K8N4PPUD8gycIkv5nkm0melGR9KeV/D5vi/iHHjVE+7oAhx30TPM9ocwydZ/gce1Rr/emeXkl2+6x62FeHH354XvziF+9xzEte8pIcfvhYn74AAAA
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.boxplot(data=df,y='TotalCharges',x='Contract',hue='Churn')\n",
"plt.legend(loc=(1.1,0.5))"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig3.png' >"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Create a bar plot showing the correlation of the following features to the class label. Keep in mind, for the categorical features, you will need to convert them into dummy variables first, as you can only calculate correlation for numeric features.**\n",
"\n",
" ['gender', 'SeniorCitizen', 'Partner', 'Dependents','PhoneService', 'MultipleLines', \n",
" 'OnlineSecurity', 'OnlineBackup', 'DeviceProtection', 'TechSupport', 'InternetService',\n",
" 'StreamingTV', 'StreamingMovies', 'Contract', 'PaperlessBilling', 'PaymentMethod']\n",
"\n",
"***Note, we specifically listed only the features above, you should not check the correlation for every feature, as some features have too many unique instances for such an analysis, such as customerID***"
]
},
{
"cell_type": "code",
"execution_count": 147,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 148,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"Index(['customerID', 'gender', 'SeniorCitizen', 'Partner', 'Dependents',\n",
" 'tenure', 'PhoneService', 'MultipleLines', 'InternetService',\n",
" 'OnlineSecurity', 'OnlineBackup', 'DeviceProtection', 'TechSupport',\n",
" 'StreamingTV', 'StreamingMovies', 'Contract', 'PaperlessBilling',\n",
" 'PaymentMethod', 'MonthlyCharges', 'TotalCharges', 'Churn'],\n",
" dtype='object')"
]
},
"execution_count": 148,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df.columns"
]
},
{
"cell_type": "code",
"execution_count": 149,
"metadata": {},
"outputs": [],
"source": [
"corr_df = pd.get_dummies(df[['gender', 'SeniorCitizen', 'Partner', 'Dependents','PhoneService', 'MultipleLines', 'InternetService',\n",
" 'OnlineSecurity', 'OnlineBackup', 'DeviceProtection', 'TechSupport','StreamingTV', 'StreamingMovies', 'Contract', 'PaperlessBilling',\n",
" 'PaymentMethod','Churn']]).corr()"
]
},
{
"cell_type": "code",
"execution_count": 150,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"Contract_Two year -0.301552\n",
"StreamingMovies_No internet service -0.227578\n",
"StreamingTV_No internet service -0.227578\n",
"TechSupport_No internet service -0.227578\n",
"DeviceProtection_No internet service -0.227578\n",
"OnlineBackup_No internet service -0.227578\n",
"OnlineSecurity_No internet service -0.227578\n",
"InternetService_No -0.227578\n",
"PaperlessBilling_No -0.191454\n",
"Contract_One year -0.178225\n",
"OnlineSecurity_Yes -0.171270\n",
"TechSupport_Yes -0.164716\n",
"Dependents_Yes -0.163128\n",
"Partner_Yes -0.149982\n",
"PaymentMethod_Credit card (automatic) -0.134687\n",
"InternetService_DSL -0.124141\n",
"PaymentMethod_Bank transfer (automatic) -0.118136\n",
"PaymentMethod_Mailed check -0.090773\n",
"OnlineBackup_Yes -0.082307\n",
"DeviceProtection_Yes -0.066193\n",
"MultipleLines_No -0.032654\n",
"MultipleLines_No phone service -0.011691\n",
"PhoneService_No -0.011691\n",
"gender_Male -0.008545\n",
"gender_Female 0.008545\n",
"PhoneService_Yes 0.011691\n",
"MultipleLines_Yes 0.040033\n",
"StreamingMovies_Yes 0.060860\n",
"StreamingTV_Yes 0.063254\n",
"StreamingTV_No 0.128435\n",
"StreamingMovies_No 0.130920\n",
"Partner_No 0.149982\n",
"SeniorCitizen 0.150541\n",
"Dependents_No 0.163128\n",
"PaperlessBilling_Yes 0.191454\n",
"DeviceProtection_No 0.252056\n",
"OnlineBackup_No 0.267595\n",
"PaymentMethod_Electronic check 0.301455\n",
"InternetService_Fiber optic 0.307463\n",
"TechSupport_No 0.336877\n",
"OnlineSecurity_No 0.342235\n",
"Contract_Month-to-month 0.404565\n",
"Name: Churn_Yes, dtype: float64"
]
},
"execution_count": 150,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"corr_df['Churn_Yes'].sort_values().iloc[1:-1]"
]
},
{
"cell_type": "code",
"execution_count": 151,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABo0AAAUYCAYAAAB9eJuxAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AAEAAElEQVR4nOzdebgsVXk37N8Dh1kQIygqBI2KYpSI4jxhMBpFI0aNGvMpCGKMRozzkLwxiZJEX17FOCSOYIxTNEoEHOIAzgJO0QgiKgQcEQXhyMz6/qjanmK7h95D7959uO/rqqtXda2u9ezu3n32qV+vqmqtBQAAAAAAgOu2LSZdAAAAAAAAAJMnNAIAAAAAAEBoBAAAAAAAgNAIAAAAAACACI0AAAAAAACI0AgAAAAAAIAIjQAAAAAAAIjQCAAAAAAAgAiNAAAAAAAAiNAIAAAAAACACI0AAAAAAACI0AgAAAAAAIAIjQAAAAAAAIjQCAAAAAAAgAiNAAAAAAAAiNAIAAAAAACACI0AAAAAAACI0AgAAAAAAIAIjQAAAAAAAIjQCAAA4NdUVZtZJl3LQqrq4EGtx0y6HlhrVbX/4HfgpEnXAwAw7YRGAADrSFWdNDxYvYRl/0nXzuRU1Z2q6kVV9bGq+nZV/byqrqiq86vq61V1bFU9uapuMOla4bqiqt41+Iw+s6q2W+LjDx08/oqqusO4ap0Un10AAOuP0AgAgDVXVccMDoYePOl6plVV3a2qPpbkS0leluSAJLdKsnOSrZLskuT2SZ6Q5A1JflhVb6mq3SdTMQsxa2h0VXX24Lm6+aTrmcefJ/lp3751kr8Z9YFVdZMk/3dw18taa19fxdomymcXAMD6tWHSBQAAMK9Tk5wyYt/vj7MQ1p+qelaSV+TaXwS7It175vtJLkp34HWPJHdKsmWSbZIckuSxSbZfy3rhuqa1dn5V/XmSd/Z3Pauq3tNaO22Eh782XYCSJP+d5MgxlDgRPrsAANY3oREAwPp1YmvtJZMugvWnqv4hyfMHd52b5K+TvKe1tnGO/jdM8ogkL0pyiyRLOk0W61dr7Zgkx0y4DObRWntXVT0myUHpwo+3VNWdW2tXzveYqnpUut/XJLkqySEL9Z8mPrsAANY/p6cDAIApUlV/mGsfdP1Yktu11t4610HXJGmtXdBae1OS2yT5qyTXjL9SoPfUJD/v23dI8uL5OvbX7nnN4K5XtNa+PMba1ozPLgCA6SA0AgCAKVFVOyZ50+CuryY5sLV2ySiPb61d2Vp7aZIHj6E8YA6ttR8l+YvBXS+qqjvM0/2VSW7ct0/PEq6DtJ757AIAmB5CIwCAzVh1HlFVx1bVmVV1UVVdVlXnVtUHquqJVTXSKYurau+q+ouq+o+q+lZVXVxVV1bV+VV1WlW9sqput8g+zq6qluSJg7vfOrig/XB5yVyPHfXC91V1zKD/waP2qaqdq+qIqvpUVX2/qq7qt+88x+NX7fkd0VOS3KBvtyQHt9auWOpOWmsfXWh7/3M9uqreWVXfqapL+uU7VfWOqnpUVdVi41TVSYPnd//+vptU1Yuq6pSq+lFVXV1VFw4eM1WvSVXdqKoO6cf7SlX9rP+9uLCqzqiqt1bVgxbZxzH978VbB3c/cZ7fi5NmPfbgwbZjRqh3nK/tb1TV86vq1Kr6aVVdWlXfrao3V9XtF9vnCGPefGbMJHsONn1vnudq/wX2dfeqek1V/U9V/bx/j5xXVR+uqqdX1Q4rrXeotXZskg/1q1ulO03dlrNqemA2fTZek+RJrbXL56j9hlX17Kr6r/59fVn/fvtmVb22qvYbpab+vXBQ/7p/q6p+0f8+bqzu8/YTVfWPVXX/qlrpsYM1+ewaqqrfrap39e/By6rqgv4z5OlVtdUij91/vt+5BR7zq/feUvpU1e9U1dFV9Y3+86NV1QcG23/t376q2r2q/q6qvta/9hv7z5t/qqo9f31kAIDRuaYRAMBmqqr2SXJskjvOsXn3fnl4khdW1R+21r65wL7ek+TR82zepV/unOSIqjo6yXNaa1evoPyJqKp7pbto/R4j9F2153cJnjpof7y19rVV2Oe1VNWtk7w7yb5zbP6tfnlcki9V1R+11r67hH0/PF0wcoPF+g4es25fk6p6RpL/l+5aNbNdv19uk+TgqvpEkj9qrV2wkjFXYsyv7b36fd9s1qZb9MsTq+qprbU3Lqv4VdKHQW9O8pg5Nt+sXx6UbjbQoa21D83Rb7kOT/I/SXZKsl+SZyd5+aCufxn0fVVr7Qtz1P+0JC9L994a2qa/b+8kT62qtyZ56nzBTFXdOMn7k9xjjs3bpwvl9kxy/yTPS/J76U4nt1xj/+yaUVVbpzvF35NnbdomyX365ZCqelBr7afjqmMU1X054i8z92fIfI85KN11zGa/B27TL4dW1aNbayesTpUAwHWN0AgAYDNUVfdN8sF0ByeT5Mokpyb5dt++eZJ7J9k23UGmz1XVPVprp8+zy9/sb69K8s1+PxcmuTrJjZLcJd3B1kryzHQH5/5sjv0cm+SGSQ5Ictv+vo8nOWOOvqcs9nOuslsleVW6A3EXJ/lUkh+kCzjuO+w4hud3Uf23x39rcNc7lruvBcbYO8nJSXYd3P31dKeSaunChpnTat053c9139bamSPs/p5JXpJulsUF6Z7fn6Z7/8wVYiTr/DVJctNsOtj73XSnEzs/yWVJdk73XP12v/13k3ysqu4+x+yRjyW5JN3vxAH9fWek+92Y7dvLKXTMr+3tk/x9kusl+UmST6d7jW+W7ufeLt3z9M9V9fW5wpAR/SLJa/v2E5Ls2Lfflu79Mdv3hytVtX2STyS56+DuH/T1XpLu/XbvvtabJPnPqnpca+29y6z3Wlpr51XVc7MpHPqbqvpA/xwfme49miRnpQsSrqWqXpXkiMFdP03y+SQ/Sve+3jfda1FJnpTkplV1YGvtmln72TLJCele5xnf6JcL+33tluR30j0PK7IWn12zvCHdjK1rknwx3e/SFknunu53P0nulO5985Ax1zKv/r3w1/3qd9L9m/fLdO+DK+d52AOS/HO69+j/pnv9f5EumN0/3TGe7ZK8p6pu31r73pjKBwA2Z601i8VisVgsFss6WZKclO4AbkvykmXuY7ckPx7s59gkN5mj342T/Meg338n2XKeff59uplGO82zvZI8LN0B45n93XuBGo8Z9Dt4xJ/r7MFjbj5C/0XHmNXnyv72NUmuN6vfVkm2GNfzO+LP/yeDfbV0F5Bfzffe1tkUILT+Z3zAHP0emC4Ymen3pSRbjfB+vjLdQdy/nN0/yTbr7TUZPtcLPGdPSvL0JDdboM8+6cKrmf395QJ9Dx70O2bE123Rx6zBa3tZukD5WUk2zOq3R7pwaqbvJ1bp/Xr2YJ+Lfh70j3nd4DFXpQtgtpjV59ZJThv0u2jU/S+h9o8N9v+pdIHq1f36NUnuO897bVjTYXO9NulmBp036Pu8Ofo8fLD9B0nutkCtv53kH5LcdQU/77g/u/af9V5s6QKY287qV/1rPqzl157rOfZ50oh1jPKZMRz7ynQh3UFz9Bt+Jg7f65elCzj/JEnN8VoNX/u3rObzbLFYLBaL5bqzuKYRAMD69ZD+mhuLLbNPB/WydLM3kuTVrbUnttZ+OHvnrbUfpwuCPtHfdYckj5qrkNbaC1tr/95a+8U821tr7YPpgqMZf76En3U92JDkTa21p7dZF2dv3UXYZ76tv+rP74huPmhfk7lnZ63E49PNLEi6g5m/31r7tdNRte6aIg9Jd9A96b6x/7gR9r8hyV+11l7aWrvWt+jbHNdtGTxm3b4mrbW3tNZe01r7/gJ9/jvd7IAf9Xf9Wc26ls0aGPdru02SP2ut/b/W2lXDDa21c/t9tP6u/atqxbNXlqqqbpnuujozjmitHd1mzcJprX073anYzu7v2inJ/1nlcg5LsrFv3yfJh7PpesOva619ati5qnZMclS/ekWSB7bW3jT79yhJWmufTFf/Zf1dz+tnWA3dZ9D+P621L85XaGvtf1prL2itrWTm580H7XF8dg1tk2423u+21q4
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.barplot(x=corr_df['Churn_Yes'].sort_values().iloc[1:-1].index,y=corr_df['Churn_Yes'].sort_values().iloc[1:-1].values)\n",
"plt.title(\"Feature Correlation to Yes Churn\")\n",
"plt.xticks(rotation=90);"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='figbar.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"---\n",
"---\n",
"\n",
"# Part 3: Churn Analysis\n",
"\n",
"**This section focuses on segementing customers based on their tenure, creating \"cohorts\", allowing us to examine differences between customer cohort segments.**"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: What are the 3 contract types available?**"
]
},
{
"cell_type": "code",
"execution_count": 152,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 153,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"array(['Month-to-month', 'One year', 'Two year'], dtype=object)"
]
},
"execution_count": 153,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df['Contract'].unique()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Create a histogram displaying the distribution of 'tenure' column, which is the amount of months a customer was or has been on a customer.**"
]
},
{
"cell_type": "code",
"execution_count": 154,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 155,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<AxesSubplot:xlabel='tenure', ylabel='Count'>"
]
},
"execution_count": 155,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABqUAAALaCAYAAAC4dVmkAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AABmvUlEQVR4nOz9e5hlVX0n/r8/gNCIEC8oRBAv2AyYNAlfgShe0OiYMcYRI9FoEiVixEQwOtGoQRNNwjheMqMy0S+KRp18o2NUxDHwi5dEVNCIjpEkCN3gDS8geAsCDTas3x9nF705VNWp6q5dt/N6Pc95ztpnr/3Z6zS9qerzPmvtaq0FAAAAAAAAhrTLSg8AAAAAAACA9U8oBQAAAAAAwOCEUgAAAAAAAAxOKAUAAAAAAMDghFIAAAAAAAAMTigFAAAAAADA4IRSAAAAAAAADE4oBQAAAAAAwOCEUgAAAAAAAAxOKAUAAAAAAMDghFIAAAAAAAAMTigFAAAAAADA4IRSAAAAAAAADE4oBQAAAAAAwOCEUgAAAAAAAAxOKAUAAAAAAMDgdlvpAbA6VNUeSTZ1m1cnuXkFhwMAAAAAAKysXZPcvWv/S2vtxp0tKJRixqYkF670IAAAAAAAgFXnqCSf39kilu8DAAAAAABgcGZKMePqmcbnPve5/PRP//RKjgUAAAAAAFhB3/nOd3L00UfPbF49X9+FEkox49Z7SP30T/90DjzwwJUcCwAAAAAAsHrcPLnLZJbvAwAAAAAAYHBCKQAAAAAAAAYnlAIAAAAAAGBwQikAAAAAAAAGJ5QCAAAAAABgcEIpAAAAAAAABieUAgAAAAAAYHBCKQAAAAAAAAYnlAIAAAAAAGBwQikAAAAAAAAGJ5QCAAAAAABgcEIpAAAAAAAABieUAgAAAAAAYHBCKQAAAAAAAAYnlAIAAAAAAGBwQikAAAAAAAAGJ5QCAAAAAABgcEIpAAAAAAAABieUAgAAAAAAYHBCKQAAAAAAAAYnlAIAAAAAAGBwQikAAAAAAAAGJ5QCAAAAAABgcEIpAAAAAAAABrfbSg8AVrOtW7dmy5Yty3KujRs3ZsOGDctyLgAAAAAAWG5CKZjHli1bcvjhhy/LuS666KJs2rRpWc4FAAAAAADLzfJ9AAAAAAAADM5MKVigY046LXvvd9CS1rz2qm/kgjNOXdKaAAAAAACwGgmlYIH23u+g3PmAg1d6GAAAAAAAsCZZvg8AAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGNzUh1JVtXtVPauq/r6qvlNVN1bVj6vq0qr6q6o6ZoF1HltVZ1XVN7sa3+y2H7uIsexWVc+pqk9V1dVVdUNVXV5VZ1TVz+z4uwQAAAAAAFhZu630AFZSVd07yd8lGQ98dk9ySPc4oapOT/L7rbU2S41dkrwlyYljuw7oHsdV1ZlJTmqt3TLPWPZNck6So8Z23S/Js5M8o6pObq2dudD3BwAAAAAAsFpM7UypqrpDbhtIXZTkhCQPTvKYJH+a5Lpu3ylJXjxHqdOyPZD6YpKnJjm6e/5i9/qzkvz5PGPZNclZ2R5IfSDJY5P8QpLnJflukj2SnLGYmVcAAAAAAACrxTTPlHpCtgdSn0nysNbazb39H62qD3X77pDkxVX1utbatpkOVXVIkhd2m59P8vDW2g3d9oXd8eclOTLJi6rq7a21y2YZyzOSPLRrv6m19tzevs9V1blJvpBknyRvrKrD+uMAAAAAAABY7aZ2plSS/r2iXjUWSCVJWmtfSPLhbvPOSQ4b6/L8bA/2TukFUjPHX5/RLKt0/V4wx1hmgq3vJ3nRLOO4LMmrus37J3niHHUAAAAAAABWpWkOpXbvtb8yT7/LZzumqiqj2VZJcklr7bOzHdy9fmm3+YTuuFt1s61mwq73dkHWbN7RawulAAAAAACANWWaQ6lLe+37zdPv4O65JdnSe/2+Se7Ztc+bcK6Z/Qckuc/YvofO0u92WmtXJtncbT5kwvkAAAAAAABWlWkOpd6d5N+79ouratfxDlV1RJLHdZt/01r7997uB/Tal0w4V3//+BKAO1LnXlW114S+AAAAAAAAq8Zuk7usT621a6rqtzIKpx6S5MKqen1Gs5Hu1L32Bxkt2fd/u3bfgb32Nyec7ope+15LUKe64y6dp+9tVNWBE7rsv9BaAAAAAAAAizW1oVSStNY+VFUPzChwOjHJO8e6XJXk5UneOsu9nvbutX884VTX9dp3GqjOJFdM7gIAAAAAADCMaV6+L1W1e5KnJ3lCRrOPxu2X5DeTPHqWfRt67ZsmnOrGXnvPgeoAAAAAAACsWlMbSnX3ZPpYkpcmuWuS12R0v6c9kvxUksck+XSSI5N8sKr+y1iJrb327hNOt0evfcNAdSa514THUYusBwAAAAAAsGDTvHzfK5I8rGuf2FrrL913U5KPVtU/JvlIkkcmeW1Vfby19qWuz7W9/pOW0tur1x5fom+8ztbMbb4682qtzXu/qqrZJooBAAAAAAAsjamcKVWjBOaZ3ebmsUDqVq21bRndUyoZ/Vmd0NvdD3kOnHDKe/Xa4/d22pE6bew4AAAAAACAVW0qQ6mM7hV11679xQl9v9BrH9prXzzH67Pp7//y2L4dqXNFa+26CX0BAAAAAABWjWkNpbb12pOWMLzDHMd9Ncm3u/axE2o8vHv+VpKvje37dK89Z52q2j/JId3m+RPOBwAAAAAAsKpMayj1/ST/3rUfXFXzBVP9oOirM43WWktydrd5aFU9aLaDu9dnZjid3R13q9ba5myfPfXkqrrjHOM4odc+a57xAgAAAAAArDpTGUq11m5J8nfd5j2TnDpbv6q6S5JX91768FiX1ye5uWufXlV7jh2/Z5LTu81tXf/ZvK57vmuS18wyjoOTvLTbvCxCKQAAAAAAYI2ZylCq86dJru/ar6iqD1XVk6rqiKp6cFW9IMk/J3lA1+fjrbWP9At0s5xe220emeT8qnpKVR1ZVU/JaJm9I7v9r22tbZljLO/M9iX5nltV76uqX6qqo6vq5CQXJNknyS1Jntda2zZHHQAAAAAAgFVp0v2U1q3W2iVV9YQk706yb5LHd4/Z/EOSX5tj36lJ7pHkmUmOSPKeWfq8LcnL5hnLzVV1XJJzkhyV5Endo+/GJCe31s6dqw4AAAAAAMBqNc0zpdJa+1hG93t6cZJPJLk6yU+S3JDR/aPem+S4JI9urf1gjhq3tNZOTPK4jO4x9e0kN3XPZyf55dbas7olA+cbyzVJjknye0k+neR7SbYm+UqStyZ5YGvtzJ14uwAAAAAAACtmamdKzWitfS+j+zjd7l5Oi6xzTkYznXamxrYkb+4eAAAAAAAA68ZUz5QCAAAAAABgeQilAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQmlAAAAAAAAGJxQCgAAAAAAgMEJpQAAAAAAABicUAoAAAAAAIDBCaUAAAAAAAAYnFAKAAAAAACAwQm
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.histplot(data=df,x='tenure',bins=60)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src=\"fig5.png\">"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Now use the seaborn documentation as a guide to create histograms separated by two additional features, Churn and Contract.**"
]
},
{
"cell_type": "code",
"execution_count": 156,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 157,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<Figure size 2000x600 with 0 Axes>"
]
},
"metadata": {},
"output_type": "display_data"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABDAAAALICAYAAACJhQBYAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAABUn0lEQVR4nO3debhsZ10n+u+PJIfQJJwMnD6dCZNAhA4qB4iHQdoOoHag1URFhseWaMdO3zY4XEdQr41t9xWcggpipwEJXmRsMBG5wZiESYGQkEOSQwRiSMhEJiGg3JDpvX/U2kmdSu1dtfep2nvVOZ/P89SzV721atWvVu397qpvvetd1VoLAAAAQJ89bKMLAAAAAJhEgAEAAAD0ngADAAAA6D0BBgAAANB7AgwAAACg9wQYAAAAQO8JMAAAAIDeE2AsqKr6V1X19qr6h6q6tKreX1XfXFUnVtX7Nrq+UVX1Y1V1f1V921DblVV19Cq28cqq+rFlbtteVR+uqs9W1WVV9Yaq+hdrqPOgqvrJ1d5vhe2dWFXPnNG2rq2qj4y07aiqK3djm78ytHz0NNua9T7aXaP7uKreXFUv2MiaINFPj7ltb+inN1fVW6rq6u51f0tVbZ7FtoFd6WMfctse3cdW1Se6971frKrbuuUdq9l/7BkEGAuoqirJe5N8sLX22NbaU5O8IsnWGWx7393dxgpuSPKrs95oVW1N8q4kv9xae3xr7clJzkty4Bo2d1CSsZ32GvfNiUlm8sa4c2BVHdXV869nsL1fmbzKQxyUZfbRBjkxs93HsNv007vai/rpNya5prX2uNbaY5N8IckbZrTtmZvz7xLMjT52V3tDH9tae1prbVuSX0/yjtbatu5y7e5ue9b0rfMlwFhMz05yT2vtT5YaWmufbq0tfTt/QFW9u6r+vqre2nXyS9/gP7pbPqGqPtgtv7Kq/qyq/jbJn3XX31RVH6yqa6rqp2dU9/uSPLGqHj96Q1W9pKqu6JLoV69yu2ckObu19rGlhtbau1trt1TVIVX1F1V1eVV9fCn1XuE5virJY7tE93e61PgjVXVuks909/2LLunfWVWnDz2Hk6rqU1X16aq6oEuE/48k/2e3vX+zyuc1zjuTvKhbfkmStw09/v5V9afdfrysqp7dtf9YVb2nqs6rqs9X1W937a9K8oiutrd2m9mnqv5X99z+uqoeMaaG0X1U3c8ru8d+0Zj7LO3zs7v9eV1V/WBV/XZ3n/Oqar9uved29V/RvUYP79qvrarf6PbxFVX1hBX28XdW1d91r63RGGwE/fSu9vh+uqoel+SpSX5zqPm/JTmhqh7b1fnBZV73p1bVh7qaP1BVh41s+8Cq+sJQP/mopevdts/r7vuRqnpCt8731eAby8uq6m9q8AHnIb9La32+sMH0sbva4/vYUVW1T9cPVg1GjdxXVd/Z3fbhqjpuuec+sp0PV9W2oesfraonVdUju/1zcdePntzdfnS3Pz7VXZ7ZtT9kPzEnrTWXBbsk+ekkZy5z24lJ7kxyZAYB1ceSPKu77dokj+6WT8ggtU6SVya5NMkjhq7/XZKHJ3l0kjuS7Dfmsd6RZMeYy0vHrPtjSV6b5KUZdLBJcmWSo5McnuSLSbYk2TfJhUlOGbONVyb5sTHt70ly8jL744+S/Ndu+TlJdqz0HLt6rhzZn/+c5JihtkO6n4/onsOhXe3XL603tM4rk/zCMrU9e5n993fLrH9tkscv3Z7ksiTHL9Wb5OeTvKlbfkK3T/fv9v01STZ3169LclS33j8Nbf/oJPcm2dZdf2eS/zCmjtF99ENJzk+yTwbffHwxyWHLvH4f7fbzk5J8Pcnzutvem+SUrr7rk3xz1/6WJD879Px/qlv+ySRvGLePk7w5g28hHtbtn6s3+m/WZe+7RD892r7H99NJvj/Je8e0v7e7bezr3j2nv0uypVv/Ren68pHt/OnSPk9yepLf65YvSHJct/y0JBd2ywcnqW75J4bWf2WGfpdcXBbxEn3saPse38eO7sdu+bwkT0zyvUk+mcHolocn+cJKz31ke6cmeU23/M1JLumW/+9074MzGJXyuSSPTPIvkuzftR83tP5D9pPLfC6Gt+yZLm6t3ZAM5kjIoCP66IT7nNta+/+Grv9Va+0bSb5RVbdm8MH0huE7tNbGftM+wZ8n+dWqOmao7dsz+AdyW1fzW5N8Z5K/WMP2Rz0rgw/Yaa1dWFWHVtWjutvGPcdxLm6tfWHo+k9X1Q90y0dl0HltSfLhpfVaa/84qbDW2kVJtq3y+dyR5MtV9eIkV2UQAix5VgYddVprf19V12XQESfJBa21O5Okqj6T5Jsy+Ccz6guttR3d8qUZ/O5M8qwkb2ut3Zfklqr6UAav6blj1v1/W2v3VNUVGQQe53XtV3SP9fiuhs917Wdn8K3Ca7rr7xmq7QdXqOkvWmv3J/nM0reO0DP66Qftaf30Ssa97l9J8i1Jzu++JN4nyc1j7vuGJL+UwT7/8ST/qaoOyGBo9ru6+yaDN+/J4MPbO2owmmNTBoezLBn9XYI9jT72QXtyH/uRDPbTMUl+K8l/SvKhDMKMZJnn3lr76tA23pXk/6qqX0zyHzP4IixJvifJ91fVL3TX90/ymCQ3JXltN2rjvjz4Xjt56H5iDgQYi2lnkpWGxX9jaPm+PPg635sHDxvaf+Q+/zzlNh5QVe/I4APnqN9vrb1lXGGttXur6veS/PL40tdkZwbDds9Z5f0mPsfOA/umqk5M8l1JntFa+3o39HB0X06lBod4nDnmpq+31lY6VvAdSV6XQQI9rWmf6+h6j6jBnBt/2bX9SR4MHVZUVWdk8I8kSZ4/vP3W2v1VdU9rg8g6yf0r1DSuvpWew/B6SVLLrgXzo5/e1d7QT38mybaqelgXoKaqHpbBm/PPZBAojHs+lWRna+0ZK9XSWvvbbujyiUn2aa1d2X0I+UobHBc+6o8yeJ3P7e7zyqHbRn+XYNHoY3e1N/Sx43w4yX/JYATLryf5xQxGQnxkhfvsonsO5yc5OckLM9iPyaBv/qHW2mdHan5lklsyGE38sCR3Dd2sb10H5sBYTBcmefjIMWffNsVxZdfmwT/KH9rdIlprL2oPTqAzfBnbYQ95cwYd35bu+sVJ/m1VPbqq9slgbocPraKU1yY5taqettRQg/kVtmbQgf1I13ZikttHUtdRX8vKEx5tTvLlrrN7QpKnd+0fz2DehWO6xzpk0vZaaxcts/8mddjvTfLbST4w0j78XL85g5T4s1nZPdUdU72c1tr1Q7X9yZjn9JEkL+qORdySQRJ+cWvtdUP3u2lCHUs+m+ToGhxLniQ/msm/C5NeM9gI+uld7fH9dGvt6gwO7fu1oeZfS/Kp7rblfDbJlqp6RlfXflX1xGXWfUsG397+afeYX03yhar64e6+VVVPGtoPN3bLp67w+LCI9LG72uP72GVcnMEotPtba3dlcPjJf84g2Eimf+5vSPKHST7ZWvty1/aBJD9V9cD8KU8eev43d0H1j2Ywao51JMBYQN231j+Q5LtqcOqonRkMm/rShLv+RpI/qKpLMkhZN0Rr7e4MOol/2V2/OcnLk1yU5NNJLm2tTZ0gt9ZuSfLiJL9bg1NHXZXk32XQYb4yyVOr6vIMJiVa8U1ca+2OJH9bgwmUfmfMKucl2bd7jFdl0FmnG/J3epL3VNWnMxglkQxGLvxAzXDiotba11prr+7247A/TvKw7vCMd2RwjOQ3HrqFXZyV5PJ6cBLPaR5/dB+9N8nlGbx2Fyb5pdbapN/F5bZ9VwZDo9/VPY/7Mxj1sZKZ72PYXfrph2xvb+mnT0vyzd1r/g8ZDC0+bcLzuTuDb5Jf3dW1I8vP2P/WDOa2eNtQ248kOa27784MvkVMBvv1XVV1aZLb1/RsoKf0sQ/Z3t7Sx47W+o0MDon+eNf0kQzCkiu666/MFM+9tXZpkq+mC4c7v5nBnCCXd79fSxM0/3EGYdGnM5hzzqiLdbY0uRP0Xjdk69rW2ps3uBQAxtBPz1cNzqp0cmvtRze6FmD96WPno6oOT/LBJE9YOgSQ/jIHBgBAz1X
"text/plain": [
"<Figure size 1080x720 with 6 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,3),dpi=200)\n",
"sns.displot(data=df,x='tenure',bins=70,col='Contract',row='Churn');"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src=\"fig6.png\">"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Display a scatter plot of Total Charges versus Monthly Charges, and color hue by Churn.**"
]
},
{
"cell_type": "code",
"execution_count": 158,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 159,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"Index(['customerID', 'gender', 'SeniorCitizen', 'Partner', 'Dependents',\n",
" 'tenure', 'PhoneService', 'MultipleLines', 'InternetService',\n",
" 'OnlineSecurity', 'OnlineBackup', 'DeviceProtection', 'TechSupport',\n",
" 'StreamingTV', 'StreamingMovies', 'Contract', 'PaperlessBilling',\n",
" 'PaymentMethod', 'MonthlyCharges', 'TotalCharges', 'Churn'],\n",
" dtype='object')"
]
},
"execution_count": 159,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df.columns"
]
},
{
"cell_type": "code",
"execution_count": 160,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<AxesSubplot:xlabel='MonthlyCharges', ylabel='TotalCharges'>"
]
},
"execution_count": 160,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABrcAAALaCAYAAACbLghxAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AAEAAElEQVR4nOz9d5Rl113nf7/3CTfHqlu5qnNUqyW1smzJwgHbOGDAmJmfMeCBMTwebH5ODMMw2PMMDwsGMAM/DwwOP7AJA5ixDbbliKOiFVqh1TlWDvfWzeHck/bzxy2Vq1udJLfckvr7WquWz73nnH32OXVVXut++vvdSmuNEEIIIYQQQgghhBBCCCGEEC8ExuWegBBCCCGEEEIIIYQQQgghhBAXS8ItIYQQQgghhBBCCCGEEEII8YIh4ZYQQgghhBBCCCGEEEIIIYR4wZBwSwghhBBCCCGEEEIIIYQQQrxgSLglhBBCCCGEEEIIIYQQQgghXjAk3BJCCCGEEEIIIYQQQgghhBAvGBJuCSGEEEIIIYQQQgghhBBCiBcMCbeEEEIIIYQQQgghhBBCCCHEC4aEW0IIIYQQQgghhBBCCCGEEOIFQ8ItIYQQQgghhBBCCCGEEEII8YIh4ZYQQgghhBBCCCGEEEIIIYR4wZBwSwghhBBCCCGEEEIIIYQQQrxgSLglhBBCCCGEEEIIIYQQQgghXjAk3BJCCCGEEEIIIYQQQgghhBAvGBJuCSGEEEIIIYQQQgghhBBCiBcMCbeEEEIIIYQQQgghhBBCCCHEC4Z1uScgXjyUUlFg98rLIhBcxukIIYQQQgghhBBCCCGEEOLyMoGBle19WuvupRhUwi1xKe0GHrrckxBCCCGEEEIIIYQQQgghxPPOTcDDl2IgaUsohBBCCCGEEEIIIYQQQgghXjCkcktcSsWnNh588EFGRkYu51yEEEIIIYQQQgghhBBCCHEZzc/Pc/PNNz/1sni+Y58JCbfEpbS6xtbIyAjj4+OXcy5CCCGEEEIIIYQQQgghhHj+CC58yMWRtoRCCCGEEEIIIYQQQgghhBDiBUPCLSGEEEIIIYQQQgghhBBCCPGCIeGWEEIIIYQQQgghhBBCCCGEeMGQcEsIIYQQQgghhBBCCCGEEEK8YEi4JYQQQgghhBBCCCGEEEIIIV4wJNwSQgghhBBCCCGEEEIIIYQQLxgSbgkhhBBCCCGEEEIIIYQQQogXDAm3hBBCCCGEEEIIIYQQQgghxAuGhFtCCCGEEEIIIYQQQgghhBDiBUPCLSGEEEIIIYQQQgghhBBCCPGCYV3uCQhxpjAMaTab1Ot1XNclCILLPSXxPGWaJolEglwuRywWu9zTEUIIIYQQQgghhBBCCPFDIOGWeF5pNBrMzs6itb7cUxEvAL7v0+12qVQqZLNZRkZGUEpd7mkJIYQQQgghhBBCCCGEeA5JuCWeN84WbCmlME3zMs5KPJ/5vr+6XavViEQiFAqFyzgjIYQQQgghhBBCCCGEEM81CbfE80IYhqcFW6lUir6+PhKJhFTiiHMKgoBqtcrS0hIAxWKRTCZDJBK5zDMTQgghhBBCCCGEEEII8VwxLvcEhABoNpunBVvj4+Mkk0kJtsR5maZJf38//f39q+81m83LOCMhhBBCCCGEEEIIIYQQzzUJt8TzQr1eX93u6+uTUEs8I5lMZnW71WpdxpkIIYQQQgghhBBCCCGEeK5JW0LxvOC6LtBbYyuRSFzm2YgXmmg0ilIKrfXqZ0kIIYQQQgghhBBCCCGeqbDbJnQaEPqgDJQdx0zmLve0xBkk3BLPC0EQAL02c1K1JZ4ppRSmaeL7PmEYXu7pCCGEEEIIIYQQQgghXgC01lTdDh3fxXcdlNsh1qkSq8ziV+cg8DCiKSLDW7EL6zEzQyjz6bFKw3UoOy28sPc9d8S06IslSNmxH/YtXTEk3BJCCCGEEEIIIYQQQgghxBXDCwOKnQbTjQqn6iWq1Xm65VmU55BLZNnSP8ro4FayzRLe/CG80iRmukB03bXE1l+PEY0DUOw0mGlWOV4rUuw0cAMfgKhpM5hIszkzwHgqR388dTlv90VJwi0hhBBCCCGEEEIIIYQQQlwRHN/jyeU59i3PUnZaBI0lvOIpQrcDQLWxzOTSKQazg+wZ287m9dcRTj1O0CjSOXw3BB6xzbcw2Wnx4NIp5ppVQvRp1/D9LifrXSYby4yn8tw8tIH16f7LcbsvWhJuCSGEEEIIIYQQQgghhBDiRc8NfB4vzfDQ0im6gU/oNPCWZ1aDradoHbJYXeBup0m4+Xq2j+3Cn34C7XfpnHiQjh3niJ1iplk57/VCrZlqlPHDAEMZTKTyz+XtXVGMyz0BIYQQQgghhBBCCCGEEEKI59rJ+jJ7i1N0V9oHBu0qodM45/Etp8mDp/axGM9hpnqVV91Og8WTD5MI/Yu+7lyrxmNL07S97g92A2KVhFtCCCGEEEIIIYQQQgghhHjRCcOQpXaDY9UlphplHitNMd+q4YUB2usStM5feQVQbVU42ayicqMANL0u83NHyDh1+qLJi57LbKvCYufcQZp4ZiTcEuI5curUKZRSKKX45Cc/ebmnI4QQQgghhBBCCCGEEFcEPwyYapS5Z/4YX53az3fnjnKgPM93Z45yrLbEiVqJotPEj2cwookLjje1PEstkUfbUardNu12BV2ZJReNX/ScWr7Lyfoyfhj+ILcmVsiaW0JcgOu6fOYzn+HLX/4yDz74IMVikXq9TjabZf369dx88828+c1v5hWveAWGIXmxEEIIIYQQQgghhBBCPJf8MGDZaVFymji+h4EiYUcoxFKkIlH2l+d5dGmK6spaWuOpPDW3s1o5VQ5alN0OOaUZTQ0QpUTYbZ3zeku1JWrmdaT6N6A8j1QsjRG4WM/w++Cldp221yXzDEIxcXYSbglxHp/97Gd5//vfz6lTp562b3l5meXlZfbu3ctf/MVfsG3bNv74j/+Y17/+9T/8iQohhBBCCCGEEEIIIcSLXBCGTDcrHK8VmW1W0MBAPEXEtCg5TdqeC0ox36xiKROFQqOxlEHjjPWuQjTFWpFQayZS/diBh/bd0y9oGBixNGE0RSOSpDy4hU40jRGGWLlhhhNZljpNqt32Rc3f1yG+lsqtS0HCLSHO4Xd+53f44Ac/uPr6R3/0R/nxH/9xrrrqKnK5HOVymcOHD/OFL3yBr3/96xw5coTf+q3fknBLCCGEEEIIIYQQQgghLjEvDDiwPM/DS6dAKdal8nhhwGSjzIlaES8MSNsxip0m69J97OgbZjSVZaZVRaMxlMILA4KVcMnSmlAplutF4laE0UQWXS+uXs+IJvBTBWpaU3aaLDUrHJw7wmOzB0AZbN1yC4PDmtFkluF4hqO1pdWxz8VAYSj1XD6mK4aEW0KcxV/91V+tBluDg4N8+tOf5s4773zaca961av41V/9VZ588kne+973UiwWn3aMEEIIIYQQQgghhBBCiGcv1CGHK4s8sHiCmGkzmsxyoLLAE8UZaiutB/PRBOVumyOVRR4tTXPf/DHuGNvKnsIEbd/FVIp6t4MT+gCkTRtDGWjfZblVpT83SMQw0WGAEU3STRWYc5os10tELRtDa1y/i6kMAtNmxnX43sl95KMJXja2lZ35YQ5WFs4bcKXsKFHT/qE8sxc7WSBIiDPMzs7yrne9C4BkMsl3vvOdswZba1199dV89atf5QMf+MAPY4pCCCGEEEIIIYQQQgjxguMGPovtOjPNCjPNCovtOm7gX/C8stPmidIMpjIYTWbZW5zintmjq8GWoRQR02K500SvnFNyWnxz+jDfmTuKF4YkrAjrMv2EWhNqjaNDiMRxw4BSq0rV9yGWRFkR3FQ/M50GxdoSoQ7Z0jdGRAdUm2Xipk0s1UfHsFfv56uT+6m7DltyA+e8B0sZbM4OELck3LoUJNwS4gz/43/8D9rtXo/U//bf/hs7duy4qPMMw+Btb3vbeY/5+te/zhvf+EaGh4eJRqNs3LiRd77znczMzJzznLe//e0opdiwYcN5x/7kJz+JUgql1FnXCNuwYQNKKd7+9rcD8Mgjj/D
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.scatterplot(data=df,x='MonthlyCharges',y='TotalCharges',hue='Churn', linewidth=0.5,alpha=0.5,palette='Dark2')"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig7.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"### Creating Cohorts based on Tenure\n",
"\n",
"**Let's begin by treating each unique tenure length, 1 month, 2 month, 3 month...N months as its own cohort.**\n",
"\n",
"**TASK: Treating each unique tenure group as a cohort, calculate the Churn rate (percentage that had Yes Churn) per cohort. For example, the cohort that has had a tenure of 1 month should have a Churn rate of 61.99%. You should have cohorts 1-72 months with a general trend of the longer the tenure of the cohort, the less of a churn rate. This makes sense as you are less likely to stop service the longer you've had it.**"
]
},
{
"cell_type": "code",
"execution_count": 161,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 162,
"metadata": {},
"outputs": [],
"source": [
"no_churn = df.groupby(['Churn','tenure']).count().transpose()['No']\n",
"yes_churn = df.groupby(['Churn','tenure']).count().transpose()['Yes']"
]
},
{
"cell_type": "code",
"execution_count": 163,
"metadata": {},
"outputs": [],
"source": [
"churn_rate = 100 * yes_churn / (no_churn+yes_churn)"
]
},
{
"cell_type": "code",
"execution_count": 164,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"tenure\n",
"1 61.990212\n",
"2 51.680672\n",
"3 47.000000\n",
"4 47.159091\n",
"5 48.120301\n",
" ... \n",
"68 9.000000\n",
"69 8.421053\n",
"70 9.243697\n",
"71 3.529412\n",
"72 1.657459\n",
"Name: customerID, Length: 72, dtype: float64"
]
},
"execution_count": 164,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"churn_rate.transpose()['customerID']"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Now that you have Churn Rate per tenure group 1-72 months, create a plot showing churn rate per months of tenure.**"
]
},
{
"cell_type": "code",
"execution_count": 165,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 166,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABpQAAALaCAYAAAA/RLCiAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AAEAAElEQVR4nOzdd3hcxdn+8XvUre5uyZIsF1wx7gabYkwgQOgESIBQQgmkkF8KKaT3kPCGlzcJodcQegkJNRBcwAX33i3LlmzZlmyr9935/aG1vGfVpd2zKt/Pde21O8/OOeexY0zw7Zkx1loBAAAAAAAAAAAALYkIdwMAAAAAAAAAAADo3giUAAAAAAAAAAAA0CoCJQAAAAAAAAAAALSKQAkAAAAAAAAAAACtIlACAAAAAAAAAABAqwiUAAAAAAAAAAAA0CoCJQAAAAAAAAAAALSKQAkAAAAAAAAAAACtIlACAAAAAAAAAABAqwiUAAAAAAAAAAAA0CoCJQAAAAAAAAAAALSKQAkAAAAAAAAAAACtIlACAAAAAAAAAABAqwiUAAAAAAAAAAAA0CoCJQAAAAAAAAAAALSKQAkAAAAAAAAAAACtigp3A+gaY0yspMm+YaEkTxjbAQAAAAAAAAAA4RUpabDv80ZrbU0wbkqg1PNNlrQy3E0AAAAAAAAAAIBuZ5akVcG4EVveAQAAAAAAAAAAoFWsUOr5Co9/WLFihdLS0sLZCwAAAAAAAAAACKOCggLNnj37+LCwtbkdQaDU8zWemZSWlqaMjIxw9gIAAAAAAAAAALoPT9tT2oct7wAAAAAAAAAAANAqAiUAAAAAAAAAAAC0ikAJAAAAAAAAAAAArSJQAgAAAAAAAAAAQKsIlAAAAAAAAAAAANAqAiUAAAAAAAAAAAC0ikAJAAAAAAAAAAAArSJQAgAAAAAAAAAAQKsIlAAAAAAAAAAAANAqAiUAAAAAAAAAAAC0ikAJAAAAAAAAAAAArSJQAgAAAAAAAAAAQKsIlAAAAAAAAAAAANAqAiUAAAAAAAAAAAC0ikAJAAAAAAAAAAAArSJQAgAAAAAAAAAAQKsIlAAAAAAAAAAAANAqAiUAAAAAAAAAAAC0ikAJAAAAAAAAAAAArSJQAgAAAAAAAAAAQKsIlAAAAAAAAAAAANAqAiUAAAAAAAAAAAC0ikAJvVZVrUerco/q8Y9zdKi0OtztAAAAAAAAAADQY0WFuwEgFG544lMt3X1EHq+VJA1JjtOlU9LD3BUAAAAAAAAAAD0TK5TQK0VFmMYwSZLW5xWHrxkAAAAAAAAAAHo4AiX0SlMyUx1jAiUAAAAAAAAAADqPQAm9UmCgtOlAieo83vA0AwAAAAAAAABAD0eghF5pSkaqY1xd59WOQ2XhaQYAAAAAAAAAgB6OQAm90oCEGGUNiHfU1ueVhKkbAAAAAAAAAAB6NgIl9FpTOUcJAAAAAAAAAICgIFBCrxV4jtI6AiUAAAAAAAAAADqFQAm91tTMFMd4x+EyldfUh6kbAAAAAAAAAAB6LgIl9FqT0lMUGWEax9ZKm/ZzjhIAAAAAAAAAAB1FoIReKy46UuOHJTlqnKMEAAAAAAAAAEDHESj5GGOyjDG/NMasMsYUGmOqjTF5xpiPjTG/Msac3Mb1Fxpj3jDG5BtjanzvbxhjLnTrx4CmAs9RWp9fHJY+AAAAAAAAAADoyaLC3UB3YIy5S9LvJSUEfJXhe50hKVnSt5q5NkLSo5JuDfhquO91uTHmcUl3WGu9we0cbZmakarnP93XOF6fx5Z3AAAAAAAAAAB0VJ8PlIwxP5H0a99wh6THJK2UVCJpoKRpkq6Q1FIY9FudCJPWSvqjpN2SRkv6vu/62yQVSvpR8H8EaM3UrFTHeH9xlQ6XVWtIUlx4GgIAAAAAAAAAoAfq04GSMeYzOhEmPSvpNmttXcC0/0r6H2NMTDPXj5V0t2+4StJZ1toq33ilMeZfkhZJminpe8aYJ621u4L940DLRg9OVEJMpCpqPY219XklOm8igRIAAAAAAAAAAO3VZ89Q8m1V95BvuF7Src2ESY2stbXNlL+lE6HcXX5h0vFrKiXd5RtGSfp2V3pGx0VGGE3OSHHU1ucVh6cZAAAAAAAAAAB6qD4bKEn6rKSTfJ//YK2t78jFxhgj6TLfcJu1dnlz83z17b7hZb7r4KIpmamO8fr84rD0AQAAAAAAAABAT9WXA6Wrfe9W0lvHi8aYAcaYk4wxA9q4fqSkdN/nRW3MPf79cEnZHewTXTQ1I9UxXp9XLK/XhqcZAAAAAAAAAAB6oL4cKJ3me8+11pYZY64zxmyUdETSDklHjDHbjTF3G2Nim7l+ot/nbW08y//7CR1p0hiT0dpL0rCO3K8vClyhVFpdr9wjFeFpBgAAAAAAAACAHiiq7Sm9j+/8pPG+YZEx5v8kfbOZqWMl3SfpCmPMRdbaYr/vMvw+57fxyDy/z5kdbDev7SloTVpKnAYnxaqwrKaxtj6/WKMGJ4axKwAAAAAAAAAAeo6+ukIpRSd+7JPVECYVSPqSpAGS4iXNk3T8XKS5kp4MuEeS3+fyNp7nvxyGFMNlxhhNDTxHKa8kPM0AAAAAAAAAANAD9ckVSpIS/D7HSaqUNN9au92vvtgYc46kZZKmqGGV0qnW2k/9rjuuto3n1fh97tfBXtta0TRM0soO3rPPmZqZqg+2HGocr80rDl8zAAAAAAAAAAD0MH01UKoOGD8eECZJkqy1VcaYH0t6y1f6gqTjgZL/PWLaeJ7/GUxVHWnUWtvqdnrGmI7crs+akpHqGG89UKqaeo9ioyLD0xAAAAAAAAAAAD1IX93yrixg/J9W5v5XUr3v86wW7tHWNnb+K6La2h4PITA5I8UxrvV4ta0g8JcBAAAAAAAAAABoTp8MlKy1NZIK/Up5rcytllTkGw72+8p/5VBGG4/037auxWchdFL6RWvU4ARHbX1+cXiaAQAAAAAAAACgh+mTgZLPZr/Pbe17dvz7er/aFr/P49u43v/7rW3MRYhMDdj2bh3nKAEAAAAAAAAA0C59OVBa7Pd5VEuTjDHJkgb5hvv9vtoj6YDv87w2nnWW3/W57W8RwTQ1K9UxXk+gBAAAAAAAAABAu/TlQOk1v89XtDLvCknG9/nj40VrrZX0pm843hhzWnMX++rHVyi96bsOYTAlYIXS7sIKlVbXhacZAAAAAAAAAAB6kD4bKFlrN0h61ze81hjzmcA5xphhkn7jG9ZKeipgygOSPL7PfzHG9Au4vp+kv/iG9b75CJPxaUmKiXT+kt+QVxKmbgAAAAAAAAAA6Dn6bKDk8y1JxWr4eXjLGPN7Y8yZxpiZxpivSVopKcM396fWWv8t72St3SHpPt9wpqQlxpgv+K7/gqQlvrok3Wet3RnaHw5aExsVqQnpyY7a+vzi8DQDAAAAAAAAAEAPEhXuBsLJWrvDGHOJpFclDZX0Q9/LMU3Sb621f2zhNj+WNETSLZKmSXqxmTlPSPpJUJpGl0zNSHGcnbSOc5QAAAAAAAAAAGhTX1+hJGvtJ5ImSfqlpPWSSiVVS9qjhi3uZlhrf9rK9V5r7a2SLlLDmUoH1LA93gHf+HPW2tustd6Q/kDQLlMyUx3jdXnF4lgrAAAAAAAAAABa16dXKB1nrT0i6Re+V2fv8Y6kd4LUEkIkMFAqLKvRwdJqpaX0a/4CAAAAAAAAAADACiX0LSMHJig5zpmjrmfbOwAAAAAAAAAAWkWghD4lIsI0s+1dSXiaAQAAAAAAAACghyBQQp8zJSPVMV6Xdyw8jQAAAAAAAAAA0EMQKKHPCVyhtDG/RB6vDU8zAAAAAAAAAAD0AARK6HOmZKQ4xhW1Hu0uLA9TNwAAAAAAAAAAdH8ESuhzhiTHKT0lzlFbl1ccnmYAAAAAAAAAAOgBCJTQJwVue7eeQAkAAAAAAAAAgBYRKKFPmhoYKOUXh6UPAAAAAAAAAAB6AgIl9EmBK5S2FZSpus4TnmYAAAAAAAAAAOjmCJTQJ00enqIIc2Jc77XafKAkfA0BAAAAAAAAANCNESihT0qIjdJJQ5IctXV5BEoAAAAAAAA
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"churn_rate.iloc[0].plot()\n",
"plt.ylabel('Churn Percentage');"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig9.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"### Broader Cohort Groups\n",
"**TASK: Based on the tenure column values, create a new column called Tenure Cohort that creates 4 separate categories:**\n",
" * '0-12 Months'\n",
" * '24-48 Months'\n",
" * '12-24 Months'\n",
" * 'Over 48 Months' "
]
},
{
"cell_type": "code",
"execution_count": 167,
"metadata": {},
"outputs": [],
"source": [
"# CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 168,
"metadata": {},
"outputs": [],
"source": [
"def cohort(tenure):\n",
" if tenure < 13:\n",
" return '0-12 Months'\n",
" elif tenure < 25:\n",
" return '12-24 Months'\n",
" elif tenure < 49:\n",
" return '24-48 Months'\n",
" else:\n",
" return \"Over 48 Months\""
]
},
{
"cell_type": "code",
"execution_count": 169,
"metadata": {},
"outputs": [],
"source": [
"df['Tenure Cohort'] = df['tenure'].apply(cohort)"
]
},
{
"cell_type": "code",
"execution_count": 170,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>tenure</th>\n",
" <th>Tenure Cohort</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>1</td>\n",
" <td>0-12 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>34</td>\n",
" <td>24-48 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>2</td>\n",
" <td>0-12 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>45</td>\n",
" <td>24-48 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>2</td>\n",
" <td>0-12 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5</th>\n",
" <td>8</td>\n",
" <td>0-12 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6</th>\n",
" <td>22</td>\n",
" <td>12-24 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>7</th>\n",
" <td>10</td>\n",
" <td>0-12 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8</th>\n",
" <td>28</td>\n",
" <td>24-48 Months</td>\n",
" </tr>\n",
" <tr>\n",
" <th>9</th>\n",
" <td>62</td>\n",
" <td>Over 48 Months</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"</div>"
],
"text/plain": [
" tenure Tenure Cohort\n",
"0 1 0-12 Months\n",
"1 34 24-48 Months\n",
"2 2 0-12 Months\n",
"3 45 24-48 Months\n",
"4 2 0-12 Months\n",
"5 8 0-12 Months\n",
"6 22 12-24 Months\n",
"7 10 0-12 Months\n",
"8 28 24-48 Months\n",
"9 62 Over 48 Months"
]
},
"execution_count": 170,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"df.head(10)[['tenure','Tenure Cohort']]"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Create a scatterplot of Total Charges versus Monthly Charts,colored by Tenure Cohort defined in the previous task.**"
]
},
{
"cell_type": "code",
"execution_count": 171,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 172,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<AxesSubplot:xlabel='MonthlyCharges', ylabel='TotalCharges'>"
]
},
"execution_count": 172,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABrcAAALaCAYAAACbLghxAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AAEAAElEQVR4nOzdd5ydZZ3//9d1l1On90nvBZIABqRJUWygKCq2lW5l1VVXVxd1Rdf1u/4W17WwugorKLqurmJFBKS3GDoJ6ZlJMr3PmdPvcl2/P+7JZJJMCiQYYj7Px2PMmbnv+zrXKQk+zns+n48yxiCEEEIIIYQQQgghhBBCCCHE0cA60hsQQgghhBBCCCGEEEIIIYQQ4mBJuCWEEEIIIYQQQgghhBBCCCGOGhJuCSGEEEIIIYQQQgghhBBCiKOGhFtCCCGEEEIIIYQQQgghhBDiqCHhlhBCCCGEEEIIIYQQQgghhDhqSLglhBBCCCGEEEIIIYQQQgghjhoSbgkhhBBCCCGEEEIIIYQQQoijhoRbQgghhBBCCCGEEEIIIYQQ4qgh4ZYQQgghhBBCCCGEEEIIIYQ4aki4JYQQQgghhBBCCCGEEEIIIY4aEm4JIYQQQgghhBBCCCGEEEKIo4aEW0IIIYQQQgghhBBCCCGEEOKoIeGWEEIIIYQQQgghhBBCCCGEOGpIuCWEEEIIIYQQQgghhBBCCCGOGhJuCSGEEEIIIYQQQgghhBBCiKOGhFtCCCGEEEIIIYQQQgghhBDiqOEc6Q2Ivx5KqTiwfPzbASA8gtsRQgghhBBCCCGEEEIIIcSRZQON47fXGGPKh2NRCbfE4bQceOxIb0IIIYQQQgghhBBCCCGEEC85pwCPH46FpC2hEEIIIYQQQgghhBBCCCGEOGpI5ZY4nAZ23li9ejWtra1Hci9CCCGEEEIIIYQQQgghhDiCenp6ePnLX77z24H9nft8SLglDqeJGVutra3MmDHjSO5FCCGEEEIIIYQQQgghhBAvHeGBTzk40pZQCCGEEEIIIYQQQgghhBBCHDUk3BJCCCGEEEIIIYQQQgghhBBHDQm3hBBCCCGEEEIIIYQQQgghxFFDwi0hhBBCCCGEEEIIIYQQQghx1JBwSwghhBBCCCGEEEIIIYQQQhw1JNwSQgghhBBCCCGEEEIIIYQQRw0Jt4QQQgghhBBCCCGEEEIIIcRRQ8ItIYQQQgghhBBCCCGEEEIIcdSQcEsIIYQQQgghhBBCCCGEEEIcNSTcEkIIIYQQQgghhBBCCCGEEEcN50hvQIg9aa3J5XKMjY3heR5hGB7pLQkhxGFn2zapVIqamhoSicSR3o4QQgghhBBCCCGEEEcNCbfES0o2m6WrqwtjzJHeihBCvKiCIKBcLjMyMkJ1dTWtra0opY70toQQQgghhBBCCCGEeMmTcEu8ZEwVbCmlsG37CO5KCCFeHEEQTNzOZDLEYjEaGhqO4I6EEEIIIYQQQgghhDg6SLglXhK01rsFWxUVFdTV1ZFKpaSSQQjxVykMQ0ZHR+nv7wdgYGCAqqoqYrHYEd6ZEEIIIYQQQgghhBAvbdaR3oAQALlcbrdga8aMGaTTaQm2hBB/tWzbpr6+nvr6+omf5XK5I7gjIYQQQgghhBBCCCGODhJuiZeEsbGxidt1dXUSagkhjhlVVVUTt/P5/BHciRBCCCGEEEIIIYQQRwdpSyheEjzPA6IZW6lU6gjvRggh/nLi8ThKKYwxE/8WCiGEEEIIIYQQQogjQ5d8TM6DUIOlUHEHqypxpLcl9iDhlnhJCMMQiNp0SdWWEOJYopTCtm2CIEBrfaS3I4QQQgghhBBCCHHMMMagMyUo+ehiAF6AKfrokRLBQBbla1QqhjOnFntaFVZ9GuXs3RBP58rokSL40efcuDZWXQorLbPVXywSbgkhhBBCCCGEEEIIIYQQ4phhghA9mCfoGiPcNkzQnSHszYIXYtWncRc24jRWoEsBYfswYXcGqzaFs7gR9/hmrIQLQDiUJ+zKEGwbQQ/mMV4Ubqm4g9WQxplbhz2tCrtOupUdbhJuCSGEEEIIIYQQQgghhBDimKBLAcH6Pvx1feiRAnq4SNiZwZT86HhfjmBDP/aMamKnzMRZ1EiweQA9XMB7ohNCjXtCK7o3h/dEZxSKabPbfZjAI8x7hB2j2NOqiK2cgTOz5gg82r9eEm4JIYQQQgghhBBCCCGEEOKvnvFC/Od68Z7shHKIyXuEPWMTwdYEbQh3jFLKlEictxBnXj3B5kHwQrxne8BW6EyJsHts/3eoDWFnhnKgUZbCnl794j24Y8zezSGFEEIIIYQQQgghhBBCCCH+ygQ7RvCf7oZy1D5Qj5UxeW+f55tMifKDbWAprJpk9LOxEt4TXSilDvp+dW8W79kedNE/8MnioEi4JYQQQgghhBBCCCGEEEKIvzpGa8LBHEHbEEHnKP6zPYR9WUwQYrwAPVY64Bp6IE/YmcFqTEdr5j38dX0YX2PVJg96L2HPGLo/94Ifi9idhFtCCHGU2bZtG0oplFLcfPPNR3o7QgghhBBCCCGEEEK8pJhAE3SM4j26g9LdWyg9so1gwwDlh7cRtA0Rtg+jhwqopItKugdcL9gyiIrZELPQo0XMcAE9kENVJw5+TwWfYPsIJtCH8tDEOJm5JcRLxLZt25g7d+4hr2OMOfBJ4pBorfn973/PbbfdxiOPPEJfXx8jIyNUVlYyffp0TjnlFN70pjfxhje8Adc98H8chRBCCCGEEEIIIYQQB88EGj1cQA8XonlZlkIlY1j1SVQ6TrCxH++ZHkwmqsyyp1WhsyX0QFQ5pb0ipi+LijtYjRWgovBpX8KuDCbQ2K1VUSgWszGBxrKfX/2QHshjCh6q6uBDMTE1CbeEEOJ5uP/++/noRz/KmjVr9jo2MjLCyMgIa9eu5aabbmLatGl85Stf4YorrvjLb/Qlamcv4muvvZYvfvGLR3YzQgghhBBCCCGEEOKoYkJN2JUh2DZC2J0BY7Dq06i4A0MFTMEDpQh7sihHYRRgAMfC5PaYraUNYdcYRhvs5sqoosoLdz/HVqhULKrwshVWfRozrx5Cgz2zGpWOoQbzEyHawezfhFKccDhIuCXES8T06dOnDEx2Wr58OQAnn3wyN910019qW2KSm266iQ9+8IP4fvRbHKeddhpvfetbOfHEE6mvryeTybB161b+8Ic/cNttt9Hd3c3HP/5xCbeEEEIIIYQQQgghhDhEJgjxNwzgPdUFgDOjGuNrwo5Rgm3DGD/EqowTDuZxZtTgLmrEbqki7B4DY8CKKr4Ix9sCagCD7sliJV1URRwzXJi4P5V0UVVxTDkg7M0S9uXwN/bj/XkHKEX8FXNxFjXitFRimisItg7BgYIrS0Vf4pBJuCXES4TruixbtuyA56XT6YM6Txxed999N+973/vQWpNOp7n55pu5+OKL9zrvla98Je973/vYtm0b//iP/8gf//jHI7BbIYQQQgghhBBCCCH+ehhtCDYP4j3WgUo42C2V+JsG8Nf0YsaiqilVmyQcLRJsGSR4tofy6h3Ez5iNu7wVU/BRlkJnS1AOwRhUKgYGTNknHMjhzGvA2BaEGpWKwi7dlyPszaLidnRuKQDLQtkKPZSn9MchrJok8TPn4CxqJNg0sN+Ay0q50VrikEm4JYQQB1AoFLjkkkvQWmNZFrfddhvnnHPOfq+ZM2cO//u//8uPfvSjv9AuhRBCCCGEEEIIIYR4adNegBktQTDe/s+1saqTqNj+Ax89UsB7rg9shd1ahfd0N/5TnVHLQYhmbrk2Yc/YxM/McIHy/W2YsTLO8hZU0sWZURMFUIDxQ0g4mFyZsDeH3VyFSroYL0BVxAn7sujuMQCcRS2gQPfnUAkHqzIe3YkXovtzlP60mcRrFuHMrSPYMjT1g3AsnHn1WAn3UJ5CMe75TTsTQrykPfnkk3zoQx9i8eLFVFRUkE6nWbx4MVdffTWbNm3a53U333wzSimUUmzbtg2tNd///vc544wzqK2tJZ1Os2LFCr7yla9QKBT2uc6cOXNQSh2wDd8VV1yBUoo5c+bsdWzbtm0Te7n
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.scatterplot(data=df,x='MonthlyCharges',y='TotalCharges',hue='Tenure Cohort', linewidth=0.5,alpha=0.5,palette='Dark2')"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig10.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Create a count plot showing the churn count per cohort.**"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 295,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABrcAAALaCAYAAACbLghxAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AACL9klEQVR4nOzdedReVXk34N8dQgIEkFmQoKiIjG2RwVpUsFqHihOlDnUAZbDWWaQWx4q1rVCt1mpBQIFqnUUF8XNElMGCKBZBQFQUUGRSZohJ9vfHexIeXt8pyTud5LrWOuvZ5zn73Od+QlxZq7/ufaq1FgAAAAAAAOiDOTPdAAAAAAAAAEyUcAsAAAAAAIDeEG4BAAAAAADQG8ItAAAAAAAAekO4BQAAAAAAQG8ItwAAAAAAAOgN4RYAAAAAAAC9IdwCAAAAAACgN4RbAAAAAAAA9IZwCwAAAAAAgN4QbgEAAAAAANAbwi0AAAAAAAB6Q7gFAAAAAABAbwi3AAAAAAAA6A3hFgAAAAAAAL0h3AIAAAAAAKA35s50A6w+qmp+kl270xuSLJnBdgAAAAAAgJm1VpLNu/HFrbV7JqOocIvJtGuSC2a6CQAAAAAAYNbZM8n3JqOQbQkBAAAAAADoDSu3mEw3LBucf/752WqrrWayFwAAAAAAYAb9+te/zl577bXs9Iax5q4I4RaTafk7trbaaqssXLhwJnsBAAAAAABmjyXjT5kY2xICAAAAAADQG8ItAAAAAAAAekO4BQAAAAAAQG8ItwAAAAAAAOgN4RYAAAAAAAC9IdwCAAAAAACgN4RbAAAAAAAA9IZwCwAAAAAAgN4QbgEAAAAAANAbwi0AAAAAAAB6Y+5MNwAAAAAAAEyvu+++O7/73e9y5513ZsmSJTPdDrPUWmutlXnz5mXDDTfM+uuvnzlzZseaKeEWAAAAAACsIVpr+fWvf51bbrllpluhBxYvXpx77rknt912W6oqW2+9dTbYYIOZbku4BQAAAAAAa4qbbrrpD4KtuXNFBYxsyZIlaa0lGQpGr7322lkRcPkbCwAAAAAAa4BFixblhhtuWH6+xRZbZKONNspaa601g10xm7XWcuedd+bmm2/O7bffvjzg2n777Wd0i8LZsTkiAAAAAAAwpW6//fbl40033TSbbrqpYIsxVVUWLFiQhQsXZv31108yFHgN/l2aCcItAAAAAABYA9xxxx3LxxtuuOEMdkLfVFU22WST5ee33nrrDHYj3AIAAAAAgDXCokWLkgwFFfPnz5/hbuib9dZbL1WV5N6/SzNFuAUAAAAAAGuApUuXJknWWmut5SEFTFRVLd/GcsmSJTPai3ALAAAAAACA3hBuAQAAAAAA0BvCLQAAAAAAAHpDuAUAAAAAAEBvCLcAAAAAAADoDeEWAAAAAADQO1dddVWqKlWVk046aabbYRoJtwAAAAAAgBmzaNGifPzjH8+LXvSi7LDDDtl0002z9tprZ7PNNsvuu++el73sZfn617+epUuXznSrzBJzZ7oBAAAAAABgzfS5z30uhx9+eK666qo/uHbTTTflpptuyve///0ce+yx2X777fOe97wnT33qU6e/UWYV4RYAAAAAsEba/YhTZroF1hAXHvOimW5hVnrHO96Rt771rcvP/+Iv/iJPf/rTs9NOO2WjjTbKzTffnMsvvzynnXZavva1r+WKK67Im970JuEWwi0AAAAAAGB6feQjH1kebG2xxRb51Kc+lX322ecP5j3hCU/Iy1/+8vzoRz/Ka1/72txwww3T3SqzkHALAAAAAACYNtdee21e8YpXJEkWLFiQs846KzvssMOY9+yyyy75yle+kv/5n/+ZjhaZ5ebMdAMAAAAAAMCa49///d9z5513JkmOOuqocYOtZebMmZMXvOAFY8752te+lqc97WnZcsstM3/+/Dz4wQ/Oy172slxzzTWj3nPQQQelqrLtttuOWfukk05KVaWqRnxH2LbbbpuqykEHHZQkufDCC3PQQQflwQ9+cObPn5+qGnXu5ZdfnkMPPTTbbrtt5s+fn/vf//551rOele9+97tj9rSmsnILAAAAAACYFq21nHzyyUmGVm0deuihk1b7yCOPzL/+67/e57urrroqxx57bD772c/mrLPOyo477jhpzxvLsccem1e+8pVZvHjxuHNPPfXUvOAFL1ge+CXJ9ddfn89//vM57bTT8rGPfSzPec5zprLd3hFuAQAAAAAA0+KSSy7JjTfemCR5zGMekw022GBS6h5//PE599xzs88+++SlL31ptt9++/zud7/LKaecklNOOSU33HBDXvKSl+S8886blOeN5YILLshHP/rRbLPNNnn961+fPfbYI4sXL853vvOdP5h78cUX55Of/GS22mqrHH744dljjz3SWstXvvKV/Ou//mvuvvvuHHbYYfnzP//zbL755lPee18ItwAAAAAAgGnxwx/+cPl49913n7S65557bg499NAcd9xx99n+7/GPf3zmzZuXE044Id/97nfzgx/8ILvtttukPXckl156aXbdddd8+9vfzkYbbbT8+7333vsP5n7/+9/P7rvvnm9+85vZcMMNl3//p3/6p9luu+3yghe8ILfeems++tGP5rWvfe2U9t0n3rkFAAAAAABMi5tuumn5eIsttpi0ultttVXe//733yfYWub1r3/98vFIq6emwgc+8IH7BFtj+fCHP3yfYGuZv/mbv8kDHvCAJNPXd18ItwAAAAAAgGlx2223LR8vWLBg0uoecMABmT9//ojXHv7wh2f99ddPkvzsZz+btGeOZptttsljHvOYCc3ddddd80d/9EcjXquq5avMpqPvPhFuAQAAAAAA02LwHVt33HHHpNXdYYcdxry+8cYbJ7lvuDZVRgurRjJe35tsskmS6em7T4RbAAAAAADAtNh0002Xj3/zm99MWt311ltvzOtz5gzFIUuWLJm0Z45mWZA2EbOp7z4RbgEAAAAAANPij//4j5ePv//9789gJ1NnrbXWmukWVnvCLQAAAAAAYFrsvPPO2WyzzZIk3/nOd3LrrbfOcEf3ro5aunTpmPMmcxtFVs0aG25V1R5V9daq+mpVXVNV91TV7VV1RVV9pKoevYL1nlJVpw7UuqY7f8oK1JhbVX9bVd+pqhuq6q6q+mlVHVdVO69Anc2q6qiq+r+qurU7/q/7btPxKwAAAAAAwOSrqhx44IFJhsKiE044YYY7uvc9YL/73e/GnHfFFVdMQzdMxBoZblXVt5NckOTtSf4iydZJ5iVZkORhSQ5K8p2qOrmq5o1Ta05VnZDkjCTPHKi1dXd+RlUdX1Vj/llX1WZJzk3yX0kenWSzJOskeUiSw5JcWFWHTOC3PTLJxUnekmTXJBt0x67ddxdX1V7j1QEAAAAAgKnw2te+dvm7pt761rfmsssum9B9S5cuzcc+9rFJ7+fBD35wkuS2227L5ZdfPuKcRYsW5bOf/eykP5uVs0aGW0ke0H3+Ksn7khyQZK8kj0ryuiTXdtdflOSkcWq9M8nB3fgHSZ7X1Xped54khyT5p9EKVNVaSU5Nsmf31eeSPCXJI5O8Ksn1SeYnOW6slWBVtU2S05JsmWRxkqOTPLY7ju6+2yrJaVW1cJzfBQAAAAAAk27rrbfOf/7nfyYZWr21zz775KyzzhrznksvvTRPfvKTc8wxx0x6P/vss8/y8bvf/e4R57zuda/LtddeO+I1pt/cmW5ghlyW5I1JPttaWzLs2ner6r+TnJNk+yTPq6pjW2vfHl6kqrZP8vru9HtJHttau6s7v6CqvpjkrCR7JDmiqj7cWrtyhH4OzNBqrST5YGvt5QPXzq+qLye5MMmGSf6jqnZsrS0eoc47k2zejf+mtfbpgWvfqaoLk3wyyRYZCtsOGqEGAAAAAABMqRe/+MW55ppr8ta3vjXXX3999t133zzxiU/MM57xjOy4447ZaKONcvPNN+eKK67Il770pfy///f/smTJkvzxH//xpPey22675VGPelTOO++8HH/88Vm0aFEOPPDA3O9+98tPfvKTfOhDH8o3v/nN/Nmf/VnOPffcSX8+K26
"text/plain": [
"<Figure size 2000x800 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.countplot(data=df,x='Tenure Cohort',hue='Churn')"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='cplot.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Create a grid of Count Plots showing counts per Tenure Cohort, separated out by contract type and colored by the Churn hue.**"
]
},
{
"cell_type": "code",
"execution_count": 174,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 175,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<seaborn.axisgrid.FacetGrid at 0x2d1e3caafd0>"
]
},
"execution_count": 175,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"text/plain": [
"<Figure size 2000x800 with 0 Axes>"
]
},
"metadata": {},
"output_type": "display_data"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABHYAAAFuCAYAAADkjJ/RAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAAyJElEQVR4nO3debglVXn3/e9PWkVERaWDjGmiGB9EBek4oygOaBJBRQVRBk2Iz6s4oxgTQHxMNMSoOAaVSYmAOICGgIRBMcrUCDRD0BYRISiIOKARAe/3j1oHNocz9el9zu7q8/1c1766atWqVavW2efep+9aVTtVhSRJkiRJkvrnXqPugCRJkiRJkmbHxI4kSZIkSVJPmdiRJEmSJEnqKRM7kiRJkiRJPWViR5IkSZIkqadM7EiSJEmSJPWUiR3NmSQPS3Jskh8kWZbk5CSPnGVbb0qyzpD6tSTJK4bU1pFJfpvkAQNlH0pSSdafZZt7JdloYP3qmbQ1zDFaVePHuJ3TR0fZJ0mTWyDxOkn+Lsn3k3wvyZlJHj2MtiVpJtb0WJvkY0kuSnJ5kv9tyxcl2WUY/ZQ0ORM7mhNJAnwZOKuqHl5V2wLvBDaYZZNvAib88Eqy1kq2tQQYyn8UmhXATq0v9wKeBVy3Cu3tBWw0XaUJvIlJxmgEljDcMZY0RxZQvH4d8BTgcVX1SOAfgZOSrD2k9ocqyaJR90HS8CyEWFtVr6uqrYEXAD+oqq3b64RVbXsuzGKcpNWWiR3NlWcCt1XVJ8cKquriqjq7XTU9JMmlSZYneTlAku2TnJXkhCT/neSYVvcNdImOM5Oc2erekuQDSS4GnpzkgCTntzYPax+eJHlEkv9McnGSC5M8HHgfsF27gvDmIZzrscDL2/L2wH8Bt49tTPKW1q9Lk7yplS1JckWSTyW5LMnXk9yvXdFYChzT+ne/1sy+rf/LkzxqfAcmGaPdWv1Lk7x/oo63Mf9GkhOTXJXkfUl2T3Je2/fhA/09I8klSU5PslkrPzLJoUm+3fYfuyIz0RhvlOSUdrX8n2Y31JLmwEKJ1+8AXl9Vv23n+HXg28DuA/18bzv+OUk2aOWLk3yx9fn8JE8d33CSbybZemD9W0kel+T+SQ5vMfW7ScYuAixJcnY7zwuTPGVgXM9OchJw+Sqer6TVy0KJtXeT5N+TPLYtfzfJAW354CR/Pdm5j2vj4LS/odv6e5O8sS3v187zkiTvHqjzlXSzoi5Lss9A+d3GaZjnKo1UVfnyNfQX8Abgg5NsewlwGrAW3VWKa4AN6ZIivwQ2oUs6fgd4WtvnamD9gTYKeNnA+kMGlj8L/GVbPhd4UVtem+7KxvbA1ybp2wOAiyZ5bTlB/SOBXYBzgAcDnwKeMdZfYFtgOXB/YF3gMmAbuisjtwNbt3aOB17Zls8Clg4c42pg37b8/wGfnqTvd44R3Yf9NcBiYBFwBrDzBPtsD/yijf996WYavbtteyPwobb8VWDPtvxq4CsD5/+F9vPaElgx0O7XBo6zF3AV8KD2c/gRsOmo36e+fPlaGPEaeCDw8wnaeCPwLwP9HOvLPwF/15b/beDcNgOumKCdPQfi5SOBC9ryP3BXbF8P+B7d58E6wNqtfIuB+tsDvwE2H/X7wpcvX8N9LYRYO7DPEuDStrw/3YzJBwHnA6e28jOBP53s3Cdo78K2fC/gB8BDgecChwFp5V8Dnj54/sD9gEuBh040Tr58rSkvp/lqFJ4GfL6q7gB+muQbwJ8BvwLOq6prAZJcRBfIvzVBG3cAXxxYf2aSt9N9OD0EuCzJWcDGVfVlgKr6XWt30o5V1a+BrWdxTl8CdgWeCPzNQPnTgC9X1W/asb8EbAecBPywqi5q9ZbRnetU7Y/Ve/EM+vNndFN9b2zHPQZ4OvCVCeqeX1XXt3o/AL7eypfTXV2C7orG2HE/S/efnjFfqao/AJePXeGexOlV9ct2nMuBPwZ+PINzkTQ6a2K8nszv6f5TAF2sfU5bfjaw5UBfHphk3aq6ZWDfLwB/n2Q/uuT3ka38ucALk7ytra9Nlxz6H+CjbZbPHXTJoDHnVdUPh3VSknphTY61Z9MltX4I/DvwnHTPBtq8qq5M8lomPveTBvpwdZKbkmxDl/z5blXdlOS5dHH2u63qunTJ8m8Cb0jyola+aSu/iXuOk7RGMLGjuXIZ3UyWlXXrwPIdTP4e/V37ACDd8xE+TjfL5cdJDqL743mlpXsI8tmTbH5FVU02Nf44uv8IHFVVf5jqA3LA+HO932QVB+reOSZJTqX7cLugqv5qJgdM8kTgX9vqAXR/MAz24w8D639gZjFicP+pTnymP1tJ82uNj9dV9askv0nyJ1V11UC9bYFvtOXbqqomOJ97AU8a+w/QRKrqt0lOo3ve2stau9DFxJdU1ZXj+n4Q8FPgca39wbZ/M9lxJPXaGh9rJ3E+3WMGrqKbmbM+8Nd0fzevjE/TzQB/GHD4WPeAf6yqfx2smGR7uqT8k1t8Pou7zv/OcZLWJD5jR3PlDOC+4+5pfWyS7eg+HF6eZK0ki+lmkpw3TXu/ppsKOpGxQP2zJOvSPjTbFYZrk+zcjn/fdoVg0raq6td114Pexr8m/eCqqh8B76L7EB10NrBzknWS3B94EZN/OM7kXAeP+bzWr7GkzuB+5wHPSLJ+ugfD7QZ8o6rOHTifkyZqdxLfppuRBN3zKIZyDpJWCwslXh8CHJr27LIkz6a7Sv5v05zP14F9x1Yy8CydcT4NHEo3C/LmVnYq3TPSxp5tsU0rfxBwfZvt+Cq6WxAkrdkWSqwdv//v6WZov5TuVrKzgbfRzaphJc79y8COdLN5Tm1lpwKvbudIko2T/BFdjL25JXUeBTxpun5KfWdiR3OiXfV8EfDsdF/peBndN5D8hC4wXwJcTPch9/aq+sk0TR4GnJL2gLhxx/oF3bNtLqUL8OcPbH4V3VTMS+iSEw9rx74j3UPjhvaAuKr616r6wbiyC+mm5J9Hd0/zp6vquxPsPuhI4JO5+8OTZ+LOMWq3Vu1Pd//yxcCyqjpxJdoab19g7zaOr6J7LsVU5mSMJQ3fAorXH2nHW57kSuDvgZ2q6n+n2e8NwNJ0D+a8HHjtRJWqahndLMgjBorfA9wbuKSN63ta+ceBPdvDOx+Fs3SkNd4CirUTORu4ocXbs+meGTR2kXBG594SRGcCx4/NuKnuIfj/BnwnyXLgBLoE1SnAoiRX0D0Y+pw5OCdptZK7Zh1LkiRpNpJsRPfw+0e1mTiSpCFJci/gQuClVfX9UfdHWt04Y0eSJGkVJNmDblbmu0zqSNJwJdkSWEH3RRwmdaQJOGNHkiRJkiSpp5yxI0mSJEmS1FMmdiRJkiRJknpq0ag7MBd23HHHOuWUU0bdDUnqg8xmJ+OsJM2YcVaS5tas4uyaZI2csfOzn/1s1F2QpDWacVaS5pZxVpI0U2tkYkeSJEmSJGkhMLEjSZIkSZLUUyZ2JEmSJEmSesrEjiRJkiRJUk+Z2JEkSZIkSeopEzuSJEmSJEk9ZWJHkiRJkiSpp0zsSJIkSZIk9ZSJHUmSJEmSpJ4ysSNJkiRJktRTJnYkSZIkSZJ6ysSOJEmSJElST5nYkSRJkiRJ6qlFo+6AJEmSJEm6y7b7HT3qLgCw7JA9Rt0FzYAzdiRJkiRJknrKxI4kSZIkSVJPmdiRJEmSJEnqKRM7kiRJkiRJPWViR5IkSZIkqadM7EiSJEmSJPXUnCV2khye5IYklw6UHZLkv5NckuTLSdYb2PbOJCuSXJnkeQPlO7ayFUn2n6v+SpIkSZIk9c1cztg5EthxXNlpwFZV9Vjge8A7AZJsCewKPLrt8/EkayVZC/gY8HxgS2C3VleSJEmSJGnBm7PETlV9E/j5uLKvV9XtbfUcYJO2vBNwbFXdWlU/BFYAT2ivFVV1VVX9Hji21ZUkSZIkSVrwRvmMnVcD/9GWNwZ+PLDt2lY2Wfk9JNknyQVJLrjxxhvnoLuStLAZZyVpbhlnJUmzMZLETpJ3AbcDxwyrzao6rKqWVtXSxYsXD6tZSVJjnJWkuWWclSTNxqL5PmCSvYC/AHa
"text/plain": [
"<Figure size 1132x360 with 3 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(10,4),dpi=200)\n",
"sns.catplot(data=df,x='Tenure Cohort',hue='Churn',col='Contract',kind='count')"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='fig11.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"-----\n",
"\n",
"# Part 4: Predictive Modeling\n",
"\n",
"**Let's explore 4 different tree based methods: A Single Decision Tree, Random Forest, AdaBoost, Gradient Boosting. Feel free to add any other supervised learning models to your comparisons!**\n",
"\n",
"\n",
"## Single Decision Tree"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK : Separate out the data into X features and Y label. Create dummy variables where necessary and note which features are not useful and should be dropped.**"
]
},
{
"cell_type": "code",
"execution_count": 178,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 181,
"metadata": {},
"outputs": [],
"source": [
"X = df.drop(['Churn','customerID'],axis=1)\n",
"X = pd.get_dummies(X,drop_first=True)"
]
},
{
"cell_type": "code",
"execution_count": 182,
"metadata": {},
"outputs": [],
"source": [
"y = df['Churn']"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Perform a train test split, holding out 10% of the data for testing. We'll use a random_state of 101 in the solutions notebook/video.**"
]
},
{
"cell_type": "code",
"execution_count": 183,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 184,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.model_selection import train_test_split"
]
},
{
"cell_type": "code",
"execution_count": 185,
"metadata": {},
"outputs": [],
"source": [
"X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.1, random_state=101)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Decision Tree Perfomance. Complete the following tasks:**\n",
" 1. Train a single decision tree model (feel free to grid search for optimal hyperparameters).\n",
" 2. Evaluate performance metrics from decision tree, including classification report and plotting a confusion matrix.\n",
" 2. Calculate feature importances from the decision tree.\n",
" 4. OPTIONAL: Plot your tree, note, the tree could be huge depending on your pruning, so it may crash your notebook if you display it with plot_tree."
]
},
{
"cell_type": "code",
"execution_count": 222,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.tree import DecisionTreeClassifier"
]
},
{
"cell_type": "code",
"execution_count": 223,
"metadata": {},
"outputs": [],
"source": [
"dt = DecisionTreeClassifier(max_depth=6)"
]
},
{
"cell_type": "code",
"execution_count": 224,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"DecisionTreeClassifier(max_depth=6)"
]
},
"execution_count": 224,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"dt.fit(X_train,y_train)"
]
},
{
"cell_type": "code",
"execution_count": 225,
"metadata": {},
"outputs": [],
"source": [
"preds = dt.predict(X_test)"
]
},
{
"cell_type": "code",
"execution_count": 226,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.metrics import accuracy_score,plot_confusion_matrix,classification_report"
]
},
{
"cell_type": "code",
"execution_count": 227,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
" precision recall f1-score support\n",
"\n",
" No 0.87 0.89 0.88 557\n",
" Yes 0.55 0.49 0.52 147\n",
"\n",
" accuracy 0.81 704\n",
" macro avg 0.71 0.69 0.70 704\n",
"weighted avg 0.80 0.81 0.81 704\n",
"\n"
]
}
],
"source": [
"print(classification_report(y_test,preds))"
]
},
{
"cell_type": "code",
"execution_count": 228,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x2d1e9601d90>"
]
},
"execution_count": 228,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAUIAAAEGCAYAAAAQZJzmAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAAd4UlEQVR4nO3de5he473/8fcn5xARkbCDBKngh02oOraq0W6CitqU0tYme6NFtX5tUftCdXNVi2i24peWStCDQ1uHxvmw0ZZISLOJIhXHhByFSERm5vv7Y90TT8Yc1kyeZ555Zn1e17WuWetep3syl6/7Xvda91cRgZlZkfWodgXMzKrNgdDMCs+B0MwKz4HQzArPgdDMCq9XtSvQXkMG94ythveudjWsHV6ctV61q2Dt9B5LF0XE0I6ef+Dn1o/FS+pzHTtj1qp7I+Kgjt6rHGouEG41vDfT7h1e7WpYOxy42ehqV8Ha6YG49dV1OX/xknqm3Tsi17E9h700ZF3uVQ41FwjNrOsLoIGGalcjNwdCMyu7IFgd+brGXYEDoZlVhFuEZlZoQVBfQ5/vOhCaWUU04EBoZgUWQL0DoZkVnVuEZlZoAaz2M0IzK7Ig3DU2s4ILqK+dOOhAaGbll31ZUjscCM2sAkQ9qnYlcnMgNLOyywZLHAjNrMCy9wgdCM2s4BrcIjSzInOL0MwKLxD1NZQJxIHQzCrCXWMzK7RAfBg9q12N3Gqn7WpmNSN7obpHriUvST0lPSPprrR9vaS5kmamZXQql6SJkuZImiVpt7au7RahmVVEBQZLzgCeBwaWlH0vIm5tctxYYFRa9gSuTj9b5BahmZVdhKiPHrmWPCRtARwC/DLH4eOAKZF5AhgkaVhrJzgQmllFNKBcCzBE0vSS5aRmLncF8H0+/gnzRan7O0FS31S2OfB6yTFvpLIWuWtsZmWXDZbkDi+LImL3lnZKOhRYEBEzJO1fsusc4C2gDzAJOAu4sCP1dSA0s7JrHCwpk32BwyQdDPQDBkq6MSK+mvavkvQr4Ltp+01geMn5W6SyFrlrbGYVUR/KtbQlIs6JiC0iYivgGOChiPhq43M/SQIOB55Np9wBfD2NHu8FLIuI+a3dwy1CMyu7Tvqy5CZJQwEBM4FTUvlU4GBgDrACOKGtCzkQmllFNOQcEW6PiHgEeCStj2nhmABObc91HQjNrOyySRdq58mbA6GZlV0gVtfQJ3YOhGZWdhHkflm6K3AgNLMKWPOydE1wIDSzsgvcIjQz82CJmRVbIE/MambFlqXzrJ3wUjs1NbMa4gTvZlZwQWW+LKkUB0Izqwi3CM2s0CLkFqGZFVs2WOJP7Mys0OQXqs2s2LLBEj8jNLOCq6UvS2qnpmZWMxq/LMmz5NVMgvetJT2ZErn/TlKfVN43bc9J+7dq69oOhGZWEQ30yLW0Q2OC90aXABMiYhtgKTA+lY8HlqbyCem4VjkQmlnZRcDqhh65ljyaJnhPCZvGALemQyaTJXCCLMH75LR+K3BAOr5FfkZoZmWXdY1zt7OGSJpesj0pIiY1OeYKsgTvG6TtjYF3IqIubZcmcV+T4D0i6iQtS8cvaqkCDoRmVhHt+LKkowney8aBsJPV18PpB23LxsNW86Mpc5n5+AB+ceFmrF4tRu28kjMve42eveD9d3twyWlbsmBeH+rr4MhTFnLgMUuqXf1Cm/zkbFYu70lDA9TXidPHbsvIHVZy+o/foP/6Dbz9Rh8uOXUEK5bXzovElVLm12c+luAd+BkwSFKv1CosTeLemOD9DUm9gA2Bxa3doFOeEUoKSZeVbH9X0gWdce+u5o+/HMrwUasAaGiAn54xgnOufpVJD7/AJpt/yP03DwbgjuuHMGLbD7jmgRf46W1zmHThZqz+sHbey+quvn/UJ/jmF7bj9LHbAvDtS1/nuouHccoB2/Hnuwdy5DcWVLmGXUXWNc6ztKWFBO/HAQ8DR6bDjgduT+t3pG3S/odSis8WddZgySrgCElDOul+XdLCeb2Z9uBAxh6b/c/p3aU96d0n2OITWWDc7bPv8fjUQQBIsPL9nkTAB+/3ZINB9fTs1erf0qpgi5Gr+N8n1gfgmUc34NOHLKtyjbqOhpS3pK1lHZwFnClpDtkzwGtT+bXAxqn8TODsti7UWYGwDpgEfKfpDklbSXpI0ixJD0oa0Ul16nTXnL85//6f81D6V99wcD31deLFv/UH4PG7BrFwXm8ADjthEa+91Jdjd92Rk8dsxzcufJMeHuOvrhAX/+ZlrrznRcYel/3P7NUX+7H3Qe8C8JlDlzF0s9XVrGGXkY0a98y1tO+68UhEHJrWX46IPSJim4g4KiJWpfIP0vY2af/LbV23M//T+jlwnKQNm5T/NzA5InYGbgImNj1R0kmSpkuavnBxfSdUtfyeuH8gg4bUMWrnlWvKJDjn6le45vzNOf3gUfQfUL8m2M14ZAM+seNKfv3Mc1x1/wv8/NzNef89R8JqOvPwbTjtwG0597itOezfFrHTnsu5/MzhfPH4RVx5z4v0H1BPnR9fAJV5obqSOm2wJCLelTQF+BawsmTX3sARaf0G4CfNnDuJrEXJ7rv0q8n+4eyn1ueJ+wby1IM78OEqseK9nlxy2gjOuvI1Lv/jHCALfm+83BeA+343mC+ftgAJNt/6Q/5pxIe8Pqcf2++6opq/RqEtfitrrS9b3Js/37Mh2++6gluv2YQffOUTAGw+chV7HvBuNavYpdRSOs/ObmJcQfbW9/qdfN+qO/EH87lpxmymTJvNOVe/yi6ffo+zrnyNdxZl/y/6cJW4+apNOPRrWZdr6OarmflY9srU0oW9eOMffRk2YlXV6l90ffvX03/9+jXrn/zse7zy935suHHWFZaCY894m7tu2Lia1ewyGkeN3SJsRkQskXQzWTC8LhX/hWwk6AbgOOCxzqxTtd1y1SY8+cBAogEOOX4xoz+9HIDjvv0Wl357BCeP2Y4IGH/ufDbcuDYfC3QHGw2t4/xrXwGgZ6/g4T9sxPRHBnL4+IV88d+y93T/fPeG3PfbwVWsZddSSxOzqo1R5fLcRFoeEQPS+qbAXOAnEXGBpC2BXwFDgIXACRHxWkvX2n2XfjHt3uEVr7OVz4Gbja52FaydHohbZ7T2knNbNtp+kxhz3ZFtHwj8ft+r1+le5dApLcLGIJjW3wbWK9l+leybQTPrRrpKtzcPf1liZmXniVnNzHAgNLOCa3yPsFY4EJpZRdTSe4QOhGZWdhFQl3PS1a7AgdDMKsJdYzMrND8jNDMDwoHQzIrOgyVmVmgRtfWMsHaGdcyshoj6hh65ljavJPWTNE3S3yQ9J+mHqfx6SXMlzUzL6FQuSRNTgvdZknZr6x5uEZpZRZTxGeEqYExELJfUG3hc0t1p3/ci4tYmx48FRqVlT+Dq9LNFDoRmVnbl/NY4JV5anjZ7p6W1abPGAVPSeU9IGiRpWETMb+kEd43NrPwie06YZyEleC9ZTmp6OUk9Jc0EFgD3R8STaddFqfs7QVLfVLYmwXtSmvy9WW4RmllFtGPUuNUE7wARUQ+MljQI+IOknYBzgLeAPmSpPM4CLuxIXd0iNLOyizIOlqx13Yh3yPIZHxQR8yOzimxy5z3SYY0J3huVJn9vlgOhmVVEO7rGrZI0NLUEkdQf+ALwd0nDUpmAw4Fn0yl3AF9Po8d7Actaez4I7hqbWYWUcdR4GDBZUk+yxtvNEXFXyoc+FBAwEzglHT8VOBiYA6wATmjrBg6EZlZ2WWuvbKPGs4BdmylvNsVHGi0+tT33cCA0s4qopS9LHAjNrCI6IUFm2TgQmlnZBaLBE7OaWdHVUIPQgdDMKqCMgyWdwYHQzCqjhpqEDoRmVhHdokUo6b9pJaZHxLcqUiMzq3kBNDR0g0AITO+0WphZ9xJAd2gRRsTk0m1J60XEispXycy6g1p6j7DNF30k7S1pNvD3tL2LpKsqXjMzq22Rc+kC8rzxeAVwILAYICL+BuxXwTqZWc0TEfmWriDXqHFEvJ7NdLNGfWWqY2bdRhdp7eWRJxC+LmkfIFLilDOA5ytbLTOraQFRQ6PGebr
"text/plain": [
"<Figure size 432x288 with 2 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plot_confusion_matrix(dt,X_test,y_test)"
]
},
{
"cell_type": "code",
"execution_count": 229,
"metadata": {},
"outputs": [],
"source": [
"imp_feats = pd.DataFrame(data=dt.feature_importances_,index=X.columns,columns=['Feature Importance']).sort_values(\"Feature Importance\")"
]
},
{
"cell_type": "code",
"execution_count": 230,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAACRoAAAYjCAYAAACSohEUAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAB7CAAAewgFu0HU+AAEAAElEQVR4nOzdebh153w38O/vyZOZmIKEIIYiRUwRgkiCUkJNL1J9a6ig5paaqu9LJ1qqFby0VZWooWhrnqfEFEkQYyIRQ8xkJAmZ7/ePtY5n5Tj7rH2mfZ7kfD7Xta619lr3uu/f2Wft/aTOt/ddrbUAAAAAAAAAAAAsZtN6FwAAAAAAAAAAAGz9BI0AAAAAAAAAAIBRgkYAAAAAAAAAAMAoQSMAAAAAAAAAAGCUoBEAAAAAAAAAADBK0AgAAAAAAAAAABglaAQAAAAAAAAAAIwSNAIAAAAAAAAAAEYJGgEAAAAAAAAAAKMEjQAAAAAAAAAAgFGCRgAAAAAAAAAAwChBIwAAAAAAAAAAYJSgEQAAAAAAAAAAMErQCAAAAAAAAAAAGCVoBAAAAAAAAAAAjBI0AgAAAAAAAAAARgkaAQAAAAAAAAAAowSNAAAAAAAAAACAUYJGAAAAAAAAAADAKEEjAAAAANjKVNVDq+o9VfXDqrqgqlq/HbnetbF8VXXgrH6XVXX4YKxHr+VYAAAAbByCRgAAAKyKqjpy8AfNpWwHrnftrL15f1xv610PbK2q86Ykb0tyvyTXSbLd+la1ekb+rTi/qn5aVd+sqk9X1Sur6tFVdf31rputV/+MLOe/P/y3CQAAwDIIGgEAAMASmCGCy7OqeuHg+X3hetfDgh7Rb3OOTXJ4kv/Xb+9Yh5pmZfsk10pykyR3SfKUJK9P8p2qel9V3Xs9iwMAAACSzetdAAAAAFdIx6X74/g0friWhQBczvzh4PgFrbW/WrdK1t78fys2JblKkqsmuUWSGwzO3zfJfavq8CRPa62dM7sy2cqdmC6Et5gHpZsdLJnuv1H8twkAAMAEgkYAAACshfe31l643kUAXA7dbnD8unWrYjYW/beiqnZLF7x6WpI9+tOPTnKLqjqgtfarNa9wlbXWjkxSMxrr0eneryu01toxSY5ZrE1V3TJbgkb+GwUAAGAFLJ0GAAAAAFuPqw2Of7xuVWwFWms/aa29NMleSd4+uHSHdMvJAQAAADMmaAQAAAAAW49fz0DeWrt0PQvZWrTWzk3y8CTvG5x+WFXdbZ1KAgAAgA1L0AgAAICtUnUeVFVHVNXJVfXzqjq/qr5fVe+sqkdV1VRLglfVXlX1p1X1P1V1UlWdU1UXVdVpVfX5qvqnqvrtkT6+W1UtyaMGp19fVW2B7YUL3dtve05R7+GD9o+etk1VXbWqnl5Vn6yqH1bVxf31qy5w/6q9v6upqo4c/FwH9ud2r6oXVNXxVXVmX+c3qurvqurqC/SxR1W9qG9/Vv/7/lJV/XlV7Tgy/p6D8b87OH+PqnpLVX2rqn7VPzufqqqnVNX2S/wZb1FVL+3rO72qLqiqH/U/+3Oq6hpT9PHoQZ2H9+e2qapDqupdVfXtvs5WVQ+ce1+TvGDQzQsmPL+HLzDejn0/r6iqT1fVT6vqwqo6t3++31FVj62q7aao/cDBWEcOzt+9qv6zr/38qjqjf5afUlXbTvHWDse4dlU9u6o+UlXf69+LX/XHH+iv7TlFP9tW1R9W1dv6us6pqvOq6jv98/CgqlrxMlg1+I6Yd/43fj+L9DHz52qlP/dStNZakkcmOWdw+vnT3FvdvwEvqqpjB8/uaVV1TFX9VVVdZ7yXy/S3TVU9rKreUN2/KWdV92/KGX2fh1X3nfEbz8ak53/CODevqpdU1ef63+mF/WfjZ1X1hap6fXXf1VebcP/ovyXz2l+pqp5WVR+qqh/0Y51VVV+rqldV1R2nfH9+43mtqptV1cur6sT+e+MXVfXlqnpxVe06Tb+zUJP/Dfrz/vn5SVVdUlVnT7h/1b8zVvv5BQAAWKmZ/w+GAAAAMKaq9k5yRJLbLHB5j357QJLnVdWDW2snLNLX25I8dMLlXfvt9kmeXlWHJfmz1tolKyh/XVTVXZK8Jcn1pmi7au/vWquqeyV5c5L5IYmbJXlOkkOq6oDW2ql9+z9K8uok88M/t+63R1TVQa2106Ycf9skr0ry+HmXdkhy1357UlU9sLV28khfm5P8Y5InJdlm3uXd++2AJM+tqj9prR0xTY1939dJ8ta+nlXVhws+muRKC1zeNsnOSW6Q5IFJ/qJ/Zo5fQv/bpXuPHzfv0vZJ9u+3x1TVvVtrp4/0tSnJX6R7NnZaoMn1+u13k7y4qm416fnuQwb/luTGC1zes98OSfK5qvpfrbUfLlbbWrm8PlfL0Vo7s7oA1FP7U79TVVdvrZ25UPvqQoCHJTk0v/nezH3/75vkz6rq2a21V43VUFX7p3subrrA5av3/e2b5GlJ/j7Jc8f6nDDOC9M9y/PrTpJr9tvtkjw6yZuS/O/ljDMY735JXptkt3mXtk9y1SS3SPLkqnpzkse11n65hL7/OMnL85vfy3v32+Oq6ndba59fXvVrp6oekOT1ueyShpPaHphV/M5Yi+cXAABgNQgaAQAAsFWpbimc9yTZpT91UZLjknyzP94z3R+9d0gXNvlsVe3XWjtxQpfX7/cXJzmh7+fsJJckuVaSOyS5bpJK8ifp/hD6pAX6OSJd2OUeSW7en/tYkm8s0PbYsZ9zld0k3R9xr5Juto9PJvlRuj+MXmZpoTV4f9fSbZK8KMmOSX6Q5DPpfr6bpgufVLqAyweq6lbpllZ6XX/vN9P9Hs5Pcqt0f4xNuj+W/0e6oMk0/j5bQkZfSfKlftzbJ5mbBWuvJB/v36fvL9RJH4D57yS/Nzh9ZpIj+/31khyUZLt0f9Q/vKqu2lo7bIoat0/y7r6mi5N8Nsm3+vO369u8I8nX0r0Pd+jPHZeFn9XPzXt9tWwJGf0sydfT/T7OSxfmuUnf7+Z0z89RVXW71topU9SeJP+abqawS5Mck+4ztSnJndI9g+l/jjckue+kTqpqmyRvT/KgwekLkxyd5Lvpnu/d0r1Pu/djLDgDU1U9NF14Y24mpV+le1++29d50yT79T/znZIcXVV3aK39dMqfeb6575ckefLg/P9b7Kat4LlaD2/PlqBRpfu+evf8RlW1c5IPJbnL4PS3knwhyVnpQkF3SXKddN8xr6yqXVprL5o0cFUdku45HM6wdXKS45P8PN336i36bVO679Elq6qn57Kzj52e7vn7cZLW137zdN89CwWRljrew9M973N9XZLk00lOSffZ3z/d+5Qkj0hyw6q6e2vt/Cn6fnSS1/QvT0ry+XSfp5une/8r3bP/7qraq7X285X+PKvozklemO73fUa6f1tPT/ffDrcdNlzt74y1eH4BAABWTWvNZrPZbDabzWaz2Wy2FW/p/rDd+u2Fy+xjtyQ/HfRzRJLdF2h37ST/M2j3lSTbTOjzxelmNNplwvVKcv90AYq5/u66SI2HD9o9esqf67uDe/acov3oGPPaXNTvX5XkSvPabZtk01q9v0v43R446K9N+Rydny4o8qS5n2HQ7oAk5w7aPi9dCOnnSR6yQL8PSxeWmGt/twnj7zloc2G/Pz3JvRZoe/9+vLn2H1zk53r28Ofvn8vtFnj+PzTv93rHCf09eoHf/5ELPV9Jth8cv3Bw31Sf0yR3TPK3SW65SJtrpQtgzPX90SmfhfP7/bFJbr7AZ/Pp8963BX9vffu/m9f2lUmuMaHtvv3zf4sFrt0iyS/7Pi5N8tIkV12g3Y2SfGow3vtX8hkZ9Dv6Odnanqsl/nxHLvUZnHf/TrnsZ/lFE9odMWhzUpIDF2izTZInDp7Di5PsN6G/26YLj8z1+cVF3sfdkvxZkmePPP9HLnB9c5LTBm2em2TbCeNcPcljFhqnv374oJ9HT2hz43TfnXPtjklyk3ltNiV5RroA0ly7V0zzDPfv7c+S/O4C7e6Wy36H/t/V+Ayt5Lmb1+6idN8BfzH/d5DLfq+u+nfGaj+/NpvNZrPZbDabzba
"text/plain": [
"<Figure size 2800x1200 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(14,6),dpi=200)\n",
"sns.barplot(data=imp_feats.sort_values('Feature Importance'),x=imp_feats.sort_values('Feature Importance').index,y='Feature Importance')\n",
"plt.xticks(rotation=90)\n",
"plt.title(\"Feature Importance for Decision Tree\");"
]
},
{
"cell_type": "code",
"execution_count": 231,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.tree import plot_tree"
]
},
{
"cell_type": "code",
"execution_count": 233,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAABZEAAAOnCAYAAABRRAhMAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAABcSAAAXEgFnn9JSAAEAAElEQVR4nOzdd7hcVdmG8fuFNJJA6EWaSlHEQkdABBEFFJAq0otixYa9+1mwIhZEBJTeBSyAFAsdO4hUFUEQpCdAOkne74+145wAJ6Scc9aU+3ddXrpnJ2ceMJnZ887az4rMRJIkSZIkSZKkZ7NI7QCSJEmSJEmSpPblEFmSJEmSJEmS1C+HyJIkSZIkSZKkfjlEliRJkiRJkiT1yyGyJEmSJEmSJKlfDpElSZIkSZIkSf1yiCxJkiRJkiRJ6pdDZEmSJEmSJElSvxwiS5IkSZIkSZL65RBZkiRJkiRJktQvh8iSJEmSJEmSpH45RJYkSZIkSZIk9cshsiRJkiRJkiSpXw6RJUmSJEmSJEn9cogsSZIkSZIkSeqXQ2RJkiRJkiRJUr8cIkuSJEmSJEmS+uUQWZIkSZIkSZLUL4fIkiRJkiRJkqR+OUSWJEmSJEmSJPXLIbIkSZIkSZIkqV8OkSVJkiRJkiRJ/XKILEmSJEmSJEnql0NkSZIkSZIkSVK/HCJLkiRJkiRJkvrlEFmSJEmSJEmS1C+HyJIkSZIkSZKkfjlEliRJkiRJkiT1yyGyJEmSJEmSJKlfDpElSZIkSZIkSf1yiCxJkiRJkiRJ6pdDZEmSJEmSJElSvxwiS5IkSZIkSZL65RBZkiRJkiRJktQvh8iSJEmSJEmSpH45RJYkSZIkSZIk9cshsiRJkiRJkiSpXw6RJUmSJEmSJEn9cogsSZIkSZIkSeqXQ2RJkiRJkiRJUr8cIkuSJEmSJEmS+uUQWZIkSZIkSZLUL4fIkiRJkiRJkqR+OUSWJEmSJEmSJPXLIbIkSZIkSZIkqV8OkSVJkiRJkiRJ/XKILEmSJEmSJEnql0NkSZIkSZIkSVK/HCJLkiRJkiRJkvrlEFmSJEmSJEmS1C+HyJIkSZIkSZKkfjlEliRJkiRJkiT1yyGyJEmSJEmSJKlfDpElSZIkSZIkSf1yiCxJkiRJkiRJ6pdDZEmSJEmSJElSvxwiS5IkSZIkSZL65RBZkiRJkiRJktQvh8iSJEmSJEmSpH45RJYkSZIkSZIk9cshsiRJkiRJkiSpXw6RJUmSJEmSJEn9cogsSZIkSZIkSeqXQ2RJkiRJkiRJUr8cIkuSJEmSJEmS+uUQWZIkSZIkSZLUL4fIkiRJkiRJkqR+OUSWJEmSJEmSJPXLIbIkSZIkSZIkqV8OkSVJkiRJkiRJ/XKILEmSJEmSJEnql0NkSZIkSZIkSVK/HCJLkiRJkiRJkvo1rHYASZIkCSAiAhg9RE83OTNziJ5LkiRJ6mgOkSVJktQuRgMTh+i5xgKThui5JEmSpI5mnYUkSZIkSZIkqV+uRJYkSVLb+ftJH2P0qBEA/PqGf3DHvQ/zktVXYPklxzBh4lQmTZ3Ovx8cz/6v25CrbvoXr1xndcZPnMxqyy/Fz667mectM46Vlx3HH26/h01etCrPW3Yck6dOZ+2Dvlb5n0ySJEnqPA6RJUmS1HZGjxrBmGaIvPNm6/LdB65huXFjeGj8RJYZN4atX7EG9z48gbGjRpCZzJg5i5v+9V+mTZ/BSksvwe33PMTWr1iDtVZetvI/iSRJktT5wv1EJEmS1A4iYgxNJ/J/zvrM/4bIA2XS1Oms8pYvzj4cm5l2IkuSJEnzwJXIkiRJajuTp04H4OwrbmTZJcYwdrGRTJwyjSXGjOKhCRNZZonRPD5pKjNmzmKb9dbkgfFPsvryS/GjX/6eDddelZmzZvHg+Cd5aMJEdnzlS1h+ybH/+5mSJEmS5o8rkSVJktQW+q5EHgKuRJYkSZLm0SK1A0iSJEkRsQRwwBA+5RsiYvgQPp8kSZLUsVyJLEmSpCoiYhHgNcBBwO7AYs2pWcBvgNOAC4GpC/lUywB7AvsB6/V5/OHmOU7KzJsW8jkkSZKkruUQWZIkSUMqItYEDmz+s2qfU7cDJwGnZeZ9g/TcL2ued39g+T6nbmie+4zMfGQwnluSJEnqVA6RJUmSNOiauoo3U1Ydb9Hn1OPAmZQB7h9yiC5OmyqL7Zo8OwOzqy2eAi5q8lycmU8NRR5JkiSpnTlEliRJ0qCIiEVp1VXsxpx1FZdRBrU/y8yFratYKBGxDLA3JeeGfU49DJxOqbv4a4VokiRJUltwiCxJkqQBFRFrUSojDuCZdRUnUuoq7q+R7blExEtp1V2s0OfUjbTqLh4e+mSSJElSPQ6RJUmStNAiYhytuorN+5yaQKuu4o9DVVexsCJiGHPWXYxoTs1gzrqL6TXySZIkSUPJIbIkSZIWSFNXsQ2tuopRzalZwKWUQevPa9dVLKym7uItlH/OjfqceoRW3cWNQ59MkiRJGhoOkSVJkjRfImJtWnUVq/Q5dRulruL0dq2rWFgRsS6tuosV+5z6K626i4cqRJMkSZIGjUNkSZIkPaemrmIvygD16XUVZ1AGqH/qlLqKhdXUXbyesjr5TTyz7uJk4CLrLiRJktQNHCJLkiTpWTV1Fa+lDEp3Zc66iksog+NfdHpdxcKKiKVp1V1s3OfUozR1F8CNvTJglyRJUvdxiCxJkqQ5RMSLaNVVrNzn1K206ir+WyNbu4uIl9D6d9e37uImyjD5dOsuJEmS1GkcIkuSJImIWBJ4M2U17WZ9To2nVVfxZ1fTzpum7uJ1lH+fuzBn3cXFlH+f1l1IkiSpIzhEliRJ6lFNXcW2tOoqRjanZjJnXcW0Gvm6RVN3sRfl3/MmfU49SmtAf4MDekmSJLUrh8iSJEk9JiJeTKty4Xl9Tt1Cq67igRrZul1ErEPr3/1KfU79jVbdxYMVokmSJEn9cogsSZLUAyJiKcpq2AOBV/Y59Rit1bB/cTXs0GjqLmavAt+FOVeBXwycDFzoKnBJkiS1A4fIkiRJXaqpq+jby9t3UPlLyuDYQWVlfQb8BwGb9jnlgF+SJEltwSGyJElSl+lTmbA/c9ZV3EypqzjDuor2NJeqkZtp1V34/50kSZKGlENkSZKkLvAcq1lPx83bOspzbHroKnJJkiQNKYfIkiRJHarp1Z1dV/EmntmrexJwkYPGzhYRS9L6guDpfdZnUv5//rNfEEiSJGmwOESWJEnqMBHxElp1FSv1OfU3WnUVD9bIpsEVES+iVXexcp9Tt1CGyadZdyFJkqSB5hBZkiSpAzR1FW+hrEbdpM+pR2nVVdzoatTe0NRdvJZW3cWo5tRM4BLKn4dfuApdkiRJA8EhsiRJUptq6ipeT6uuYkRzagZz1lVMr5FP7aGpu3gz5c/JZn1OjadVd/Env2CQJEnSgnKILEmS1GYiYl1adRUr9jl1E626iodqZFN7a+ouDmj+s0qfU7fSqrv4b4VokiRJ6mAOkSVJktpARCxNq65i4z6nHqGpq8jMG4c+mTpRU3exDeXP02606i5mMWfdxdQa+SRJktRZHCJLkiRV0tRVbEcZ9O3MnHUVF1EGfRdbV6GFERHjaNVdbN7n1HjgLMqfsz9adyFJkqT+OESWJEkaYk1dxUHAfsxZV3EjcDLWVWiQRMTalKqLA5mz7uI2WnUX91eIJkmSpDbmEFmSJGkIRMQytOoqNupz6hHgNOBk6yo0VJq6i9dQ/jzuzpx1F5dSBso/t+5CkiRJ4BBZkiRp0DxHXcWFlEHdL62rUE1N3cWelD+nW/Q5NQE4E+suJEmSep5DZEmSpAEWES+lVVexQp9TN1AGcmdm5sNDn0yau4hYi1bdxap9Tt1Oq+7ivgrRJEmSVJFDZEmSpAHQ1FXsTRkeb9jn1MO06ir+WiGaNN8iYhHmrLtYrDk1C7iMMlD+mXUXkiRJvcEhsiRJ0gKKiOHA9pRVmzsDw5tTM4Bf0KqreKpKQGkARMQStOouXtXn1ATgLMpmkL+37kKSJKl7OUSWJEmaTxHxMlp1Fcv3OfUXykDNugp1pYhYk/K
"text/plain": [
"<Figure size 1800x1200 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.figure(figsize=(12,8),dpi=150)\n",
"plot_tree(dt,filled=True,feature_names=X.columns);"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"<img src='hugetree.png'>"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Random Forest\n",
"\n",
"**TASK: Create a Random Forest model and create a classification report and confusion matrix from its predicted results on the test set.**"
]
},
{
"cell_type": "code",
"execution_count": 259,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 260,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.ensemble import RandomForestClassifier"
]
},
{
"cell_type": "code",
"execution_count": 266,
"metadata": {},
"outputs": [],
"source": [
"rf = RandomForestClassifier(n_estimators=100)"
]
},
{
"cell_type": "code",
"execution_count": 267,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"RandomForestClassifier()"
]
},
"execution_count": 267,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"rf.fit(X_train,y_train)"
]
},
{
"cell_type": "code",
"execution_count": 268,
"metadata": {},
"outputs": [],
"source": [
"preds = rf.predict(X_test)"
]
},
{
"cell_type": "code",
"execution_count": 269,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
" precision recall f1-score support\n",
"\n",
" No 0.86 0.89 0.87 557\n",
" Yes 0.52 0.44 0.48 147\n",
"\n",
" accuracy 0.80 704\n",
" macro avg 0.69 0.67 0.68 704\n",
"weighted avg 0.79 0.80 0.79 704\n",
"\n"
]
}
],
"source": [
"print(classification_report(y_test,preds))"
]
},
{
"cell_type": "code",
"execution_count": 270,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x2d1e6a54040>"
]
},
"execution_count": 270,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAUIAAAEGCAYAAAAQZJzmAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAAd4UlEQVR4nO3de5he473/8fcn5xARkbCDBKngh02oOraq0W6CitqU0tYme6NFtX5tUftCdXNVi2i24peWStCDQ1uHxvmw0ZZISLOJIhXHhByFSERm5vv7Y90TT8Yc1kyeZ555Zn1e17WuWetep3syl6/7Xvda91cRgZlZkfWodgXMzKrNgdDMCs+B0MwKz4HQzArPgdDMCq9XtSvQXkMG94ythveudjWsHV6ctV61q2Dt9B5LF0XE0I6ef+Dn1o/FS+pzHTtj1qp7I+Kgjt6rHGouEG41vDfT7h1e7WpYOxy42ehqV8Ha6YG49dV1OX/xknqm3Tsi17E9h700ZF3uVQ41FwjNrOsLoIGGalcjNwdCMyu7IFgd+brGXYEDoZlVhFuEZlZoQVBfQ5/vOhCaWUU04EBoZgUWQL0DoZkVnVuEZlZoAaz2M0IzK7Ig3DU2s4ILqK+dOOhAaGbll31ZUjscCM2sAkQ9qnYlcnMgNLOyywZLHAjNrMCy9wgdCM2s4BrcIjSzInOL0MwKLxD1NZQJxIHQzCrCXWMzK7RAfBg9q12N3Gqn7WpmNSN7obpHriUvST0lPSPprrR9vaS5kmamZXQql6SJkuZImiVpt7au7RahmVVEBQZLzgCeBwaWlH0vIm5tctxYYFRa9gSuTj9b5BahmZVdhKiPHrmWPCRtARwC/DLH4eOAKZF5AhgkaVhrJzgQmllFNKBcCzBE0vSS5aRmLncF8H0+/gnzRan7O0FS31S2OfB6yTFvpLIWuWtsZmWXDZbkDi+LImL3lnZKOhRYEBEzJO1fsusc4C2gDzAJOAu4sCP1dSA0s7JrHCwpk32BwyQdDPQDBkq6MSK+mvavkvQr4Ltp+01geMn5W6SyFrlrbGYVUR/KtbQlIs6JiC0iYivgGOChiPhq43M/SQIOB55Np9wBfD2NHu8FLIuI+a3dwy1CMyu7Tvqy5CZJQwEBM4FTUvlU4GBgDrACOKGtCzkQmllFNOQcEW6PiHgEeCStj2nhmABObc91HQjNrOyySRdq58mbA6GZlV0gVtfQJ3YOhGZWdhHkflm6K3AgNLMKWPOydE1wIDSzsgvcIjQz82CJmRVbIE/MambFlqXzrJ3wUjs1NbMa4gTvZlZwQWW+LKkUB0Izqwi3CM2s0CLkFqGZFVs2WOJP7Mys0OQXqs2s2LLBEj8jNLOCq6UvS2qnpmZWMxq/LMmz5NVMgvetJT2ZErn/TlKfVN43bc9J+7dq69oOhGZWEQ30yLW0Q2OC90aXABMiYhtgKTA+lY8HlqbyCem4VjkQmlnZRcDqhh65ljyaJnhPCZvGALemQyaTJXCCLMH75LR+K3BAOr5FfkZoZmWXdY1zt7OGSJpesj0pIiY1OeYKsgTvG6TtjYF3IqIubZcmcV+T4D0i6iQtS8cvaqkCDoRmVhHt+LKkowney8aBsJPV18PpB23LxsNW86Mpc5n5+AB+ceFmrF4tRu28kjMve42eveD9d3twyWlbsmBeH+rr4MhTFnLgMUuqXf1Cm/zkbFYu70lDA9TXidPHbsvIHVZy+o/foP/6Dbz9Rh8uOXUEK5bXzovElVLm12c+luAd+BkwSFKv1CosTeLemOD9DUm9gA2Bxa3doFOeEUoKSZeVbH9X0gWdce+u5o+/HMrwUasAaGiAn54xgnOufpVJD7/AJpt/yP03DwbgjuuHMGLbD7jmgRf46W1zmHThZqz+sHbey+quvn/UJ/jmF7bj9LHbAvDtS1/nuouHccoB2/Hnuwdy5DcWVLmGXUXWNc6ztKWFBO/HAQ8DR6bDjgduT+t3pG3S/odSis8WddZgySrgCElDOul+XdLCeb2Z9uBAxh6b/c/p3aU96d0n2OITWWDc7bPv8fjUQQBIsPL9nkTAB+/3ZINB9fTs1erf0qpgi5Gr+N8n1gfgmUc34NOHLKtyjbqOhpS3pK1lHZwFnClpDtkzwGtT+bXAxqn8TODsti7UWYGwDpgEfKfpDklbSXpI0ixJD0oa0Ul16nTXnL85//6f81D6V99wcD31deLFv/UH4PG7BrFwXm8ADjthEa+91Jdjd92Rk8dsxzcufJMeHuOvrhAX/+ZlrrznRcYel/3P7NUX+7H3Qe8C8JlDlzF0s9XVrGGXkY0a98y1tO+68UhEHJrWX46IPSJim4g4KiJWpfIP0vY2af/LbV23M//T+jlwnKQNm5T/NzA5InYGbgImNj1R0kmSpkuavnBxfSdUtfyeuH8gg4bUMWrnlWvKJDjn6le45vzNOf3gUfQfUL8m2M14ZAM+seNKfv3Mc1x1/wv8/NzNef89R8JqOvPwbTjtwG0597itOezfFrHTnsu5/MzhfPH4RVx5z4v0H1BPnR9fAJV5obqSOm2wJCLelTQF+BawsmTX3sARaf0G4CfNnDuJrEXJ7rv0q8n+4eyn1ueJ+wby1IM78OEqseK9nlxy2gjOuvI1Lv/jHCALfm+83BeA+343mC+ftgAJNt/6Q/5pxIe8Pqcf2++6opq/RqEtfitrrS9b3Js/37Mh2++6gluv2YQffOUTAGw+chV7HvBuNavYpdRSOs/ObmJcQfbW9/qdfN+qO/EH87lpxmymTJvNOVe/yi6ffo+zrnyNdxZl/y/6cJW4+apNOPRrWZdr6OarmflY9srU0oW9eOMffRk2YlXV6l90ffvX03/9+jXrn/zse7zy935suHHWFZaCY894m7tu2Lia1ewyGkeN3SJsRkQskXQzWTC8LhX/hWwk6AbgOOCxzqxTtd1y1SY8+cBAogEOOX4xoz+9HIDjvv0Wl357BCeP2Y4IGH/ufDbcuDYfC3QHGw2t4/xrXwGgZ6/g4T9sxPRHBnL4+IV88d+y93T/fPeG3PfbwVWsZddSSxOzqo1R5fLcRFoeEQPS+qbAXOAnEXGBpC2BXwFDgIXACRHxWkvX2n2XfjHt3uEVr7OVz4Gbja52FaydHohbZ7T2knNbNtp+kxhz3ZFtHwj8ft+r1+le5dApLcLGIJjW3wbWK9l+leybQTPrRrpKtzcPf1liZmXniVnNzHAgNLOCa3yPsFY4EJpZRdTSe4QOhGZWdhFQl3PS1a7AgdDMKsJdYzMrND8jNDMDwoHQzIrOgyVmVmgRtfWMsHaGdcyshoj6hh65ljavJPWTNE3S3yQ9J+mHqfx6SXMlzUzL6FQuSRNTgvdZknZr6x5uEZpZRZTxGeEqYExELJfUG3hc0t1p3/ci4tYmx48FRqVlT+Dq9LNFDoRmVnbl/NY4JV5anjZ7p6W1abPGAVPSeU9IGiRpWETMb+kEd43NrPwie06YZyEleC9ZTmp6OUk9Jc0EFgD3R8STaddFqfs7QVLfVLYmwXtSmvy9WW4RmllFtGPUuNUE7wARUQ+MljQI+IOknYBzgLeAPmSpPM4CLuxIXd0iNLOyizIOlqx13Yh3yPIZHxQR8yOzimxy5z3SYY0J3huVJn9vlgOhmVVEO7rGrZI0NLUEkdQf+ALwd0nDUpmAw4Fn0yl3AF9Po8d7Actaez4I7hqbWYWUcdR4GDBZUk+yxtvNEXFXyoc+FBAwEzglHT8VOBiYA6wATmjrBg6EZlZ2WWuvbKPGs4BdmylvNsVHGi0+tT33cCA0s4qopS9LHAjNrCI6IUFm2TgQmlnZBaLBE7OaWdHVUIPQgdDMKqCMgyWdwYHQzCqjhpqEDoRmVhHdokUo6b9pJaZHxLcqUiMzq3kBNDR0g0AITO+0WphZ9xJAd2gRRsTk0m1J60XEispXycy6g1p6j7DNF30k7S1pNvD3tL2LpKsqXjMzq22Rc+kC8rzxeAVwILAYICL+BuxXwTqZWc0TEfmWriDXqHFEvJ7NdLNGfWWqY2bdRhdp7eWRJxC+LmkfIFLilDOA5ytbLTOraQFRQ6PGebr
"text/plain": [
"<Figure size 432x288 with 2 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plot_confusion_matrix(dt,X_test,y_test)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Boosted Trees\n",
"\n",
"**TASK: Use AdaBoost or Gradient Boosting to create a model and report back the classification report and plot a confusion matrix for its predicted results**"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"#CODE HERE"
]
},
{
"cell_type": "code",
"execution_count": 288,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.ensemble import GradientBoostingClassifier,AdaBoostClassifier"
]
},
{
"cell_type": "code",
"execution_count": 289,
"metadata": {},
"outputs": [],
"source": [
"ada_model = AdaBoostClassifier()"
]
},
{
"cell_type": "code",
"execution_count": 290,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"AdaBoostClassifier()"
]
},
"execution_count": 290,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"ada_model.fit(X_train,y_train)"
]
},
{
"cell_type": "code",
"execution_count": 291,
"metadata": {},
"outputs": [],
"source": [
"preds = ada_model.predict(X_test)"
]
},
{
"cell_type": "code",
"execution_count": 292,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
" precision recall f1-score support\n",
"\n",
" No 0.88 0.90 0.89 557\n",
" Yes 0.60 0.54 0.57 147\n",
"\n",
" accuracy 0.83 704\n",
" macro avg 0.74 0.72 0.73 704\n",
"weighted avg 0.82 0.83 0.83 704\n",
"\n"
]
}
],
"source": [
"print(classification_report(y_test,preds))"
]
},
{
"cell_type": "code",
"execution_count": 293,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"<sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x2d1e9373a30>"
]
},
"execution_count": 293,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAUIAAAEGCAYAAAAQZJzmAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8vihELAAAACXBIWXMAAAsTAAALEwEAmpwYAAAd4UlEQVR4nO3de5he473/8fcn5xARkbCDBKngh02oOraq0W6CitqU0tYme6NFtX5tUftCdXNVi2i24peWStCDQ1uHxvmw0ZZISLOJIhXHhByFSERm5vv7Y90TT8Yc1kyeZ555Zn1e17WuWetep3syl6/7Xvda91cRgZlZkfWodgXMzKrNgdDMCs+B0MwKz4HQzArPgdDMCq9XtSvQXkMG94ythveudjWsHV6ctV61q2Dt9B5LF0XE0I6ef+Dn1o/FS+pzHTtj1qp7I+Kgjt6rHGouEG41vDfT7h1e7WpYOxy42ehqV8Ha6YG49dV1OX/xknqm3Tsi17E9h700ZF3uVQ41FwjNrOsLoIGGalcjNwdCMyu7IFgd+brGXYEDoZlVhFuEZlZoQVBfQ5/vOhCaWUU04EBoZgUWQL0DoZkVnVuEZlZoAaz2M0IzK7Ig3DU2s4ILqK+dOOhAaGbll31ZUjscCM2sAkQ9qnYlcnMgNLOyywZLHAjNrMCy9wgdCM2s4BrcIjSzInOL0MwKLxD1NZQJxIHQzCrCXWMzK7RAfBg9q12N3Gqn7WpmNSN7obpHriUvST0lPSPprrR9vaS5kmamZXQql6SJkuZImiVpt7au7RahmVVEBQZLzgCeBwaWlH0vIm5tctxYYFRa9gSuTj9b5BahmZVdhKiPHrmWPCRtARwC/DLH4eOAKZF5AhgkaVhrJzgQmllFNKBcCzBE0vSS5aRmLncF8H0+/gnzRan7O0FS31S2OfB6yTFvpLIWuWtsZmWXDZbkDi+LImL3lnZKOhRYEBEzJO1fsusc4C2gDzAJOAu4sCP1dSA0s7JrHCwpk32BwyQdDPQDBkq6MSK+mvavkvQr4Ltp+01geMn5W6SyFrlrbGYVUR/KtbQlIs6JiC0iYivgGOChiPhq43M/SQIOB55Np9wBfD2NHu8FLIuI+a3dwy1CMyu7Tvqy5CZJQwEBM4FTUvlU4GBgDrACOKGtCzkQmllFNOQcEW6PiHgEeCStj2nhmABObc91HQjNrOyySRdq58mbA6GZlV0gVtfQJ3YOhGZWdhHkflm6K3AgNLMKWPOydE1wIDSzsgvcIjQz82CJmRVbIE/MambFlqXzrJ3wUjs1NbMa4gTvZlZwQWW+LKkUB0Izqwi3CM2s0CLkFqGZFVs2WOJP7Mys0OQXqs2s2LLBEj8jNLOCq6UvS2qnpmZWMxq/LMmz5NVMgvetJT2ZErn/TlKfVN43bc9J+7dq69oOhGZWEQ30yLW0Q2OC90aXABMiYhtgKTA+lY8HlqbyCem4VjkQmlnZRcDqhh65ljyaJnhPCZvGALemQyaTJXCCLMH75LR+K3BAOr5FfkZoZmWXdY1zt7OGSJpesj0pIiY1OeYKsgTvG6TtjYF3IqIubZcmcV+T4D0i6iQtS8cvaqkCDoRmVhHt+LKkowney8aBsJPV18PpB23LxsNW86Mpc5n5+AB+ceFmrF4tRu28kjMve42eveD9d3twyWlbsmBeH+rr4MhTFnLgMUuqXf1Cm/zkbFYu70lDA9TXidPHbsvIHVZy+o/foP/6Dbz9Rh8uOXUEK5bXzovElVLm12c+luAd+BkwSFKv1CosTeLemOD9DUm9gA2Bxa3doFOeEUoKSZeVbH9X0gWdce+u5o+/HMrwUasAaGiAn54xgnOufpVJD7/AJpt/yP03DwbgjuuHMGLbD7jmgRf46W1zmHThZqz+sHbey+quvn/UJ/jmF7bj9LHbAvDtS1/nuouHccoB2/Hnuwdy5DcWVLmGXUXWNc6ztKWFBO/HAQ8DR6bDjgduT+t3pG3S/odSis8WddZgySrgCElDOul+XdLCeb2Z9uBAxh6b/c/p3aU96d0n2OITWWDc7bPv8fjUQQBIsPL9nkTAB+/3ZINB9fTs1erf0qpgi5Gr+N8n1gfgmUc34NOHLKtyjbqOhpS3pK1lHZwFnClpDtkzwGtT+bXAxqn8TODsti7UWYGwDpgEfKfpDklbSXpI0ixJD0oa0Ul16nTXnL85//6f81D6V99wcD31deLFv/UH4PG7BrFwXm8ADjthEa+91Jdjd92Rk8dsxzcufJMeHuOvrhAX/+ZlrrznRcYel/3P7NUX+7H3Qe8C8JlDlzF0s9XVrGGXkY0a98y1tO+68UhEHJrWX46IPSJim4g4KiJWpfIP0vY2af/LbV23M//T+jlwnKQNm5T/NzA5InYGbgImNj1R0kmSpkuavnBxfSdUtfyeuH8gg4bUMWrnlWvKJDjn6le45vzNOf3gUfQfUL8m2M14ZAM+seNKfv3Mc1x1/wv8/NzNef89R8JqOvPwbTjtwG0597itOezfFrHTnsu5/MzhfPH4RVx5z4v0H1BPnR9fAJV5obqSOm2wJCLelTQF+BawsmTX3sARaf0G4CfNnDuJrEXJ7rv0q8n+4eyn1ueJ+wby1IM78OEqseK9nlxy2gjOuvI1Lv/jHCALfm+83BeA+343mC+ftgAJNt/6Q/5pxIe8Pqcf2++6opq/RqEtfitrrS9b3Js/37Mh2++6gluv2YQffOUTAGw+chV7HvBuNavYpdRSOs/ObmJcQfbW9/qdfN+qO/EH87lpxmymTJvNOVe/yi6ffo+zrnyNdxZl/y/6cJW4+apNOPRrWZdr6OarmflY9srU0oW9eOMffRk2YlXV6l90ffvX03/9+jXrn/zse7zy935suHHWFZaCY894m7tu2Lia1ewyGkeN3SJsRkQskXQzWTC8LhX/hWwk6AbgOOCxzqxTtd1y1SY8+cBAogEOOX4xoz+9HIDjvv0Wl357BCeP2Y4IGH/ufDbcuDYfC3QHGw2t4/xrXwGgZ6/g4T9sxPRHBnL4+IV88d+y93T/fPeG3PfbwVWsZddSSxOzqo1R5fLcRFoeEQPS+qbAXOAnEXGBpC2BXwFDgIXACRHxWkvX2n2XfjHt3uEVr7OVz4Gbja52FaydHohbZ7T2knNbNtp+kxhz3ZFtHwj8ft+r1+le5dApLcLGIJjW3wbWK9l+leybQTPrRrpKtzcPf1liZmXniVnNzHAgNLOCa3yPsFY4EJpZRdTSe4QOhGZWdhFQl3PS1a7AgdDMKsJdYzMrND8jNDMDwoHQzIrOgyVmVmgRtfWMsHaGdcyshoj6hh65ljavJPWTNE3S3yQ9J+mHqfx6SXMlzUzL6FQuSRNTgvdZknZr6x5uEZpZRZTxGeEqYExELJfUG3hc0t1p3/ci4tYmx48FRqVlT+Dq9LNFDoRmVnbl/NY4JV5anjZ7p6W1abPGAVPSeU9IGiRpWETMb+kEd43NrPwie06YZyEleC9ZTmp6OUk9Jc0EFgD3R8STaddFqfs7QVLfVLYmwXtSmvy9WW4RmllFtGPUuNUE7wARUQ+MljQI+IOknYBzgLeAPmSpPM4CLuxIXd0iNLOyizIOlqx13Yh3yPIZHxQR8yOzimxy5z3SYY0J3huVJn9vlgOhmVVEO7rGrZI0NLUEkdQf+ALwd0nDUpmAw4Fn0yl3AF9Po8d7Actaez4I7hqbWYWUcdR4GDBZUk+yxtvNEXFXyoc+FBAwEzglHT8VOBiYA6wATmjrBg6EZlZ2WWuvbKPGs4BdmylvNsVHGi0+tT33cCA0s4qopS9LHAjNrCI6IUFm2TgQmlnZBaLBE7OaWdHVUIPQgdDMKqCMgyWdwYHQzCqjhpqEDoRmVhHdokUo6b9pJaZHxLcqUiMzq3kBNDR0g0AITO+0WphZ9xJAd2gRRsTk0m1J60XEispXycy6g1p6j7DNF30k7S1pNvD3tL2LpKsqXjMzq22Rc+kC8rzxeAVwILAYICL+BuxXwTqZWc0TEfmWriDXqHFEvJ7NdLNGfWWqY2bdRhdp7eWRJxC+LmkfIFLilDOA5ytbLTOraQFRQ6PGebr
"text/plain": [
"<Figure size 432x288 with 2 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plot_confusion_matrix(dt,X_test,y_test)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**TASK: Analyze your results, which model performed best for you?**"
]
},
{
"cell_type": "code",
"execution_count": 294,
"metadata": {},
"outputs": [],
"source": [
"# With base models, we got best performance from an AdaBoostClassifier, but note, we didn't do any gridsearching AND most models performed about the same on the data set."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"### Great job!"
]
}
],
"metadata": {
"anaconda-cloud": {},
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.8.5"
}
},
"nbformat": 4,
"nbformat_minor": 1
}