ptyadana
diff --git a/‎ML - Applied Machine Learning - Algorithms/04.Multi-layer Perceptron/01.Multilayer Perceptron - Hyperparameters.ipynb
Lines changed: 69 additions & 0 deletions b/‎ML - Applied Machine Learning - Algorithms/04.Multi-layer Perceptron/01.Multilayer Perceptron - Hyperparameters.ipynb
Lines changed: 69 additions & 0 deletions
diff --git a/‎ML - Applied Machine Learning - Algorithms/04.Multi-layer Perceptron/02.Multilayer Perceptron - Fit and evaluate a model.ipynb
Lines changed: 219 additions & 0 deletions b/‎ML - Applied Machine Learning - Algorithms/04.Multi-layer Perceptron/02.Multilayer Perceptron - Fit and evaluate a model.ipynb
Lines changed: 219 additions & 0 deletions
diff --git a/‎ML - Applied Machine Learning - Algorithms/04.Multi-layer Perceptron/img/hidden_layers.png
145 KB b/‎ML - Applied Machine Learning - Algorithms/04.Multi-layer Perceptron/img/hidden_layers.png
145 KB
diff --git a/‎ML - Applied Machine Learning - Algorithms/Pickled_Models/MLP_model.pkl
24.5 KB b/‎ML - Applied Machine Learning - Algorithms/Pickled_Models/MLP_model.pkl
24.5 KB
@@ -0,0 +1,69 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Multilayer Perceptron: Hyperparameters\n",
+    "\n",
+    "Import [`MLPClassifier`](https://scikit-learn.org/stable/modules/generated/sklearn.neural_network.MLPClassifier.html#sklearn.neural_network.MLPClassifier) and [`MLPRegressor`](https://scikit-learn.org/stable/modules/generated/sklearn.neural_network.MLPRegressor.html#sklearn.neural_network.MLPRegressor) from `sklearn` and explore the hyperparameters."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Import Multilayer Perceptron Algorithm for Classification & Regression"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "MLPRegressor()\n",
+      "MLPClassifier()\n"
+     ]
+    }
+   ],
+   "source": [
+    "from sklearn.neural_network import MLPRegressor, MLPClassifier\n",
+    "\n",
+    "print(MLPRegressor())\n",
+    "print(MLPClassifier())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.8.3"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
@@ -0,0 +1,219 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Multilayer Perceptron: Fit and evaluate a model\n",
+    "\n",
+    "Using the Titanic dataset from [this](https://www.kaggle.com/c/titanic/overview) Kaggle competition.\n",
+    "\n",
+    "In this section, we will fit and evaluate a simple Multilayer Perceptron model."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Read in Data"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import joblib\n",
+    "import pandas as pd\n",
+    "from sklearn.neural_network import MLPClassifier\n",
+    "from sklearn.model_selection import GridSearchCV\n",
+    "\n",
+    "import warnings\n",
+    "warnings.filterwarnings('ignore', category=FutureWarning)\n",
+    "warnings.filterwarnings('ignore', category=DeprecationWarning)\n",
+    "\n",
+    "train_features = pd.read_csv('../Data/train_features.csv')\n",
+    "train_labels = pd.read_csv('../Data/train_labels.csv', header=None)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Hyperparameter tuning\n",
+    "\n",
+    "![hidden layer](img/hidden_layers.png)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def print_results(results):\n",
+    "    print('BEST PARAMS: {}'.format(results.best_params_))\n",
+    "    \n",
+    "    means = results.cv_results_['mean_test_score']\n",
+    "    stds = results.cv_results_['std_test_score']\n",
+    "    for mean, std, params in zip(means, stds, results.cv_results_['params']):\n",
+    "        print('{} (+- {}) for {}'.format(round(mean,3), round(std *2, 3), params))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### Hyper parameters tuning Notes\n",
+    "- #### hidden_layer_sizes\n",
+    "   - as the problem is relatively simple, we will use one layer only => passing value in the tuple with one value represents 1 layer\n",
+    "   - here 1 hidden layer with 10 nodes, 50 nodes and 100 nodes.\n",
+    "- #### activation\n",
+    "   - `relu`, `tanh`, `logistic`\n",
+    "- #### learning_rate\n",
+    "    - `constant`: it will just take the initial learning rate and keep it the same throughout the entire optimization process.\n",
+    "    - `invscaling`: (inverse scaling) it gradually decreases the learning rate at each step. So this will allow it to take large jump at first. and then it slowly decreases as it gets closer and closer to optimal model.\n",
+    "    - `adaptive`: this keeps the learning constant as long as training loss keeps decreasing. If the learning rate stops going down, then it will decrease the learning rate, so that it takes smaller steps. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "C:\\Users\\Phone Thiri Yadana\\.conda\\envs\\venv-datascience\\lib\\site-packages\\sklearn\\neural_network\\_multilayer_perceptron.py:582: ConvergenceWarning: Stochastic Optimizer: Maximum iterations (1000) reached and the optimization hasn't converged yet.\n",
+      "  warnings.warn(\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "BEST PARAMS: {'activation': 'tanh', 'hidden_layer_sizes': (10,), 'learning_rate': 'constant'}\n",
+      "0.787 (+- 0.114) for {'activation': 'relu', 'hidden_layer_sizes': (10,), 'learning_rate': 'constant'}\n",
+      "0.792 (+- 0.099) for {'activation': 'relu', 'hidden_layer_sizes': (10,), 'learning_rate': 'invscaling'}\n",
+      "0.79 (+- 0.1) for {'activation': 'relu', 'hidden_layer_sizes': (10,), 'learning_rate': 'adaptive'}\n",
+      "0.774 (+- 0.136) for {'activation': 'relu', 'hidden_layer_sizes': (50,), 'learning_rate': 'constant'}\n",
+      "0.787 (+- 0.082) for {'activation': 'relu', 'hidden_layer_sizes': (50,), 'learning_rate': 'invscaling'}\n",
+      "0.8 (+- 0.102) for {'activation': 'relu', 'hidden_layer_sizes': (50,), 'learning_rate': 'adaptive'}\n",
+      "0.789 (+- 0.123) for {'activation': 'relu', 'hidden_layer_sizes': (100,), 'learning_rate': 'constant'}\n",
+      "0.783 (+- 0.105) for {'activation': 'relu', 'hidden_layer_sizes': (100,), 'learning_rate': 'invscaling'}\n",
+      "0.805 (+- 0.109) for {'activation': 'relu', 'hidden_layer_sizes': (100,), 'learning_rate': 'adaptive'}\n",
+      "0.824 (+- 0.092) for {'activation': 'tanh', 'hidden_layer_sizes': (10,), 'learning_rate': 'constant'}\n",
+      "0.792 (+- 0.115) for {'activation': 'tanh', 'hidden_layer_sizes': (10,), 'learning_rate': 'invscaling'}\n",
+      "0.779 (+- 0.139) for {'activation': 'tanh', 'hidden_layer_sizes': (10,), 'learning_rate': 'adaptive'}\n",
+      "0.805 (+- 0.082) for {'activation': 'tanh', 'hidden_layer_sizes': (50,), 'learning_rate': 'constant'}\n",
+      "0.807 (+- 0.083) for {'activation': 'tanh', 'hidden_layer_sizes': (50,), 'learning_rate': 'invscaling'}\n",
+      "0.809 (+- 0.108) for {'activation': 'tanh', 'hidden_layer_sizes': (50,), 'learning_rate': 'adaptive'}\n",
+      "0.803 (+- 0.086) for {'activation': 'tanh', 'hidden_layer_sizes': (100,), 'learning_rate': 'constant'}\n",
+      "0.792 (+- 0.09) for {'activation': 'tanh', 'hidden_layer_sizes': (100,), 'learning_rate': 'invscaling'}\n",
+      "0.788 (+- 0.091) for {'activation': 'tanh', 'hidden_layer_sizes': (100,), 'learning_rate': 'adaptive'}\n",
+      "0.798 (+- 0.106) for {'activation': 'logistic', 'hidden_layer_sizes': (10,), 'learning_rate': 'constant'}\n",
+      "0.79 (+- 0.127) for {'activation': 'logistic', 'hidden_layer_sizes': (10,), 'learning_rate': 'invscaling'}\n",
+      "0.787 (+- 0.142) for {'activation': 'logistic', 'hidden_layer_sizes': (10,), 'learning_rate': 'adaptive'}\n",
+      "0.805 (+- 0.12) for {'activation': 'logistic', 'hidden_layer_sizes': (50,), 'learning_rate': 'constant'}\n",
+      "0.789 (+- 0.124) for {'activation': 'logistic', 'hidden_layer_sizes': (50,), 'learning_rate': 'invscaling'}\n",
+      "0.8 (+- 0.111) for {'activation': 'logistic', 'hidden_layer_sizes': (50,), 'learning_rate': 'adaptive'}\n",
+      "0.794 (+- 0.108) for {'activation': 'logistic', 'hidden_layer_sizes': (100,), 'learning_rate': 'constant'}\n",
+      "0.794 (+- 0.121) for {'activation': 'logistic', 'hidden_layer_sizes': (100,), 'learning_rate': 'invscaling'}\n",
+      "0.789 (+- 0.1) for {'activation': 'logistic', 'hidden_layer_sizes': (100,), 'learning_rate': 'adaptive'}\n"
+     ]
+    }
+   ],
+   "source": [
+    "mlp = MLPClassifier(max_iter = 1000)\n",
+    "\n",
+    "parameters = {\n",
+    "    'hidden_layer_sizes': [(10,), (50,), (100,)], \n",
+    "    'activation': ['relu', 'tanh', 'logistic'],\n",
+    "    'learning_rate': ['constant', 'invscaling', 'adaptive'],\n",
+    "}\n",
+    "\n",
+    "cv = GridSearchCV(mlp, parameters, cv=5)\n",
+    "cv.fit(train_features, train_labels.values.ravel())\n",
+    "\n",
+    "print_results(cv)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Write out pickled model"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "MLPClassifier(activation='tanh', hidden_layer_sizes=(10,), max_iter=1000)"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "cv.best_estimator_"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "['../Pickled_Models/MLP_model.pkl']"
+      ]
+     },
+     "execution_count": 12,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "joblib.dump(cv.best_estimator_, '../Pickled_Models/MLP_model.pkl')"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.8.3"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}