Step 5: Tune Hyperparameters (original) (raw)

Step 5: Tune Hyperparameters

Stay organized with collections Save and categorize content based on your preferences.

We had to choose a number of hyperparameters for defining and training the model. We relied on intuition, examples and best practice recommendations. Our first choice of hyperparameter values, however, may not yield the best results. It only gives us a good starting point for training. Every problem is different and tuning these hyperparameters will help refine our model to better represent the particularities of the problem at hand. Let’s take a look at some of the hyperparameters we used and what it means to tune them:

There are couple of additional hyperparameters we tuned that are specific to our sepCNN model:

  1. Kernel size: The size of the convolution window. Recommended values: 3 or 5.
  2. Embedding dimensions: The number of dimensions we want to use to represent word embeddings—i.e., the size of each word vector. Recommended values: 50–300. In our experiments, we used GloVe embeddings with 200 dimensions with a pre- trained embedding layer.

Play around with these hyperparameters and see what works best. Once you have chosen the best-performing hyperparameters for your use case, your model is ready to be deployed.