arman salehi

Posted on Feb 23 • Originally published at armansal.com

Setting Up Your Deep Learning Environment and Training Your First Model

As an AI researcher and engineer, I’m excited to share a practical guide to help you dive into deep learning. Whether you’re new to the field or looking to solidify your skills, this tutorial will walk you through setting up a deep learning environment and training your first model on an engaging dataset. Let’s get started!

Introduction

Deep learning has transformed artificial intelligence, empowering machines to learn from massive datasets and tackle tasks once thought exclusive to humans—think image recognition, natural language processing, and beyond. Its power lies in neural networks, which mimic the human brain to extract patterns and make predictions.

In this tutorial, we’ll set up a deep learning environment from scratch and train a model on the CIFAR-10 dataset—a collection of 60,000 small color images across 10 classes like airplanes, cars, and birds. This dataset is perfect for beginners: it’s challenging enough to showcase deep learning’s capabilities but simple enough to grasp. We’ll cover environment setup, code implementation with explanations, and wrap up with key takeaways. Let’s dive in!

Setting Up the Environment

Before we build and train our model, we need a functional deep learning environment. While GPUs accelerate computations, you can follow along with a CPU if needed.

Note: Prefer a hassle-free setup? Use Google Colab—it offers a free GPU and pre-installed TensorFlow. Just upload your notebook and run the cells there.

Steps:

1. Install Python

Deep learning relies heavily on Python. Ensure it’s installed on your system by downloading it from python.org.

2. Install Deep Learning Libraries

We’ll use TensorFlow and Keras (included in TensorFlow) for this tutorial. Install them via pip:

pip install tensorflow keras

Got a GPU? Install the GPU-optimized version:

pip install tensorflow-gpu

3. Install Additional Libraries

We’ll need tools for data handling and visualization:

pip install numpy pandas matplotlib

4. Verify Installation

Test your setup by importing the libraries in a Python shell:

import tensorflow as tf
import keras
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

No errors? You’re ready!

5. Choosing an Interesting Dataset

We’ll use the CIFAR-10 dataset, which includes 50,000 training images and 10,000 test images, each 32x32 pixels, labeled across 10 categories. It’s a classic benchmark that balances complexity and accessibility. Load it directly with Keras:

from keras.datasets import cifar10
(x_train, y_train), (x_test, y_test) = cifar10.load_data()

Code and Explanations

Let’s build and train a convolutional neural network (CNN) step by step.

Data Preprocessing
Raw data needs preparation before training:
Normalize the Images
Pixel values range from 0 to 255. Scale them to [0, 1] for better model performance:

x_train = x_train.astype('float32') / 255
x_test = x_test.astype('float32') / 255

6. One-Hot Encode the Labels

Convert integer labels (0-9) into binary vectors for multi-class classification:

from keras.utils import to_categorical
y_train = to_categorical(y_train, 10)
y_test = to_categorical(y_test, 10)

7. Building the Model

We’ll create a simple CNN, ideal for image tasks:

from keras.models import Sequential
from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense

model = Sequential()
model.add(Conv2D(32, (3, 3), activation='relu', input_shape=(32, 32, 3)))
model.add(MaxPooling2D((2, 2)))
model.add(Conv2D(64, (3, 3), activation='relu'))
model.add(MaxPooling2D((2, 2)))
model.add(Conv2D(64, (3, 3), activation='relu'))
model.add(Flatten())
model.add(Dense(64, activation='relu'))
model.add(Dense(10, activation='softmax'))

Explanation:

Conv2D: Extracts features (edges, textures) using 3x3 filters. The first layer has 32 filters, later ones 64. ReLU activation introduces nonlinearity.

MaxPooling2D: Downsamples feature maps (e.g., 2x2 reduces size by half), retaining key info while cutting computation.

Flatten: Converts 2D feature maps into a 1D vector for dense layers.

Dense: Fully connected layers; the final one uses softmax for 10-class probability outputs.

input_shape=(32, 32, 3): Matches CIFAR-10’s 32x32 RGB images.

8. Compiling the Model

Configure the model for training:

model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

Explanation:

optimizer='adam': Adaptive optimizer balancing speed and accuracy.

loss='categorical_crossentropy': Standard for multi-class tasks, measuring prediction-label divergence.

metrics=['accuracy']: Tracks classification accuracy during training.

9. Training the Model

Fit the model to the training data:

history = model.fit(x_train, y_train, epochs=10, batch_size=64, validation_split=0.2)

Explanation:

epochs=10: Runs through the dataset 10 times. More epochs might improve accuracy but risk overfitting.

batch_size=64: Updates weights after every 64 samples, balancing speed and stability.

validation_split=0.2: Reserves 20% of training data to monitor performance, avoiding over-reliance on training set fit.

10. Evaluating the Model

Test the trained model:

test_loss, test_acc = model.evaluate(x_test, y_test)
print(f'Test accuracy: {test_acc}')

Explanation:

evaluate: Computes loss and accuracy on unseen test data, giving a real-world performance snapshot.

11. Visualizing the Results

Plot training progress:

plt.plot(history.history['accuracy'], label='accuracy')
plt.plot(history.history['val_accuracy'], label='val_accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.ylim([0, 1])
plt.legend(loc='lower right')
plt.show()

Explanation:

Graphs training and validation accuracy per epoch. Divergence (e.g., high training accuracy, low validation) signals overfitting.

Summary

In this tutorial, we’ve set up a deep learning environment and trained a CNN on the CIFAR-10 dataset. We installed essential libraries, preprocessed data, built a model, and evaluated its performance. Our simple architecture gets decent results, but there’s plenty of room to grow.
To boost accuracy, try adding layers, using dropout for regularization, or applying data augmentation to enhance generalization. Deep learning is a vast playground—experiment with architectures, hyperparameters, or explore new datasets and domains like NLP or reinforcement learning. Practice makes perfect, so keep tinkering. Happy learning!

Top comments (1)

Hayyan Hami • Feb 23

Wonderful! 🤩🤩 Absolutely useful and quick dive into ml world...

DEV Community

Setting Up Your Deep Learning Environment and Training Your First Model

Introduction

Setting Up the Environment

1. Install Python

2. Install Deep Learning Libraries

3. Install Additional Libraries

4. Verify Installation

5. Choosing an Interesting Dataset

6. One-Hot Encode the Labels

7. Building the Model

8. Compiling the Model

9. Training the Model

10. Evaluating the Model

11. Visualizing the Results

Summary

Top comments (1)

Read next

Responsive Design in React Native: Building Apps for Multiple Screen Sizes

Most Contributions 💪 Let's do it!

Glam Up Your Love: An Interactive Love Language Experience

Day 7: Unlocking User Authentication & Authorization