Person Detection - Transfer learning

Harvard University / Person Detection Public

Target: Cortex-M4F 80MHz Clone this project

Neural Network settings

Training settings

Number of training cycles

Please provide a valid number of training cycles (numeric only)

Use learned optimizer

Learning rate

Please provide a valid number for the learning rate (between 0 and 1)

Training processor

Please provide a valid training processor option

Data augmentation

Advanced training settings

Validation set size

Please provide a valid number for the train/validate split (between 0 and 1)

Split train/validation set on metadata key

Batch size

Auto-weight classes

Profile int8 model

Neural network architecture

Save

import math, requests from pathlib import Path import tensorflow as tf from tensorflow.keras import Model from tensorflow.keras.models import Sequential from tensorflow.keras.layers import ( Dense, InputLayer, Dropout, Conv1D, Flatten, Reshape, MaxPooling1D, BatchNormalization, Conv2D, GlobalMaxPooling2D, Lambda, GlobalAveragePooling2D) from tensorflow.keras.optimizers.legacy import Adam, Adadelta from tensorflow.keras.losses import categorical_crossentropy sys.path.append('./resources/libraries') import ei_tensorflow.training WEIGHTS_PATH = './transfer-learning-weights/keras/mobilenet_2_5_128_tf.h5' # Download the model weights root_url = 'https://cdn.edgeimpulse.com/' p = Path(WEIGHTS_PATH) if not p.exists(): print(f"Pretrained weights {WEIGHTS_PATH} unavailable; downloading...") if not p.parent.exists(): p.parent.mkdir(parents=True) weights_data = requests.get(root_url + WEIGHTS_PATH[2:]).content with open(WEIGHTS_PATH, 'wb') as f: f.write(weights_data) print(f"Pretrained weights {WEIGHTS_PATH} unavailable; downloading OK") print("") INPUT_SHAPE = (96, 96, 3) base_model = tf.keras.applications.MobileNet( input_shape = INPUT_SHAPE, weights = WEIGHTS_PATH, alpha = 0.25 ) base_model.trainable = False model = Sequential() model.add(InputLayer(input_shape=INPUT_SHAPE, name='x_input')) # Don't include the base model's top layers last_layer_index = -5 model.add(Model(inputs=base_model.inputs, outputs=base_model.layers[last_layer_index].output)) model.add(Reshape((-1, model.layers[-1].output.shape[3]))) model.add(Dropout(0.1)) model.add(Flatten()) model.add(Dense(classes, activation='softmax')) BATCH_SIZE = args.batch_size or 32 EPOCHS = args.epochs or 20 LEARNING_RATE = args.learning_rate or 0.0005 # If True, non-deterministic functions (e.g. shuffling batches) are not used. # This is False by default. ENSURE_DETERMINISM = args.ensure_determinism if not ENSURE_DETERMINISM: train_dataset = train_dataset.shuffle(buffer_size=BATCH_SIZE*4) prefetch_policy = 1 if ENSURE_DETERMINISM else tf.data.AUTOTUNE train_dataset = train_dataset.batch(BATCH_SIZE, drop_remainder=False).prefetch(prefetch_policy) validation_dataset = validation_dataset.batch(BATCH_SIZE, drop_remainder=False).prefetch(prefetch_policy) callbacks.append(BatchLoggerCallback(BATCH_SIZE, train_sample_count, epochs=EPOCHS, ensure_determinism=ENSURE_DETERMINISM)) model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=LEARNING_RATE), loss='categorical_crossentropy', metrics=['accuracy']) model.fit(train_dataset, validation_data=validation_dataset, epochs=EPOCHS, verbose=2, callbacks=callbacks) print('') print('Initial training done.', flush=True) # How many epochs we will fine tune the model FINE_TUNE_EPOCHS = 10 # What percentage of the base model's layers we will fine tune FINE_TUNE_PERCENTAGE = 65 print('Fine-tuning best model for {} epochs...'.format(FINE_TUNE_EPOCHS), flush=True) # Load best model from initial training model = ei_tensorflow.training.load_best_model(BEST_MODEL_PATH) # Determine which layer to begin fine tuning at model_layer_count = len(model.layers) fine_tune_from = math.ceil(model_layer_count * ((100 - FINE_TUNE_PERCENTAGE) / 100)) # Allow the entire base model to be trained model.trainable = True # Freeze all the layers before the 'fine_tune_from' layer for layer in model.layers[:fine_tune_from]: layer.trainable = False model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=0.000045), loss='categorical_crossentropy', metrics=['accuracy']) model.fit(train_dataset, epochs=FINE_TUNE_EPOCHS, verbose=2, validation_data=validation_dataset, callbacks=callbacks, class_weight=None )

Input layer (27,648 features)

MobileNetV1 96x96 0.25 (no final dense layer, 0.1 dropout)

Output layer (2 classes)

Model

Model version:

Last training performance (validation set)

Accuracy

80.7%

Loss

0.48

Confusion matrix (validation set)

Data explorer (full training set)

On-device performance

Engine:

Inferencing time

2161 ms.

Peak RAM usage

106.9K

Flash usage

305.3K

Did you know? You can customize your model through the Expert view (click on to switch),
or can even bring your own model (in PyTorch, Keras or scikit-learn).

Model	Author
MobileNetV1 96x96 0.25 Officially supported A pre-trained multi-layer convolutional network designed to efficiently classify images. Uses around 105.9K RAM and 301.6K ROM with default settings and optimizations.	Edge Impulse
MobileNetV1 96x96 0.2 Officially supported Uses around 83.1K RAM and 218.3K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV1 96x96 0.1 Officially supported Uses around 53.2K RAM and 101K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 96x96 0.35 Officially supported Uses around 296.8K RAM and 575.2K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 96x96 0.1 Officially supported Uses around 270.2K RAM and 212.3K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 96x96 0.05 Officially supported Uses around 265.3K RAM and 162.4K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 160x160 1.0 Officially supported Uses around 1.3M RAM and 2.6M ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
MobileNetV2 160x160 0.75 Officially supported Uses around 1.3M RAM and 1.7M ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
MobileNetV2 160x160 0.5 Officially supported Uses around 700.7K RAM and 982.4K ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
MobileNetV2 160x160 0.35 Officially supported Uses around 683.3K RAM and 658.4K ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
EfficientNet B0 Community Transfer learning model based on efficientnetb0_notop.h5 weights. This is a much larger model than MobileNet for Linux devices and accelerators.	Community blocks
NVIDIA TAO Image Classification Professional Enterprise Image classification model for general purpose use. Configurable backbones optimized for targets from MCU to GPU. Only supports RGB images. Pre-trained weights only support 224x224 resolution. Image width and height must be greater than 32. Training requires GPU.	Edge Impulse Inc.

The backbone is the foundational element of a neural network model, responsible for extracting meaningful features from input data, enabling subsequent parts to perform anomaly scoring or classification.

Description	Author
EfficientNet V2B0 Uses around 339K RAM based on your input size, and between 240-1675K ROM depending on the number of layers, with default compiler settings. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 0.35 Uses around 249K RAM based on your input size, and between 69-134K ROM depending on the number of layers, with default compiler settings. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 0.50 Uses around 256K RAM based on your input size, and between 78-168K ROM depending on the number of layers, with default compiler settings. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 0.75 Uses around 489K RAM based on your input size, and between 109-266K ROM depending on the number of layers, with default compiler settings. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 1.0 Uses around 499K RAM based on your input size, and between 126-372K ROM depending on the number of layers, with default compiler settings. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 0.1 Uses around 238K RAM based on your input size, and 58K ROM, with default compiler settings. Supports both RGB and grayscale.	Edge Impulse

The scoring function operates on the features extracted by the backbone, and returns anomaly scores for each segment of the image.

Description	Author	Recommended
MobileNetV1 96x96 0.25 A pre-trained multi-layer convolutional network designed to efficiently classify images. Uses around 105.9K RAM and 301.6K ROM with default settings and optimizations.	Edge Impulse
MobileNetV1 96x96 0.2 Uses around 83.1K RAM and 218.3K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV1 96x96 0.1 Uses around 53.2K RAM and 101K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 96x96 0.35 Uses around 296.8K RAM and 575.2K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 96x96 0.1 Uses around 270.2K RAM and 212.3K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 96x96 0.05 Uses around 265.3K RAM and 162.4K ROM with default settings and optimizations. Works best with 96x96 input size. Supports both RGB and grayscale.	Edge Impulse
MobileNetV2 160x160 1.0 Uses around 1.3M RAM and 2.6M ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
MobileNetV2 160x160 0.75 Uses around 1.3M RAM and 1.7M ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
MobileNetV2 160x160 0.5 Uses around 700.7K RAM and 982.4K ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
MobileNetV2 160x160 0.35 Uses around 683.3K RAM and 658.4K ROM with default settings and optimizations. Works best with 160x160 input size. Supports RGB only.	Edge Impulse
EfficientNet B0 Transfer learning model based on efficientnetb0_notop.h5 weights. This is a much larger model than MobileNet for Linux devices and accelerators.	Edge Impulse
NVIDIA TAO Image Classification Image classification model for general purpose use. Configurable backbones optimized for targets from MCU to GPU. Only supports RGB images. Pre-trained weights only support 224x224 resolution. Image width and height must be greater than 32. Training requires GPU.	Edge Impulse

These users will be notified over email when this job finishes.

	Name
	Jenny Plunkett Edge Impulse staff

Neural Network settings

Training settings

Augmentation settings

Advanced training settings

Neural network architecture

Model

Last training performance (validation set)

Accuracy

Loss

Confusion matrix (validation set)

Data explorer (full training set)

Settings

On-device performance

Inferencing time

Peak RAM usage

Flash usage

Last training performance (validation set)

Accuracy

Loss

Confusion matrix (validation set)

Data explorer (full training set)

Settings

On-device performance

Inferencing time

Peak RAM usage

Flash usage