Conveyor Belt Cubes Left Right - Object detection

Edge Impulse Inc. / Conveyor Belt Cubes Left Right Public

Clone this project

Target: Cortex-M7 216MHz

Neural Network settings

Training settings

Number of training cycles

Please provide a valid number of training cycles (numeric only)

Use learned optimizer

Learning rate

Please provide a valid number for the learning rate (between 0 and 1)

Training processor

Please provide a valid training processor option

Data augmentation

Advanced training settings

Validation set size

Please provide a valid number for the train/validate split (between 0 and 1)

Split train/validation set on metadata key

Batch size

Profile int8 model

Neural network architecture

Save

sys.path.append('./resources/libraries') import os import tensorflow as tf from tensorflow.keras.optimizers import Adam from tensorflow.keras.applications import MobileNetV2 from tensorflow.keras.layers import BatchNormalization, Conv2D, Softmax, Reshape from tensorflow.keras.models import Model from ei_tensorflow.constrained_object_detection import models, dataset, metrics, util import ei_tensorflow.training from pathlib import Path import requests def build_model(input_shape: tuple, weights: str, alpha: float, num_classes: int) -> tf.keras.Model: """ Construct a constrained object detection model. Args: input_shape: Passed to MobileNet construction. weights: Weights for initialization of MobileNet where None implies random initialization. alpha: MobileNet alpha value. num_classes: Number of classes, i.e. final dimension size, in output. Returns: Uncompiled keras model. Model takes (B, H, W, C) input and returns (B, H//8, W//8, num_classes) logits. """ #! First create full mobile_net_V2 from (HW, HW, C) input #! to (HW/8, HW/8, C) output mobile_net_v2 = MobileNetV2(input_shape=input_shape, weights=weights, alpha=alpha, include_top=True) #! Default batch norm is configured for huge networks, let's speed it up for layer in mobile_net_v2.layers: if type(layer) == BatchNormalization: layer.momentum = 0.9 #! Cut MobileNet where it hits 1/8th input resolution; i.e. (HW/8, HW/8, C) cut_point = mobile_net_v2.get_layer('block_6_expand_relu') model = cut_point.output #! Now attach a small additional head on the MobileNet model = Conv2D( filters=32, kernel_size=1, strides=1, activation='relu', name='head')(model) # Branch, attend (with positional encodings), # and recombine as residual connection from ei_tensorflow.self_attention import WithSpatialPositionalEncodings, PatchAttention from tensorflow.keras.layers import Dropout, Add branch = Dropout(rate=0.5)(model) branch = WithSpatialPositionalEncodings()(branch) branch = PatchAttention(key_dim=8)(branch) model = Add()([model, branch]) # and finally a classifier layer logits = Conv2D( filters=num_classes, kernel_size=1, strides=1, activation=None, name='logits')(model) model = Model(inputs=mobile_net_v2.input, outputs=logits) print(model.summary()) return model def train(num_classes: int, learning_rate: float, num_epochs: int, alpha: float, object_weight: float, train_dataset: tf.data.Dataset, validation_dataset: tf.data.Dataset, best_model_path: str, input_shape: tuple, lr_finder: bool = False) -> tf.keras.Model: """ Construct and train a constrained object detection model. Args: num_classes: Number of classes in datasets. This does not include implied background class introduced by segmentation map dataset conversion. learning_rate: Learning rate for Adam. num_epochs: Number of epochs passed to model.fit alpha: Alpha used to construct MobileNet. Pretrained weights will be used if there is a matching set. object_weight: The weighting to give the object in the loss function where background has an implied weight of 1.0. train_dataset: Training dataset of (x, (bbox, one_hot_y)) validation_dataset: Validation dataset of (x, (bbox, one_hot_y)) best_model_path: location to save best model path. note: weights will be restored from this path based on best val_f1 score. input_shape: The shape of the model's input max_training_time_s: Max training time (will exit if est. training time is over the limit) is_enterprise_project: Determines what message we print if training time exceeds Returns: Trained keras model. Constructs a new constrained object detection model with num_classes+1 outputs (denoting the classes with an implied background class of 0). Both training and validation datasets are adapted from (x, (bbox, one_hot_y)) to (x, segmentation_map). Model is trained with a custom weighted cross entropy function. """ nonlocal callbacks num_classes_with_background = num_classes + 1 input_width_height = None width, height, input_num_channels = input_shape if width != height: raise Exception(f"Only square inputs are supported; not {input_shape}") input_width_height = width #! Use pretrained weights, if we have them for configured weights = None if input_num_channels == 1: if alpha == 0.1: weights = "./transfer-learning-weights/edgeimpulse/MobileNetV2.0_1.96x96.grayscale.bsize_64.lr_0_05.epoch_441.val_loss_4.13.val_accuracy_0.2.hdf5" elif alpha == 0.35: weights = "./transfer-learning-weights/edgeimpulse/MobileNetV2.0_35.96x96.grayscale.bsize_64.lr_0_005.epoch_260.val_loss_3.10.val_accuracy_0.35.hdf5" elif input_num_channels == 3: if alpha == 0.1: weights = "./transfer-learning-weights/edgeimpulse/MobileNetV2.0_1.96x96.color.bsize_64.lr_0_05.epoch_498.val_loss_3.85.hdf5" elif alpha == 0.35: weights = "./transfer-learning-weights/keras/mobilenet_v2_weights_tf_dim_ordering_tf_kernels_0.35_96.h5" if (weights and not os.path.exists(weights)): print(f"Pretrained weights {weights} unavailable; downloading...") p = Path(weights) if not p.exists(): if not p.parent.exists(): p.parent.mkdir(parents=True) root_url = 'https://cdn.edgeimpulse.com/' weights_data = requests.get(root_url + weights[2:]).content with open(weights, 'wb') as f: f.write(weights_data) print(f"Pretrained weights {weights} unavailable; downloading OK\n") model = build_model( input_shape=input_shape, weights=weights, alpha=alpha, num_classes=num_classes_with_background ) #! Derive output size from model model_output_shape = model.layers[-1].output.shape _batch, width, height, num_classes = model_output_shape if width != height: raise Exception(f"Only square outputs are supported; not {model_output_shape}") output_width_height = width #! Build weighted cross entropy loss specific to this model size weighted_xent = models.construct_weighted_xent_fn(model.output.shape, object_weight) #! Transform bounding box labels into segmentation maps def as_segmentation(ds): return ds.map(dataset.bbox_to_segmentation(output_width_height, num_classes_with_background) ).batch(32, drop_remainder=False).prefetch(1) train_segmentation_dataset = as_segmentation(train_dataset) validation_segmentation_dataset = as_segmentation(validation_dataset) validation_dataset_for_callback = validation_dataset.batch(32, drop_remainder=False).prefetch(1) #! Initialise bias of final classifier based on training data prior. util.set_classifier_biases_from_dataset( model, train_segmentation_dataset) if lr_finder: learning_rate = ei_tensorflow.lr_finder.find_lr(model, train_segmentation_dataset, weighted_xent) model.compile(loss=weighted_xent, optimizer=Adam(learning_rate=learning_rate)) #! Create callback that will do centroid scoring on end of epoch against #! validation data. Include a callback to show % progress in slow cases. callbacks = callbacks if callbacks else [] callbacks.append(metrics.CentroidScoring(validation_dataset_for_callback, output_width_height, num_classes_with_background)) callbacks.append(metrics.PrintPercentageTrained(num_epochs)) #! Include a callback for model checkpointing based on the best validation f1. callbacks.append( tf.keras.callbacks.ModelCheckpoint(best_model_path, monitor='val_f1', save_best_only=True, mode='max', save_weights_only=True, verbose=0)) model.fit(train_segmentation_dataset, validation_data=validation_segmentation_dataset, epochs=num_epochs, callbacks=callbacks, verbose=0) #! Restore best weights. model.load_weights(best_model_path) #! Add explicit softmax layer before export. softmax_layer = Softmax()(model.layers[-1].output) model = Model(model.input, softmax_layer) return model EPOCHS = args.epochs or 50 LEARNING_RATE = args.learning_rate or 0.0005 model = train(num_classes=classes, learning_rate=LEARNING_RATE, num_epochs=EPOCHS, alpha=0.35, object_weight=100, train_dataset=train_dataset, validation_dataset=validation_dataset, best_model_path=BEST_MODEL_PATH, input_shape=MODEL_INPUT_SHAPE, lr_finder=False) override_mode = 'segmentation' disable_per_channel_quantization = False

Input layer (76,800 features)

FOMO (Faster Objects, More Objects) MobileNetV2 0.35

Output layer (2 classes)

Model

Model version:

Did you know? You can customize your model through the Expert view (click on to switch),
or can even bring your own model (in PyTorch, Keras or scikit-learn).

Model	Author
MobileNetV2 SSD FPN-Lite (320x320 only) Officially supported A pre-trained object detection model designed to locate up to 10 objects within an image, outputting a bounding box for each object detected. The model is around 3.7MB in size. It supports an RGB input at 320x320px. For other resolutions, use FOMO or an NVIDIA TAO model.	Edge Impulse
FOMO (Faster Objects, More Objects) MobileNetV2 0.1 Officially supported An object detection model based on MobileNetV2 (alpha 0.1) designed to coarsely segment an image into a grid of background vs objects of interest. These models are designed to be <100KB in size and support a grayscale or RGB input at any resolution.	Edge Impulse
FOMO (Faster Objects, More Objects) MobileNetV2 0.35 Officially supported An object detection model based on MobileNetV2 (alpha 0.35) designed to coarsely segment an image into a grid of background vs objects of interest. These models are designed to be <100KB in size and support a grayscale or RGB input at any resolution.	Edge Impulse
efficientdet-lite Enterprise efficientdet-lite	Edge Impulse Inc.
YOLOv5 (yolov5n.pt) Enterprise YOLOv5 model with yolov5n.pt pretrained weights	Edge Impulse Inc.
YOLOv3 Enterprise Fine-tune a pretrained YOLOv3 model based on yolov3-tiny.pt	Edge Impulse Inc.
TI YOLOX Enterprise Transfer learning model based on yolox_nano_ti_lite_26p1_41p8_checkpoint.pth	Edge Impulse Inc.
mat_akida_nan_testing Enterprise testing of nans coming from akida training	Edge Impulse Inc.
coco-cats Enterprise coco-cats	Edge Impulse Inc.
raul-edgeai-regnetx-800mf-fpn-bgr-lite Enterprise "TI's EDGEAI optimization of the mmdetection regnetx-800mf-fpn-bgr-lite architecture. https://github.com/TexasInstruments/edgeai-mmdetection"	Edge Impulse Inc.
YOLOv5 for Renesas DRP-AI Community Transfer learning model using YOLOv5 v5 branch with yolov5s.pt weights. This block is only compatible with Renesas DRP-AI.	Renesas
YOLOv5 Community Transfer learning model based on Ultralytics YOLOv5 using yolov5n.pt weights, supports RGB input at any resolution (square images only).	Community blocks
YOLOX for TI TDA4VM Community TI's EDGEAI YOLOX. https://github.com/TexasInstruments/edgeai-yolox. Outputs ONNX v7 model format both with and without final detect layers using PyTorch 1.7.1. See the implementation https://github.com/edgeimpulse/example-custom-ml-block-ti-yolox/tree/onnx-v7	Texas Instruments
NVIDIA TAO RetinaNet Enterprise Object detection model with superior performance on smaller objects. Configurable backbones optimized for targets from MCU to GPU. Supports rectangular input. Image width and height must be multiples of 32. Training requires GPU.	Edge Impulse Inc.
NVIDIA TAO YOLOV3 Enterprise Object detection model that is fast and accurate. Configurable backbones optimized for targets from MCU to GPU. Supports rectangular input. Image width and height must be multiples of 32. Training requires GPU.	Edge Impulse Inc.
NVIDIA TAO YOLOV4 Enterprise Object detection model that is fast and accurate. Configurable backbones optimized for targets from MCU to GPU. Supports rectangular input. Image width and height must be multiples of 32. Training requires GPU.	Edge Impulse Inc.
NVIDIA TAO SSD Enterprise Object detection model for general purpose use. Configurable backbones optimized for targets from MCU to GPU. Supports rectangular input. Image width and height must be multiples of 32. Training requires GPU.	Edge Impulse Inc.

These users will be notified over email when this job finishes.

	Name
	Mat Kelcey Edge Impulse staff

Neural Network settings

Training settings

Augmentation settings

Advanced training settings

Neural network architecture

Model

MobileNetV2 SSD FPN-Lite (320x320 only)
Officially supported

FOMO (Faster Objects, More Objects) MobileNetV2 0.1
Officially supported

FOMO (Faster Objects, More Objects) MobileNetV2 0.35
Officially supported

efficientdet-lite
Enterprise

YOLOv5 (yolov5n.pt)
Enterprise

YOLOv3
Enterprise

TI YOLOX
Enterprise

mat_akida_nan_testing
Enterprise

coco-cats
Enterprise

raul-edgeai-regnetx-800mf-fpn-bgr-lite
Enterprise

YOLOv5 for Renesas DRP-AI
Community

YOLOv5
Community

YOLOX for TI TDA4VM
Community

NVIDIA TAO RetinaNet
Enterprise

NVIDIA TAO YOLOV3
Enterprise

NVIDIA TAO YOLOV4
Enterprise

NVIDIA TAO SSD
Enterprise

Description

Last training performance (validation set)

F1 score

Loss

Confusion matrix (validation set)

Feature explorer (full training set)

Personal

Enterprise

Public

Private

( of remaining)

Select a version

Neural Network settings

Training settings

Augmentation settings

Advanced training settings

Neural network architecture

Model

Description

Last training performance (validation set)

F1 score

Loss

Confusion matrix (validation set)

Feature explorer (full training set)

Settings