Multi-scale Context Aggregation Net
Trained on
CamVid Data
Released in 2016, this is the first model featuring a systematic use of dilated convolutions for pixel-wise classification. A context aggregation module featuring convolutions with exponentially increasing dilations is appended to a VGG-style front end.
Number of layers: 53 |
Parameter count: 134,313,443 |
Trained size: 537 MB |
Examples
Resource retrieval
Get the pre-trained net:
Evaluation function
Write an evaluation function to handle padding and tiling of the input image:
Label list
Define the label list for this model. Integers in the model’s output correspond to elements in the label list:
Basic usage
Obtain a segmentation mask for a given image:
Inspect which classes are detected:
Visualize the mask:
Advanced visualization
Associate classes to colors:
Write a function to overlap the image and the mask with a legend:
Inspect the results:
Net information
Inspect the number of parameters of all arrays in the net:
Obtain the total number of parameters:
Obtain the layer type counts:
Display the summary graphic:
Export to MXNet
Export the net into a format that can be opened in MXNet:
Export also creates a net.params file containing parameters:
Get the size of the parameter file:
The size is similar to the byte count of the resource object:
Represent the MXNet net as a graph:
Requirements
Wolfram Language
11.3
(March 2018)
or above
Resource History
Reference