Pix2pix Photo-to-Street-Map Translation

Generate a street map from a satellite photo

Released in 2016, this model is an application of a powerful method for general-purpose image-to-image translation using conditional adversarial networks. The automatic learning of the loss function with the adversarial networks technique allows the same paradigm to generalize across a wide range of image translation tasks. The architecture enables an efficient aggregation of features of multiple scales through skip connections with concatenations. This particular model was trained to generate a street map from a satellite photo.

Number of layers: 56 | Parameter count: 54,419,459 | Trained size: 218 MB |

Training Set Information

A set of 1,096 training images from pix2pix.

Examples

Download Example Notebook

Open in Wolfram Cloud

Resource retrieval

Get the pre-trained net:

In[1]:=

Out[1]=

Basic usage

Obtain a satellite photo:

In[2]:=

Out[2]=

Use the net to draw the street map:

In[3]:=

Out[3]=

Evaluate accuracy

Overlap photo and prediction:

In[4]:=

Out[4]=

Obtain the actual street map:

In[5]:=

Out[5]=

Compare the generated street map with the actual street map:

In[6]:=

Out[6]=

Issues

More complex patterns are harder to render. Obtain a new photo and street map pair:

In[7]:=

{img, actualMap} = ImageResize[
GeoImage[GeoPosition[{41.940360, 12.533762}], #, GeoRange -> 200], {256, 256}] & /@ {"Satellite", "StreetMap"}

Out[7]=

Compare the prediction with the actual street map:

In[8]:=

Out[8]=

Net information

Inspect the number of parameters of all arrays in the net:

In[9]:=

$NetInformation[ NetModel["Pix2pix Photo-to-Street-Map Translation"], \ "ArraysElementCounts"]$

Out[9]=

Obtain the total number of parameters:

In[10]:=

$NetInformation[ NetModel["Pix2pix Photo-to-Street-Map Translation"], \ "ArraysTotalElementCount"]$

Out[10]=

Obtain the layer type counts:

In[11]:=

$NetInformation[ NetModel["Pix2pix Photo-to-Street-Map Translation"], \ "LayerTypeCounts"]$

Out[11]=

Display the summary graphic:

In[12]:=

$NetInformation[ NetModel["Pix2pix Photo-to-Street-Map Translation"], \ "SummaryGraphic"]$

Out[12]=

Export to MXNet

Export the net into a format that can be opened in MXNet:

In[13]:=

Out[13]=

Export also creates a net.params file containing parameters:

In[14]:=

Out[14]=

Get the size of the parameter file:

In[15]:=

Out[15]=

The size is similar to the byte count of the resource object:

In[16]:=

Out[16]=

Represent the MXNet net as a graph:

In[17]:=

Out[17]=

Construction Notebook

Download Construction Notebook

Open in Wolfram Cloud

Requirements

Wolfram Language 11.2 (September 2017) or above

Resource History

Date Created: 12 October 2017
Latest Update: 21 June 2018

Reference

P. Isola, J. Zhu, T. Zhou, A. A. Efros, "Image-to-Image Translation with Conditional Adversarial Networks," arXiv:1611.07004 (2016)
Available from: https://github.com/phillipi/pix2pix
Rights: BSD 2-Clause "Simplified" License