ResNet-101 Trained on YFCC100m Geotagged Data

Determine the geolocation of a photograph

Released in 2017, this geolocation model classifies the location in which a photo was taken among more than 15,000 predefined locations around the world. The classes correspond to cells extracted from Google's S2 Geometry library.

Number of layers: 344 | Parameter count: 74,405,235 | Trained size: 299 MB |

Training Set Information

Yahoo Flickr Creative Commons 100 Million Dataset (YFCC100M), containing a total of one hundred million media objects, of which approximately 99.2 million are photos and 0.8 million are videos.

Performance

This model correctly localized 82.2% of the IM2GPS test set within 2,500 kilometers.

Examples

Download Example Notebook

Open in Wolfram Cloud

Resource retrieval

Get the pre-trained net:

In[1]:=

Out[1]=

Basic Usage

Obtain an estimate of the latitude and longitude of where a photo was taken:

In[2]:=

(* Evaluate this cell to get the example input *) CloudGet["https://www.wolframcloud.com/obj/329374fa-fed7-4929-a905-ef70e26821fd"]

Out[2]=

Show a map of the area corresponding to the position:

In[3]:=

Out[3]=

Mark the position on a world map:

In[4]:=

Out[4]=

Multiple Predictions

The net returns a probability distribution over all available locations. Obtain the 50 most probable locations for a given image and plot these locations on the world map, with the size of the location marker proportional to the probability:

In[5]:=

(* Evaluate this cell to get the example input *) CloudGet["https://www.wolframcloud.com/obj/ac19a0ac-ac9e-416d-8aef-d01548a14105"]

Out[5]=

Fine Scale Predictions

In places with high population density, very fine-grained predictions are possible. Consider the following four landmarks in Paris:

In[6]:=

landmarks = EntityValue[{Entity["Building", "EiffelTower::5h9w8"], Entity["Building", "TheLouvre::vqy3g"], Entity["Building", "NotreDameCathedral::95fcw"], Entity["Building", "ArcDeTriomphe::92x88"]}, "Image", "EntityAssociation"]

Out[6]=

Predict the locations of the four landmarks and mark the locations on the map:

In[7]:=

GeoListPlot[
MapThread[
GeoMarker[#1, #2, "Scale" -> 0.01] &, {NetModel[
"ResNet-101 Trained on YFCC100m Geotagged Data"][
Values[landmarks]], Values[landmarks]}], GeoRange -> Quantity[1.5, "Miles"]]

Out[7]=

Compare with the actual locations:

In[8]:=

GeoListPlot[
Map[GeoMarker[EntityValue[#, "Position"], EntityValue[#, "Image"], "Scale" -> 0.01] &, Keys[landmarks]], GeoRange -> Quantity[1.5, "Miles"]]

Out[8]=

Region Density

Inspect the distribution of the available positions. Display a heat map of the location density on the map:

In[9]:=

GeoHistogram[
NetExtract[NetModel["ResNet-101 Trained on YFCC100m Geotagged Data"],
"Output"][["Labels"]], 50, PlotStyle -> Opacity[0.4], ImageSize -> Large]

Out[9]=

Net information

Inspect the number of parameters of all arrays in the net:

In[10]:=

$NetInformation[ NetModel["ResNet-101 Trained on YFCC100m Geotagged Data"], \ "ArraysElementCounts"]$

Out[10]=

Obtain the total number of parameters:

In[11]:=

$NetInformation[ NetModel["ResNet-101 Trained on YFCC100m Geotagged Data"], \ "ArraysTotalElementCount"]$

Out[11]=

Obtain the layer type counts:

In[12]:=

$NetInformation[ NetModel["ResNet-101 Trained on YFCC100m Geotagged Data"], \ "LayerTypeCounts"]$

Out[12]=

Display the summary graphic:

In[13]:=

$NetInformation[ NetModel["ResNet-101 Trained on YFCC100m Geotagged Data"], \ "SummaryGraphic"]$

Out[13]=

Export to MXNet

Export the net into a format that can be opened in MXNet:

In[14]:=

Out[14]=

Export also creates a net.params file containing parameters:

In[15]:=

Out[15]=

Get the size of the parameter file:

In[16]:=

Out[16]=

The size is similar to the byte count of the resource object:

In[17]:=

Out[17]=

Construction Notebook

Download Construction Notebook

Open in Wolfram Cloud

Requirements

Wolfram Language 11.2 (September 2017) or above

External Links

https://aws.amazon.com/blogs/machine-learning/estimating-the-location-of-images-using-mxnet-and-multimedia-commons-dataset-on-aws-ec2

Resource History

Date Created: 3 July 2017
Latest Update: 11 April 2019

Reference

Available from: https://github.com/multimedia-berkeley/tutorials
Rights: Creative Commons Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)