TensorFlow->TensorRT Image Classification === This contains examples, scripts and code related to image classification using TensorFlow models (from [here](https://github.com/tensorflow/models/tree/master/research/slim#Pretrained)) converted to TensorRT. Converting TensorFlow models to TensorRT offers significant performance gains on the Jetson TX2 as seen [below](#default_models). ## Quick Start 1. Follow the [installation guide](INSTALL.md). 2. Download the pretrained TensorFlow models and example images. ``` source scripts/download_models.sh source scripts/download_images.sh ``` 3. Convert the pretrained models to frozen graphs. ``` python scripts/models_to_frozen_graphs.py ``` 4. Convert the frozen graphs to optimized TensorRT engines. ``` python scripts/frozen_graphs_to_plans.py ``` 5. Execute the Inception V1 model on a single image. ``` ./build/examples/classify_image/classify_image data/images/gordon_setter.jpg data/plans/inception_v1.plan data/imagenet_labels_1001.txt input InceptionV1/Logits/SpatialSqueeze inception ``` For more details, read through the examples [link](examples/README.md). ## Default Models The table below shows various details related to the default models ported from the TensorFlow slim model zoo. | _Model | _{Input Size} | _{TensorRT (TX2 / Half)} | _{TensorRT (TX2 / Float)} | _{TensorFlow (TX2 / Float)} | _{Input Name} | _{Output Name} | _{Preprocessing Fn.} | |--- |:---:|:---:|:---:|:---:|---|---|---| | _{inception_v1} | _224x224 | _7.98ms | _12.8ms | _27.6ms | _input | _{InceptionV1/Logits/SpatialSqueeze} | _inception | | _{inception_v3} | _299x299 | _26.3ms | _46.1ms | _98.4ms | _input | _{InceptionV3/Logits/SpatialSqueeze} | _inception | | _{inception_v4} | _299x299 | _52.1ms | _88.2ms | _176ms | _input | _{InceptionV4/Logits/Logits/BiasAdd} | _inception | | _{inception_resnet_v2} | _299x299 | _53.0ms | _98.7ms | _168ms | _input | _{InceptionResnetV2/Logits/Logits/BiasAdd} | _inception | | _{resnet_v1_50} | _224x224 | _15.7ms | _27.1ms | _63.9ms | _input | _{resnet_v1_50/SpatialSqueeze} | _vgg | | _{resnet_v1_101} | _224x224 | _29.9ms | _51.8ms | _107ms | _input | _{resnet_v1_101/SpatialSqueeze} | _vgg | | _{resnet_v1_152} | _224x224 | _42.6ms | _78.2ms | _157ms | _input | _{resnet_v1_152/SpatialSqueeze} | _vgg | | _{resnet_v2_50} | _299x299 | _27.5ms | _44.4ms | _92.2ms | _input | _{resnet_v2_50/SpatialSqueeze} | _inception | | _{resnet_v2_101} | _299x299 | _49.2ms | _83.1ms | _160ms | _input | _{resnet_v2_101/SpatialSqueeze} | _inception | | _{resnet_v2_152} | _299x299 | _74.6ms | _124ms | _230ms | _input | _{resnet_v2_152/SpatialSqueeze} | _inception | | _{mobilenet_v1_0p25_128} | _128x128 | _2.67ms | _2.65ms | _15.7ms | _input | _{MobilenetV1/Logits/SpatialSqueeze} | _inception | | _{mobilenet_v1_0p5_160} | _160x160 | _3.95ms | _4.00ms | _16.9ms | _input | _{MobilenetV1/Logits/SpatialSqueeze} | _inception | | _{mobilenet_v1_1p0_224} | _224x224 | _12.9ms | _12.9ms | _24.4ms | _input | _{MobilenetV1/Logits/SpatialSqueeze} | _inception | | _{vgg_16} | _224x224 | _38.2ms | _79.2ms | _171ms | _input | _{vgg_16/fc8/BiasAdd} | _vgg | The times recorded include data transfer to GPU, network execution, and data transfer back from GPU. Time does not include preprocessing. See **scripts/test_tf.py**, **scripts/test_trt.py**, and **src/test/test_trt.cu** for implementation details. To reproduce the timings run ``` python scripts/test_tf.py python scripts/test_trt.py ``` The timing results will be located in **data/test_output_tf.txt** and **data/test_output_trt.txt**. Note that you must download and convert the models (as in the quick start) prior to running the benchmark scripts.