No Description

jaybdub-nv c07ceb28fb Update README.md		8 years ago
data	0aced03dc4 initial commit	8 years ago
examples	0aced03dc4 initial commit	8 years ago
scripts	0aced03dc4 initial commit	8 years ago
src	0aced03dc4 initial commit	8 years ago
third_party	4a7c3565e7 added tf models submodule	8 years ago
.gitmodules	4a7c3565e7 added tf models submodule	8 years ago
CMakeLists.txt	0aced03dc4 initial commit	8 years ago
INSTALL.md	7d79bc6aa2 Update INSTALL.md	8 years ago
LICENSE.md	0aced03dc4 initial commit	8 years ago
README.md	c07ceb28fb Update README.md	8 years ago

TensorFlow->TensorRT Image Classification

This contains examples, scripts and code related to image classification using TensorFlow models (from here) converted to TensorRT. Converting TensorFlow models to TensorRT offers significant performance gains on the Jetson TX2 as seen below.

Model overview
Download models and create frozen graphs
Convert frozen graph to TensorRT engine
Execute TensorRT engine
Benchmark all models

Models

The table below shows various details related to pretrained models ported from the TensorFlow slim model zoo.

_Model	_{Input Size}	_{TensorRT (TX2 / Half)}	_{TensorRT (TX2 / Float)}	_{TensorFlow (TX2 / Float)}	_{Input Name}	_{Output Name}	_{Preprocessing Fn.}
_{inception_v1}	_224x224	_7.98ms	_12.8ms	_27.6ms	_input	_{InceptionV1/Logits/SpatialSqueeze}	_inception
_{inception_v3}	_299x299	_26.3ms	_46.1ms	_98.4ms	_input	_{InceptionV3/Logits/SpatialSqueeze}	_inception
_{inception_v4}	_299x299	_52.1ms	_88.2ms	_176ms	_input	_{InceptionV4/Logits/Logits/BiasAdd}	_inception
_{inception_resnet_v2}	_299x299	_53.0ms	_98.7ms	_168ms	_input	_{InceptionResnetV2/Logits/Logits/BiasAdd}	_inception
_{resnet_v1_50}	_224x224	_15.7ms	_27.1ms	_63.9ms	_input	_{resnet_v1_50/SpatialSqueeze}	_vgg
_{resnet_v1_101}	_224x224	_29.9ms	_51.8ms	_107ms	_input	_{resnet_v1_101/SpatialSqueeze}	_vgg
_{resnet_v1_152}	_224x224	_42.6ms	_78.2ms	_157ms	_input	_{resnet_v1_152/SpatialSqueeze}	_vgg
_{resnet_v2_50}	_299x299	_27.5ms	_44.4ms	_92.2ms	_input	_{resnet_v2_50/SpatialSqueeze}	_inception
_{resnet_v2_101}	_299x299	_49.2ms	_83.1ms	_160ms	_input	_{resnet_v2_101/SpatialSqueeze}	_inception
_{resnet_v2_152}	_299x299	_74.6ms	_124ms	_230ms	_input	_{resnet_v2_152/SpatialSqueeze}	_inception
_{mobilenet_v1_0p25_128}	_128x128	_2.67ms	_2.65ms	_15.7ms	_input	_{MobilenetV1/Logits/SpatialSqueeze}	_inception
_{mobilenet_v1_0p5_160}	_160x160	_3.95ms	_4.00ms	_16.9ms	_input	_{MobilenetV1/Logits/SpatialSqueeze}	_inception
_{mobilenet_v1_1p0_224}	_224x224	_12.9ms	_12.9ms	_24.4ms	_input	_{MobilenetV1/Logits/SpatialSqueeze}	_inception
_{vgg_16}	_224x224	_38.2ms	_79.2ms	_171ms	_input	_{vgg_16/fc8/BiasAdd}	_vgg

The times recorded include data transfer to GPU, network execution, and data transfer back from GPU. Time does not include preprocessing. See scripts/test_tf.py, scripts/test_trt.py, and src/test/test_trt.cu for implementation details. To reproduce the timings run

python scripts/test_tf.py
python scripts/test_trt.py

The timing results will be located in data/test_output_tf.txt and data/test_output_trt.txt. Note that you must download and convert the models (as in the quick start) prior to running the benchmark scripts.

Download models and create frozen graphs

Run the following bash script to download all of the pretrained models.

source scripts/download_models.sh

If there are any models you don't want to use, simply remove their URL and name from the model lists in scripts/download_models.sh.

Next, because the TensorFlow models are provided in checkpoint format, we must convert them to frozen graphs for optimization with TensorRT. Run the scripts/models_to_frozen_graphs.py script.

python scripts/models_to_frozen_graphs.py

If you removed any models in the previous step, you must add 'exclude': true to the corresponding item in the NETS dictionary located in scripts/model_meta.py.

Convert frozen graph to TensorRT engine

Execute TensorRT engine

./build/examples/classify_image/classify_image data/images/gordon_setter.jpg data/plans/inception_v1.plan data/imagenet_labels_1001.txt input InceptionV1/Logits/SpatialSqueeze inception

Benchmark all models

python scripts/frozen_graphs_to_plans.py

README.md