9 years ago · c539b46db8
--- a/inception/README.md
+++ b/inception/README.md
@@ -111,15 +111,12 @@ ready to train or evaluate with the ImageNet data set.
 
				 intensive task and depending on your compute setup may take several days or even
			
 
				 weeks.
			
 
				 
			
 
				-*Before proceeding* please read the [Convolutional Neural Networks]
			
 
				-(https://www.tensorflow.org/tutorials/deep_cnn/index.html) tutorial in
			
 
				-particular focus on [Training a Model Using Multiple GPU Cards]
			
 
				-(https://www.tensorflow.org/tutorials/deep_cnn/index.html#training-a-model-using-multiple-gpu-cards)
			
 
				-. The model training method is nearly identical to that described in the
			
 
				+*Before proceeding* please read the [Convolutional Neural Networks](https://www.tensorflow.org/tutorials/deep_cnn/index.html) tutorial; in
			
 
				+particular, focus on [Training a Model Using Multiple GPU Cards](https://www.tensorflow.org/tutorials/deep_cnn/index.html#launching_and_training_the_model_on_multiple_gpu_cards). The model training method is nearly identical to that described in the
			
 
				 CIFAR-10 multi-GPU model training. Briefly, the model training
			
 
				 
			
 
				-*   Places an individual model replica on each GPU. Split the batch across the
			
 
				-    GPUs.
			
 
				+*   Places an individual model replica on each GPU.
			
 
				+*   Splits the batch across the GPUs.
			
 
				 *   Updates model parameters synchronously by waiting for all GPUs to finish
			
 
				     processing a batch of data.
			
 
				 
			
@@ -245,11 +242,9 @@ We term each machine that maintains model parameters a `ps`, short for
 
				 `ps` as the model parameters may be sharded across multiple machines.
			
 
				 
			
 
				 Variables may be updated with synchronous or asynchronous gradient updates. One
			
 
				-may construct a an [`Optimizer`]
			
 
				-(https://www.tensorflow.org/api_docs/python/train.html#optimizers) in TensorFlow
			
 
				-that constructs the necessary graph for either case diagrammed below from
			
 
				-TensorFlow [Whitepaper]
			
 
				-(http://download.tensorflow.org/paper/whitepaper2015.pdf):
			
 
				+may construct a an [`Optimizer`](https://www.tensorflow.org/api_docs/python/train.html#optimizers) in TensorFlow
			
 
				+that constructs the necessary graph for either case diagrammed below from the
			
 
				+TensorFlow [Whitepaper](http://download.tensorflow.org/paper/whitepaper2015.pdf):
			
 
				 
			
 
				 <div style="width:40%; margin:auto; margin-bottom:10px; margin-top:20px;">
			
 
				   <img style="width:100%"
			
@@ -380,10 +375,8 @@ training Inception in a distributed manner.
 
				 Evaluating an Inception v3 model on the ImageNet 2012 validation data set
			
 
				 requires running a separate binary.
			
 
				 
			
 
				-The evaluation procedure is nearly identical to [Evaluating a Model]
			
 
				-(https://www.tensorflow.org/tutorials/deep_cnn/index.html#evaluating-a-model)
			
 
				-described in the [Convolutional Neural Network]
			
 
				-(https://www.tensorflow.org/tutorials/deep_cnn/index.html) tutorial.
			
 
				+The evaluation procedure is nearly identical to [Evaluating a Model](https://www.tensorflow.org/tutorials/deep_cnn/index.html#evaluating_a_model)
			
 
				+described in the [Convolutional Neural Network](https://www.tensorflow.org/tutorials/deep_cnn/index.html) tutorial.
			
 
				 
			
 
				 **WARNING** Be careful not to run the evaluation and training binary on the same
			
 
				 GPU or else you might run out of memory. Consider running the evaluation on a
			
@@ -438,8 +431,7 @@ daisy, dandelion, roses, sunflowers, tulips
 
				 There is a single automated script that downloads the data set and converts it
			
 
				 to the TFRecord format. Much like the ImageNet data set, each record in the
			
 
				 TFRecord format is a serialized `tf.Example` proto whose entries include a
			
 
				-JPEG-encoded string and an integer label. Please see [`parse_example_proto`]
			
 
				-(inception/image_processing.py) for details.
			
 
				+JPEG-encoded string and an integer label. Please see [`parse_example_proto`](inception/image_processing.py) for details.
			
 
				 
			
 
				 The script just takes a few minutes to run depending your network connection
			
 
				 speed for downloading and processing the images. Your hard disk requires 200MB
			
@@ -471,14 +463,12 @@ and `validation-?????-of-00002`, respectively.
 
				 **NOTE** If you wish to prepare a custom image data set for transfer learning,
			
 
				 you will need to invoke [`build_image_data.py`](inception/data/build_image_data.py) on
			
 
				 your custom data set. Please see the associated options and assumptions behind
			
 
				-this script by reading the comments section of [`build_image_data.py`]
			
 
				-(inception/data/build_image_data.py). Also, if your custom data has a different
			
 
				+this script by reading the comments section of [`build_image_data.py`](inception/data/build_image_data.py). Also, if your custom data has a different
			
 
				 number of examples or classes, you need to change the appropriate values in
			
 
				 [`imagenet_data.py`](inception/imagenet_data.py).
			
 
				 
			
 
				 The second piece you will need is a trained Inception v3 image model. You have
			
 
				-the option of either training one yourself (See [How to Train from Scratch]
			
 
				-(#how-to-train-from-scratch) for details) or you can download a pre-trained
			
 
				+the option of either training one yourself (See [How to Train from Scratch](#how-to-train-from-scratch) for details) or you can download a pre-trained
			
 
				 model like so:
			
 
				 
			
 
				 ```shell
			
@@ -806,8 +796,7 @@ comments in [`image_processing.py`](inception/image_processing.py) for more deta
 
				 #### The model runs out of CPU memory.
			
 
				 
			
 
				 In lieu of buying more CPU memory, an easy fix is to decrease
			
 
				-`--input_queue_memory_factor`. See [Adjusting Memory Demands]
			
 
				-(#adjusting-memory-demands).
			
 
				+`--input_queue_memory_factor`. See [Adjusting Memory Demands](#adjusting-memory-demands).
			
 
				 
			
 
				 #### The model runs out of GPU memory.
			
 
				 
			
--- a/inception/inception/data/build_image_data.py
+++ b/inception/inception/data/build_image_data.py
@@ -32,7 +32,7 @@ a sharded data set consisting of TFRecord files
 
				   train_directory/train-00000-of-01024
			
 
				   train_directory/train-00001-of-01024
			
 
				   ...
			
 
				-  train_directory/train-00127-of-01024
			
 
				+  train_directory/train-01023-of-01024
			
 
				 
			
 
				 and
			
 
				 
			
@@ -50,7 +50,7 @@ contains the following fields:
 
				   image/width: integer, image width in pixels
			
 
				   image/colorspace: string, specifying the colorspace, always 'RGB'
			
 
				   image/channels: integer, specifying the number of channels, always 3
			
 
				-  image/format: string, specifying the format, always'JPEG'
			
 
				+  image/format: string, specifying the format, always 'JPEG'
			
 
				 
			
 
				   image/filename: string containing the basename of the image file
			
 
				             e.g. 'n01440764_10026.JPEG' or 'ILSVRC2012_val_00000293.JPEG'
			
@@ -60,7 +60,7 @@ contains the following fields:
 
				   image/class/text: string specifying the human-readable version of the label
			
 
				     e.g. 'dog'
			
 
				 
			
 
				-If you data set involves bounding boxes, please look at build_imagenet_data.py.
			
 
				+If your data set involves bounding boxes, please look at build_imagenet_data.py.
			
 
				 """
			
 
				 from __future__ import absolute_import
			
 
				 from __future__ import division
			
@@ -72,7 +72,6 @@ import random
 
				 import sys
			
 
				 import threading
			
 
				 
			
 
				-
			
 
				 import numpy as np
			
 
				 import tensorflow as tf
			
 
				 
			
@@ -306,7 +305,7 @@ def _process_image_files(name, filenames, texts, labels, num_shards):
 
				   spacing = np.linspace(0, len(filenames), FLAGS.num_threads + 1).astype(np.int)
			
 
				   ranges = []
			
 
				   for i in range(len(spacing) - 1):
			
 
				-    ranges.append([spacing[i], spacing[i+1]])
			
 
				+    ranges.append([spacing[i], spacing[i + 1]])
			
 
				 
			
 
				   # Launch a thread for each batch.
			
 
				   print('Launching %d threads for spacings: %s' % (FLAGS.num_threads, ranges))
			
--- a/inception/inception/data/build_imagenet_data.py
+++ b/inception/inception/data/build_imagenet_data.py
@@ -36,7 +36,7 @@ a sharded data set consisting of 1024 and 128 TFRecord files, respectively.
 
				   train_directory/train-00000-of-01024
			
 
				   train_directory/train-00001-of-01024
			
 
				   ...
			
 
				-  train_directory/train-00127-of-01024
			
 
				+  train_directory/train-01023-of-01024
			
 
				 
			
 
				 and
			
 
				 
			
@@ -54,7 +54,7 @@ serialized Example proto. The Example proto contains the following fields:
 
				   image/width: integer, image width in pixels
			
 
				   image/colorspace: string, specifying the colorspace, always 'RGB'
			
 
				   image/channels: integer, specifying the number of channels, always 3
			
 
				-  image/format: string, specifying the format, always'JPEG'
			
 
				+  image/format: string, specifying the format, always 'JPEG'
			
 
				 
			
 
				   image/filename: string containing the basename of the image file
			
 
				             e.g. 'n01440764_10026.JPEG' or 'ILSVRC2012_val_00000293.JPEG'
			
@@ -80,7 +80,7 @@ serialized Example proto. The Example proto contains the following fields:
 
				 Note that the length of xmin is identical to the length of xmax, ymin and ymax
			
 
				 for each example.
			
 
				 
			
 
				-Running this script using 16 threads may take around ~2.5 hours on a HP Z420.
			
 
				+Running this script using 16 threads may take around ~2.5 hours on an HP Z420.
			
 
				 """
			
 
				 from __future__ import absolute_import
			
 
				 from __future__ import division
			
@@ -92,7 +92,6 @@ import random
 
				 import sys
			
 
				 import threading
			
 
				 
			
 
				-
			
 
				 import numpy as np
			
 
				 import tensorflow as tf
			
 
				 
			
@@ -435,7 +434,7 @@ def _process_image_files(name, filenames, synsets, labels, humans,
 
				   ranges = []
			
 
				   threads = []
			
 
				   for i in range(len(spacing) - 1):
			
 
				-    ranges.append([spacing[i], spacing[i+1]])
			
 
				+    ranges.append([spacing[i], spacing[i + 1]])
			
 
				 
			
 
				   # Launch a thread for each batch.
			
 
				   print('Launching %d threads for spacings: %s' % (FLAGS.num_threads, ranges))
			
--- a/inception/inception/data/download_and_preprocess_flowers.sh
+++ b/inception/inception/data/download_and_preprocess_flowers.sh
@@ -35,7 +35,7 @@
 
				 set -e
			
 
				 
			
 
				 if [ -z "$1" ]; then
			
 
				-  echo "usage download_and_preprocess_flowers.sh [data dir]"
			
 
				+  echo "Usage: download_and_preprocess_flowers.sh [data dir]"
			
 
				   exit
			
 
				 fi
			
 
				 
			
--- a/inception/inception/data/download_and_preprocess_flowers_mac.sh
+++ b/inception/inception/data/download_and_preprocess_flowers_mac.sh
@@ -35,7 +35,7 @@
 
				 set -e
			
 
				 
			
 
				 if [ -z "$1" ]; then
			
 
				-  echo "usage download_and_preprocess_flowers.sh [data dir]"
			
 
				+  echo "Usage: download_and_preprocess_flowers.sh [data dir]"
			
 
				   exit
			
 
				 fi
			
 
				 
			
--- a/inception/inception/data/download_and_preprocess_imagenet.sh
+++ b/inception/inception/data/download_and_preprocess_imagenet.sh
@@ -49,7 +49,7 @@
 
				 set -e
			
 
				 
			
 
				 if [ -z "$1" ]; then
			
 
				-  echo "usage download_and_preprocess_imagenet.sh [data dir]"
			
 
				+  echo "Usage: download_and_preprocess_imagenet.sh [data dir]"
			
 
				   exit
			
 
				 fi
			
 
				 
			
@@ -84,7 +84,7 @@ BOUNDING_BOX_FILE="${SCRATCH_DIR}/imagenet_2012_bounding_boxes.csv"
 
				 BOUNDING_BOX_DIR="${SCRATCH_DIR}bounding_boxes/"
			
 
				 
			
 
				 "${BOUNDING_BOX_SCRIPT}" "${BOUNDING_BOX_DIR}" "${LABELS_FILE}" \
			
 
				- | sort >"${BOUNDING_BOX_FILE}"
			
 
				+ | sort > "${BOUNDING_BOX_FILE}"
			
 
				 echo "Finished downloading and preprocessing the ImageNet data."
			
 
				 
			
 
				 # Build the TFRecords version of the ImageNet data.
			
--- a/inception/inception/data/download_imagenet.sh
+++ b/inception/inception/data/download_imagenet.sh
@@ -24,7 +24,7 @@
 
				 # downloading the raw images.
			
 
				 #
			
 
				 # usage:
			
 
				-#  ./download_imagenet.sh [dirname]
			
 
				+#  ./download_imagenet.sh [dir name] [synsets file]
			
 
				 set -e
			
 
				 
			
 
				 if [ "x$IMAGENET_ACCESS_KEY" == x -o "x$IMAGENET_USERNAME" == x ]; then