Inference Decoding
Understand the difference between training and inference decoding.
Chapter Goals:
- Learn how to perform decoding for inference
- Learn about variable scopes for declaring and retrieving variables
A. Decoding without ground truth
During training, we have access to ground truth tokens, which are used as input for the decoder. However, for inference (i.e. generating output in real-time), we only have access to an input sequence. In this case, the input for the decoder at each time is just the decoder's output token from the previous time step.
In TensorFlow, we can switch from training to inference mode just by changing the TrainingSampler
object to an inference helper. The most commonly used inference helper object is GreedyEmbeddingSampler
.
B. Greedy decoding
The name of GreedyEmbeddingSampler
comes from the fact that it utilizes a greedy decoding method and incorporates an embedding matrix.
The greedy decoding method means that the output at each decoder time step (also the input into the next time step) is just the highest probability vocabulary word at the time step. The output is converted into a word embedding prior to being used as input for the next time step.
Below we create a GreedyEmbeddingSampler
with a vocabulary size of 598 as first argument and second argument hidden size which is half of the vocabulary size.
import tensorflow as tfvocab_size = 598sos = vocab_sizeeos = vocab_size + 1extended_vocab_size = vocab_size + 2# Placeholder representing the embedding matrix for the vocab# Embedding dim is 10embedding_matrix = tf.compat.v1.placeholder(tf.float32, shape=(600, 10))batch_size = 8start_tokens = tf.tile([sos], [batch_size])#Greedy Embedding Samplerembedding_layer = tf.keras.layers.Embedding(vocab_size, int(vocab_size/2))sampler = tfa.seq2seq.sampler.GreedyEmbeddingSampler(embedding_layer)print(sampler)
The GreedyEmbeddingSampler
is initialized with an embedding matrix.
C. Embedding variable scope
When using feature columns to create embeddings (e.g. the get_embeddings
function on line 35 in the code), we don't have direct access to the embedding matrix variable. This means we need to create an embedding matrix variable with the exact same name as the one used during training. To do this, we need to create the variable in the appropriate variable scope.
A variable scope is similar to directory paths in an operating system. We may know the name of the file (in this case the variable), but to retrieve it we have to be in the proper directory (the variable scope). We can also create a variable under a certain variable scope similar to how we would create files in certain directories.
We specify variable scopes in TensorFlow using the tf.compat.v1.variable_scope
function. You'll notice that tf.compat.v1.variable_scope
is used in the get_embeddings
function.
import tensorflow as tfwith tf.compat.v1.variable_scope('scope1'):v1 = tf.compat.v1.get_variable('var', shape=(2, 2))with tf.compat.v1.variable_scope('scope2'):v2 = tf.compat.v1.get_variable('var', shape=(2, 2))print(v2)
The above code block creates variables named 'var'
inside scopes 'scope1'
and scope2'
. Despite the shared name, these variables are different, since the scopes differ.
We can also use nested scopes to create a variable. A nested scope is similar to putting a file in subdirectories of outer directories.
import tensorflow as tfwith tf.compat.v1.variable_scope('s'):with tf.compat.v1.variable_scope('sub1'):v1 = tf.compat.v1.get_variable('v', shape=(2, 2))with tf.compat.v1.variable_scope('sub2'):v2 = tf.compat.v1.get_variable('v', shape=(2, 2))print(v1)print(v2)
Mimicking a nested scope using a single scope is also possible. We just need to use the appropriate "path" when specifying the scope name.
import tensorflow as tfwith tf.compat.v1.variable_scope('s/sub1'):v1 = tf.compat.v1.get_variable('v', shape=(2, 2))print(v1)
The tf.compat.v1.variable_scope
function also has a useful reuse
keyword argument. While it's not used in the code for this course, it sets the argument with value tf.compat.v1.AUTO_REUSE
, in order to re-use previously created variables.
We don't use it in the encoder-decoder model because the training and inference portions of the code are separate (i.e. we don't create the same embedding matrix twice in one model run).
D. Inference Decoding Translation Example
The code widget below uses inference decoding to translate English sentences into Urdu. Type a sentence into the text box and it will print out the Urdu translation.translate()
Enter the input below to be saved in file __ed_input.txt
Time to Code!
In this chapter you'll be completing the create_decoder_sampler
function from the Training Sampler chapter. This time, you'll be creating the Sampler object for inference.
We’ve already filled in part of the code to obtain the decoder’s embedding matrix and initialize the start and end tokens. Your job now is to create the GreedyEmbeddingSampler
object for inference decoding.
Inside the else
block of the create_decoder_sampler
function, First create embedding_matrix
by using tf.keras.layers.Embedding
applied with vocab_size
and hidden_size
(which is half of vocab_size) as input the arguments.
set sampler
equal to tfa.seq2seq.sampler.GreedyEmbeddingSampler
applied with embedding_matrix
as the input arguments.
The create_decoder_sampler
function returns a tuple containing the sampler object and ground truth sequence lengths. However, in inference mode there are no ground truth sequences. In this case, we just set dec_seq_lens
equal to None
.
Inside the else
block, set dec_seq_lens
equal to None
.
import tensorflow as tfimport tensorflow_addons as tfa# Seq2seq modelclass Seq2SeqModel(object):def __init__(self, vocab_size, num_lstm_layers, num_lstm_units):self.vocab_size = vocab_size# Extended vocabulary includes start, stop tokenself.extended_vocab_size = vocab_size + 2self.num_lstm_layers = num_lstm_layersself.num_lstm_units = num_lstm_unitsself.tokenizer = tf.keras.preprocessing.text.Tokenizer(num_words=vocab_size)# create Greedy Samplerdef create_decoder_sampler(self,decoder_inputs, is_training, batch_size):if is_training:#Already done this in Previous lessonspasselse:#CODE HEREpassreturn sampler, dec_seq_lens
Get hands-on with 1300+ tech skills courses.