Gain insights into predicting retail sales with TensorFlow. Delve into data analysis, training models, and extracting insights using industry techniques. Explore efficient model evaluation with experts from top tech companies.

APLCS.tar.gz

case-study

In this course, you'll work on an industry-level machine learning project based on predicting weekly retail sales given different factors. You will learn the most efficient techniques used to train and evaluate scalable machine learning models. After completing this course, you will be able to take on industry-level machine learning projects, from data analysis to creating efficient models and providing results and insights.

The code for this course is built around the TensorFlow framework, which is one of the premier frameworks for industry machine learning, and the Python pandas library for data analysis. Basic knowledge of Python and TensorFlow are prerequisites. To get some experience with TensorFlow, try our course: Machine Learning for Software Engineers.

This course was created by AdaptiLab, a company specializing in evaluating, sourcing, and upskilling enterprise machine learning talent. It is built in collaboration with industry machine learning experts from Google, Microsoft, Amazon, and Apple.

Applied Machine Learning: Industry Case Study with TensorFlow

# Chapter Goals:
- Create the generalizable function for the MLP model's layers

# A. MLP architecture
Our project's model follows the standard MLP architecture. This means that it is made up of multiple fully-connected layers, where each hidden layer uses ReLU activation and the final layer uses no activation. The input layer for the MLP consists of a batch of data observations from the input pipeline (more on this later).


Larger models (i.e. more hidden layers and nodes) have higher potential to make more accurate predictions, but they can also take longer to train and have a higher chance of overfitting. It's good to experiment with different model sizes, so we can ultimately choose the best model. This is why we use an evaluation set in addition to the training set, to compare different model configurations and see which configuration performs best on new data.

For our MLP model, we'll start off with 2 hidden layers. The first hidden layer contains 200 nodes, while the second contains 100. This equates to the list `[200, 100]` for initializing the `SalesModel` class object.

# Time to Code!

All code for this chapter goes in the `model_layers` function.

The first layer for the MLP is the input layer. This corresponds to the 
`inputs` argument of the function.

**Set `layer` equal to `inputs`.**
 
 
The `SalesModel` class is initialized with a `hidden_layers` argument. This is a list of integers, where the integer at index *i* represents the number of nodes in hidden layer *i* of the MLP.

**Create a `for` loop that iterates through `self.hidden_layers` using a variable called `num_nodes`.**
 
 
Each hidden layer of the MLP is a fully-connected layer with ReLU activation, using the previous layer's output as the input.

**Inside the `for` loop, set `layer` equal to `tf.keras.layers.Dense` applied with `layer` and `num_nodes` as required arguments, along with `tf.nn.relu` as the `activation` keyword argument.**
 
 
The model's predictions for the input batch of data observations is the output from the MLP's final layer. The final layer has one node, since sales predictions are a single number, and doesn't use any activation function.

 **Outside the `for` loop, set `batch_predictions` equal to `tf.keras.layers.Dense` applied with `layer` and `1` as required arguments.**

**Return `batch_predictions`.**

class SalesModel(object):
  def __init__(self, hidden_layers):
    self.hidden_layers = hidden_layers

  def model_layers(self, inputs):
    # CODE HERE
    

import os
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '3'

import importlib.util
import tensorflow as tf
import numpy as np
import logging
import shutil
import os
import csv
import random
#import adaptilab.cs1_lab_2_ref as lab_ref
tf.compat.v1.disable_eager_execution()
ref_graph=""
user_graph=""
SUCCESS_STRING = 'Your code is correct, good job!'
HIDDEN_LAYERS = [10, 5]
REF_EVAL_LOSS = 14379
REF_PREDICTIONS_LIST = [6033.45, 8043.76, 6618.43, 7964.87, 6219.28]
USER_CKPT_DIR = '/tmp/user_ckpt_dir/'
REF_CKPT_DIR = 'adaptilab/model_dir/'
PREDICT_TFRECORDS = 'adaptilab/predict.tfrecords'
LOG_FILE = '/tmp/user_output.log'
FINAL_PREDICTIONS_CSV = 'adaptilab/final_predictions.csv'


class RefSalesModel(object):
  def __init__(self, hidden_layers):
    self.hidden_layers = hidden_layers

  def model_layers(self, inputs):
    # CODE HERE
    layer = inputs
    for num_nodes in self.hidden_layers:
      layer = tf.keras.layers.Dense(num_nodes,
        activation=tf.nn.relu)(layer)
    batch_predictions = tf.keras.layers.Dense(1)(layer)
    return batch_predictions

  def regression_fn(self, features, labels, mode, params):
    feature_columns = create_feature_columns()
    inputs = tf.feature_column.input_layer(features, feature_columns)
    batch_predictions = self.model_layers(inputs)
    predictions = tf.squeeze(batch_predictions)
    if labels is not None:
      loss = tf.losses.absolute_difference(labels, predictions)
    if mode == tf.estimator.ModeKeys.TRAIN:
      global_step = tf.train.get_or_create_global_step()
      adam = tf.train.AdamOptimizer()
      train_op = adam.minimize(
        loss, global_step=global_step)
      return tf.estimator.EstimatorSpec(mode, loss=loss, train_op=train_op)
    if mode == tf.estimator.ModeKeys.EVAL:
      return tf.estimator.EstimatorSpec(mode, loss=loss)
    if mode == tf.estimator.ModeKeys.PREDICT:
      prediction_info = {
        'predictions': batch_predictions
      }
      return tf.estimator.EstimatorSpec(mode, predictions=prediction_info)

  def create_regression_model(self, ckpt_dir):
    regression_model = tf.estimator.Estimator(
      self.regression_fn,
      config=tf.estimator.RunConfig(log_step_count_steps=5000),
      model_dir=ckpt_dir)
    return regression_model

  def run_regression_training(self, ckpt_dir, batch_size, num_training_steps=None):
    regression_model = self.create_regression_model(ckpt_dir)
    def input_fn(): return create_tensorflow_dataset('train.tfrecords', batch_size)
    regression_model.train(input_fn, steps=num_training_steps)

  def run_regression_eval(self, ckpt_dir):
    regression_model = self.create_regression_model(ckpt_dir)
    def input_fn(): return create_tensorflow_dataset(
      'eval.tfrecords', 50, training=False)
    return regression_model.evaluate(input_fn)

  def run_regression_predict(self, ckpt_dir, data_file):
    regression_model = self.create_regression_model(ckpt_dir)
    def input_fn(): return create_tensorflow_dataset(
      data_file, 1, training=False, has_labels=False)
    prediction_info = regression_model.predict(input_fn)
    prediction_list = []
    for pred_dict in prediction_info:
      prediction_list.append(pred_dict['predictions'][0])
    return prediction_list

  def ref_eval(self, chapter, has_labels, mode):
    if chapter == 1:
      inputs = tf.compat.v1.placeholder(tf.float32, shape=(None, 15))
      batch_predictions = self.model_layers(inputs)
      tf.add(batch_predictions, 2)
    elif chapter < 6:
      features = {}
      for feature_name, feature_value in FEATURE_VALUES.items():
        features[feature_name] = tf.constant([feature_value])
      if has_labels:
        labels = tf.compat.v1.placeholder(tf.float32, shape=())
      else:
        labels = None
      self.regression_fn(features, labels, mode, None)


def test_graph_eq(chapter, has_labels=True, mode=None):
    user_graph = str(tf.compat.v1.get_default_graph().as_graph_def())
    tf.compat.v1.reset_default_graph()
    ref_model = RefSalesModel(HIDDEN_LAYERS)
    ref_model.ref_eval(chapter, has_labels, mode)
    ref_graph = str(tf.compat.v1.get_default_graph().as_graph_def())
    #tf.test.assert_equal_graph_def(user_graph, tf.get_default_graph().as_graph_def())

def model_layers(self, inputs):
    # CODE HERE
    layer = inputs
    for num_nodes in self.hidden_layers:
      layer = tf.keras.layers.Dense(num_nodes,
        activation=tf.nn.relu)(layer)
    batch_predictions = tf.keras.layers.Dense(1)(layer)
    return batch_predictions





What you'll learn from this course

Preliminary Data Analysis

Data Processing

Model Predictions

Model Layers

Chapter Goals:

A. MLP architecture

Time to Code!