SageMaker Capabilities

Do’s and Don’ts

A. Don’t optimizes too many hyperparameters

B. Limit ranges to as small as possible

C. Use logarithmic scales when possible

How to

Connect Sagemaker notebook to remote EMR cluster running spark -> get generated DataFrames -> Call SageMakerEstimator Fit to generate a model-> call SageMaker transform to make inferences

Create and share Jupyter notebook with SageMaker Studio

Switch between hardware configuration

SageMaker Experiments —Organize, capture, compare, and search ML jobs

Saves gradients, tensors over time during training

Define rule to capture unwanted conditions. each rule create a debugger job

Sends logs to CloudWatch, where events can be created for further actions

SageMaker Debugger Insight dashboard,

auto generated training reports and built-in rules like monitoring system bottlenecks(CPU, GPU Memory), profile model framework metrics ( Max Initialization Time, Overall Framework metrics, Step outlier, system usage), debug model parameter

Built-in actions like StopTraining(), Email(), or SMS(). Integration with SNS

Supported framework — TensorFlow, PyTorch, MXNet, XGBoost, SageMaker generic estimator

Debugger APIs for further integration. ( construct hooks and rules). SMDebug client library

Load Data to S3 -> select target column for prediction-> Autopilot creates model leaderboard with model recommendation-> pick a model-> autopilot creates the model in notebook where it can be tweaked

Problem types —binary or multiclass classification and Regression

Algorith types — Linear regression, XGBoost, Deep Learning (MLP)

Data files must be tabular CSV

Integrate with SageMaker Clarify to identify biases and provide transparency how model arrive at a result (assign each feature an importance value for a prediction)

Get alerts on quality deviations on deployed models via CloudWatch

Visualize data quality drift based on Normalized Discounter Cumulative Gain (NDCG), Bias drift

Detect anomalies and outliers

Detect new feature arriving in new data

Create monitoring job via Monitoring Schedule

Integrate with TensorBoard, QuickSight, Tableau

Integrate with Ground Truth



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store