Amazon SageMaker Data Wrangler

Learn to simplify data preparation with Amazon SageMaker Data Wrangler, using visual workflows to clean, transform, and analyze data for machine learning.

Amazon SageMaker Data Wrangler simplifies the data preparation phase for machine learning projects, transforming a process that traditionally takes weeks into one that takes just minutes. This service provides a visual and natural language interface that makes it easy to work with tabular, image, and text data. It simplifies feature engineering and data preparation, making it an ideal tool for novices and experienced data scientists looking to facilitate the workflow.

Sagemaker Data Wrangler supports preparing petabyte-scale data without manual coding. We can launch processing jobs directly from the user interface or integrate data preparation into machine learning workflows by exporting data to the SageMaker Feature Store or integrating it with SageMaker Pipelines.

To illustrate the workflow from data ingestion to model training, consider the following diagram:

Get hands-on with 1400+ tech skills courses.