This repository contains examples and related resources showing you how to preprocess, train, and serve your models using Amazon SageMaker with data fetched from Delta Lake.

Repository Structure

The repository contains the following resources:

Those notebooks were tested on SageMaker Studio with Python 3 (Data Science) Kernel.

Disclaimer: The examples in this repository are for demo purposes only and not meant to be used in production:

  • The solution is missing appropriate authorization/authentication tokens.
  • Transfer of data over the cloud will be a challenge for large datasets, mainly from cost perspective.


Please contact or raise an issue on this repo.


This library is licensed under the MIT-0 License. See the LICENSE file.


