What is ML Insights

ML Insights is a python library for data scientists, ML engineers and developers. Insights can be used to ingest data in different formats, apply row based transformations and monitor data and ML Models from validation to production.

ML Insights library also provides many ways to process and evaluate data and ML models. The options include low code alternative for customisation, a pre-built application and further extensibility through custom applications and custom components.

Quick Start

For quick introduction to ML Insights Library please follow on the links below -

Installation and Setup

ML Insights API in 10 minutes

Low Code Setup with Config Reader

How it works

ML Insights helps evaluate and monitor data and ML model for entirety of ML Observability lifecycle.

Insights is component based where each component has a specific responsibility with a workflow managing the individual components.

Insights provides components to carry out tasks like data ingestion, row level data transformation, metric calculation and post processing of metric output. More details on these are covered in the Getting Started section.

In very simple terms, one has to provide location to the input data set that needs to be processed, select any additional simple transformation needed on the input data (for example, converting an un-structured column to structured one), and decide which metrics should be calculated for different features (columns of data). The user can also decide to define some post-action to be performed once all the metrics have been calculated.

Insights provides a simple, declarative API, out of box components covering majority of common use cases to choose from. Also, Insights enables users to author json-based configurations that can be used to define and customise all of its core features.

  • Insights currently supports CSV, JSON, and JSONL data types.

  • It also supports major execution engines like Native Pandas, Dask, and Spark.

  • Insights provides metrics in different groups like

    • Data Integrity

    • Data Quality/ Summary

    • Feature and Prediction Drift Detection

    • Model Performance for both classification and Regression Models

  • Insights supports integration for writing metric data, or connecting to OCI monitoring service.

  • Insights Tests/Test Suite feature enables comprehensive validation of customer’s machine learning models and data.

Contact

ML Insights SDK is offered by the OCI Data Science team. You can reach us through Oracle Support - https://www.oracle.com/support/.