What is ML Insights¶
ML Insights is a python library for data scientists, ML engineers and developers. Insights can be used to ingest data in different formats, apply row based transformations and monitor data and ML Models from validation to production.
ML Insights library also provides many ways to process and evaluate data and ML models. The options include low code alternative for customisation, a pre-built application and further extensibility through custom applications and custom components.
Quick Start¶
For quick introduction to ML Insights Library please follow on the links below -
Low Code Setup with Config Reader
How it works¶
ML Insights helps evaluate and monitor data and ML model for entirety of ML Observability lifecycle.
Insights is component based where each component has a specific responsibility with a workflow managing the individual components.
Insights provides components to carry out tasks like data ingestion, row level data transformation, metric calculation and post processing of metric output. More details on these are covered in the Getting Started section.
In very simple terms, one has to provide location to the input data set that needs to be processed, select any additional simple transformation needed on the input data (for example, converting an un-structured column to structured one), and decide which metrics should be calculated for different features (columns of data). The user can also decide to define some post-action to be performed once all the metrics have been calculated.
Insights provides a simple, declarative API, out of box components covering majority of common use cases to choose from. Also, Insights enables users to author json-based configurations that can be used to define and customise all of its core features.
Insights currently supports CSV, JSON, and JSONL data types.
It also supports major execution engines like Native Pandas, Dask, and Spark.
Insights provides metrics in different groups like
Data Integrity
Data Quality/ Summary
Feature and Prediction Drift Detection
Feature Correlation
Model Performance for both classification and Regression Models
Bias and Fairness
Insights supports integration for writing metric data, or connecting to OCI monitoring service.
Insights Tests/Test Suite feature enables comprehensive validation of customer’s machine learning models and data.
Contact¶
ML Insights SDK is offered by the OCI Data Science team. You can reach us through Oracle Support - https://www.oracle.com/support/.