How it works

A look under the hood of Gradient:
The world’s first AI optimization
engine for your data infrastructure

High-performance computing
optimization engine

Gradient is an AI compute optimization engine that improves the costs and
runtimes of data pipelines in cloud environments

Data-driven modeling

Gradient pairs each job with a machine-learning model that is fine-tuned based on historical Spark event logs. The model uses that data to predict resource configurations, Spark metrics, and compute costs, taking into account factors like instance types, resource demands, and dependencies.

Each job is paired with an advanced model that is trained using that job's historical metrics

Statistical trends

Gradient’s core model is trained using statistical data about your jobs. This info is used to identify patterns and determine the optimal configuration to meet your desired cost, runtime, or SLAs hit rate. This is done even in the midst of noise such as data size variation, new code updates, or spot market.

Stability-minded
iterative optimization

Gradient takes an iterative approach to optimization to ensure successful runs. Recommending small changes at first, it gradually increases the scope of its recommendations based on the impact the previous optimizations have had on job cost and performance.

Identify root causes in minutes with Gradient

Job monitoring
and observability

Gradient continuously monitors your job’s costs and Spark metrics to identify and adapt to anomalies. This data is available in the app to help data engineers identify root causes and jobs that require attention.

Gradient continuously monitors your jobs and alerts your on anomalies instantly

Self-improving models

Gradient uses a self-learning closed loop feedback system to ensure its performance improves over time. This system also allows it to adapt seamlessly to changes in workload patterns and cloud environments, becoming more accurate and efficient with each run.

Closed-loop feedback ensures our models improve over time

Co-pilot or full automation

Easily switch between co-pilot mode for guided optimization and full automation with a single click. Tailor the level of control to your team’s needs and comfort level.

Gradient supports both co-pilot and autopilot modes

No pipeline too complex

Gradient works with any data pipeline, even complex ones with varying data sizes, cyclic data patterns, DAG dependencies, etc. Gradient’s API-first approach allows for full customization to your needs.

Optimize cyclic data pipelines and variable sized data pipelines with Gradient

Goal-based optimization

Input your runtime SLA for Gradient to optimize your compute clusters to meet that SLA at the cheapest cost. Or if you just want the lowest compute costs possible, Gradient will optimize your clusters for the lowest costs.