Blog Archives

Realities of company and cloud complexities require new levels of control and autonomy to meet business goals at scale

Jeffrey Chou
10 Sep 2023

Blog

Configuring Databricks clusters can seem more like art than science. We’ve reported in the past about ways to optimize worker and driver nodes, and how the proper selection of instances impacts a job’s cost and performance. We’ve also discussed how autoscaling performs, and how it’s not always the most efficient choice for static jobs. In

Jeffrey Chou
17 Aug 2023

Blog

Introduction: The Gradient Command Line Interface (CLI) is a powerful yet easy utility to automate the optimization of your Spark jobs from your terminal, command prompt, or automation scripts. Whether you are a Data Engineer, SysDevOps administrator, or just an Apache Spark enthusiast, knowing how to use the Gradient CLI can be incredibly beneficial as

Pete Tamisin
11 Jul 2023

Blog, Case Study

Jeffrey Chou
30 Jun 2023

Blog

Summary In this blog post, we’ll explore how you can integrate Sync’s Gradient with Airflow. We’ll walk through the steps to create a DAG that will submit a run to Databricks, and then make a call through Sync’s library to generate a recommendation for an optimized cluster for that task. This DAG example can be

Brandon Kaplan
27 Jun 2023

Blog

Introduction: Using Gradient in a Workflow Gradient, the latest product release from Sync Computing, helps customers manage the infrastructure behind their recurring Apache Spark applications. Gradient gives infrastructure recommendations for each job to lower the cost of their Production jobs while hitting their target SLA’s. We’ve been hard at work on this project for a

Sean Gorsky
19 Jun 2023

Blog

Category:
Blog

Why Your Data Pipelines Need Closed-Loop Feedback Control

Are Databricks clusters with Photon and Graviton instances worth it?

How to Use the Gradient CLI Tool to Optimize Databricks / EMR Programmatically

Databricks Summit ’23 Talk: “How we built Gradient to optimize Databricks clusters at scale”

Integrating Gradient into Apache Airflow

Developing Gradient Part II