Tag:
driver

As many previous blog posts have reported, tuning and optimizing the cluster configurations of Apache Spark is a notoriously difficult problem. Especially when a data engineer needs to lower costs or accelerate runtimes on platforms such as EMR or Databricks on AWS, tuning these parameters becomes a high priority. Here at Sync, we will experimentally

Jeffrey Chou
07 Feb 2023

Blog, Case Study

Databricks driver sizing impact on cost and performance

Databricks driver sizing impact on cost and performance