From the course: Microsoft Azure Solutions Architect Expert (AZ-305) Cert Prep: 2 Design Data Storage Solutions by Microsoft Press

Unlock this course with a free trial

Join today to access over 23,200 courses taught by industry experts.

Introduction to Azure Databricks

Introduction to Azure Databricks

- [Instructor] When you need to integrate data across applications or when you want to be able to manipulate data and report on it, one of the tools we can use is Azure Databricks. So Databricks is a tool primarily for data exploration and cleanup. Now it uses Apache Spark clusters under the hood to perform the work, and they're designed to distribute workloads across nodes within a cluster. Of course, being Azure, you can spin clusters up and down as needed and only pay for what you need. And you also get to choose between the size of clusters, the number of nodes in a cluster, and how much CPU, to have how much RAM, and even the type of CPU they can use. Before we get into that, when we're first creating a Databricks cluster, we do have a couple of configuration options we need to be aware of from an architectural point of view. The first is the pricing tier. So the standard is a Standard pricing tier, but we also…

Contents