5  What is Databricks?

For Data Science, Databricks Streamlines the end-to-end data science workflow — from data prep to modeling to sharing insights — with a collaborative and unified data science environment built on an open lakehouse foundation. Get quick access to clean and reliable data, preconfigured compute resources, IDE integration, multi-language support, and built-in advanced visualization tools for maximum flexibility for data analytics teams.
Databricks website

Databricks comparison1

With origins in academia and the open-source community, Databricks was founded in 2013 by the original creators of Apache Spark™, Delta Lake, and MLflow. As the world’s first and only lakehouse platform in the cloud, Databricks combines the best of data warehouses and data lakes to offer an open and unified platform for data and AI.
Databricks website

5.1 The elevator pitches

5.2 The details

5.2.1 Clean and reliable data

5.2.2 Preconfigured compute resources

5.2.3 IDE integration

Databricks has taken the Jupyter Notebook (.ipynb) and the classic notebook interface and built a tool that is highly responsive and usable for data scientists and engineers.

5.2.4 multi-language support

5.2.5 built-in advanced visualization tools


  1. https://www.databricks.com/spark/comparing-databricks-to-apache-spark↩︎