Introduction to Azure Databricks with CrateDB

jayeff · August 3, 2021, 2:08pm

This is a quick intro into getting started with Azure Databricks and CrateDB.

Setup Azure Databricks

Add a new Databricks service to your Azure Subscription
Once this is done use “Launch Workspace”
After you are signed into Azure Databricks use the common task “New Cluster” to start a cluster for your Spark jobs execution
Install the pgjdbc library (as of time of publishing org.postgresql:postgresql:42.2.23) from Maven for your cluster

azure-databricks-server-install-library

Connect to CrateDB: Scala example

Create a new notebook with default language Scala
Add the following code and run the notebook

val crateUsername = "<username>"
val cratePassword = "<password>"
val postgresqlUrl = "jdbc:postgresql://<url-to-server>:5432/?sslmode=require";
val tableName = "<tablename>"

val jdbcDF = spark.read
       .format("jdbc")
       .option("url", postgresqlUrl)
       .option("driver", "org.postgresql.Driver")
       .option("dbtable", tableName)
       .option("user", crateUsername)
       .option("password", cratePassword)
       .option("fetchsize", 100000)
       .load()
jdbcDF.head(n=10);

You should see the results from CrateDB

Connect to CrateDB: Python example

Create a new notebook of default language Python
Add the following code and run the notebook

crateUsername = "<username>"
cratePassword = "<password>"
postgresqlUrl = "jdbc:postgresql://<url-to-server>:5432/?sslmode=require";
tableName = "<tablename>"

jdbcDF = spark.read \
    .format("jdbc") \
    .option("url", postgresqlUrl) \
    .option("driver", "org.postgresql.Driver") \
    .option("dbtable", tableName) \
    .option("user", crateUsername) \
    .option("password", cratePassword) \
    .load()
jdbcDF.head(n=10)

You should see the results from CrateDB

Topic		Replies	Views
Connecting pyspark to CrateDB inside Jupyter Notebooks CrateDB	5	709	February 1, 2023
Connecting to CrateDB with Java and JDBC Tutorials integration	0	1565	September 8, 2021
Getting started with Apache Spark and CrateDB: A step by step tutorial Tutorials integration	0	562	September 19, 2023
Overview of CrateDB integration tutorials Integrations integration , getting-started	2	4085	September 6, 2023
Tableau Integration 3rd Party Tools	1	1291	January 18, 2019

Introduction to Azure Databricks with CrateDB

Setup Azure Databricks

Connect to CrateDB: Scala example

Connect to CrateDB: Python example

Related topics