Skip to main content

Databricks installation

Pathling has been tested on Databricks Runtime 14.3 LTS.

To install the Pathling library on a Databricks cluster, navigate to the "Compute" section and click on the cluster. Click on the "Libraries" tab, and click "Install new".

Install the core Pathling functionality by selecting "Maven" as the library source and installing the library runtime Maven package.

You can the optionally install the PyPI package for Python support, and/or the R package.

Once the cluster is restarted, Pathling should be available for use within notebooks.

See the Databricks documentation on Libraries for more information.

Environment variables

By default, Databricks uses Java 8 within its clusters, while Pathling requires Java 17. To enable Java 17 support within your cluster, navigate to Advanced Options > Spark > Environment Variables and add the following:

JNAME=zulu17-ca-amd64