Skip to content

Databricks Spark SQL Data Source in Privacera#

Prerequisites#

Have the following details ready to enter into the data source definition in Privacera:

  • A username and password in the target system that has read/write permission.

  • The name of the JDBC driver you need.

  • A JDBC connection string to communicate with the target data source.

Add Databricks Spark SQL Data Source in Privacera#

To add Databricks Spark SQL data source in Privacera Platform:

  1. Navigate to: Settings > Data Source Registration.

  2. Optionally click Add System or modify an existing data source.

  3. Enter a useful name for this data source and a useful description.

  4. Click Save.

  5. Locate the new data source system name and from the wrench icon on the right, select Add Data Source.

  6. In the Add Data Source dialog, on the Choose tab, select Databricks Spark SQL.

  7. On the Configure tab:

  8. Enter a required Application Name of your choice.

  9. Enter a required Application Code of your choice. This is an identifier for your own use.

  10. If you have prepared a properties file in JSON format, click Import Properties and load the file.

  11. Scroll to find the following properties and enter the values you prepared:

    1. jdbc.username

      Enter the Email ID used to login to the Databricks account console.

    2. jdbc.password

      On Databricks account console:

      a. Navigate to Settings -> User Settings -> Access Tokens.

      b. Click Generate New Token.

      c. Use the Token as password.

    3. jdbc.url

      On Databricks account console:

      a. Click Compute and select the Cluster.

      b. Navigate to Advance Options and click JDBC/ODBC tab.

      c. Copy the URL from the JBDC URL section and update as shown in the following example:

      Original URL:
      jdbc:spark://<yourHostname>:443/default;transportMode=http;ssl=1;httpPath=sql/protocolv1/o/6824215520793722/0406-064613-sweet542;**AuthMech=3;UID=token;PWD=<personal-access-token>**
      
      New URL:
      jdbc:hive2://<yourHostname>:443/default;transportMode=http;ssl=true;httpPath=sql/protocolv1/o/6824215520793722/0406-064613-sweet542;
      
  12. Accept the default values for all other properties or modify them if needed.

  13. At the bottom left, to verify the properties, click Test Connection.

    Note

    Your Databricks cluster should be up and running before clicking Test Connection.

  14. At the bottom right, click Next to save the data source or Back to return to the Choose tab.