New Relic Databricks Integration

This integration collects telemetry from Databricks (including Spark on Databricks) and/or Spark telemetry from any Spark deployment. See the Features section for supported telemetry types.

Important Notes

All references within this document to Databricks documentation reference the Databricks on AWS documentation. Use the cloud switcher menu located in the upper right hand corner of the documentation to select corresponding documentation for a different cloud.
On-host deployment is currently the only supported deployment type. For Databricks and non-Databricks Spark deployments, the integration can be deployed on any supported host platform. For Databricks, support is also provided to deploy the integration on the driver node of a Databricks cluster using a cluster-scoped init script.

Getting Started

To get started with the New Relic Databricks integration, deploy the integration using a supported deployment type, configure the integration using supported configuration mechanisms, and then import the sample dashboard.

On-host

The New Relic Databricks integration can be run on any supported host platform. The integration will collect Databricks telemetry (including Spark on Databricks) via the Databricks ReST API using the Databricks SDK for Go and/or Spark telemetry from a non-Databricks Spark deployment via the Spark ReST API.

The New Relic Databricks integration can also be deployed on the driver node of a Databricks cluster using the provided init script to install and configure the integration at cluster startup time.

Deploy the integration on a host

The New Relic Databricks integration provides binaries for the following host platforms.

Linux amd64
Windows amd64

To run the Databricks integration on a host, perform the following steps.

Download the appropriate archive for your platform from the latest release.
Extract the archive to a new or existing directory.
Create a directory named configs in the same directory.
Create a file named config.yml in the configs directory and copy the contents of the file configs/config.template.yml in this repository into it.
Edit the config.yml file to configure the integration appropriately for your environment.
From the directory where the archive was extracted, execute the integration binary using the command ./newrelic-databricks-integration (or .\newrelic-databricks-integration.exe on Windows) with the appropriate Command Line Options.

Deploy the integration on the driver node of a Databricks cluster

The New Relic Databricks integration can be deployed on the driver node of a Databricks cluster using a cluster-scoped init script. The init script uses custom environment variables to specify configuration parameters necessary for the integration configuration.

To install the init script, perform the following steps.

Login to your Databricks account and navigate to the desired workspace.
Follow the recommendations for init scripts to store the cluster_init_integration.sh script within your workspace in the recommended manner. For example, if your workspace is enabled for Unity Catalog, you should store the init script in a Unity Catalog volume.
Navigate to the Compute tab and select the desired all-purpose or job compute to open the compute details UI.
Click the button labeled Edit to edit the compute's configuration.
Follow the steps to use the UI to configure a cluster-scoped init script and point to the location where you stored the init script in step 2 above.
If your cluster is not running, click on the button labeled Confirm to save your changes. Then, restart the cluster. If your cluster is already running, click on the button labeled Confirm and restart to save your changes and restart the cluster.

Additionally, follow the steps to set environment variables to add the following environment variables.

NEW_RELIC_API_KEY - Your New Relic User API Key
NEW_RELIC_LICENSE_KEY - Your New Relic License Key
NEW_RELIC_ACCOUNT_ID - Your New Relic Account ID
NEW_RELIC_REGION - The region of your New Relic account; one of US or EU
NEW_RELIC_DATABRICKS_WORKSPACE_HOST - The instance name of the target Databricks instance
NEW_RELIC_DATABRICKS_ACCESS_TOKEN - To authenticate with a personal access token, your personal access token
NEW_RELIC_DATABRICKS_OAUTH_CLIENT_ID - To use a service principal to authenticate with Databricks (OAuth M2M), the OAuth client ID for the service principal
NEW_RELIC_DATABRICKS_OAUTH_CLIENT_SECRET - To use a service principal to authenticate with Databricks (OAuth M2M), an OAuth client secret associated with the service principal

Note that the NEW_RELIC_API_KEY and NEW_RELIC_ACCOUNT_ID are currently unused but are required by the new-relic-client-go module used by the integration. Additionally, note that only the personal access token or OAuth credentials need to be specified but not both. If both are specified, the OAuth credentials take precedence. Finally, make sure to restart the cluster following the configuration of the environment variables.

Features

The New Relic Databricks integration supports the following capabilities.

Collect Spark telemetry

The New Relic Databricks integration can collect telemetry from Spark running on Databricks. By default, the integration will automatically connect to and collect telemetry from the Spark deployments in all clusters created via the UI or API in the specified workspace.

The New Relic Databricks integration can also collect Spark telemetry from any non-Databricks Spark deployment.
Collect Databricks billable usage and list pricing data

The New Relic Databricks integration can collect Databricks billable usage and list pricing data from the Databricks system tables. This data can be used to show basic Databricks DBU consumption and cost metrics directly within New Relic.

Usage

Command Line Options

Option	Description	Default
--config_path	path to the (#configyml) to use	`configs/config.yml`
--dry_run	flag to enable "dry run" mode	`false`
--env_prefix	prefix to use for environment variable lookup	`''`
--verbose	flag to enable "verbose" mode	`false`
--version	display version information only	N/a

Configuration

The Databricks integration is configured using the config.yml and/or environment variables. For Databricks, authentication related configuration parameters may also be set in a Databricks configuration profile. In all cases, where applicable, environment variables always take precedence.

`config.yml`

All configuration parameters for the Databricks integration can be set using a YAML file named config.yml. The default location for this file is configs/config.yml relative to the current working directory when the integration binary is executed. The supported configuration parameters are listed below. See config.template.yml for a full configuration example.

General configuration

The parameters in this section are configured at the top level of the config.yml.

`licenseKey`

Description	Valid Values	Required	Default
New Relic license key	string	Y	N/a

This parameter specifies the New Relic License Key (INGEST) that should be used to send generated metrics.

The license key can also be specified using the NEW_RELIC_LICENSE_KEY environment variable.

`region`

Description	Valid Values	Required	Default
New Relic region identifier	`US` / `EU`	N	`US`

This parameter specifies which New Relic region that generated metrics should be sent to.

`interval`

Description	Valid Values	Required	Default
Polling interval (in seconds)	numeric	N	60

This parameter specifies the interval (in seconds) at which the integration should poll for data.

This parameter is only used when runAsService is set to true.

`runAsService`

Description	Valid Values	Required	Default
Flag to enable running the integration as a "service"	`true` / `false`	N	`false`

The integration can run either as a "service" or as a simple command line utility which runs once and exits when it is complete.

When set to true, the integration process will run continuously and poll the for data at the recurring interval specified by the interval parameter. The process will only exit if it is explicitly stopped or a fatal error or panic occurs.

When set to false, the integration will run once and exit. This is intended for use with an external scheduling mechanism like cron.

`pipeline`

Description	Valid Values	Required	Default
The root node for the set of pipeline configuration parameters	YAML Mapping	N	N/a

The integration retrieves, processes, and exports data to New Relic using a data pipeline consisting of one or more receivers, a processing chain, and a New Relic exporter. Various aspects of the pipeline are configurable. This element groups together the configuration parameters related to pipeline configuration.

`log`

Description	Valid Values	Required	Default
The root node for the set of log configuration parameters	YAML Mapping	N	N/a

The integration uses the logrus package for application logging. This element groups together the configuration parameters related to log configuration.

`mode`

Description	Valid Values	Required	Default
The integration execution mode	`databricks`	N	`databricks`

The integration execution mode. Currently, the only supported execution mode is databricks.

Deprecated: As of v2.3.0, this configuration parameter is no longer used. The presence (or not) of the databricks top-level node will be used to enable (or disable) the Databricks collector. Likewise, the presence (or not) of the spark top-level node will be used to enable (or disable) the Spark collector separate from Databricks.

`databricks`

Description	Valid Values	Required	Default
The root node for the set of Databricks configuration parameters	YAML Mapping	N	N/a

This element groups together the configuration parameters to configure the Databricks collector. If this element is not specified, the Databricks collector will not be run.

Note that this node is not required. It can be used with or without the spark top-level node.

`spark`

Description	Valid Values	Required	Default
The root node for the set of Spark configuration parameters	YAML Mapping	N	N/a

This element groups together the configuration parameters to configure the Spark collector. If this element is not specified, the Spark collector will not be run.

Note that this node is not required. It can be used with or without the databricks top-level node.

`tags`

Description	Valid Values	Required	Default
The root node for a set of custom tags to add to all telemetry sent to New Relic	YAML Mapping	N	N/a

This element specifies a group of custom tags that will be added to all telemetry sent to New Relic. The tags are specified as a set of key-value pairs.

Pipeline configuration

`receiveBufferSize`

Description	Valid Values	Required	Default
Size of the buffer that holds items before processing	number	N	500

This parameter specifies the size of the buffer that holds received items before being flushed through the processing chain and on to the exporters. When this size is reached, the items in the buffer will be flushed automatically.

`harvestInterval`

Description	Valid Values	Required	Default
Harvest interval (in seconds)	number	N	60

This parameter specifies the interval (in seconds) at which the pipeline should automatically flush received items through the processing chain and on to the exporters. Each time this interval is reached, the pipeline will flush items even if the item buffer has not reached the size specified by the receiveBufferSize parameter.

`instances`

Description	Valid Values	Required	Default
Number of concurrent pipeline instances to run	number	N	3

The integration retrieves, processes, and exports metrics to New Relic using a data pipeline consisting of one or more receivers, a processing chain, and a New Relic exporter. When runAsService is true, the integration can launch one or more "instances" of this pipeline to receive, process, and export data concurrently. Each "instance" will be configured with the same processing chain and exporters and the receivers will be spread across the available instances in a round-robin fashion.

This parameter specifies the number of pipeline instances to launch.

NOTE: When runAsService is false, only a single pipeline instance is used.

Log configuration

`level`

Description	Valid Values	Required	Default
Log level	`panic` / `fatal` / `error` / `warn` / `info` / `debug` / `trace`	N	`warn`

This parameter specifies the maximum severity of log messages to output with trace being the least severe and panic being the most severe. For example, at the default log level (warn), all log messages with severities warn, error, fatal, and panic will be output but info, debug, and trace will not.

`fileName`

Description	Valid Values	Required	Default
Path to a file where log output will be written	string	N	`stderr`

This parameter designates a file path where log output should be written. When no path is specified, log output will be written to the standard error stream (stderr).

Databricks configuration

The Databricks configuration parameters are used to configure the Databricks collector.

`workspaceHost`

Description	Valid Values	Required	Default
Databricks workspace instance name	string	conditional	N/a

This parameter specifies the instance name of the target Databricks instance for which data should be collected. This is used by the integration when constructing the URLs for API calls. Note that the value of this parameter must not include the https:// prefix, e.g. https://my-databricks-instance-name.cloud.databricks.com.

This parameter is required when the collection of Spark telemetry for Spark running on Databricks is enabled. Note that this does not apply when the integration is deployed directly on the driver node via the provided init script. This parameter is unused in that scenario.

The workspace host can also be specified using the DATABRICKS_HOST environment variable.

NOTE: The DATABRICKS_HOST environment variable can not be used to specify both the instance name and the accounts API endpoint. To account for this, the environment variables DATABRICKS_WORKSPACEHOST and DATABRICKS_ACCOUNTHOST environment variables can be alternately used either separately or in combination with the DATABRICKS_HOST environment variable to specify the instance name and the accounts API endpoint, respectively.

`accountHost`

Description	Valid Values	Required	Default
Databricks accounts API endpoint	string	conditional	N/a

This parameter specifies the accounts API endpoint. This is used by the integration when constructing the URLs for account-level ReST API calls. Note that unlike the value of workspaceHost, the value of this parameter must include the https:// prefix, e.g. https://accounts.cloud.databricks.com.

This parameter is required when the collection of Databricks billable usage and list pricing data is enabled.

The account host can also be specified using the DATABRICKS_HOST environment variable.

NOTE: The DATABRICKS_HOST environment variable can not be used to specify both the instance name and the accounts API endpoint. To account for this, the environment variables DATABRICKS_WORKSPACEHOST and DATABRICKS_ACCOUNTHOST environment variables can be alternately used either separately or in combination with the DATABRICKS_HOST environment variable to specify the instance name and the accounts API endpoint, respectively.

`accountId`

Description	Valid Values	Required	Default
Databricks account ID for the accounts API	string	conditional	N/a

This parameter specifies the Databricks account ID. This is used by the integration when constructing the URLs for account-level ReST API calls.

This parameter is required when the collection of Databricks billable usage and list pricing data is enabled.

`accessToken`

Description	Valid Values	Required	Default
Databricks personal access token	string	N	N/a

When set, the integration will use Databricks personal access token authentication to authenticate Databricks API calls with the value of this parameter as the Databricks personal access token.

The personal access token can also be specified using the DATABRICKS_TOKEN environment variable or any other SDK-supported mechanism (e.g. the token field in a Databricks configuration profile).

See the authentication section for more details.

NOTE: Databricks personal access tokens can only be used to collect data at the workspace level. To collect account level data such as billable usage and list pricing data, OAuth authentication must be used instead.

`oauthClientId`

Description	Valid Values	Required	Default
Databricks OAuth M2M client ID	string	N	N/a

When set, the integration will use a service principal to authenticate with Databricks (OAuth M2M) when making Databricks API calls. The value of this parameter will be used as the OAuth client ID.

The OAuth client ID can also be specified using the DATABRICKS_CLIENT_ID environment variable or any other SDK-supported mechanism (e.g. the client_id field in a Databricks configuration profile).

See the authentication section for more details.

`oauthClientSecret`

Description	Valid Values	Required	Default
Databricks OAuth M2M client secret	string	N	N/a

When the oauthClientId is set, this parameter can be set to specify the OAuth secret associated with the service principal.

The OAuth client secret can also be specified using the DATABRICKS_CLIENT_SECRET environment variable or any other SDK-supported mechanism (e.g. the client_secret field in a Databricks configuration profile).

See the authentication section for more details.

`sparkMetrics`

Description	Valid Values	Required	Default
Flag to enable automatic collection of Spark metrics	`true` / `false`	N	`true`

Deprecated This configuration parameter has been deprecated in favor of the configuration parameter databricks.spark.enabled. Use that parameter instead.

`sparkMetricPrefix`

Description	Valid Values	Required	Default
A prefix to prepend to Spark metric names	string	N	N/a

Deprecated This configuration parameter has been deprecated in favor of the configuration parameter databricks.spark.metricPrefix. Use that parameter instead.

`sparkClusterSources`

Description	Valid Values	Required	Default
The root node for the Databricks cluster source configuration	YAML Mapping	N	N/a

Deprecated This configuration parameter has been deprecated in favor of the configuration parameter databricks.spark.clusterSources. Use that parameter instead.

`sqlStatementTimeout`

Description	Valid Values	Required	Default
Timeout (in seconds) to use when executing SQL statements on a SQL warehouse	number	N	30

Certain telemetry and data collected by the Databricks collector requires the collector to run Databricks SQL statements on a SQL warehouse. This configuration parameter specifies the number of seconds to wait before timing out a pending or running SQL query.

`spark`

Description	Valid Values	Required	Default
The root node for the set of Databricks Spark configuration parameters	YAML Mapping	N	N/a

This element groups together the configuration parameters to configure the Databricks collector settings related to the collection of telemetry from Databricks running on Spark. The configuration parameters in this group replace the configuration parameters sparkMetrics, sparkMetricPrefix, and sparkClusterSources.

`usage`

Description	Valid Values	Required	Default
The root node for the set of Databricks Usage configuration parameters	YAML Mapping	N	N/a

This element groups together the configuration parameters to configure the Databricks collector settings related to the collection of billable usage and list pricing data.

Databricks `spark` configuration

Databricks Spark `enabled`

Description	Valid Values	Required	Default
Flag to enable automatic collection of Spark metrics	`true` / `false`	N	`true`

By default, when the Databricks collector is enabled, it will automatically collect Spark telemetry from Spark running on Databricks.

This flag can be used to disable the collection of Spark telemetry by the Databricks collector. This may be useful to control data ingest when business requirements call for the collection of non-Spark related Databricks telemetry and Spark telemetry is not used. This flag is also used by the integration when it is deployed directly on the driver node of a Databricks cluster using the the provided init script since Spark telemetry is collected by the Spark collector in this scenario.

NOTE: This configuration parameter replaces the older sparkMetrics configuration parameter.

Databricks Spark `metricPrefix`

Description	Valid Values	Required	Default
A prefix to prepend to Spark metric names	string	N	N/a

This parameter serves the same purpose as the metricPrefix parameter of the Spark configuration except that it applies to Spark telemetry collected by the Databricks collector. See the metricPrefix parameter of the Spark configuration for more details.

Note that this parameter has no effect on Spark telemetry collected by the Spark collector. This includes the case when the integration is deployed directly on the driver node of a Databricks cluster using the the provided init script since Spark telemetry is collected by the Spark collector in this scenario.

NOTE: This configuration parameter replaces the older sparkMetricPrefix configuration parameter.

Databricks Spark `clusterSources`

Description	Valid Values	Required	Default
The root node for the Databricks cluster source configuration	YAML Mapping	N	N/a

The mechanism used to create a cluster is referred to as a cluster "source". The Databricks collector supports collecting Spark telemetry from all-purpose clusters created via the UI or API and from job clusters created via the Databricks Jobs Scheduler. This element groups together the flags used to individually enable or disable the cluster sources from which the Databricks collector will collect Spark telemetry.

NOTE: This configuration parameter replaces the older sparkClusterSources configuration parameter.

Databricks cluster source configuration

`ui`

Description	Valid Values	Required	Default
Flag to enable automatic collection of Spark telemetry from all-purpose clusters created via the UI	`true` / `false`	N	`true`

By default, when the Databricks collector is enabled, it will automatically collect Spark telemetry from all all-purpose clusters created via the UI.

This flag can be used to disable the collection of Spark telemetry from all-purpose clusters created via the UI.

`job`

Description	Valid Values	Required	Default
Flag to enable automatic collection of Spark telemetry from job clusters created via the Databricks Jobs Scheduler	`true` / `false`	N	`true`

By default, when the Databricks collector is enabled, it will automatically collect Spark telemetry from job clusters created by the Databricks Jobs Scheduler.

This flag can be used to disable the collection of Spark telemetry from job clusters created via the Databricks Jobs Scheduler.

`api`

Description	Valid Values	Required	Default
Flag to enable automatic collection of Spark telemetry from all-purpose clusters created via the Databricks ReST API	`true` / `false`	N	`true`

By default, when the Databricks collector is enabled, it will automatically collect Spark telemetry from all-purpose clusters created via the Databricks ReST API.

This flag can be used to disable the collection of Spark telemetry from all-purpose clusters created via the Databricks ReST API.

Databricks Usage Configuration

The Databricks usage configuration parameters are used to configure Databricks collector settings related to the collection of Databricks billable usage and list pricing data.

Databricks Usage `enabled`

Description	Valid Values	Required	Default
Flag to enable automatic collection of billable usage and list pricing data	`true` / `false`	N	`true`

By default, when the Databricks collector is enabled, it will automatically collect billable usage and list pricing data.

This flag can be used to disable the collection of billable usage and list pricing data by the Databricks collector. This may be useful when running multiple instances of the New Relic Databricks integration. In this scenario, Databricks billable usage and list pricing data collection should only be enabled on a single instance. Otherwise, billable usage data will be recorded more than once in New Relic, affecting consumption and cost calculations.

`warehouseId`

Description	Valid Values	Required	Default
ID of a SQL warehouse on which to run usage-related SQL statements	string	Y	N/a

The ID of a SQL warehouse on which to run the SQL statements used to collect Databricks billable usage and list pricing data.

This parameter is required when the collection of Databricks billable usage and list pricing data is enabled.

`includeIdentityMetadata`

Description	Valid Values	Required	Default
Flag to enable inclusion of identity related metadata in billable usage data	`true` / `false`	N	`false`

When the collection of Databricks billable usage and list pricing data is enabled, the Databricks collector can include several pieces of identifying information along with the billable usage data.

By default, when the collection of Databricks billable usage and list pricing data is enabled, the Databricks collector will not collect such data as it may be personally identifiable. This flag can be used to enable the inclusion of the identifying information.

When enabled, the following values are included.

The identity of the user a serverless billing record is attributed to. This value is included in the identity metadata returned from usage records in the billable usage system table.
The identity of the cluster creator for each usage record for billable usage attributed to all-purpose and job compute.
The single user name for each usage record for billable usage attributed to all-purpose and job compute configured for single-user access mode.
The identity of the warehouse creator for each usage record for billable usage attributed to SQL warehouse compute.

`runTime`

Description	Valid Values	Required	Default
Time of day (as `HH:mm:ss`) at which to run usage data collection	string with format `HH:mm:ss`	N	`02:00:00`

This parameter specifies the time of day at which the collection of billable usage and list pricing data occur. The value must be of the form HH:mm:ss where HH is the 0-padded 24-hour clock hour (00 - 23), mm is the 0-padded minute (00 - 59) and ss is the 0-padded second (00 - 59). For example, 09:00:00 is the time 9:00 AM and 23:30:00 is the time 11:30 PM.

The time will always be interpreted according to the UTC time zone. The time zone can not be configured. For example, to specify that the integration should be run at 2:00 AM EST (-0500), the value 07:00:00 should be specified.

Spark configuration

The Spark configuration parameters are used to configure the Spark collector.

`webUiUrl`

Description	Valid Values	Required	Default
The Web UI URL of an application on the Spark deployment to monitor	string	N	N/a

This parameter can be used to monitor a non-Databricks Spark deployment. It specifes the URL of the Web UI of an application running on the Spark deployment to monitor. The value should be of the form http[s]://<hostname>:<port> where <hostname> is the hostname of the Spark deployment to monitor and <port> is the port number of the Spark application's Web UI (typically 4040 or 4041, 4042, etc if more than one application is running on the same host).

Note that the value must not contain a path. The path of the Spark ReST API endpoints (mounted at /api/v1) will automatically be prepended.

`metricPrefix`

Description	Valid Values	Required	Default
A prefix to prepend to Spark metric names	string	N	N/a

This parameter specifies a prefix that will be prepended to each Spark metric name when the metric is exported to New Relic.

For example, if this parameter is set to spark., then the full name of the metric representing the value of the memory used on application executors (app.executor.memoryUsed) will be spark.app.executor.memoryUsed.

Note that it is not recommended to leave this value empty as the metric names without a prefix may be ambiguous. Additionally, note that this parameter has no effect on Spark telemetry collected by the Databricks collector. In that case, use the sparkMetricPrefix instead.

Authentication

The Databricks integration uses the Databricks SDK for Go to access the Databricks and Spark ReST APIs. The SDK performs authentication on behalf of the integration and provides many options for configuring the authentication type and credentials to be used. See the SDK documentation and the Databricks client unified authentication documentation for details.

For convenience purposes, the following parameters can be used in the Databricks configuration section of the `config.yml file.

accessToken - When set, the integration will instruct the SDK to explicitly use Databricks personal access token authentication. The SDK will not attempt to try other authentication mechanisms and instead will fail immediately if personal access token authentication fails.

NOTE: Databricks personal access tokens can only be used to collect data at the workspace level. To collect account level data such as billable usage and list pricing data, OAuth authentication must be used instead.
oauthClientId - When set, the integration will instruct the SDK to explicitly use a service principal to authenticate with Databricks (OAuth M2M). The SDK will not attempt to try other authentication mechanisms and instead will fail immediately if OAuth M2M authentication fails. The OAuth Client secret can be set using the oauthClientSecret configuration parameter or any of the other mechanisms supported by the SDK (e.g. the client_secret field in a Databricks configuration profile or the DATABRICKS_CLIENT_SECRET environment variable).
oauthClientSecret - The OAuth client secret to use for OAuth M2M authentication. This value is only used when oauthClientId is set in the config.yml. The OAuth client secret can also be set using any of the other mechanisms supported by the SDK (e.g. the client_secret field in a Databricks configuration profile or the DATABRICKS_CLIENT_SECRET environment variable).

Billable Usage & List Pricing Data

The New Relic Databricks integration can collect Databricks billable usage and list pricing data from the Databricks system tables. This data can be used to show basic Databricks DBU consumption and cost metrics directly within New Relic.

When the Databricks usage enabled flag is set to true, the Databricks collector will import billable usage records from the system.billing.usage table and list pricing records from the system.billing.list_prices table once a day at the time specified in the runTime configuration parameter.

NOTE: In order for the New Relic Databricks integration to collect billing usage and list pricing data, OAuth authentication must be used. This is required even when the integration is deployed on the driver node of a Databricks cluster using the provided init script because the billing usage and list pricing data can only be acquired using the account-level ReST API calls.

Billable Usage Data

On each run, billable usage data is collected for the previous day. For each billable usage record, a corresponding record is created as a New Relic event with the event type DatabricksUsage and the following attributes.

NOTE: Not every attribute is included in every event. For example, the cluster_* attributes are only included in events for usage records relevant to all-purpose or job related compute. Similarly, the warehouse_* attributes are only included in events for usage records relevant to SQL warehouse related compute.

NOTE: Descriptions below are sourced from the billable usage system table reference.

Name	Description
`account_id`	ID of the account this usage record was generated for
`workspace_id`	ID of the Workspace this usage record was associated with
`workspace_name`	Name of the Workspace this usage record was associated with
`record_id`	Unique ID for this usage record
`sku_name`	Name of the SKU associated with this usage record
`cloud`	Name of the Cloud this usage record is relevant for
`usage_start_time`	The start time relevant to this usage record
`usage_end_time`	The end time relevant to this usage record
`usage_date`	Date of this usage record
`custom_tags`	Tags applied by the users to this usage record. Includes compute resource tags and jobs tags.
`usage_unit`	Unit this usage record is measured in
`usage_quantity`	Number of units consumed for this usage record
`record_type`	Whether the usage record is original, a retraction, or a restatement. See the section "Analyze Correction Records" in the Databricks documentation for more details.
`ingestion_date`	Date the usage record was ingested into the usage table
`billing_origin_product`	The product that originated this usage reocrd
`usage_type`	The type of usage attributed to the product or workload for billing purposes
`cluster_id`	ID of the cluster associated with this usage record
`cluster_creator`	Creator of the cluster associated with this usage record (only included if `includeIdentityMetadata` is `true`)
`cluster_single_user_name`	Single user name of the cluster associated with this usage record if the access mode of the cluster is single-user access mode (only included if `includeIdentityMetadata` is `true`)
`cluster_source`	Cluster source of the cluster associated with this usage record
`cluster_instance_pool_id`	Instance pool ID of the cluster associated with this usage record
`warehouse_id`	ID of the SQL warehouse associated with this usage record
`warehouse_name`	Name of the SQL warehouse associated with this usage record
`warehouse_creator`	Creator of the SQL warehouse associated with this usage record (only included if `includeIdentityMetadata` is `true`)
`instance_pool_id`	ID of the instance pool associated with this usage record
`node_type`	The instance type of the compute resource associated with this usage record
`job_id`	ID of the job associated with this usage record for serverless compute or jobs compute usage
`job_run_id`	ID of the job run associated with this usage record for serverless compute or jobs compute usage
`job_name`	User-given name of the job associated with this usage record for serverless compute or jobs compute usage
`notebook_id`	ID of the notebook associated with this usage record for serverless compute for notebook usage
`notebook_path`	Workspace storage path of the notebook associated with this usage for serverless compute for notebook usage
`dlt_pipeline_id`	ID of the Delta Live Tables pipeline associated with this usage record
`dlt_update_id`	ID of the Delta Live Tables pipeline update associated with this usage record
`dlt_maintenance_id`	ID of the Delta Live Tables pipeline maintenance tasks associated with this usage record
`run_name`	Unique user-facing identifier of the Mosaic AI Model Training fine-tuning run associated with this usage record
`endpoint_name`	Name of the model serving endpoint or vector search endpoint associated with this usage record
`endpoint_id`	ID of the model serving endpoint or vector search endpoint associated with this usage record
`central_clean_room_id`	ID of the central clean room associated with this usage record
`run_as`	See the section "Analyze Identity Metadata" in the Databricks documentation for more details (only included if `includeIdentityMetadata` is `true`)
`jobs_tier`	Jobs tier product features for this usage record: values include `LIGHT`, `CLASSIC`, or `null`
`sql_tier`	SQL tier product features for this usage record: values include `CLASSIC`, `PRO`, or `null`
`dlt_tier`	DLT tier product features for this usage record: values include `CORE`, `PRO`, `ADVANCED`, or `null`
`is_serverless`	Flag indicating if this usage record is associated with serverless usage: values include `true` or `false`, or `null`
`is_photon`	Flag indicating if this usage record is associated with Photon usage: values include `true` or `false`, or `null`
`serving_type`	Serving type associated with this usage record: values include `MODEL`, `GPU_MODEL`, `FOUNDATION_MODEL`, `FEATURE`, or `null`

List Pricing Data

On each run, list pricing data is gathered into a New Relic lookup table named DatabricksListPrices. The entire lookup table is updated on each run. For each pricing record, this table will contain a corresponding row with the following columns.

NOTE: Descriptions below are sourced from the pricing system table reference.

Name	Description
account_id	ID of the account this pricing record was generated for
price_start_time	The time the price in this pricing record became effective in UTC
price_end_time	The time the price in this pricing record stopped being effective in UTC
sku_name	Name of the SKU associated with this pricing record
cloud	Name of the Cloud this pricing record is relevant for
currency_code	The currency the price in this pricing record is expressed in
usage_unit	The unit of measurement that is monetized in this pricing record
list_price	A single price that can be used for simple long-term estimates
promotional_price	A temporary promotional price that all customers get which could be used for cost estimation during the temporary period
effective_list_price	The effective list price used for calculating the cost

Building

Coding Conventions

Style Guidelines

While not strictly enforced, the basic preferred editor settings are set in the .editorconfig. Other than this, no style guidelines are currently imposed.

Static Analysis

This project uses both go vet and staticcheck to perform static code analysis. These checks are run via precommit on all commits. Though this can be bypassed on local commit, both tasks are also run during the validate workflow and must have no errors in order to be merged.

Commit Messages

Commit messages must follow the conventional commit format. Again, while this can be bypassed on local commit, it is strictly enforced in the validate workflow.

The basic commit message structure is as follows.

<type>[optional scope][!]: <description>

[optional body]

[optional footer(s)]

In addition to providing consistency, the commit message is used by svu during the release workflow. The presence and values of certain elements within the commit message affect auto-versioning. For example, the feat type will bump the minor version. Therefore, it is important to use the guidelines below and carefully consider the content of the commit message.

Please use one of the types below.

feat (bumps minor version)
fix (bumps patch version)
chore
build
docs
test

Any type can be followed by the ! character to indicate a breaking change. Additionally, any commit that has the text BREAKING CHANGE: in the footer will indicate a breaking change.

Local Development

For local development, simply use go build and go run. For example,

go build cmd/databricks/databricks.go

Or

go run cmd/databricks/databricks.go

If you prefer, you can also use goreleaser with the --single-target option to build the binary for the local GOOS and GOARCH only.

goreleaser build --single-target

Releases

Releases are built and packaged using goreleaser. By default, a new release will be built automatically on any push to the main branch. For more details, review the .goreleaser.yaml and the goreleaser documentation.

The svu utility is used to generate the next tag value based on commit messages.

GitHub Workflows

This project utilizes GitHub workflows to perform actions in response to certain GitHub events.

Workflow	Events	Description
validate	`push`, `pull_request` to `main` branch	Runs precommit to perform static analysis and runs commitlint to validate the last commit message
build	`push`, `pull_request`	Builds and tests code
release	`push` to `main` branch	Generates a new tag using svu and runs `goreleaser`
repolinter	`pull_request`	Enforces repository content guidelines

Appendix

The sections below cover topics that are related to Databricks telemetry but that are not specifically part of this integration. In particular, any assets referenced in these sections must be installed and/or managed separately from the integration. For example, the init scripts provided to monitor cluster health are not automatically installed or used by the integration.

Monitoring Cluster Health

New Relic Infrastructure can be used to collect system metrics like CPU and memory usage from the nodes in a Databricks cluster. Additionally, New Relic APM can be used to collect application metrics like JVM heap size and GC cycle count from the Apache Spark driver and executor JVMs. Both are achieved using cluster-scoped init scripts. The sections below cover the installation of these init scripts.

NOTE: Use of one or both init scripts will have a slight impact on cluster startup time. Therefore, consideration should be given when using the init scripts with a job cluster, particularly when using a job cluster scoped to a single task.

Configure the New Relic license key

Both the New Relic Infrastructure Agent init script and the New Relic APM Java Agent init script require a New Relic license key to be specified in a custom environment variable named NEW_RELIC_LICENSE_KEY. While the license key can be specified by hard-coding it in plain text in the compute configuration, this is not recommended. Instead, it is recommended to create a secret. using the Databricks CLI and reference the secret in the environment variable.

To create the secret and set the environment variable, perform the following steps.

Follow the steps to install or update the Databricks CLI.
Use the Databricks CLI to create a Databricks-backed secret scope with the name newrelic. For example,
```
databricks secrets create-scope newrelic
```
NOTE: Be sure to take note of the information in the referenced URL about the MANAGE scope permission and use the correct version of the command.

Use the Databricks CLI to create a secret for the license key in the new scope with the name licenseKey. For example,

databricks secrets put-secret --json '{
   "scope": "newrelic",
   "key": "licenseKey",
   "string_value": "[YOUR_LICENSE_KEY]"
}'

To set the custom environment variable named NEW_RELIC_LICENSE_KEY and reference the value from the secret, follow the steps to configure custom environment variables and add the following line after the last entry in the Environment variables field.

NEW_RELIC_LICENSE_KEY={{secrets/newrelic/licenseKey}}

Install the New Relic Infrastructure Agent init script

The cluster_init_infra.sh script automatically installs the latest version of the New Relic Infrastructure Agent on each node of the cluster.

To install the init script, perform the following steps.

Login to your Databricks account and navigate to the desired workspace.
Follow the recommendations for init scripts to store the cluster_init_infra.sh script within your workspace in the recommended manner. For example, if your workspace is enabled for Unity Catalog, you should store the init script in a Unity Catalog volume.
Navigate to the Compute tab and select the desired all-purpose or job compute to open the compute details UI.
Click the button labeled Edit to edit the compute's configuration.
Follow the steps to use the UI to configure a cluster to run an init script and point to the location where you stored the init script in step 2.
If your cluster is not running, click on the button labeled Confirm to save your changes. Then, restart the cluster. If your cluster is already running, click on the button labeled Confirm and restart to save your changes and restart the cluster.

Install the New Relic APM Java Agent init script

The cluster_init_apm.sh script automatically installs the latest version of the New Relic APM Java Agent on each node of the cluster.

To install the init script, perform the same steps as outlined in the Install the New Relic Infrastructure Agent init script section using the cluster_init_apm.sh script instead of the cluster_init_infra.sh script.

Additionally, perform the following steps.

Login to your Databricks account and navigate to the desired workspace.
Navigate to the Compute tab and select the desired all-purpose or job compute to open the compute details UI.
Click the button labeled Edit to edit the compute's configuration.

Follow the steps to configure custom Spark configuration properties and add the following lines after the last entry in the Spark Config field.

spark.driver.extraJavaOptions -javaagent:/databricks/jars/newrelic-agent.jar
spark.executor.extraJavaOptions -javaagent:/databricks/jars/newrelic-agent.jar -Dnewrelic.tempdir=/tmp

If your cluster is not running, click on the button labeled Confirm to save your changes. Then, restart the cluster. If your cluster is already running, click on the button labeled Confirm and restart to save your changes and restart the cluster.

Viewing your cluster data

With the New Relic Infrastructure Agent init script installed, a host entity will show up for each node in the cluster.

With the New Relic APM Java Agent init script installed, an APM application entity named Databricks Driver will show up for the Spark driver JVM and an APM application entity named Databricks Executor will show up for the executor JVMs. Note that all executor JVMs will report to a single APM application entity. Metrics for a specific executor can be viewed on many pages of the APM UI by selecting the instance from the Instances menu located below the time range selector. On the JVM Metrics page, the JVM metrics for a specific executor can be viewed by selecting an instance from the JVM instances table.

Additionally, both the host entities and the APM entities are tagged with the tags listed below to make it easy to filter down to the entities that make up your cluster using the entity filter bar that is available in many places in the UI.

databricksClusterId - The ID of the Databricks cluster
databricksClusterName - The name of the Databricks cluster
databricksIsDriverNode - true if the entity is on the driver node, otherwise false
databricksIsJobCluster - true if the entity is part of a job cluster, otherwise false

Below is an example of using the databricksClusterName to filter down to the host and APM entities for a single cluster using the entity filter bar on the All entities view.

Support

New Relic has open-sourced this project. This project is provided AS-IS WITHOUT WARRANTY OR DEDICATED SUPPORT. Issues and contributions should be reported to the project here on GitHub.

We encourage you to bring your experiences and questions to the Explorers Hub where our community members collaborate on solutions and new ideas.

Privacy

At New Relic we take your privacy and the security of your information seriously, and are committed to protecting your information. We must emphasize the importance of not sharing personal data in public forums, and ask all users to scrub logs and diagnostic information for sensitive information, whether personal, proprietary, or otherwise.

We define “Personal Data” as any information relating to an identified or identifiable individual, including, for example, your name, phone number, post code or zip code, Device ID, IP address, and email address.

For more information, review New Relic’s General Data Privacy Notice.

Contribute

We encourage your contributions to improve this project! Keep in mind that when you submit your pull request, you'll need to sign the CLA via the click-through using CLA-Assistant. You only have to sign the CLA one time per project.

If you have any questions, or to execute our corporate CLA (which is required if your contribution is on behalf of a company), drop us an email at [email protected].

A note about vulnerabilities

As noted in our security policy, New Relic is committed to the privacy and security of our customers and their data. We believe that providing coordinated disclosure by security researchers and engaging with the security community are important means to achieve our security goals.

If you believe you have found a security vulnerability in this project or any of New Relic's products or websites, we welcome and greatly appreciate you reporting it to New Relic through HackerOne.

If you would like to contribute to this project, review these guidelines.

To all contributors, we thank you! Without your contribution, this project would not be what it is today.

License

The New Relic Databricks Integration project is licensed under the Apache 2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.github		.github
cmd/databricks		cmd/databricks
configs		configs
examples		examples
init		init
internal		internal
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
cla.md		cla.md
commitlint.config.js		commitlint.config.js
go.mod		go.mod
go.sum		go.sum

License

newrelic-experimental/newrelic-databricks-integration

Folders and files

Latest commit

History

Repository files navigation

New Relic Databricks Integration

Table of Contents

Important Notes

Getting Started

On-host

Deploy the integration on a host

Deploy the integration on the driver node of a Databricks cluster

Features

Usage

Command Line Options

Configuration

config.yml

General configuration

licenseKey

region

interval

runAsService

pipeline

log

mode

databricks

spark

tags

Pipeline configuration

receiveBufferSize

harvestInterval

instances

Log configuration

level

fileName

Databricks configuration

workspaceHost

accountHost

accountId

accessToken

oauthClientId

oauthClientSecret

sparkMetrics

sparkMetricPrefix

sparkClusterSources

sqlStatementTimeout

spark

usage

Databricks spark configuration

Databricks Spark enabled

Databricks Spark metricPrefix

Databricks Spark clusterSources

Databricks cluster source configuration

ui

job

api

Databricks Usage Configuration

Databricks Usage enabled

warehouseId

includeIdentityMetadata

runTime

Spark configuration

webUiUrl

metricPrefix

Authentication

Billable Usage & List Pricing Data

Billable Usage Data

List Pricing Data

Building

Coding Conventions

Style Guidelines

Static Analysis

Commit Messages

Local Development

Releases

GitHub Workflows

Appendix

Monitoring Cluster Health

Configure the New Relic license key

`config.yml`

`licenseKey`

`region`

`interval`

`runAsService`

`pipeline`

`log`

`mode`

`databricks`

`spark`

`tags`

`receiveBufferSize`

`harvestInterval`

`instances`

`level`

`fileName`

`workspaceHost`

`accountHost`

`accountId`

`accessToken`

`oauthClientId`

`oauthClientSecret`

`sparkMetrics`

`sparkMetricPrefix`

`sparkClusterSources`

`sqlStatementTimeout`

`spark`

`usage`

Databricks `spark` configuration

Databricks Spark `enabled`

Databricks Spark `metricPrefix`

Databricks Spark `clusterSources`

`ui`

`job`

`api`

Databricks Usage `enabled`

`warehouseId`

`includeIdentityMetadata`

`runTime`

`webUiUrl`

`metricPrefix`

Packages