num_workers |
Option<i32> |
If num_workers, number of worker nodes that this cluster must have. A cluster has one Spark driver and num_workers executors for a total of num_workers + 1 Spark nodes. Note: When reading the properties of a cluster, this field reflects the desired number of workers rather than the actual number of workers. For instance, if a cluster is resized from 5 to 10 workers, this field is immediately updated to reflect the target size of 10 workers, whereas the workers listed in executors gradually increase from 5 to 10 as the new nodes are provisioned. |
[optional] |
autoscale |
Option<crate::models::AutoScale> |
|
[optional] |
cluster_id |
Option<String> |
Canonical identifier for the cluster. This ID is retained during cluster restarts and resizes, while each new cluster has a globally unique ID. |
[optional] |
creator_user_name |
Option<String> |
Creator user name. The field won’t be included in the response if the user has already been deleted. |
[optional] |
driver |
Option<crate::models::SparkNode> |
|
[optional] |
executors |
Option<Veccrate::models::SparkNode> |
Nodes on which the Spark executors reside. |
[optional] |
spark_context_id |
Option<i64> |
A canonical SparkContext identifier. This value does change when the Spark driver restarts. The pair (cluster_id, spark_context_id) is a globally unique identifier over all Spark contexts. |
[optional] |
jdbc_port |
Option<i32> |
Port on which Spark JDBC server is listening in the driver node. No service listens on this port in executor nodes. |
[optional] |
cluster_name |
Option<String> |
Cluster name requested by the user. This doesn’t have to be unique. If not specified at creation, the cluster name is an empty string. |
[optional] |
spark_version |
Option<String> |
The runtime version of the cluster. You can retrieve a list of available runtime versions by using the Runtime versions API call. |
[optional] |
spark_conf |
Option<::std::collections::HashMap<String, serde_json::Value>> |
An arbitrary object where the object key is a configuration propery name and the value is a configuration property value. |
[optional] |
aws_attributes |
Option<crate::models::AwsAttributes> |
|
[optional] |
node_type_id |
Option<String> |
This field encodes, through a single value, the resources available to each of the Spark nodes in this cluster. For example, the Spark nodes can be provisioned and optimized for memory or compute intensive workloads. A list of available node types can be retrieved by using the List node types API call. |
[optional] |
driver_node_type_id |
Option<String> |
The node type of the Spark driver. This field is optional; if unset, the driver node type is set as the same value as node_type_id defined above. |
[optional] |
ssh_public_keys |
Option<Vec> |
SSH public key contents that are added to each Spark node in this cluster. The corresponding private keys can be used to login with the user name ubuntu on port 2200 . Up to 10 keys can be specified. |
[optional] |
custom_tags |
Option<::std::collections::HashMap<String, serde_json::Value>> |
An object with key value pairs. The key length must be between 1 and 127 UTF-8 characters, inclusive. The value length must be less than or equal to 255 UTF-8 characters. For a list of all restrictions, see AWS Tag Restrictions: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/Using_Tags.html#tag-restrictions |
[optional] |
cluster_log_conf |
Option<crate::models::ClusterLogConf> |
|
[optional] |
init_scripts |
Option<Veccrate::models::InitScriptInfo> |
The configuration for storing init scripts. Any number of destinations can be specified. The scripts are executed sequentially in the order provided. If cluster_log_conf is specified, init script logs are sent to <destination>/<cluster-ID>/init_scripts . |
[optional] |
docker_image |
Option<crate::models::DockerImage> |
|
[optional] |
spark_env_vars |
Option<::std::collections::HashMap<String, serde_json::Value>> |
An arbitrary object where the object key is an environment variable name and the value is an environment variable value. |
[optional] |
autotermination_minutes |
Option<i32> |
Automatically terminates the cluster after it is inactive for this time in minutes. If not set, this cluster is not be automatically terminated. If specified, the threshold must be between 10 and 10000 minutes. You can also set this value to 0 to explicitly disable automatic termination. |
[optional] |
enable_elastic_disk |
Option<bool> |
Autoscaling Local Storage: when enabled, this cluster dynamically acquires additional disk space when its Spark workers are running low on disk space. This feature requires specific AWS permissions to function correctly - refer to Autoscaling local storage for details. |
[optional] |
instance_pool_id |
Option<String> |
The optional ID of the instance pool to which the cluster belongs. Refer to Pools for details. |
[optional] |
cluster_source |
Option<crate::models::ClusterSource> |
|
[optional] |
state |
Option<crate::models::ClusterState> |
|
[optional] |
state_message |
Option<String> |
A message associated with the most recent state transition (for example, the reason why the cluster entered a TERMINATED state). This field is unstructured, and its exact format is subject to change. |
[optional] |
start_time |
Option<i64> |
Time (in epoch milliseconds) when the cluster creation request was received (when the cluster entered a PENDING state). |
[optional] |
terminated_time |
Option<i64> |
Time (in epoch milliseconds) when the cluster was terminated, if applicable. |
[optional] |
last_state_loss_time |
Option<i64> |
Time when the cluster driver last lost its state (due to a restart or driver failure). |
[optional] |
last_activity_time |
Option<i64> |
Time (in epoch milliseconds) when the cluster was last active. A cluster is active if there is at least one command that has not finished on the cluster. This field is available after the cluster has reached a RUNNING state. Updates to this field are made as best-effort attempts. Certain versions of Spark do not support reporting of cluster activity. Refer to Automatic termination for details. |
[optional] |
cluster_memory_mb |
Option<i64> |
Total amount of cluster memory, in megabytes. |
[optional] |
cluster_cores |
Option<f32> |
Number of CPU cores available for this cluster. This can be fractional since certain node types are configured to share cores between Spark nodes on the same instance. |
[optional] |
default_tags |
Option<::std::collections::HashMap<String, serde_json::Value>> |
An object with key value pairs. The key length must be between 1 and 127 UTF-8 characters, inclusive. The value length must be less than or equal to 255 UTF-8 characters. For a list of all restrictions, see AWS Tag Restrictions: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/Using_Tags.html#tag-restrictions |
[optional] |
cluster_log_status |
Option<crate::models::LogSyncStatus> |
|
[optional] |
termination_reason |
Option<crate::models::TerminationReason> |
|
[optional] |