google_cloud_pipeline_components.experimental.custom_job.custom_job module

Module for supporting Google Vertex AI Custom Training Job Op.

google_cloud_pipeline_components.experimental.custom_job.custom_job.custom_training_job_op(component_spec: Callable, display_name: Optional[str] = '', replica_count: Optional[int] = 1, machine_type: Optional[str] = 'n1-standard-4', accelerator_type: Optional[str] = '', accelerator_count: Optional[int] = 1, boot_disk_type: Optional[str] = 'pd-ssd', boot_disk_size_gb: Optional[int] = 100, timeout: Optional[str] = '', restart_job_on_worker_restart: Optional[bool] = False, service_account: Optional[str] = '', network: Optional[str] = '', worker_pool_specs: Optional[List[Mapping[str, Any]]] = None, encryption_spec_key_name: Optional[str] = '', tensorboard: Optional[str] = '', base_output_directory: Optional[str] = '', labels: Optional[Dict[str, str]] = None) → Callable

Run a pipeline task using Vertex AI custom training job.

For detailed doc of the service, please refer to https://cloud.google.com/vertex-ai/docs/training/create-custom-job

Args:

component_spec:

The task (ContainerOp) object to run as Vertex AI custom job.

display_name:

Optional. The name of the custom job. If not provided the component_spec.name will be used instead.

replica_count:

Optional. The number of replicas to be split between master workerPoolSpec and worker workerPoolSpec. (master always has 1 replica).

machine_type:

Optional. The type of the machine to run the custom job. The default value is “n1-standard-4”.

For more details about this input config, see https://cloud.google.com/vertex-ai/docs/training/configure-compute#machine-types

accelerator_type:

Optional. The type of accelerator(s) that may be attached to the machine as per accelerator_count.

For more details about this input config, see https://cloud.google.com/vertex-ai/docs/reference/rest/v1/MachineSpec#acceleratortype

accelerator_count:

Optional. The number of accelerators to attach to the machine. Defaults to 1 if accelerator_type is set.

boot_disk_type:

Optional. Type of the boot disk (default is “pd-ssd”). Valid values: “pd-ssd” (Persistent Disk Solid State Drive) or “pd-standard”

(Persistent Disk Hard Disk Drive).

boot_disk_size_gb:

Optional. Size in GB of the boot disk (default is 100GB).

timeout:

Optional. The maximum job running time. The default is 7 days. A duration in seconds with up to nine fractional digits, terminated by ‘s’. Example: “3.5s”.

restart_job_on_worker_restart:

Optional. Restarts the entire CustomJob if a worker gets restarted. This feature can be used by distributed training jobs that are not resilient to workers leaving and joining a job.

service_account:

Optional. Sets the default service account for workload run-as account. Users submitting jobs must have act-as permission on this run-as account. If unspecified, the Vertex AI Custom Code Service Agent(https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents) for the CustomJob’s project is used.

network:

Optional. The full name of the Compute Engine network to which the job should be peered. For example, projects/12345/global/networks/myVPC. Format is of the form projects/{project}/global/networks/{network}. Where {project} is a project number, as in 12345, and {network} is a network name. Private services access must already be configured for the network. If left unspecified, the job is not peered with any network.

worker_pool_specs:

Optional, worker_pool_specs for distributed training. this will overwite all other cluster configurations. For details, please see: https://cloud.google.com/ai-platform-unified/docs/training/distributed-training

encryption_spec_key_name:

Optional, customer-managed encryption key options for the CustomJob. If this is set, then all resources created by the CustomJob will be encrypted with the provided encryption key.

tensorboard:

The name of a Vertex AI Tensorboard resource to which this CustomJob will upload Tensorboard logs.

base_output_directory:

The Cloud Storage location to store the output of this CustomJob or HyperparameterTuningJob. see below for more details: https://cloud.google.com/vertex-ai/docs/reference/rest/v1/GcsDestination

labels:

Optional. The labels with user-defined metadata to organize CustomJobs. See https://goo.gl/xmQnxf for more information.

Returns:

A Custom Job component OP correspoinding to the input component OP.