glue

This provider is a derived work of the Terraform Provider distributed under MPL 2.0. If you encounter a bug or missing feature, first check the pulumi/pulumi-aws repo; however, if that doesn’t turn up anything, please consult the source terraform-providers/terraform-provider-aws repo.

class pulumi_aws.glue.AwaitableGetScriptResult(dag_edges=None, dag_nodes=None, language=None, python_script=None, scala_code=None, id=None)
class pulumi_aws.glue.CatalogDatabase(resource_name, opts=None, catalog_id=None, description=None, location_uri=None, name=None, parameters=None, __props__=None, __name__=None, __opts__=None)

Provides a Glue Catalog Database Resource. You can refer to the Glue Developer Guide for a full explanation of the Glue Data Catalog functionality

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • catalog_id (pulumi.Input[str]) – ID of the Glue Catalog to create the database in. If omitted, this defaults to the AWS Account ID.

  • description (pulumi.Input[str]) – Description of the database.

  • location_uri (pulumi.Input[str]) – The location of the database (for example, an HDFS path).

  • name (pulumi.Input[str]) – The name of the database.

  • parameters (pulumi.Input[dict]) – A list of key-value pairs that define parameters and properties of the database.

catalog_id = None

ID of the Glue Catalog to create the database in. If omitted, this defaults to the AWS Account ID.

description = None

Description of the database.

location_uri = None

The location of the database (for example, an HDFS path).

name = None

The name of the database.

parameters = None

A list of key-value pairs that define parameters and properties of the database.

static get(resource_name, id, opts=None, catalog_id=None, description=None, location_uri=None, name=None, parameters=None)

Get an existing CatalogDatabase resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • catalog_id (pulumi.Input[str]) – ID of the Glue Catalog to create the database in. If omitted, this defaults to the AWS Account ID.

  • description (pulumi.Input[str]) – Description of the database.

  • location_uri (pulumi.Input[str]) – The location of the database (for example, an HDFS path).

  • name (pulumi.Input[str]) – The name of the database.

  • parameters (pulumi.Input[dict]) – A list of key-value pairs that define parameters and properties of the database.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.CatalogTable(resource_name, opts=None, catalog_id=None, database_name=None, description=None, name=None, owner=None, parameters=None, partition_keys=None, retention=None, storage_descriptor=None, table_type=None, view_expanded_text=None, view_original_text=None, __props__=None, __name__=None, __opts__=None)

Provides a Glue Catalog Table Resource. You can refer to the Glue Developer Guide for a full explanation of the Glue Data Catalog functionality.

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • catalog_id (pulumi.Input[str]) – ID of the Glue Catalog and database to create the table in. If omitted, this defaults to the AWS Account ID plus the database name.

  • database_name (pulumi.Input[str]) – Name of the metadata database where the table metadata resides. For Hive compatibility, this must be all lowercase.

  • description (pulumi.Input[str]) – Description of the table.

  • name (pulumi.Input[str]) – Name of the SerDe.

  • owner (pulumi.Input[str]) – Owner of the table.

  • parameters (pulumi.Input[dict]) – A map of initialization parameters for the SerDe, in key-value form.

  • partition_keys (pulumi.Input[list]) – A list of columns by which the table is partitioned. Only primitive types are supported as partition keys.

  • retention (pulumi.Input[float]) – Retention time for this table.

  • storage_descriptor (pulumi.Input[dict]) –

    A storage descriptor object containing information about the physical storage of this table. You can refer to the Glue Developer Guide for a full explanation of this object.

  • table_type (pulumi.Input[str]) – The type of this table (EXTERNAL_TABLE, VIRTUAL_VIEW, etc.).

  • view_expanded_text (pulumi.Input[str]) – If the table is a view, the expanded text of the view; otherwise null.

  • view_original_text (pulumi.Input[str]) – If the table is a view, the original text of the view; otherwise null.

The partition_keys object supports the following:

  • comment (pulumi.Input[str]) - Free-form text comment.

  • name (pulumi.Input[str]) - Name of the SerDe.

  • type (pulumi.Input[str]) - The datatype of data in the Column.

The storage_descriptor object supports the following:

  • bucketColumns (pulumi.Input[list]) - A list of reducer grouping columns, clustering columns, and bucketing columns in the table.

  • columns (pulumi.Input[list]) - A list of the Columns in the table.

    • comment (pulumi.Input[str]) - Free-form text comment.

    • name (pulumi.Input[str]) - Name of the SerDe.

    • type (pulumi.Input[str]) - The datatype of data in the Column.

  • compressed (pulumi.Input[bool]) - True if the data in the table is compressed, or False if not.

  • inputFormat (pulumi.Input[str]) - The input format: SequenceFileInputFormat (binary), or TextInputFormat, or a custom format.

  • location (pulumi.Input[str]) - The physical location of the table. By default this takes the form of the warehouse location, followed by the database location in the warehouse, followed by the table name.

  • numberOfBuckets (pulumi.Input[float]) - Must be specified if the table contains any dimension columns.

  • outputFormat (pulumi.Input[str]) - The output format: SequenceFileOutputFormat (binary), or IgnoreKeyTextOutputFormat, or a custom format.

  • parameters (pulumi.Input[dict]) - A map of initialization parameters for the SerDe, in key-value form.

  • serDeInfo (pulumi.Input[dict]) - Serialization/deserialization (SerDe) information.

    • name (pulumi.Input[str]) - Name of the SerDe.

    • parameters (pulumi.Input[dict]) - A map of initialization parameters for the SerDe, in key-value form.

    • serializationLibrary (pulumi.Input[str]) - Usually the class that implements the SerDe. An example is: org.apache.hadoop.hive.serde2.columnar.ColumnarSerDe.

  • skewedInfo (pulumi.Input[dict]) - Information about values that appear very frequently in a column (skewed values).

    • skewedColumnNames (pulumi.Input[list]) - A list of names of columns that contain skewed values.

    • skewedColumnValueLocationMaps (pulumi.Input[dict]) - A list of values that appear so frequently as to be considered skewed.

    • skewedColumnValues (pulumi.Input[list]) - A mapping of skewed values to the columns that contain them.

  • sortColumns (pulumi.Input[list]) - A list of Order objects specifying the sort order of each bucket in the table.

    • column (pulumi.Input[str]) - The name of the column.

    • sortOrder (pulumi.Input[float]) - Indicates that the column is sorted in ascending order (== 1), or in descending order (==0).

  • storedAsSubDirectories (pulumi.Input[bool]) - True if the table data is stored in subdirectories, or False if not.

catalog_id = None

ID of the Glue Catalog and database to create the table in. If omitted, this defaults to the AWS Account ID plus the database name.

database_name = None

Name of the metadata database where the table metadata resides. For Hive compatibility, this must be all lowercase.

description = None

Description of the table.

name = None

Name of the SerDe.

owner = None

Owner of the table.

parameters = None

A map of initialization parameters for the SerDe, in key-value form.

partition_keys = None

A list of columns by which the table is partitioned. Only primitive types are supported as partition keys.

  • comment (str) - Free-form text comment.

  • name (str) - Name of the SerDe.

  • type (str) - The datatype of data in the Column.

retention = None

Retention time for this table.

storage_descriptor = None

A storage descriptor object containing information about the physical storage of this table. You can refer to the Glue Developer Guide for a full explanation of this object.

  • bucketColumns (list) - A list of reducer grouping columns, clustering columns, and bucketing columns in the table.

  • columns (list) - A list of the Columns in the table.

    • comment (str) - Free-form text comment.

    • name (str) - Name of the SerDe.

    • type (str) - The datatype of data in the Column.

  • compressed (bool) - True if the data in the table is compressed, or False if not.

  • inputFormat (str) - The input format: SequenceFileInputFormat (binary), or TextInputFormat, or a custom format.

  • location (str) - The physical location of the table. By default this takes the form of the warehouse location, followed by the database location in the warehouse, followed by the table name.

  • numberOfBuckets (float) - Must be specified if the table contains any dimension columns.

  • outputFormat (str) - The output format: SequenceFileOutputFormat (binary), or IgnoreKeyTextOutputFormat, or a custom format.

  • parameters (dict) - A map of initialization parameters for the SerDe, in key-value form.

  • serDeInfo (dict) - Serialization/deserialization (SerDe) information.

    • name (str) - Name of the SerDe.

    • parameters (dict) - A map of initialization parameters for the SerDe, in key-value form.

    • serializationLibrary (str) - Usually the class that implements the SerDe. An example is: org.apache.hadoop.hive.serde2.columnar.ColumnarSerDe.

  • skewedInfo (dict) - Information about values that appear very frequently in a column (skewed values).

    • skewedColumnNames (list) - A list of names of columns that contain skewed values.

    • skewedColumnValueLocationMaps (dict) - A list of values that appear so frequently as to be considered skewed.

    • skewedColumnValues (list) - A mapping of skewed values to the columns that contain them.

  • sortColumns (list) - A list of Order objects specifying the sort order of each bucket in the table.

    • column (str) - The name of the column.

    • sortOrder (float) - Indicates that the column is sorted in ascending order (== 1), or in descending order (==0).

  • storedAsSubDirectories (bool) - True if the table data is stored in subdirectories, or False if not.

table_type = None

The type of this table (EXTERNAL_TABLE, VIRTUAL_VIEW, etc.).

view_expanded_text = None

If the table is a view, the expanded text of the view; otherwise null.

view_original_text = None

If the table is a view, the original text of the view; otherwise null.

static get(resource_name, id, opts=None, catalog_id=None, database_name=None, description=None, name=None, owner=None, parameters=None, partition_keys=None, retention=None, storage_descriptor=None, table_type=None, view_expanded_text=None, view_original_text=None)

Get an existing CatalogTable resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • catalog_id (pulumi.Input[str]) – ID of the Glue Catalog and database to create the table in. If omitted, this defaults to the AWS Account ID plus the database name.

  • database_name (pulumi.Input[str]) – Name of the metadata database where the table metadata resides. For Hive compatibility, this must be all lowercase.

  • description (pulumi.Input[str]) – Description of the table.

  • name (pulumi.Input[str]) – Name of the SerDe.

  • owner (pulumi.Input[str]) – Owner of the table.

  • parameters (pulumi.Input[dict]) – A map of initialization parameters for the SerDe, in key-value form.

  • partition_keys (pulumi.Input[list]) – A list of columns by which the table is partitioned. Only primitive types are supported as partition keys.

  • retention (pulumi.Input[float]) – Retention time for this table.

  • storage_descriptor (pulumi.Input[dict]) –

    A storage descriptor object containing information about the physical storage of this table. You can refer to the Glue Developer Guide for a full explanation of this object.

  • table_type (pulumi.Input[str]) – The type of this table (EXTERNAL_TABLE, VIRTUAL_VIEW, etc.).

  • view_expanded_text (pulumi.Input[str]) – If the table is a view, the expanded text of the view; otherwise null.

  • view_original_text (pulumi.Input[str]) – If the table is a view, the original text of the view; otherwise null.

The partition_keys object supports the following:

  • comment (pulumi.Input[str]) - Free-form text comment.

  • name (pulumi.Input[str]) - Name of the SerDe.

  • type (pulumi.Input[str]) - The datatype of data in the Column.

The storage_descriptor object supports the following:

  • bucketColumns (pulumi.Input[list]) - A list of reducer grouping columns, clustering columns, and bucketing columns in the table.

  • columns (pulumi.Input[list]) - A list of the Columns in the table.

    • comment (pulumi.Input[str]) - Free-form text comment.

    • name (pulumi.Input[str]) - Name of the SerDe.

    • type (pulumi.Input[str]) - The datatype of data in the Column.

  • compressed (pulumi.Input[bool]) - True if the data in the table is compressed, or False if not.

  • inputFormat (pulumi.Input[str]) - The input format: SequenceFileInputFormat (binary), or TextInputFormat, or a custom format.

  • location (pulumi.Input[str]) - The physical location of the table. By default this takes the form of the warehouse location, followed by the database location in the warehouse, followed by the table name.

  • numberOfBuckets (pulumi.Input[float]) - Must be specified if the table contains any dimension columns.

  • outputFormat (pulumi.Input[str]) - The output format: SequenceFileOutputFormat (binary), or IgnoreKeyTextOutputFormat, or a custom format.

  • parameters (pulumi.Input[dict]) - A map of initialization parameters for the SerDe, in key-value form.

  • serDeInfo (pulumi.Input[dict]) - Serialization/deserialization (SerDe) information.

    • name (pulumi.Input[str]) - Name of the SerDe.

    • parameters (pulumi.Input[dict]) - A map of initialization parameters for the SerDe, in key-value form.

    • serializationLibrary (pulumi.Input[str]) - Usually the class that implements the SerDe. An example is: org.apache.hadoop.hive.serde2.columnar.ColumnarSerDe.

  • skewedInfo (pulumi.Input[dict]) - Information about values that appear very frequently in a column (skewed values).

    • skewedColumnNames (pulumi.Input[list]) - A list of names of columns that contain skewed values.

    • skewedColumnValueLocationMaps (pulumi.Input[dict]) - A list of values that appear so frequently as to be considered skewed.

    • skewedColumnValues (pulumi.Input[list]) - A mapping of skewed values to the columns that contain them.

  • sortColumns (pulumi.Input[list]) - A list of Order objects specifying the sort order of each bucket in the table.

    • column (pulumi.Input[str]) - The name of the column.

    • sortOrder (pulumi.Input[float]) - Indicates that the column is sorted in ascending order (== 1), or in descending order (==0).

  • storedAsSubDirectories (pulumi.Input[bool]) - True if the table data is stored in subdirectories, or False if not.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.Classifier(resource_name, opts=None, grok_classifier=None, json_classifier=None, name=None, xml_classifier=None, __props__=None, __name__=None, __opts__=None)

Provides a Glue Classifier resource.

NOTE: It is only valid to create one type of classifier (grok, JSON, or XML). Changing classifier types will recreate the classifier.

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • grok_classifier (pulumi.Input[dict]) – A classifier that uses grok patterns. Defined below.

  • json_classifier (pulumi.Input[dict]) – A classifier for JSON content. Defined below.

  • name (pulumi.Input[str]) – The name of the classifier.

  • xml_classifier (pulumi.Input[dict]) – A classifier for XML content. Defined below.

The grok_classifier object supports the following:

  • classification (pulumi.Input[str]) - An identifier of the data format that the classifier matches.

  • customPatterns (pulumi.Input[str]) - Custom grok patterns used by this classifier.

  • grokPattern (pulumi.Input[str]) - The grok pattern used by this classifier.

The json_classifier object supports the following:

  • jsonPath (pulumi.Input[str]) - A JsonPath string defining the JSON data for the classifier to classify. AWS Glue supports a subset of JsonPath, as described in Writing JsonPath Custom Classifiers.

The xml_classifier object supports the following:

  • classification (pulumi.Input[str]) - An identifier of the data format that the classifier matches.

  • rowTag (pulumi.Input[str]) - The XML tag designating the element that contains each record in an XML document being parsed. Note that this cannot identify a self-closing element (closed by />). An empty row element that contains only attributes can be parsed as long as it ends with a closing tag (for example, <row item_a="A" item_b="B"></row> is okay, but <row item_a="A" item_b="B" /> is not).

grok_classifier = None

A classifier that uses grok patterns. Defined below.

  • classification (str) - An identifier of the data format that the classifier matches.

  • customPatterns (str) - Custom grok patterns used by this classifier.

  • grokPattern (str) - The grok pattern used by this classifier.

json_classifier = None

A classifier for JSON content. Defined below.

  • jsonPath (str) - A JsonPath string defining the JSON data for the classifier to classify. AWS Glue supports a subset of JsonPath, as described in Writing JsonPath Custom Classifiers.

name = None

The name of the classifier.

xml_classifier = None

A classifier for XML content. Defined below.

  • classification (str) - An identifier of the data format that the classifier matches.

  • rowTag (str) - The XML tag designating the element that contains each record in an XML document being parsed. Note that this cannot identify a self-closing element (closed by />). An empty row element that contains only attributes can be parsed as long as it ends with a closing tag (for example, <row item_a="A" item_b="B"></row> is okay, but <row item_a="A" item_b="B" /> is not).

static get(resource_name, id, opts=None, grok_classifier=None, json_classifier=None, name=None, xml_classifier=None)

Get an existing Classifier resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • grok_classifier (pulumi.Input[dict]) – A classifier that uses grok patterns. Defined below.

  • json_classifier (pulumi.Input[dict]) – A classifier for JSON content. Defined below.

  • name (pulumi.Input[str]) – The name of the classifier.

  • xml_classifier (pulumi.Input[dict]) – A classifier for XML content. Defined below.

The grok_classifier object supports the following:

  • classification (pulumi.Input[str]) - An identifier of the data format that the classifier matches.

  • customPatterns (pulumi.Input[str]) - Custom grok patterns used by this classifier.

  • grokPattern (pulumi.Input[str]) - The grok pattern used by this classifier.

The json_classifier object supports the following:

  • jsonPath (pulumi.Input[str]) - A JsonPath string defining the JSON data for the classifier to classify. AWS Glue supports a subset of JsonPath, as described in Writing JsonPath Custom Classifiers.

The xml_classifier object supports the following:

  • classification (pulumi.Input[str]) - An identifier of the data format that the classifier matches.

  • rowTag (pulumi.Input[str]) - The XML tag designating the element that contains each record in an XML document being parsed. Note that this cannot identify a self-closing element (closed by />). An empty row element that contains only attributes can be parsed as long as it ends with a closing tag (for example, <row item_a="A" item_b="B"></row> is okay, but <row item_a="A" item_b="B" /> is not).

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.Connection(resource_name, opts=None, catalog_id=None, connection_properties=None, connection_type=None, description=None, match_criterias=None, name=None, physical_connection_requirements=None, __props__=None, __name__=None, __opts__=None)

Provides a Glue Connection resource.

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • catalog_id (pulumi.Input[str]) – The ID of the Data Catalog in which to create the connection. If none is supplied, the AWS account ID is used by default.

  • connection_properties (pulumi.Input[dict]) – A map of key-value pairs used as parameters for this connection.

  • connection_type (pulumi.Input[str]) – The type of the connection. Defaults to JBDC.

  • description (pulumi.Input[str]) – Description of the connection.

  • match_criterias (pulumi.Input[list]) – A list of criteria that can be used in selecting this connection.

  • name (pulumi.Input[str]) – The name of the connection.

  • physical_connection_requirements (pulumi.Input[dict]) – A map of physical connection requirements, such as VPC and SecurityGroup. Defined below.

The physical_connection_requirements object supports the following:

  • availability_zone (pulumi.Input[str]) - The availability zone of the connection. This field is redundant and implied by subnet_id, but is currently an api requirement.

  • securityGroupIdLists (pulumi.Input[list]) - The security group ID list used by the connection.

  • subnet_id (pulumi.Input[str]) - The subnet ID used by the connection.

catalog_id = None

The ID of the Data Catalog in which to create the connection. If none is supplied, the AWS account ID is used by default.

connection_properties = None

A map of key-value pairs used as parameters for this connection.

connection_type = None

The type of the connection. Defaults to JBDC.

description = None

Description of the connection.

match_criterias = None

A list of criteria that can be used in selecting this connection.

name = None

The name of the connection.

physical_connection_requirements = None

A map of physical connection requirements, such as VPC and SecurityGroup. Defined below.

  • availability_zone (str) - The availability zone of the connection. This field is redundant and implied by subnet_id, but is currently an api requirement.

  • securityGroupIdLists (list) - The security group ID list used by the connection.

  • subnet_id (str) - The subnet ID used by the connection.

static get(resource_name, id, opts=None, catalog_id=None, connection_properties=None, connection_type=None, description=None, match_criterias=None, name=None, physical_connection_requirements=None)

Get an existing Connection resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • catalog_id (pulumi.Input[str]) – The ID of the Data Catalog in which to create the connection. If none is supplied, the AWS account ID is used by default.

  • connection_properties (pulumi.Input[dict]) – A map of key-value pairs used as parameters for this connection.

  • connection_type (pulumi.Input[str]) – The type of the connection. Defaults to JBDC.

  • description (pulumi.Input[str]) – Description of the connection.

  • match_criterias (pulumi.Input[list]) – A list of criteria that can be used in selecting this connection.

  • name (pulumi.Input[str]) – The name of the connection.

  • physical_connection_requirements (pulumi.Input[dict]) – A map of physical connection requirements, such as VPC and SecurityGroup. Defined below.

The physical_connection_requirements object supports the following:

  • availability_zone (pulumi.Input[str]) - The availability zone of the connection. This field is redundant and implied by subnet_id, but is currently an api requirement.

  • securityGroupIdLists (pulumi.Input[list]) - The security group ID list used by the connection.

  • subnet_id (pulumi.Input[str]) - The subnet ID used by the connection.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.Crawler(resource_name, opts=None, catalog_targets=None, classifiers=None, configuration=None, database_name=None, description=None, dynamodb_targets=None, jdbc_targets=None, name=None, role=None, s3_targets=None, schedule=None, schema_change_policy=None, security_configuration=None, table_prefix=None, __props__=None, __name__=None, __opts__=None)

Manages a Glue Crawler. More information can be found in the AWS Glue Developer Guide

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • classifiers (pulumi.Input[list]) – List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

  • configuration (pulumi.Input[str]) – JSON string of configuration information.

  • database_name (pulumi.Input[str]) – The name of the Glue database to be synchronized.

  • description (pulumi.Input[str]) – Description of the crawler.

  • dynamodb_targets (pulumi.Input[list]) – List of nested DynamoDB target arguments. See below.

  • jdbc_targets (pulumi.Input[list]) – List of nested JBDC target arguments. See below.

  • name (pulumi.Input[str]) – Name of the crawler.

  • role (pulumi.Input[str]) – The IAM role friendly name (including path without leading slash), or ARN of an IAM role, used by the crawler to access other resources.

  • s3_targets (pulumi.Input[list]) – List nested Amazon S3 target arguments. See below.

  • schedule (pulumi.Input[str]) – A cron expression used to specify the schedule. For more information, see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify: cron(15 12 * * ? *).

  • schema_change_policy (pulumi.Input[dict]) – Policy for the crawler’s update and deletion behavior.

  • security_configuration (pulumi.Input[str]) – The name of Security Configuration to be used by the crawler

  • table_prefix (pulumi.Input[str]) – The table prefix used for catalog tables that are created.

The catalog_targets object supports the following:

  • database_name (pulumi.Input[str]) - The name of the Glue database to be synchronized.

  • tables (pulumi.Input[list]) - A list of catalog tables to be synchronized.

The dynamodb_targets object supports the following:

  • path (pulumi.Input[str]) - The path to the Amazon S3 target.

The jdbc_targets object supports the following:

  • connectionName (pulumi.Input[str]) - The name of the connection to use to connect to the JDBC target.

  • exclusions (pulumi.Input[list]) - A list of glob patterns used to exclude from the crawl.

  • path (pulumi.Input[str]) - The path to the Amazon S3 target.

The s3_targets object supports the following:

  • exclusions (pulumi.Input[list]) - A list of glob patterns used to exclude from the crawl.

  • path (pulumi.Input[str]) - The path to the Amazon S3 target.

The schema_change_policy object supports the following:

  • deleteBehavior (pulumi.Input[str]) - The deletion behavior when the crawler finds a deleted object. Valid values: LOG, DELETE_FROM_DATABASE, or DEPRECATE_IN_DATABASE. Defaults to DEPRECATE_IN_DATABASE.

  • updateBehavior (pulumi.Input[str]) - The update behavior when the crawler finds a changed schema. Valid values: LOG or UPDATE_IN_DATABASE. Defaults to UPDATE_IN_DATABASE.

arn = None

The ARN of the crawler

classifiers = None

List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

configuration = None

JSON string of configuration information.

database_name = None

The name of the Glue database to be synchronized.

description = None

Description of the crawler.

dynamodb_targets = None

List of nested DynamoDB target arguments. See below.

  • path (str) - The path to the Amazon S3 target.

jdbc_targets = None

List of nested JBDC target arguments. See below.

  • connectionName (str) - The name of the connection to use to connect to the JDBC target.

  • exclusions (list) - A list of glob patterns used to exclude from the crawl.

  • path (str) - The path to the Amazon S3 target.

name = None

Name of the crawler.

role = None

The IAM role friendly name (including path without leading slash), or ARN of an IAM role, used by the crawler to access other resources.

s3_targets = None

List nested Amazon S3 target arguments. See below.

  • exclusions (list) - A list of glob patterns used to exclude from the crawl.

  • path (str) - The path to the Amazon S3 target.

schedule = None

A cron expression used to specify the schedule. For more information, see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify: cron(15 12 * * ? *).

schema_change_policy = None

Policy for the crawler’s update and deletion behavior.

  • deleteBehavior (str) - The deletion behavior when the crawler finds a deleted object. Valid values: LOG, DELETE_FROM_DATABASE, or DEPRECATE_IN_DATABASE. Defaults to DEPRECATE_IN_DATABASE.

  • updateBehavior (str) - The update behavior when the crawler finds a changed schema. Valid values: LOG or UPDATE_IN_DATABASE. Defaults to UPDATE_IN_DATABASE.

security_configuration = None

The name of Security Configuration to be used by the crawler

table_prefix = None

The table prefix used for catalog tables that are created.

static get(resource_name, id, opts=None, arn=None, catalog_targets=None, classifiers=None, configuration=None, database_name=None, description=None, dynamodb_targets=None, jdbc_targets=None, name=None, role=None, s3_targets=None, schedule=None, schema_change_policy=None, security_configuration=None, table_prefix=None)

Get an existing Crawler resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • arn (pulumi.Input[str]) – The ARN of the crawler

  • classifiers (pulumi.Input[list]) – List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

  • configuration (pulumi.Input[str]) – JSON string of configuration information.

  • database_name (pulumi.Input[str]) – The name of the Glue database to be synchronized.

  • description (pulumi.Input[str]) – Description of the crawler.

  • dynamodb_targets (pulumi.Input[list]) – List of nested DynamoDB target arguments. See below.

  • jdbc_targets (pulumi.Input[list]) – List of nested JBDC target arguments. See below.

  • name (pulumi.Input[str]) – Name of the crawler.

  • role (pulumi.Input[str]) – The IAM role friendly name (including path without leading slash), or ARN of an IAM role, used by the crawler to access other resources.

  • s3_targets (pulumi.Input[list]) – List nested Amazon S3 target arguments. See below.

  • schedule (pulumi.Input[str]) –

    A cron expression used to specify the schedule. For more information, see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify: cron(15 12 * * ? *).

  • schema_change_policy (pulumi.Input[dict]) – Policy for the crawler’s update and deletion behavior.

  • security_configuration (pulumi.Input[str]) – The name of Security Configuration to be used by the crawler

  • table_prefix (pulumi.Input[str]) – The table prefix used for catalog tables that are created.

The catalog_targets object supports the following:

  • database_name (pulumi.Input[str]) - The name of the Glue database to be synchronized.

  • tables (pulumi.Input[list]) - A list of catalog tables to be synchronized.

The dynamodb_targets object supports the following:

  • path (pulumi.Input[str]) - The path to the Amazon S3 target.

The jdbc_targets object supports the following:

  • connectionName (pulumi.Input[str]) - The name of the connection to use to connect to the JDBC target.

  • exclusions (pulumi.Input[list]) - A list of glob patterns used to exclude from the crawl.

  • path (pulumi.Input[str]) - The path to the Amazon S3 target.

The s3_targets object supports the following:

  • exclusions (pulumi.Input[list]) - A list of glob patterns used to exclude from the crawl.

  • path (pulumi.Input[str]) - The path to the Amazon S3 target.

The schema_change_policy object supports the following:

  • deleteBehavior (pulumi.Input[str]) - The deletion behavior when the crawler finds a deleted object. Valid values: LOG, DELETE_FROM_DATABASE, or DEPRECATE_IN_DATABASE. Defaults to DEPRECATE_IN_DATABASE.

  • updateBehavior (pulumi.Input[str]) - The update behavior when the crawler finds a changed schema. Valid values: LOG or UPDATE_IN_DATABASE. Defaults to UPDATE_IN_DATABASE.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.GetScriptResult(dag_edges=None, dag_nodes=None, language=None, python_script=None, scala_code=None, id=None)

A collection of values returned by getScript.

python_script = None

The Python script generated from the DAG when the language argument is set to PYTHON.

scala_code = None

The Scala code generated from the DAG when the language argument is set to SCALA.

id = None

id is the provider-assigned unique ID for this managed resource.

class pulumi_aws.glue.Job(resource_name, opts=None, allocated_capacity=None, command=None, connections=None, default_arguments=None, description=None, execution_property=None, max_capacity=None, max_retries=None, name=None, role_arn=None, security_configuration=None, timeout=None, __props__=None, __name__=None, __opts__=None)

Provides a Glue Job resource.

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • allocated_capacity (pulumi.Input[float]) – DEPRECATED (Optional) The number of AWS Glue data processing units (DPUs) to allocate to this Job. At least 2 DPUs need to be allocated; the default is 10. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory.

  • command (pulumi.Input[dict]) – The command of the job. Defined below.

  • connections (pulumi.Input[list]) – The list of connections used for this job.

  • default_arguments (pulumi.Input[dict]) – The map of default arguments for this job. You can specify arguments here that your own job-execution script consumes, as well as arguments that AWS Glue itself consumes. For information about how to specify and consume your own Job arguments, see the Calling AWS Glue APIs in Python topic in the developer guide. For information about the key-value pairs that AWS Glue consumes to set up your job, see the Special Parameters Used by AWS Glue topic in the developer guide.

  • description (pulumi.Input[str]) – Description of the job.

  • execution_property (pulumi.Input[dict]) – Execution property of the job. Defined below.

  • max_capacity (pulumi.Input[float]) – The maximum number of AWS Glue data processing units (DPUs) that can be allocated when this job runs.

  • max_retries (pulumi.Input[float]) – The maximum number of times to retry this job if it fails.

  • name (pulumi.Input[str]) – The name of the job command. Defaults to glueetl

  • role_arn (pulumi.Input[str]) – The ARN of the IAM role associated with this job.

  • security_configuration (pulumi.Input[str]) – The name of the Security Configuration to be associated with the job.

  • timeout (pulumi.Input[float]) – The job timeout in minutes. The default is 2880 minutes (48 hours).

The command object supports the following:

  • name (pulumi.Input[str]) - The name of the job command. Defaults to glueetl

  • scriptLocation (pulumi.Input[str]) - Specifies the S3 path to a script that executes a job.

The execution_property object supports the following:

  • maxConcurrentRuns (pulumi.Input[float]) - The maximum number of concurrent runs allowed for a job. The default is 1.

allocated_capacity = None

DEPRECATED (Optional) The number of AWS Glue data processing units (DPUs) to allocate to this Job. At least 2 DPUs need to be allocated; the default is 10. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory.

command = None

The command of the job. Defined below.

  • name (str) - The name of the job command. Defaults to glueetl

  • scriptLocation (str) - Specifies the S3 path to a script that executes a job.

connections = None

The list of connections used for this job.

default_arguments = None

The map of default arguments for this job. You can specify arguments here that your own job-execution script consumes, as well as arguments that AWS Glue itself consumes. For information about how to specify and consume your own Job arguments, see the Calling AWS Glue APIs in Python topic in the developer guide. For information about the key-value pairs that AWS Glue consumes to set up your job, see the Special Parameters Used by AWS Glue topic in the developer guide.

description = None

Description of the job.

execution_property = None

Execution property of the job. Defined below.

  • maxConcurrentRuns (float) - The maximum number of concurrent runs allowed for a job. The default is 1.

max_capacity = None

The maximum number of AWS Glue data processing units (DPUs) that can be allocated when this job runs.

max_retries = None

The maximum number of times to retry this job if it fails.

name = None

The name of the job command. Defaults to glueetl

role_arn = None

The ARN of the IAM role associated with this job.

security_configuration = None

The name of the Security Configuration to be associated with the job.

timeout = None

The job timeout in minutes. The default is 2880 minutes (48 hours).

static get(resource_name, id, opts=None, allocated_capacity=None, command=None, connections=None, default_arguments=None, description=None, execution_property=None, max_capacity=None, max_retries=None, name=None, role_arn=None, security_configuration=None, timeout=None)

Get an existing Job resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • allocated_capacity (pulumi.Input[float]) – DEPRECATED (Optional) The number of AWS Glue data processing units (DPUs) to allocate to this Job. At least 2 DPUs need to be allocated; the default is 10. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory.

  • command (pulumi.Input[dict]) – The command of the job. Defined below.

  • connections (pulumi.Input[list]) – The list of connections used for this job.

  • default_arguments (pulumi.Input[dict]) –

    The map of default arguments for this job. You can specify arguments here that your own job-execution script consumes, as well as arguments that AWS Glue itself consumes. For information about how to specify and consume your own Job arguments, see the Calling AWS Glue APIs in Python topic in the developer guide. For information about the key-value pairs that AWS Glue consumes to set up your job, see the Special Parameters Used by AWS Glue topic in the developer guide.

  • description (pulumi.Input[str]) – Description of the job.

  • execution_property (pulumi.Input[dict]) – Execution property of the job. Defined below.

  • max_capacity (pulumi.Input[float]) – The maximum number of AWS Glue data processing units (DPUs) that can be allocated when this job runs.

  • max_retries (pulumi.Input[float]) – The maximum number of times to retry this job if it fails.

  • name (pulumi.Input[str]) – The name of the job command. Defaults to glueetl

  • role_arn (pulumi.Input[str]) – The ARN of the IAM role associated with this job.

  • security_configuration (pulumi.Input[str]) – The name of the Security Configuration to be associated with the job.

  • timeout (pulumi.Input[float]) – The job timeout in minutes. The default is 2880 minutes (48 hours).

The command object supports the following:

  • name (pulumi.Input[str]) - The name of the job command. Defaults to glueetl

  • scriptLocation (pulumi.Input[str]) - Specifies the S3 path to a script that executes a job.

The execution_property object supports the following:

  • maxConcurrentRuns (pulumi.Input[float]) - The maximum number of concurrent runs allowed for a job. The default is 1.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.SecurityConfiguration(resource_name, opts=None, encryption_configuration=None, name=None, __props__=None, __name__=None, __opts__=None)

Manages a Glue Security Configuration.

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • encryption_configuration (pulumi.Input[dict]) – Configuration block containing encryption configuration. Detailed below.

  • name (pulumi.Input[str]) – Name of the security configuration.

The encryption_configuration object supports the following:

  • cloudwatchEncryption (pulumi.Input[dict])

    • cloudwatchEncryptionMode (pulumi.Input[str]) - Encryption mode to use for CloudWatch data. Valid values: DISABLED, SSE-KMS. Default value: DISABLED.

    • kms_key_arn (pulumi.Input[str]) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

  • jobBookmarksEncryption (pulumi.Input[dict])

    • jobBookmarksEncryptionMode (pulumi.Input[str]) - Encryption mode to use for job bookmarks data. Valid values: CSE-KMS, DISABLED. Default value: DISABLED.

    • kms_key_arn (pulumi.Input[str]) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

  • s3Encryption (pulumi.Input[dict]) - A s3_encryption block as described below, which contains encryption configuration for S3 data.

    • kms_key_arn (pulumi.Input[str]) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

    • s3EncryptionMode (pulumi.Input[str]) - Encryption mode to use for S3 data. Valid values: DISABLED, SSE-KMS, SSE-S3. Default value: DISABLED.

encryption_configuration = None

Configuration block containing encryption configuration. Detailed below.

  • cloudwatchEncryption (dict)

    • cloudwatchEncryptionMode (str) - Encryption mode to use for CloudWatch data. Valid values: DISABLED, SSE-KMS. Default value: DISABLED.

    • kms_key_arn (str) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

  • jobBookmarksEncryption (dict)

    • jobBookmarksEncryptionMode (str) - Encryption mode to use for job bookmarks data. Valid values: CSE-KMS, DISABLED. Default value: DISABLED.

    • kms_key_arn (str) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

  • s3Encryption (dict) - A s3_encryption block as described below, which contains encryption configuration for S3 data.

    • kms_key_arn (str) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

    • s3EncryptionMode (str) - Encryption mode to use for S3 data. Valid values: DISABLED, SSE-KMS, SSE-S3. Default value: DISABLED.

name = None

Name of the security configuration.

static get(resource_name, id, opts=None, encryption_configuration=None, name=None)

Get an existing SecurityConfiguration resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • encryption_configuration (pulumi.Input[dict]) – Configuration block containing encryption configuration. Detailed below.

  • name (pulumi.Input[str]) – Name of the security configuration.

The encryption_configuration object supports the following:

  • cloudwatchEncryption (pulumi.Input[dict])

    • cloudwatchEncryptionMode (pulumi.Input[str]) - Encryption mode to use for CloudWatch data. Valid values: DISABLED, SSE-KMS. Default value: DISABLED.

    • kms_key_arn (pulumi.Input[str]) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

  • jobBookmarksEncryption (pulumi.Input[dict])

    • jobBookmarksEncryptionMode (pulumi.Input[str]) - Encryption mode to use for job bookmarks data. Valid values: CSE-KMS, DISABLED. Default value: DISABLED.

    • kms_key_arn (pulumi.Input[str]) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

  • s3Encryption (pulumi.Input[dict]) - A s3_encryption block as described below, which contains encryption configuration for S3 data.

    • kms_key_arn (pulumi.Input[str]) - Amazon Resource Name (ARN) of the KMS key to be used to encrypt the data.

    • s3EncryptionMode (pulumi.Input[str]) - Encryption mode to use for S3 data. Valid values: DISABLED, SSE-KMS, SSE-S3. Default value: DISABLED.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

class pulumi_aws.glue.Trigger(resource_name, opts=None, actions=None, description=None, enabled=None, name=None, predicate=None, schedule=None, type=None, __props__=None, __name__=None, __opts__=None)

Manages a Glue Trigger resource.

Parameters
  • resource_name (str) – The name of the resource.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • actions (pulumi.Input[list]) – List of actions initiated by this trigger when it fires. Defined below.

  • description (pulumi.Input[str]) – A description of the new trigger.

  • enabled (pulumi.Input[bool]) – Start the trigger. Defaults to true. Not valid to disable for ON_DEMAND type.

  • name (pulumi.Input[str]) – The name of the trigger.

  • predicate (pulumi.Input[dict]) – A predicate to specify when the new trigger should fire. Required when trigger type is CONDITIONAL. Defined below.

  • schedule (pulumi.Input[str]) –

    A cron expression used to specify the schedule. Time-Based Schedules for Jobs and Crawlers

  • type (pulumi.Input[str]) – The type of trigger. Valid values are CONDITIONAL, ON_DEMAND, and SCHEDULED.

The actions object supports the following:

  • arguments (pulumi.Input[dict]) - Arguments to be passed to the job. You can specify arguments here that your own job-execution script consumes, as well as arguments that AWS Glue itself consumes.

  • jobName (pulumi.Input[str]) - The name of the job to watch.

  • timeout (pulumi.Input[float]) - The job run timeout in minutes. It overrides the timeout value of the job.

The predicate object supports the following:

  • conditions (pulumi.Input[list]) - A list of the conditions that determine when the trigger will fire. Defined below.

    • jobName (pulumi.Input[str]) - The name of the job to watch.

    • logicalOperator (pulumi.Input[str]) - A logical operator. Defaults to EQUALS.

    • state (pulumi.Input[str]) - The condition state. Currently, the values supported are SUCCEEDED, STOPPED, TIMEOUT and FAILED.

  • logical (pulumi.Input[str]) - How to handle multiple conditions. Defaults to AND. Valid values are AND or ANY.

actions = None

List of actions initiated by this trigger when it fires. Defined below.

  • arguments (dict) - Arguments to be passed to the job. You can specify arguments here that your own job-execution script consumes, as well as arguments that AWS Glue itself consumes.

  • jobName (str) - The name of the job to watch.

  • timeout (float) - The job run timeout in minutes. It overrides the timeout value of the job.

description = None

A description of the new trigger.

enabled = None

Start the trigger. Defaults to true. Not valid to disable for ON_DEMAND type.

name = None

The name of the trigger.

predicate = None

A predicate to specify when the new trigger should fire. Required when trigger type is CONDITIONAL. Defined below.

  • conditions (list) - A list of the conditions that determine when the trigger will fire. Defined below.

    • jobName (str) - The name of the job to watch.

    • logicalOperator (str) - A logical operator. Defaults to EQUALS.

    • state (str) - The condition state. Currently, the values supported are SUCCEEDED, STOPPED, TIMEOUT and FAILED.

  • logical (str) - How to handle multiple conditions. Defaults to AND. Valid values are AND or ANY.

schedule = None

A cron expression used to specify the schedule. Time-Based Schedules for Jobs and Crawlers

type = None

The type of trigger. Valid values are CONDITIONAL, ON_DEMAND, and SCHEDULED.

static get(resource_name, id, opts=None, actions=None, description=None, enabled=None, name=None, predicate=None, schedule=None, type=None)

Get an existing Trigger resource’s state with the given name, id, and optional extra properties used to qualify the lookup.

Parameters
  • resource_name (str) – The unique name of the resulting resource.

  • id (str) – The unique provider ID of the resource to lookup.

  • opts (pulumi.ResourceOptions) – Options for the resource.

  • actions (pulumi.Input[list]) – List of actions initiated by this trigger when it fires. Defined below.

  • description (pulumi.Input[str]) – A description of the new trigger.

  • enabled (pulumi.Input[bool]) – Start the trigger. Defaults to true. Not valid to disable for ON_DEMAND type.

  • name (pulumi.Input[str]) – The name of the trigger.

  • predicate (pulumi.Input[dict]) – A predicate to specify when the new trigger should fire. Required when trigger type is CONDITIONAL. Defined below.

  • schedule (pulumi.Input[str]) –

    A cron expression used to specify the schedule. Time-Based Schedules for Jobs and Crawlers

  • type (pulumi.Input[str]) – The type of trigger. Valid values are CONDITIONAL, ON_DEMAND, and SCHEDULED.

The actions object supports the following:

  • arguments (pulumi.Input[dict]) - Arguments to be passed to the job. You can specify arguments here that your own job-execution script consumes, as well as arguments that AWS Glue itself consumes.

  • jobName (pulumi.Input[str]) - The name of the job to watch.

  • timeout (pulumi.Input[float]) - The job run timeout in minutes. It overrides the timeout value of the job.

The predicate object supports the following:

  • conditions (pulumi.Input[list]) - A list of the conditions that determine when the trigger will fire. Defined below.

    • jobName (pulumi.Input[str]) - The name of the job to watch.

    • logicalOperator (pulumi.Input[str]) - A logical operator. Defaults to EQUALS.

    • state (pulumi.Input[str]) - The condition state. Currently, the values supported are SUCCEEDED, STOPPED, TIMEOUT and FAILED.

  • logical (pulumi.Input[str]) - How to handle multiple conditions. Defaults to AND. Valid values are AND or ANY.

translate_output_property(prop)

Provides subclasses of Resource an opportunity to translate names of output properties into a format of their choosing before writing those properties to the resource object.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

translate_input_property(prop)

Provides subclasses of Resource an opportunity to translate names of input properties into a format of their choosing before sending those properties to the Pulumi engine.

Parameters

prop (str) – A property name.

Returns

A potentially transformed property name.

Return type

str

pulumi_aws.glue.get_script(dag_edges=None, dag_nodes=None, language=None, opts=None)

Use this data source to generate a Glue script from a Directed Acyclic Graph (DAG).

Parameters
  • dag_edges (list) – A list of the edges in the DAG. Defined below.

  • dag_nodes (list) – A list of the nodes in the DAG. Defined below.

  • language (str) – The programming language of the resulting code from the DAG. Defaults to PYTHON. Valid values are PYTHON and SCALA.

The dag_edges object supports the following:

  • source (str) - The ID of the node at which the edge starts.

  • target (str) - The ID of the node at which the edge ends.

  • targetParameter (str) - The target of the edge.

The dag_nodes object supports the following:

  • args (list) - Nested configuration an argument or property of a node. Defined below.

    • name (str) - The name of the argument or property.

    • param (bool) - Boolean if the value is used as a parameter. Defaults to false.

    • value (str) - The value of the argument or property.

  • id (str) - A node identifier that is unique within the node’s graph.

  • lineNumber (float) - The line number of the node.

  • node_type (str) - The type of node this is.