We recommend new projects start with resources from the AWS provider.
aws-native.databrew.getJob
Explore with Pulumi AI
We recommend new projects start with resources from the AWS provider.
Resource schema for AWS::DataBrew::Job.
Using getJob
Two invocation forms are available. The direct form accepts plain arguments and either blocks until the result value is available, or returns a Promise-wrapped result. The output form accepts Input-wrapped arguments and returns an Output-wrapped result.
function getJob(args: GetJobArgs, opts?: InvokeOptions): Promise<GetJobResult>
function getJobOutput(args: GetJobOutputArgs, opts?: InvokeOptions): Output<GetJobResult>
def get_job(name: Optional[str] = None,
opts: Optional[InvokeOptions] = None) -> GetJobResult
def get_job_output(name: Optional[pulumi.Input[str]] = None,
opts: Optional[InvokeOptions] = None) -> Output[GetJobResult]
func LookupJob(ctx *Context, args *LookupJobArgs, opts ...InvokeOption) (*LookupJobResult, error)
func LookupJobOutput(ctx *Context, args *LookupJobOutputArgs, opts ...InvokeOption) LookupJobResultOutput
> Note: This function is named LookupJob
in the Go SDK.
public static class GetJob
{
public static Task<GetJobResult> InvokeAsync(GetJobArgs args, InvokeOptions? opts = null)
public static Output<GetJobResult> Invoke(GetJobInvokeArgs args, InvokeOptions? opts = null)
}
public static CompletableFuture<GetJobResult> getJob(GetJobArgs args, InvokeOptions options)
// Output-based functions aren't available in Java yet
fn::invoke:
function: aws-native:databrew:getJob
arguments:
# arguments dictionary
The following arguments are supported:
- Name string
- Job name
- Name string
- Job name
- name String
- Job name
- name string
- Job name
- name str
- Job name
- name String
- Job name
getJob Result
The following output properties are available:
- Data
Catalog List<Pulumi.Outputs Aws Native. Data Brew. Outputs. Job Data Catalog Output> - One or more artifacts that represent the AWS Glue Data Catalog output from running the job.
- Database
Outputs List<Pulumi.Aws Native. Data Brew. Outputs. Job Database Output> - Represents a list of JDBC database output objects which defines the output destination for a DataBrew recipe job to write into.
- Dataset
Name string - Dataset name
- Encryption
Key stringArn - Encryption Key Arn
- Encryption
Mode Pulumi.Aws Native. Data Brew. Job Encryption Mode - Encryption mode
- Job
Sample Pulumi.Aws Native. Data Brew. Outputs. Job Sample - Job Sample
- Log
Subscription Pulumi.Aws Native. Data Brew. Job Log Subscription - Log subscription
- Max
Capacity int - Max capacity
- Max
Retries int - Max retries
- Output
Location Pulumi.Aws Native. Data Brew. Outputs. Job Output Location - Output location
- Outputs
List<Pulumi.
Aws Native. Data Brew. Outputs. Job Output> - One or more artifacts that represent output from running the job.
- Profile
Configuration Pulumi.Aws Native. Data Brew. Outputs. Job Profile Configuration - Profile Job configuration
- Project
Name string - Project name
- Recipe
Pulumi.
Aws Native. Data Brew. Outputs. Job Recipe - A series of data transformation steps that the job runs.
- Role
Arn string - Role arn
- Timeout int
- Timeout
- Validation
Configurations List<Pulumi.Aws Native. Data Brew. Outputs. Job Validation Configuration> - Data quality rules configuration
- Data
Catalog []JobOutputs Data Catalog Output - One or more artifacts that represent the AWS Glue Data Catalog output from running the job.
- Database
Outputs []JobDatabase Output - Represents a list of JDBC database output objects which defines the output destination for a DataBrew recipe job to write into.
- Dataset
Name string - Dataset name
- Encryption
Key stringArn - Encryption Key Arn
- Encryption
Mode JobEncryption Mode - Encryption mode
- Job
Sample JobSample - Job Sample
- Log
Subscription JobLog Subscription - Log subscription
- Max
Capacity int - Max capacity
- Max
Retries int - Max retries
- Output
Location JobOutput Location - Output location
- Outputs
[]Job
Output Type - One or more artifacts that represent output from running the job.
- Profile
Configuration JobProfile Configuration - Profile Job configuration
- Project
Name string - Project name
- Recipe
Job
Recipe - A series of data transformation steps that the job runs.
- Role
Arn string - Role arn
- Timeout int
- Timeout
- Validation
Configurations []JobValidation Configuration - Data quality rules configuration
- data
Catalog List<JobOutputs Data Catalog Output> - One or more artifacts that represent the AWS Glue Data Catalog output from running the job.
- database
Outputs List<JobDatabase Output> - Represents a list of JDBC database output objects which defines the output destination for a DataBrew recipe job to write into.
- dataset
Name String - Dataset name
- encryption
Key StringArn - Encryption Key Arn
- encryption
Mode JobEncryption Mode - Encryption mode
- job
Sample JobSample - Job Sample
- log
Subscription JobLog Subscription - Log subscription
- max
Capacity Integer - Max capacity
- max
Retries Integer - Max retries
- output
Location JobOutput Location - Output location
- outputs
List<Job
Output> - One or more artifacts that represent output from running the job.
- profile
Configuration JobProfile Configuration - Profile Job configuration
- project
Name String - Project name
- recipe
Job
Recipe - A series of data transformation steps that the job runs.
- role
Arn String - Role arn
- timeout Integer
- Timeout
- validation
Configurations List<JobValidation Configuration> - Data quality rules configuration
- data
Catalog JobOutputs Data Catalog Output[] - One or more artifacts that represent the AWS Glue Data Catalog output from running the job.
- database
Outputs JobDatabase Output[] - Represents a list of JDBC database output objects which defines the output destination for a DataBrew recipe job to write into.
- dataset
Name string - Dataset name
- encryption
Key stringArn - Encryption Key Arn
- encryption
Mode JobEncryption Mode - Encryption mode
- job
Sample JobSample - Job Sample
- log
Subscription JobLog Subscription - Log subscription
- max
Capacity number - Max capacity
- max
Retries number - Max retries
- output
Location JobOutput Location - Output location
- outputs
Job
Output[] - One or more artifacts that represent output from running the job.
- profile
Configuration JobProfile Configuration - Profile Job configuration
- project
Name string - Project name
- recipe
Job
Recipe - A series of data transformation steps that the job runs.
- role
Arn string - Role arn
- timeout number
- Timeout
- validation
Configurations JobValidation Configuration[] - Data quality rules configuration
- data_
catalog_ Sequence[Joboutputs Data Catalog Output] - One or more artifacts that represent the AWS Glue Data Catalog output from running the job.
- database_
outputs Sequence[JobDatabase Output] - Represents a list of JDBC database output objects which defines the output destination for a DataBrew recipe job to write into.
- dataset_
name str - Dataset name
- encryption_
key_ strarn - Encryption Key Arn
- encryption_
mode JobEncryption Mode - Encryption mode
- job_
sample JobSample - Job Sample
- log_
subscription JobLog Subscription - Log subscription
- max_
capacity int - Max capacity
- max_
retries int - Max retries
- output_
location JobOutput Location - Output location
- outputs
Sequence[Job
Output] - One or more artifacts that represent output from running the job.
- profile_
configuration JobProfile Configuration - Profile Job configuration
- project_
name str - Project name
- recipe
Job
Recipe - A series of data transformation steps that the job runs.
- role_
arn str - Role arn
- timeout int
- Timeout
- validation_
configurations Sequence[JobValidation Configuration] - Data quality rules configuration
- data
Catalog List<Property Map>Outputs - One or more artifacts that represent the AWS Glue Data Catalog output from running the job.
- database
Outputs List<Property Map> - Represents a list of JDBC database output objects which defines the output destination for a DataBrew recipe job to write into.
- dataset
Name String - Dataset name
- encryption
Key StringArn - Encryption Key Arn
- encryption
Mode "SSE-KMS" | "SSE-S3" - Encryption mode
- job
Sample Property Map - Job Sample
- log
Subscription "ENABLE" | "DISABLE" - Log subscription
- max
Capacity Number - Max capacity
- max
Retries Number - Max retries
- output
Location Property Map - Output location
- outputs List<Property Map>
- One or more artifacts that represent output from running the job.
- profile
Configuration Property Map - Profile Job configuration
- project
Name String - Project name
- recipe Property Map
- A series of data transformation steps that the job runs.
- role
Arn String - Role arn
- timeout Number
- Timeout
- validation
Configurations List<Property Map> - Data quality rules configuration
Supporting Types
JobAllowedStatistics
- Statistics List<string>
- One or more column statistics to allow for columns that contain detected entities.
- Statistics []string
- One or more column statistics to allow for columns that contain detected entities.
- statistics List<String>
- One or more column statistics to allow for columns that contain detected entities.
- statistics string[]
- One or more column statistics to allow for columns that contain detected entities.
- statistics Sequence[str]
- One or more column statistics to allow for columns that contain detected entities.
- statistics List<String>
- One or more column statistics to allow for columns that contain detected entities.
JobColumnSelector
JobColumnStatisticsConfiguration
- Statistics
Pulumi.
Aws Native. Data Brew. Inputs. Job Statistics Configuration - Configuration for evaluations. Statistics can be used to select evaluations and override parameters of evaluations.
- Selectors
List<Pulumi.
Aws Native. Data Brew. Inputs. Job Column Selector> - List of column selectors. Selectors can be used to select columns from the dataset. When selectors are undefined, configuration will be applied to all supported columns.
- Statistics
Job
Statistics Configuration - Configuration for evaluations. Statistics can be used to select evaluations and override parameters of evaluations.
- Selectors
[]Job
Column Selector - List of column selectors. Selectors can be used to select columns from the dataset. When selectors are undefined, configuration will be applied to all supported columns.
- statistics
Job
Statistics Configuration - Configuration for evaluations. Statistics can be used to select evaluations and override parameters of evaluations.
- selectors
List<Job
Column Selector> - List of column selectors. Selectors can be used to select columns from the dataset. When selectors are undefined, configuration will be applied to all supported columns.
- statistics
Job
Statistics Configuration - Configuration for evaluations. Statistics can be used to select evaluations and override parameters of evaluations.
- selectors
Job
Column Selector[] - List of column selectors. Selectors can be used to select columns from the dataset. When selectors are undefined, configuration will be applied to all supported columns.
- statistics
Job
Statistics Configuration - Configuration for evaluations. Statistics can be used to select evaluations and override parameters of evaluations.
- selectors
Sequence[Job
Column Selector] - List of column selectors. Selectors can be used to select columns from the dataset. When selectors are undefined, configuration will be applied to all supported columns.
- statistics Property Map
- Configuration for evaluations. Statistics can be used to select evaluations and override parameters of evaluations.
- selectors List<Property Map>
- List of column selectors. Selectors can be used to select columns from the dataset. When selectors are undefined, configuration will be applied to all supported columns.
JobCsvOutputOptions
- Delimiter string
- A single character that specifies the delimiter used to create CSV job output.
- Delimiter string
- A single character that specifies the delimiter used to create CSV job output.
- delimiter String
- A single character that specifies the delimiter used to create CSV job output.
- delimiter string
- A single character that specifies the delimiter used to create CSV job output.
- delimiter str
- A single character that specifies the delimiter used to create CSV job output.
- delimiter String
- A single character that specifies the delimiter used to create CSV job output.
JobDataCatalogOutput
- Database
Name string - The name of a database in the Data Catalog.
- Table
Name string - The name of a table in the Data Catalog.
- Catalog
Id string - The unique identifier of the AWS account that holds the Data Catalog that stores the data.
- Database
Options Pulumi.Aws Native. Data Brew. Inputs. Job Database Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- Overwrite bool
- A value that, if true, means that any data in the location specified for output is overwritten with new output. Not supported with DatabaseOptions.
- S3Options
Pulumi.
Aws Native. Data Brew. Inputs. Job S3Table Output Options - Represents options that specify how and where DataBrew writes the Amazon S3 output generated by recipe jobs.
- Database
Name string - The name of a database in the Data Catalog.
- Table
Name string - The name of a table in the Data Catalog.
- Catalog
Id string - The unique identifier of the AWS account that holds the Data Catalog that stores the data.
- Database
Options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- Overwrite bool
- A value that, if true, means that any data in the location specified for output is overwritten with new output. Not supported with DatabaseOptions.
- S3Options
Job
S3Table Output Options - Represents options that specify how and where DataBrew writes the Amazon S3 output generated by recipe jobs.
- database
Name String - The name of a database in the Data Catalog.
- table
Name String - The name of a table in the Data Catalog.
- catalog
Id String - The unique identifier of the AWS account that holds the Data Catalog that stores the data.
- database
Options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- overwrite Boolean
- A value that, if true, means that any data in the location specified for output is overwritten with new output. Not supported with DatabaseOptions.
- s3Options
Job
S3Table Output Options - Represents options that specify how and where DataBrew writes the Amazon S3 output generated by recipe jobs.
- database
Name string - The name of a database in the Data Catalog.
- table
Name string - The name of a table in the Data Catalog.
- catalog
Id string - The unique identifier of the AWS account that holds the Data Catalog that stores the data.
- database
Options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- overwrite boolean
- A value that, if true, means that any data in the location specified for output is overwritten with new output. Not supported with DatabaseOptions.
- s3Options
Job
S3Table Output Options - Represents options that specify how and where DataBrew writes the Amazon S3 output generated by recipe jobs.
- database_
name str - The name of a database in the Data Catalog.
- table_
name str - The name of a table in the Data Catalog.
- catalog_
id str - The unique identifier of the AWS account that holds the Data Catalog that stores the data.
- database_
options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- overwrite bool
- A value that, if true, means that any data in the location specified for output is overwritten with new output. Not supported with DatabaseOptions.
- s3_
options JobS3Table Output Options - Represents options that specify how and where DataBrew writes the Amazon S3 output generated by recipe jobs.
- database
Name String - The name of a database in the Data Catalog.
- table
Name String - The name of a table in the Data Catalog.
- catalog
Id String - The unique identifier of the AWS account that holds the Data Catalog that stores the data.
- database
Options Property Map - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- overwrite Boolean
- A value that, if true, means that any data in the location specified for output is overwritten with new output. Not supported with DatabaseOptions.
- s3Options Property Map
- Represents options that specify how and where DataBrew writes the Amazon S3 output generated by recipe jobs.
JobDatabaseOutput
- Database
Options Pulumi.Aws Native. Data Brew. Inputs. Job Database Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- Glue
Connection stringName - Glue connection name
- Database
Output Pulumi.Mode Aws Native. Data Brew. Job Database Output Database Output Mode - Database table name
- Database
Options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- Glue
Connection stringName - Glue connection name
- Database
Output JobMode Database Output Database Output Mode - Database table name
- database
Options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- glue
Connection StringName - Glue connection name
- database
Output JobMode Database Output Database Output Mode - Database table name
- database
Options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- glue
Connection stringName - Glue connection name
- database
Output JobMode Database Output Database Output Mode - Database table name
- database_
options JobDatabase Table Output Options - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- glue_
connection_ strname - Glue connection name
- database_
output_ Jobmode Database Output Database Output Mode - Database table name
- database
Options Property Map - Represents options that specify how and where DataBrew writes the database output generated by recipe jobs.
- glue
Connection StringName - Glue connection name
- database
Output "NEW_TABLE"Mode - Database table name
JobDatabaseOutputDatabaseOutputMode
JobDatabaseTableOutputOptions
- Table
Name string - A prefix for the name of a table DataBrew will create in the database.
- Temp
Directory Pulumi.Aws Native. Data Brew. Inputs. Job S3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can store intermediate results.
- Table
Name string - A prefix for the name of a table DataBrew will create in the database.
- Temp
Directory JobS3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can store intermediate results.
- table
Name String - A prefix for the name of a table DataBrew will create in the database.
- temp
Directory JobS3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can store intermediate results.
- table
Name string - A prefix for the name of a table DataBrew will create in the database.
- temp
Directory JobS3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can store intermediate results.
- table_
name str - A prefix for the name of a table DataBrew will create in the database.
- temp_
directory JobS3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can store intermediate results.
- table
Name String - A prefix for the name of a table DataBrew will create in the database.
- temp
Directory Property Map - Represents an Amazon S3 location (bucket name and object key) where DataBrew can store intermediate results.
JobEncryptionMode
JobEntityDetectorConfiguration
- Entity
Types List<string> Entity types to detect. Can be any of the following:
- USA_SSN
- USA_ITIN
- USA_PASSPORT_NUMBER
- PHONE_NUMBER
- USA_DRIVING_LICENSE
- BANK_ACCOUNT
- CREDIT_CARD
- IP_ADDRESS
- MAC_ADDRESS
- USA_DEA_NUMBER
- USA_HCPCS_CODE
- USA_NATIONAL_PROVIDER_IDENTIFIER
- USA_NATIONAL_DRUG_CODE
- USA_HEALTH_INSURANCE_CLAIM_NUMBER
- USA_MEDICARE_BENEFICIARY_IDENTIFIER
- USA_CPT_CODE
- PERSON_NAME
- DATE
The Entity type group USA_ALL is also supported, and includes all of the above entity types except PERSON_NAME and DATE.
- Allowed
Statistics Pulumi.Aws Native. Data Brew. Inputs. Job Allowed Statistics - Configuration of statistics that are allowed to be run on columns that contain detected entities. When undefined, no statistics will be computed on columns that contain detected entities.
- Entity
Types []string Entity types to detect. Can be any of the following:
- USA_SSN
- USA_ITIN
- USA_PASSPORT_NUMBER
- PHONE_NUMBER
- USA_DRIVING_LICENSE
- BANK_ACCOUNT
- CREDIT_CARD
- IP_ADDRESS
- MAC_ADDRESS
- USA_DEA_NUMBER
- USA_HCPCS_CODE
- USA_NATIONAL_PROVIDER_IDENTIFIER
- USA_NATIONAL_DRUG_CODE
- USA_HEALTH_INSURANCE_CLAIM_NUMBER
- USA_MEDICARE_BENEFICIARY_IDENTIFIER
- USA_CPT_CODE
- PERSON_NAME
- DATE
The Entity type group USA_ALL is also supported, and includes all of the above entity types except PERSON_NAME and DATE.
- Allowed
Statistics JobAllowed Statistics - Configuration of statistics that are allowed to be run on columns that contain detected entities. When undefined, no statistics will be computed on columns that contain detected entities.
- entity
Types List<String> Entity types to detect. Can be any of the following:
- USA_SSN
- USA_ITIN
- USA_PASSPORT_NUMBER
- PHONE_NUMBER
- USA_DRIVING_LICENSE
- BANK_ACCOUNT
- CREDIT_CARD
- IP_ADDRESS
- MAC_ADDRESS
- USA_DEA_NUMBER
- USA_HCPCS_CODE
- USA_NATIONAL_PROVIDER_IDENTIFIER
- USA_NATIONAL_DRUG_CODE
- USA_HEALTH_INSURANCE_CLAIM_NUMBER
- USA_MEDICARE_BENEFICIARY_IDENTIFIER
- USA_CPT_CODE
- PERSON_NAME
- DATE
The Entity type group USA_ALL is also supported, and includes all of the above entity types except PERSON_NAME and DATE.
- allowed
Statistics JobAllowed Statistics - Configuration of statistics that are allowed to be run on columns that contain detected entities. When undefined, no statistics will be computed on columns that contain detected entities.
- entity
Types string[] Entity types to detect. Can be any of the following:
- USA_SSN
- USA_ITIN
- USA_PASSPORT_NUMBER
- PHONE_NUMBER
- USA_DRIVING_LICENSE
- BANK_ACCOUNT
- CREDIT_CARD
- IP_ADDRESS
- MAC_ADDRESS
- USA_DEA_NUMBER
- USA_HCPCS_CODE
- USA_NATIONAL_PROVIDER_IDENTIFIER
- USA_NATIONAL_DRUG_CODE
- USA_HEALTH_INSURANCE_CLAIM_NUMBER
- USA_MEDICARE_BENEFICIARY_IDENTIFIER
- USA_CPT_CODE
- PERSON_NAME
- DATE
The Entity type group USA_ALL is also supported, and includes all of the above entity types except PERSON_NAME and DATE.
- allowed
Statistics JobAllowed Statistics - Configuration of statistics that are allowed to be run on columns that contain detected entities. When undefined, no statistics will be computed on columns that contain detected entities.
- entity_
types Sequence[str] Entity types to detect. Can be any of the following:
- USA_SSN
- USA_ITIN
- USA_PASSPORT_NUMBER
- PHONE_NUMBER
- USA_DRIVING_LICENSE
- BANK_ACCOUNT
- CREDIT_CARD
- IP_ADDRESS
- MAC_ADDRESS
- USA_DEA_NUMBER
- USA_HCPCS_CODE
- USA_NATIONAL_PROVIDER_IDENTIFIER
- USA_NATIONAL_DRUG_CODE
- USA_HEALTH_INSURANCE_CLAIM_NUMBER
- USA_MEDICARE_BENEFICIARY_IDENTIFIER
- USA_CPT_CODE
- PERSON_NAME
- DATE
The Entity type group USA_ALL is also supported, and includes all of the above entity types except PERSON_NAME and DATE.
- allowed_
statistics JobAllowed Statistics - Configuration of statistics that are allowed to be run on columns that contain detected entities. When undefined, no statistics will be computed on columns that contain detected entities.
- entity
Types List<String> Entity types to detect. Can be any of the following:
- USA_SSN
- USA_ITIN
- USA_PASSPORT_NUMBER
- PHONE_NUMBER
- USA_DRIVING_LICENSE
- BANK_ACCOUNT
- CREDIT_CARD
- IP_ADDRESS
- MAC_ADDRESS
- USA_DEA_NUMBER
- USA_HCPCS_CODE
- USA_NATIONAL_PROVIDER_IDENTIFIER
- USA_NATIONAL_DRUG_CODE
- USA_HEALTH_INSURANCE_CLAIM_NUMBER
- USA_MEDICARE_BENEFICIARY_IDENTIFIER
- USA_CPT_CODE
- PERSON_NAME
- DATE
The Entity type group USA_ALL is also supported, and includes all of the above entity types except PERSON_NAME and DATE.
- allowed
Statistics Property Map - Configuration of statistics that are allowed to be run on columns that contain detected entities. When undefined, no statistics will be computed on columns that contain detected entities.
JobLogSubscription
JobOutput
- Location
Pulumi.
Aws Native. Data Brew. Inputs. Job S3Location - The location in Amazon S3 where the job writes its output.
- Compression
Format Pulumi.Aws Native. Data Brew. Job Output Compression Format - The compression algorithm used to compress the output text of the job.
- Format
Pulumi.
Aws Native. Data Brew. Job Output Format - The data format of the output of the job.
- Format
Options Pulumi.Aws Native. Data Brew. Inputs. Job Output Format Options - Represents options that define how DataBrew formats job output files.
- Max
Output intFiles - The maximum number of files to be generated by the job and written to the output folder.
- Overwrite bool
- A value that, if true, means that any data in the location specified for output is overwritten with new output.
- Partition
Columns List<string> - The names of one or more partition columns for the output of the job.
- Location
Job
S3Location - The location in Amazon S3 where the job writes its output.
- Compression
Format JobOutput Compression Format - The compression algorithm used to compress the output text of the job.
- Format
Job
Output Format - The data format of the output of the job.
- Format
Options JobOutput Format Options - Represents options that define how DataBrew formats job output files.
- Max
Output intFiles - The maximum number of files to be generated by the job and written to the output folder.
- Overwrite bool
- A value that, if true, means that any data in the location specified for output is overwritten with new output.
- Partition
Columns []string - The names of one or more partition columns for the output of the job.
- location
Job
S3Location - The location in Amazon S3 where the job writes its output.
- compression
Format JobOutput Compression Format - The compression algorithm used to compress the output text of the job.
- format
Job
Output Format - The data format of the output of the job.
- format
Options JobOutput Format Options - Represents options that define how DataBrew formats job output files.
- max
Output IntegerFiles - The maximum number of files to be generated by the job and written to the output folder.
- overwrite Boolean
- A value that, if true, means that any data in the location specified for output is overwritten with new output.
- partition
Columns List<String> - The names of one or more partition columns for the output of the job.
- location
Job
S3Location - The location in Amazon S3 where the job writes its output.
- compression
Format JobOutput Compression Format - The compression algorithm used to compress the output text of the job.
- format
Job
Output Format - The data format of the output of the job.
- format
Options JobOutput Format Options - Represents options that define how DataBrew formats job output files.
- max
Output numberFiles - The maximum number of files to be generated by the job and written to the output folder.
- overwrite boolean
- A value that, if true, means that any data in the location specified for output is overwritten with new output.
- partition
Columns string[] - The names of one or more partition columns for the output of the job.
- location
Job
S3Location - The location in Amazon S3 where the job writes its output.
- compression_
format JobOutput Compression Format - The compression algorithm used to compress the output text of the job.
- format
Job
Output Format - The data format of the output of the job.
- format_
options JobOutput Format Options - Represents options that define how DataBrew formats job output files.
- max_
output_ intfiles - The maximum number of files to be generated by the job and written to the output folder.
- overwrite bool
- A value that, if true, means that any data in the location specified for output is overwritten with new output.
- partition_
columns Sequence[str] - The names of one or more partition columns for the output of the job.
- location Property Map
- The location in Amazon S3 where the job writes its output.
- compression
Format "GZIP" | "LZ4" | "SNAPPY" | "BZIP2" | "DEFLATE" | "LZO" | "BROTLI" | "ZSTD" | "ZLIB" - The compression algorithm used to compress the output text of the job.
- format "CSV" | "JSON" | "PARQUET" | "GLUEPARQUET" | "AVRO" | "ORC" | "XML" | "TABLEAUHYPER"
- The data format of the output of the job.
- format
Options Property Map - Represents options that define how DataBrew formats job output files.
- max
Output NumberFiles - The maximum number of files to be generated by the job and written to the output folder.
- overwrite Boolean
- A value that, if true, means that any data in the location specified for output is overwritten with new output.
- partition
Columns List<String> - The names of one or more partition columns for the output of the job.
JobOutputCompressionFormat
JobOutputFormat
JobOutputFormatOptions
- Csv
Pulumi.
Aws Native. Data Brew. Inputs. Job Csv Output Options - Represents a set of options that define the structure of comma-separated value (CSV) job output.
- Csv
Job
Csv Output Options - Represents a set of options that define the structure of comma-separated value (CSV) job output.
- csv
Job
Csv Output Options - Represents a set of options that define the structure of comma-separated value (CSV) job output.
- csv
Job
Csv Output Options - Represents a set of options that define the structure of comma-separated value (CSV) job output.
- csv
Job
Csv Output Options - Represents a set of options that define the structure of comma-separated value (CSV) job output.
- csv Property Map
- Represents a set of options that define the structure of comma-separated value (CSV) job output.
JobOutputLocation
- Bucket string
- The Amazon S3 bucket name.
- Bucket
Owner string - Key string
- The unique name of the object in the bucket.
- Bucket string
- The Amazon S3 bucket name.
- Bucket
Owner string - Key string
- The unique name of the object in the bucket.
- bucket String
- The Amazon S3 bucket name.
- bucket
Owner String - key String
- The unique name of the object in the bucket.
- bucket string
- The Amazon S3 bucket name.
- bucket
Owner string - key string
- The unique name of the object in the bucket.
- bucket str
- The Amazon S3 bucket name.
- bucket_
owner str - key str
- The unique name of the object in the bucket.
- bucket String
- The Amazon S3 bucket name.
- bucket
Owner String - key String
- The unique name of the object in the bucket.
JobProfileConfiguration
- Column
Statistics List<Pulumi.Configurations Aws Native. Data Brew. Inputs. Job Column Statistics Configuration> - List of configurations for column evaluations. ColumnStatisticsConfigurations are used to select evaluations and override parameters of evaluations for particular columns. When ColumnStatisticsConfigurations is undefined, the profile job will profile all supported columns and run all supported evaluations.
- Dataset
Statistics Pulumi.Configuration Aws Native. Data Brew. Inputs. Job Statistics Configuration - Configuration for inter-column evaluations. Configuration can be used to select evaluations and override parameters of evaluations. When configuration is undefined, the profile job will run all supported inter-column evaluations.
- Entity
Detector Pulumi.Configuration Aws Native. Data Brew. Inputs. Job Entity Detector Configuration - Configuration of entity detection for a profile job. When undefined, entity detection is disabled.
- Profile
Columns List<Pulumi.Aws Native. Data Brew. Inputs. Job Column Selector> - List of column selectors. ProfileColumns can be used to select columns from the dataset. When ProfileColumns is undefined, the profile job will profile all supported columns.
- Column
Statistics []JobConfigurations Column Statistics Configuration - List of configurations for column evaluations. ColumnStatisticsConfigurations are used to select evaluations and override parameters of evaluations for particular columns. When ColumnStatisticsConfigurations is undefined, the profile job will profile all supported columns and run all supported evaluations.
- Dataset
Statistics JobConfiguration Statistics Configuration - Configuration for inter-column evaluations. Configuration can be used to select evaluations and override parameters of evaluations. When configuration is undefined, the profile job will run all supported inter-column evaluations.
- Entity
Detector JobConfiguration Entity Detector Configuration - Configuration of entity detection for a profile job. When undefined, entity detection is disabled.
- Profile
Columns []JobColumn Selector - List of column selectors. ProfileColumns can be used to select columns from the dataset. When ProfileColumns is undefined, the profile job will profile all supported columns.
- column
Statistics List<JobConfigurations Column Statistics Configuration> - List of configurations for column evaluations. ColumnStatisticsConfigurations are used to select evaluations and override parameters of evaluations for particular columns. When ColumnStatisticsConfigurations is undefined, the profile job will profile all supported columns and run all supported evaluations.
- dataset
Statistics JobConfiguration Statistics Configuration - Configuration for inter-column evaluations. Configuration can be used to select evaluations and override parameters of evaluations. When configuration is undefined, the profile job will run all supported inter-column evaluations.
- entity
Detector JobConfiguration Entity Detector Configuration - Configuration of entity detection for a profile job. When undefined, entity detection is disabled.
- profile
Columns List<JobColumn Selector> - List of column selectors. ProfileColumns can be used to select columns from the dataset. When ProfileColumns is undefined, the profile job will profile all supported columns.
- column
Statistics JobConfigurations Column Statistics Configuration[] - List of configurations for column evaluations. ColumnStatisticsConfigurations are used to select evaluations and override parameters of evaluations for particular columns. When ColumnStatisticsConfigurations is undefined, the profile job will profile all supported columns and run all supported evaluations.
- dataset
Statistics JobConfiguration Statistics Configuration - Configuration for inter-column evaluations. Configuration can be used to select evaluations and override parameters of evaluations. When configuration is undefined, the profile job will run all supported inter-column evaluations.
- entity
Detector JobConfiguration Entity Detector Configuration - Configuration of entity detection for a profile job. When undefined, entity detection is disabled.
- profile
Columns JobColumn Selector[] - List of column selectors. ProfileColumns can be used to select columns from the dataset. When ProfileColumns is undefined, the profile job will profile all supported columns.
- column_
statistics_ Sequence[Jobconfigurations Column Statistics Configuration] - List of configurations for column evaluations. ColumnStatisticsConfigurations are used to select evaluations and override parameters of evaluations for particular columns. When ColumnStatisticsConfigurations is undefined, the profile job will profile all supported columns and run all supported evaluations.
- dataset_
statistics_ Jobconfiguration Statistics Configuration - Configuration for inter-column evaluations. Configuration can be used to select evaluations and override parameters of evaluations. When configuration is undefined, the profile job will run all supported inter-column evaluations.
- entity_
detector_ Jobconfiguration Entity Detector Configuration - Configuration of entity detection for a profile job. When undefined, entity detection is disabled.
- profile_
columns Sequence[JobColumn Selector] - List of column selectors. ProfileColumns can be used to select columns from the dataset. When ProfileColumns is undefined, the profile job will profile all supported columns.
- column
Statistics List<Property Map>Configurations - List of configurations for column evaluations. ColumnStatisticsConfigurations are used to select evaluations and override parameters of evaluations for particular columns. When ColumnStatisticsConfigurations is undefined, the profile job will profile all supported columns and run all supported evaluations.
- dataset
Statistics Property MapConfiguration - Configuration for inter-column evaluations. Configuration can be used to select evaluations and override parameters of evaluations. When configuration is undefined, the profile job will run all supported inter-column evaluations.
- entity
Detector Property MapConfiguration - Configuration of entity detection for a profile job. When undefined, entity detection is disabled.
- profile
Columns List<Property Map> - List of column selectors. ProfileColumns can be used to select columns from the dataset. When ProfileColumns is undefined, the profile job will profile all supported columns.
JobRecipe
JobS3Location
- Bucket string
- The Amazon S3 bucket name.
- Bucket
Owner string - The AWS account ID of the bucket owner.
- Key string
- The unique name of the object in the bucket.
- Bucket string
- The Amazon S3 bucket name.
- Bucket
Owner string - The AWS account ID of the bucket owner.
- Key string
- The unique name of the object in the bucket.
- bucket String
- The Amazon S3 bucket name.
- bucket
Owner String - The AWS account ID of the bucket owner.
- key String
- The unique name of the object in the bucket.
- bucket string
- The Amazon S3 bucket name.
- bucket
Owner string - The AWS account ID of the bucket owner.
- key string
- The unique name of the object in the bucket.
- bucket str
- The Amazon S3 bucket name.
- bucket_
owner str - The AWS account ID of the bucket owner.
- key str
- The unique name of the object in the bucket.
- bucket String
- The Amazon S3 bucket name.
- bucket
Owner String - The AWS account ID of the bucket owner.
- key String
- The unique name of the object in the bucket.
JobS3TableOutputOptions
- Location
Pulumi.
Aws Native. Data Brew. Inputs. Job S3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can write output from a job.
- Location
Job
S3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can write output from a job.
- location
Job
S3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can write output from a job.
- location
Job
S3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can write output from a job.
- location
Job
S3Location - Represents an Amazon S3 location (bucket name and object key) where DataBrew can write output from a job.
- location Property Map
- Represents an Amazon S3 location (bucket name and object key) where DataBrew can write output from a job.
JobSample
- Mode
Pulumi.
Aws Native. Data Brew. Job Sample Mode - A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:
- FULL_DATASET - The profile job is run on the entire dataset.
- CUSTOM_ROWS - The profile job is run on the number of rows specified in the
Size
parameter.
- Size int
The
Size
parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.Long.MAX_VALUE = 9223372036854775807
- Mode
Job
Sample Mode - A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:
- FULL_DATASET - The profile job is run on the entire dataset.
- CUSTOM_ROWS - The profile job is run on the number of rows specified in the
Size
parameter.
- Size int
The
Size
parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.Long.MAX_VALUE = 9223372036854775807
- mode
Job
Sample Mode - A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:
- FULL_DATASET - The profile job is run on the entire dataset.
- CUSTOM_ROWS - The profile job is run on the number of rows specified in the
Size
parameter.
- size Integer
The
Size
parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.Long.MAX_VALUE = 9223372036854775807
- mode
Job
Sample Mode - A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:
- FULL_DATASET - The profile job is run on the entire dataset.
- CUSTOM_ROWS - The profile job is run on the number of rows specified in the
Size
parameter.
- size number
The
Size
parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.Long.MAX_VALUE = 9223372036854775807
- mode
Job
Sample Mode - A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:
- FULL_DATASET - The profile job is run on the entire dataset.
- CUSTOM_ROWS - The profile job is run on the number of rows specified in the
Size
parameter.
- size int
The
Size
parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.Long.MAX_VALUE = 9223372036854775807
- mode "FULL_DATASET" | "CUSTOM_ROWS"
- A value that determines whether the profile job is run on the entire dataset or a specified number of rows. This value must be one of the following:
- FULL_DATASET - The profile job is run on the entire dataset.
- CUSTOM_ROWS - The profile job is run on the number of rows specified in the
Size
parameter.
- size Number
The
Size
parameter is only required when the mode is CUSTOM_ROWS. The profile job is run on the specified number of rows. The maximum value for size is Long.MAX_VALUE.Long.MAX_VALUE = 9223372036854775807
JobSampleMode
JobStatisticOverride
- Parameters Dictionary<string, string>
- A map that includes overrides of an evaluation’s parameters.
- Statistic string
- The name of an evaluation
- Parameters map[string]string
- A map that includes overrides of an evaluation’s parameters.
- Statistic string
- The name of an evaluation
- parameters Map<String,String>
- A map that includes overrides of an evaluation’s parameters.
- statistic String
- The name of an evaluation
- parameters {[key: string]: string}
- A map that includes overrides of an evaluation’s parameters.
- statistic string
- The name of an evaluation
- parameters Mapping[str, str]
- A map that includes overrides of an evaluation’s parameters.
- statistic str
- The name of an evaluation
- parameters Map<String>
- A map that includes overrides of an evaluation’s parameters.
- statistic String
- The name of an evaluation
JobStatisticsConfiguration
- Included
Statistics List<string> - List of included evaluations. When the list is undefined, all supported evaluations will be included.
- Overrides
List<Pulumi.
Aws Native. Data Brew. Inputs. Job Statistic Override> - List of overrides for evaluations.
- Included
Statistics []string - List of included evaluations. When the list is undefined, all supported evaluations will be included.
- Overrides
[]Job
Statistic Override - List of overrides for evaluations.
- included
Statistics List<String> - List of included evaluations. When the list is undefined, all supported evaluations will be included.
- overrides
List<Job
Statistic Override> - List of overrides for evaluations.
- included
Statistics string[] - List of included evaluations. When the list is undefined, all supported evaluations will be included.
- overrides
Job
Statistic Override[] - List of overrides for evaluations.
- included_
statistics Sequence[str] - List of included evaluations. When the list is undefined, all supported evaluations will be included.
- overrides
Sequence[Job
Statistic Override] - List of overrides for evaluations.
- included
Statistics List<String> - List of included evaluations. When the list is undefined, all supported evaluations will be included.
- overrides List<Property Map>
- List of overrides for evaluations.
JobValidationConfiguration
- Ruleset
Arn string - Arn of the Ruleset
- Validation
Mode Pulumi.Aws Native. Data Brew. Job Validation Mode - Mode of data quality validation. Default mode is "CHECK_ALL" which verifies all rules defined in the selected ruleset.
- Ruleset
Arn string - Arn of the Ruleset
- Validation
Mode JobValidation Mode - Mode of data quality validation. Default mode is "CHECK_ALL" which verifies all rules defined in the selected ruleset.
- ruleset
Arn String - Arn of the Ruleset
- validation
Mode JobValidation Mode - Mode of data quality validation. Default mode is "CHECK_ALL" which verifies all rules defined in the selected ruleset.
- ruleset
Arn string - Arn of the Ruleset
- validation
Mode JobValidation Mode - Mode of data quality validation. Default mode is "CHECK_ALL" which verifies all rules defined in the selected ruleset.
- ruleset_
arn str - Arn of the Ruleset
- validation_
mode JobValidation Mode - Mode of data quality validation. Default mode is "CHECK_ALL" which verifies all rules defined in the selected ruleset.
- ruleset
Arn String - Arn of the Ruleset
- validation
Mode "CHECK_ALL" - Mode of data quality validation. Default mode is "CHECK_ALL" which verifies all rules defined in the selected ruleset.
JobValidationMode
Package Details
- Repository
- AWS Native pulumi/pulumi-aws-native
- License
- Apache-2.0
We recommend new projects start with resources from the AWS provider.