Class RedshiftDataSpec

java.lang.Object
com.amazonaws.services.machinelearning.model.RedshiftDataSpec
All Implemented Interfaces:
Serializable, Cloneable

public class RedshiftDataSpec extends Object implements Serializable, Cloneable

Describes the data specification of an Amazon Redshift DataSource.

See Also:
  • Constructor Details

    • RedshiftDataSpec

      public RedshiftDataSpec()
  • Method Details

    • setDatabaseInformation

      public void setDatabaseInformation(RedshiftDatabase databaseInformation)

      Describes the DatabaseName and ClusterIdentifier for an Amazon Redshift DataSource.

      Parameters:
      databaseInformation - Describes the DatabaseName and ClusterIdentifier for an Amazon Redshift DataSource.
    • getDatabaseInformation

      public RedshiftDatabase getDatabaseInformation()

      Describes the DatabaseName and ClusterIdentifier for an Amazon Redshift DataSource.

      Returns:
      Describes the DatabaseName and ClusterIdentifier for an Amazon Redshift DataSource.
    • withDatabaseInformation

      public RedshiftDataSpec withDatabaseInformation(RedshiftDatabase databaseInformation)

      Describes the DatabaseName and ClusterIdentifier for an Amazon Redshift DataSource.

      Parameters:
      databaseInformation - Describes the DatabaseName and ClusterIdentifier for an Amazon Redshift DataSource.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setSelectSqlQuery

      public void setSelectSqlQuery(String selectSqlQuery)

      Describes the SQL Query to execute on an Amazon Redshift database for an Amazon Redshift DataSource.

      Parameters:
      selectSqlQuery - Describes the SQL Query to execute on an Amazon Redshift database for an Amazon Redshift DataSource.
    • getSelectSqlQuery

      public String getSelectSqlQuery()

      Describes the SQL Query to execute on an Amazon Redshift database for an Amazon Redshift DataSource.

      Returns:
      Describes the SQL Query to execute on an Amazon Redshift database for an Amazon Redshift DataSource.
    • withSelectSqlQuery

      public RedshiftDataSpec withSelectSqlQuery(String selectSqlQuery)

      Describes the SQL Query to execute on an Amazon Redshift database for an Amazon Redshift DataSource.

      Parameters:
      selectSqlQuery - Describes the SQL Query to execute on an Amazon Redshift database for an Amazon Redshift DataSource.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDatabaseCredentials

      public void setDatabaseCredentials(RedshiftDatabaseCredentials databaseCredentials)

      Describes AWS Identity and Access Management (IAM) credentials that are used connect to the Amazon Redshift database.

      Parameters:
      databaseCredentials - Describes AWS Identity and Access Management (IAM) credentials that are used connect to the Amazon Redshift database.
    • getDatabaseCredentials

      public RedshiftDatabaseCredentials getDatabaseCredentials()

      Describes AWS Identity and Access Management (IAM) credentials that are used connect to the Amazon Redshift database.

      Returns:
      Describes AWS Identity and Access Management (IAM) credentials that are used connect to the Amazon Redshift database.
    • withDatabaseCredentials

      public RedshiftDataSpec withDatabaseCredentials(RedshiftDatabaseCredentials databaseCredentials)

      Describes AWS Identity and Access Management (IAM) credentials that are used connect to the Amazon Redshift database.

      Parameters:
      databaseCredentials - Describes AWS Identity and Access Management (IAM) credentials that are used connect to the Amazon Redshift database.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setS3StagingLocation

      public void setS3StagingLocation(String s3StagingLocation)

      Describes an Amazon S3 location to store the result set of the SelectSqlQuery query.

      Parameters:
      s3StagingLocation - Describes an Amazon S3 location to store the result set of the SelectSqlQuery query.
    • getS3StagingLocation

      public String getS3StagingLocation()

      Describes an Amazon S3 location to store the result set of the SelectSqlQuery query.

      Returns:
      Describes an Amazon S3 location to store the result set of the SelectSqlQuery query.
    • withS3StagingLocation

      public RedshiftDataSpec withS3StagingLocation(String s3StagingLocation)

      Describes an Amazon S3 location to store the result set of the SelectSqlQuery query.

      Parameters:
      s3StagingLocation - Describes an Amazon S3 location to store the result set of the SelectSqlQuery query.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDataRearrangement

      public void setDataRearrangement(String dataRearrangement)

      Describes the splitting specifications for a DataSource.

      Parameters:
      dataRearrangement - Describes the splitting specifications for a DataSource.
    • getDataRearrangement

      public String getDataRearrangement()

      Describes the splitting specifications for a DataSource.

      Returns:
      Describes the splitting specifications for a DataSource.
    • withDataRearrangement

      public RedshiftDataSpec withDataRearrangement(String dataRearrangement)

      Describes the splitting specifications for a DataSource.

      Parameters:
      dataRearrangement - Describes the splitting specifications for a DataSource.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDataSchema

      public void setDataSchema(String dataSchema)

      A JSON string that represents the schema for an Amazon Redshift DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      A DataSchema is not required if you specify a DataSchemaUri.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      Parameters:
      dataSchema - A JSON string that represents the schema for an Amazon Redshift DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      A DataSchema is not required if you specify a DataSchemaUri.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

    • getDataSchema

      public String getDataSchema()

      A JSON string that represents the schema for an Amazon Redshift DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      A DataSchema is not required if you specify a DataSchemaUri.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      Returns:
      A JSON string that represents the schema for an Amazon Redshift DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      A DataSchema is not required if you specify a DataSchemaUri.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

    • withDataSchema

      public RedshiftDataSpec withDataSchema(String dataSchema)

      A JSON string that represents the schema for an Amazon Redshift DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      A DataSchema is not required if you specify a DataSchemaUri.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      Parameters:
      dataSchema - A JSON string that represents the schema for an Amazon Redshift DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      A DataSchema is not required if you specify a DataSchemaUri.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDataSchemaUri

      public void setDataSchemaUri(String dataSchemaUri)

      Describes the schema location for an Amazon Redshift DataSource.

      Parameters:
      dataSchemaUri - Describes the schema location for an Amazon Redshift DataSource.
    • getDataSchemaUri

      public String getDataSchemaUri()

      Describes the schema location for an Amazon Redshift DataSource.

      Returns:
      Describes the schema location for an Amazon Redshift DataSource.
    • withDataSchemaUri

      public RedshiftDataSpec withDataSchemaUri(String dataSchemaUri)

      Describes the schema location for an Amazon Redshift DataSource.

      Parameters:
      dataSchemaUri - Describes the schema location for an Amazon Redshift DataSource.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • toString

      public String toString()
      Returns a string representation of this object; useful for testing and debugging.
      Overrides:
      toString in class Object
      Returns:
      A string representation of this object.
      See Also:
    • equals

      public boolean equals(Object obj)
      Overrides:
      equals in class Object
    • hashCode

      public int hashCode()
      Overrides:
      hashCode in class Object
    • clone

      public RedshiftDataSpec clone()
      Overrides:
      clone in class Object