Simple Storage Service (S3) is an online data storage service provided by Amazon, which provides low-latency access through web services. You can create connections to specific S3 buckets through the Trifacta application. These connections to S3 enable workspace users to read from and write to S3 buckets.
Pre-requisites
Before you begin, please verify that your Trifacta® environment meets the following requirements:
Integration: Your Trifacta instance is connected to a running environment supported by your product edition.
Verify that
Enable S3 Connectivity
has been enabled in the Workspace Settings Page.
Acquire the Access Key ID and Secret Key for the S3 bucket or buckets to which you are connecting. For more information on acquiring your key/secret combination, contact your S3 administrator.
Create S3 Connection
3. In the Connections page, click on Create Connection
4. In the Create Connection page, click the External Amazon S3 card.
5. Specify the connection properties as follows-
DefaultBucket | The default S3 bucket to which to connect. When the connection is first accessed for browsing, the contents of this bucket are displayed. If this value is not provided, then the list of available buckets based on the key/secret combination is displayed when browsing through the connection. NOTE: To see the list of available buckets, the connecting user must have the getBucketList permission. If that permission is not present and no default bucket is listed, then the user cannot browse S3. |
Access Key ID * | Access Key ID for the S3 connection. |
Secret Key * | Secret Key for the S3 connection. |
Server Side Encryption | If server-side encryption has been enabled on your bucket, you can select the server-side encryption policy to use when writing to the bucket. SSE-S3 and SSE-KMS methods are supported. For more information, see http://docs.aws.amazon.com/AmazonS3/latest/dev/serv-side-encryption.html. |
Server Side KMS Key Id | When KMS encryption is enabled, you must specify the AWS KMS key ID to use for the server-side encryption. |
Fields with * are mandatory.
6. Click on Test Connection to verify your connection.
For more information on the other options, see Create Connection Window.
7. Click Create
Verify the Connection
Import a dataset from External Amazon S3.
Add it to a flow and run a job, publishing results back to S3.
For more information, see Verify Operations.
Publish to External S3 Bucket
You can publish results to your external S3 buckets. Configure an output destination to write to your external S3 bucket.
In Flow View, create or edit an output object.
To edit, right-click an output object. The object details are displayed in the Details panel.
In the Details panel, click Edit.
3. Modify the output destination to use the External S3 buckets connection.
4. Navigate the bucket to select the appropriate location for the output. Specify the file as needed.
5. To save your changes, click Update.
For more information, see Create Outputs.
More Info
For more info on External S3 Connections, read this detailed documentation guide.
See the full configuration guide here
Learn more about Using S3 here
Learn more about the S3 Browser here