Install Amazon Redshift Applier
Replication into Redshift requires two separate replicator installations, one that extracts information from the source database, and a second that generates the CSV files, loads those files into S3 and then executes the statements on the Redshift database to import the CSV data and apply the transformations to build the final tables.
The two replication services can operate on the same machine, (See "deployment-multiple-replicators") or they can be installed on two different machines.
Once you have completed the configuration of the Amazon Redshift database, you can configure and install the applier as described using the steps below.
Before installing the applier, the following additions need adding to the extractor configuration. Apply the following parameter to the extractor configuration before installing the applier
Add the following to the
/etc/tungsten/tungsten.ini[alpha]...Existing Replicator Config...enable-heterogeneous-service=trueshell> tpm updateNoteThe above step is only applicable for standalone extractors. If you are configuring replications from an existing Tungsten Cluster (Cluster-Extractor), follow the steps outlined here to ensure the cluster is configured correctly: "Replicating Data Out of a Cluster"
The applier can now be configured. Unpack the Tungsten Replicator distribution in staging directory:
shell> tar zxf tungsten-replicator-8.0.4-132.tar.gzChange into the staging directory:
shell> cd tungsten-replicator-8.0.4-132Create the
/etc/tungsten/tungsten.iniusing the example below as a template.Example tungsten.ini[defaults]user=tungsteninstall-directory=/opt/continuentprofile-script=~/.bash_profilerest-api-admin-user=apiuserrest-api-admin-password=secretreplicator-rest-api-address=0.0.0.0[alpha]topology=master-slavemaster=sourcehostmembers=localhostdatasource-type=redshiftreplication-host=redshift.us-east-1.redshift.amazonaws.comreplication-user=awsRedshiftUserreplication-password=awsRedshiftPassredshift-dbname=devbatch-enabled=truebatch-load-template=redshiftsvc-applier-filters=dropstatementdatasvc-applier-block-commit-interval=30ssvc-applier-block-commit-size=250000Show argument definitions
user=tungstenOS System User, for example tungsten. DO NOT use root.install-directory=/opt/continuentInstallation directory.profile-script=~/.bash_profileAppend commands to include env.sh in this profile script.rest-api-admin-user=apiuserSpecify the initial Admin Username for API access.Available from v7.0.0rest-api-admin-password=secretSpecify the initial Admin User Password for API access.rest-api-admin-passwordalias only available from version 7.1.2 onwards.Available from v7.0.0replicator-rest-api-address=0.0.0.0Address for the API to bind too.Available from v7.0.0If your MySQL source is a Tungsten Cluster, ensure the additional steps below are also included in your applier configuration
First, prepare the required filter configuration file as follows on the Redshift applier host(s) only:
shell> mkdir -p /opt/continuent/share/shell> cp tungsten-replicator/support/filters-config/convertstringfrommysql.json /opt/continuent/share/Then, include the following parameters in the configuration
property=replicator.stage.remote-to-thl.filters=convertstringfrommysqlproperty=replicator.filter.convertstringfrommysql.definitionsFile=/opt/continuent/share/convertstringfrommysql.jsonOnce the prerequisites and configuring of the installation has been completed, the software can be installed:
shell> ./tools/tpm install
If the installation process fails, check the output of the /tmp/tungsten-configure.log file for more information about
the root cause.
On the host that is loading data into Redshift, create the s3-config-servicename.json file and then copy that file into
the share directory within the installed directory on that host. For example:
shell> cp s3-config-servicename.json /opt/continuent/share/
Now the services can be started:
shell> replicator start
Once the service is configured and running, the service can be monitored as normal using the trepctl command. See "Management and Monitoring of Amazon Redshift Deployments" for
more information.