5.6. Deploying Oracle to Vertica Replication
Hewlett-Packard's Vertica provides support for BigData, SQL-based analysis
and processing. Integration with Oracle enables data to be replicated live
from the Oracle database directly into Vertica without the need to manually
export and import the data.
Figure 5.5. Topologies: Oracle to HP Vertica with Redo Reader
Replication to Vertica operates as follows:
Data is extracted from the source database into THL.
When extracting the data from the THL, the Vertica replicator writes the
data into CSV files according to the name of the source tables. The
files contain all of the row-based data, including the global
transaction ID generated by Tungsten Replication during replication, and the
operation type (insert, delete, etc) as part of the CSV data.
The CSV data is then loaded into Vertica into staging tables.
SQL statements are then executed to perform updates on the live version
of the tables, using the CSV, batch loaded, information, deleting old
rows, and inserting the new data when performing updates to work
effectively within the confines of Vertica operation.
Setting up replication requires setting up both the master and slave
components as two different configurations, one for Oracle and the other for
Vertica. Replication also requires some additional steps to ensure that the
Vertica host is ready to accept the replicated data that has been extracted.
Tungsten Replication uses all the tools required to perform these operations
during the installation and setup.