SetToString Filter
The SetToString converts the SET column type from the internal representation to a string-based representation in the THL.
This is achieved by accessing the extractor database, obtaining the table definitions, and modifying the THL data before it is written into the THL file.
| Pre-configured filter name | settostring |
| Classname | com.continuent.tungsten.replicator.filter.SetToStringFilter |
| Property prefix | replicator.filter.settostring |
| Stage compatibility | binlog-to-q |
tpm Option compatibility | svc-extractor-filters |
| Data compatibility | Row events |
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
user | string | ${replicator.global.extract.db.user} | The username for the connection to the database for looking up column definitions |
password | string | ${replicator.global.extract.db.password} | The password for the connection to the database for looking up column definitions |
url | string | jdbc:mysql:thin://${replicator.global.extract.db.host}:${replicator.global.extract.db.port}/${replicator.schema}?createDB=true | JDBC URL of the database connection to use for looking up column definitions |
The SetToString filter should be used with heterogeneous replication to ensure that the data is represented as the string value, not the
internal numerical representation.
In the THL output below, the table has a SET column, salesman:
mysql> describe salesadv;
+----------+--------------------------------------+------+-----+---------+----------------+
| Field | Type | Null | Key | Default | Extra |
+----------+--------------------------------------+------+-----+---------+----------------+
| id | int(11) | NO | PRI | NULL | auto_increment |
| country | enum('US','UK','France','Australia') | YES | | NULL | |
| city | int(11) | YES | | NULL | |
| salesman | set('Alan','Zachary') | YES | | NULL | |
| value | decimal(10,2) | YES | | NULL | |
+----------+--------------------------------------+------+-----+---------+----------------+
When extracted in the THL, the representation uses the internal value (for example, 1 for the first element of the set description). This can be seen in the THL output below.
SEQ# = 138 / FRAG# = 0 (last frag)
- TIME = 2013-08-01 19:09:35.0
- EPOCH# = 122
- EVENTID = mysql-bin.000012:0000000000021434;0
- SOURCEID = host31
- METADATA = [mysql_server_id=1;dbms_type=mysql;service=alpha;shard=test]
- TYPE = com.continuent.tungsten.replicator.event.ReplDBMSEvent
- OPTIONS = [foreign_key_checks = 1, unique_checks = 1]
- SQL(0) =
- ACTION = INSERT
- SCHEMA = test
- TABLE = salesadv
- ROW# = 0
- COL(1: id) = 2
- COL(2: country) = 1
- COL(3: city) = 8374
- COL(4: salesman) = 1
- COL(5: value) = 35000.00
For the salesman column, the corresponding value in the THL is 1. With the SetToString filter enabled, the value
is expanded to the corresponding string value:
SEQ# = 121 / FRAG# = 0 (last frag)
- TIME = 2013-08-01 19:05:14.0
- EPOCH# = 102
- EVENTID = mysql-bin.000012:0000000000018866;0
- SOURCEID = host31
- METADATA = [mysql_server_id=1;dbms_type=mysql;service=alpha;shard=test]
- TYPE = com.continuent.tungsten.replicator.event.ReplDBMSEvent
- OPTIONS = [foreign_key_checks = 1, unique_checks = 1]
- SQL(0) =
- ACTION = INSERT
- SCHEMA = test
- TABLE = salesadv
- ROW# = 0
- COL(1: id) = 1
- COL(2: country) = US
- COL(3: city) = 8374
- COL(4: salesman) = Alan
- COL(5: value) = 35000.00
The examples here also show the enumtostring and columnname filters.