Ghost inputs/outputs, generic implementation

DataStage has an interesting feature called "runtime column propagation". This means that a DS stage (like transfomer) takes whatever is thrown at it, say the result of an SQL query, reads its metadata and propagate it through the stage element. It is possible to use the actual data in the stage like in a transfomer.


You configure it be ticking a checkbox in the output tab. It might be a bit hidden though.

It is, however a double-edged sword. The up is that one can easily implement generic etl jobs. The down is that especially the undiscerning user can get puzzled by ghostly inputs/outputs. Connected to that you need to have memorised the structure getting propagated.

 

Kommentare

Beliebte Posts aus diesem Blog

Error when checking operator: Could not find input field 'field_name'

Missing record delimiter "\n", saw EOF instead