I want to implement increment load in pentaho. I have two tables in my OLTP and I want to apply left join them and drop them as single table in OLAP. OlTP and OLAP are in different database connection in mysql means there are two different database connection in pentaho.
I want to implement this etl in pentaho and also include incremental load use case that once all data is dumped in olap so in future jobs it only inserts new entries found in olap. First table in oltp is order which have id field, orderdate and amount. Second table in oltp is orderdet which have id, orderid, prodname fields where orderid is foreign key to first table. Need to join them as single table and drop in olap.
How to do this task in pentaho?
Confused in incremental load part like how to design etl for this?
To do incremential loads, you have to understand that Pentaho is realy fast in reading data, but (relatively) slower in storing data. Therefore for incremential loads the functional idea is to load data from system A (source), and from system B (target) and to determine what action should be taken for each record. So a lot needs to be read, but only a little needs to be written.
We can either do
There are some steps in pentaho to set this up pretty easily. But in general most people start the same.
More info here