Data loading is slow while using "Insert/Update" step in pentaho 4.4.0
I am using pentaho 4.4.0. While using the "Insert/Update" step in kettle the speed of the data load is too slow compared to mysql. This step will scan through the entire records in table before inserting. If the record exist it will do a update. So what shall be done to optimize the performance while doing "Insert/Update" . and the process speed is 4 r/s, so totally my records will be above 1 lakh... The process takes 2 and half hours to complete the entire process.
Try to set a big value on the field Transaction Size (Commit).
Depending on the number of rows you will upsert, set more or less on that field.
This improve so much the performance in my case.
500 would be a little value if you will upsert, in example, 100.000 rows, because it will have to be commited 200 times.
Less commits, faster execution.