I need to filter the rows of a table according to a list DOM_LABEL that contains the correct values that should match the ones present in the column label.
DOM_LABEL = ['Microsoft', 'Apple', 'Lenovo'] this list is contained inside a csv file.
The problem is that some rows have incorrect values such as 'Samsung', 'HP', etc. I don't know the incorrect values present, so by filtering the rows by label values not contained in DOM_LABEL helps me analyze the problem better.
However, the Filter rows step does not allow me to load a list of values. And manually adding them into the step is gruesome because the DOM_LABEL contains more than 100 values. Is there a way to load those values into the Filter rows step?
I think the best sollution in this case, if I'm understanding the issue correctly, is by setting up a field beforehand, which checks if the value is in the list and then filter rows on that field.
How to do this?
Load the data from the csv in a seperate step (csv input). Then in your mainflow BEFORE the filter row step, us a 'stream lookup' step to find the value from your main flow column in the csv. retrieve the column i which you lookup the value. Then in the filter rows, just check if the lookedup value is empty or not.