I have two files, file 1 contains 2 columns, file 2 contains 5 columns. I want to remove the lines from file 2 that dont contain common strings with file 1:
-file 1, if this is a list, each line contains [0] and [1]
gene-3 +
gene-2 -
gene-1 -
-file 2, compare [0] and [1] from file 1, against [0] and [4] of this file. If noline from file1 matches in any line of file2, must be removed.
gene-1 mga CDF 1 + # this line contains + instead - although gane-1 is the same. rm
gene-2 mga CDS 1 - # [0][1] from file 1 = [0][4] from file 2: (gene-2, - ) keep it!
gene-3 mga CDH 1 + # "" "" ""
gene-4 mga CDS 1 + # no gene-4 in file 1, remove.
-Desired output:
gene-3 mga CDH 1 +
gene-2 mga CDS 1 -
any ideas?