I am trying to find the similar genes between two columns that I can later work with just the similar genes. Below is my code:
top100_1Beta <- data.frame(grp1_Beta$my_data.SYMBOL[1:100])
top100_2Beta<- data.frame(grp2_Beta$my_data.SYMBOL[1:100])
common100_Beta <- semi_join(top100_1Beta,top100_2Beta)`
When I run the code I get the following error:
Error:
by
required, because the data sources have no common variables
This is wrong since when I open top100_1Beta
and top100_2Beta
I can see at least the first few list the exact same genes: ATP2A1, SLMAP, MEOX2,...
I am confused on why then it's returning that no commonalities. Any help would be greatly appreciated. Thanks!
I don't think you need any form of
*_join
here; instead it seems you're looking forintersect
This returns a
vector
of common entries amongst the first 100 entries ofgrp1_Beta$my_data.SYMBOL
andgrp1_Beta$my_data.SYMBOL
.