I want to find the number of SNPS that have FDR adjusted p-values of p<.05. However, my for loop
and if
statement did not effectively find the # of SNPs with p<.05.
My dataset has a P
column which indicates p-value and 1422 observations.
> dput(dat[1:5,])
structure(list(CHR = c(6L, 6L, 6L, 6L, 6L), SNP = c("rs9257319",
"rs2269553", "rs2894066", "rs3763338", "rs1233508"), BP = c(28959616L,
28984488L, 29001906L, 29002290L, 29005612L), A1 = c(2L, 1L, 1L,
1L, 2L), A2 = c(1L, 2L, 2L, 2L, 1L), T = c(6L, 9L, 13L, 4L, 8L
), U = c(7L, 9L, 9L, 3L, 13L), OR = c(0.8571, 1, 1.444, 1.333,
0.6154), L95 = c(0.2881, 0.397, 0.6174, 0.2984, 0.2551), U95 = c(2.55,
2.519, 3.379, 5.957, 1.485), CHISQ = c(0.07692, 0, 0.7273, 0.1429,
1.19), P = c(0.7815, 1, 0.3938, 0.7055, 0.2752)), row.names = c(NA,
5L), class = "data.frame")
I calculated the q-values using the qvalue
library.
library(qvalue)
library(dplyr)
fdr <- qvalue(dat$P, fdr.level=0.05)
Finally, I want to find the number of SNPs with FDR adjusted p-values of p<.05.
# SNPs that have FDR adjusted p-values of p<.05
for(i in fdr$qvalues){
if(i>0.05){
fdr[!fdr$qvalues %in% i]
}
}
And found that there is one q-value > 0.05 and removed it. However, as shown below, the length of fdr$qvalues
remain the same, meaning that I did not remove the q-value > 0.05 element.
length(fdr$qvalues)
[1] 1422
Created on 2022-02-10 by the reprex package (v2.0.0)