I have several txt files which consists of different values, e.g:
TFF,BAP,VAP,DNAAF5,CDKN2B,PDE2D,SLC22A19,RBPJ,STAT1,TAP2,HLA-
I have probabely done a wrong split in the middle of the code, and it splitted by '-' so when I double click one value, it choose all line till the '-'. This mistake does not effect the function till this step. Now I need to count each value occurrens with "Counter" , and the count is wrong.
My code:
gene_calc = r'C:\Users\MrD\Top'
new_dir = r'C:\\Users\\MrD\\Br_Count\\Frequency\\'
for files in gene_calc:
if not os.path.exists(new_dir):
os.mkdir(new_dir)
else:
break
os.chdir(gene_calc)
for files in glob.glob(os.path.join('*.txt*')):
#print(files) # iterating over files to check if prints
with open(files) as f:
content = (line for line in f.read().splitlines())
list = Counter(Vol for Vol in content).most_common()
with open(new_dir + files, "w") as output:
output.write(str(list))
gene_calc folder consists of values as shown in the example above.
I couldn't resplit it (tried "if ',' in gene_list:" or reversing .reverse() but it's already a list with tuples)
at the moment you are counting lines
to count items you need a second split on ',':