|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
文件“测试.txt”中的内容如下:
GWHAOPM00000765 EVM gene 45928 49103 . + 0 ID=OeCYP51G64a;Accession=GWHGAOPM050481; ;transl_table=1
GWHAOPM00000765 EVM mRNA 45928 49103 . + 0 ID=OeCYP51G64a.1;Accession=GWHTAOPM050481;Parent=OeCYP51G64a;Parent_Accession=GWHGAOPM050481; ;transl_table=1
GWHAOPM00000765 EVM exon 45928 46049 . + 0 ID=Contig001288_ERROPOS3153804__exon_45928_46049;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;;transl_table=1
GWHAOPM00000765 EVM CDS 45928 46049 . + 0 ID=Contig001288_ERROPOS3153804__CDS_45928_46049;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;Protein_Accession=GWHPAOPM050481;;transl_table=1
GWHAOPM00000765 EVM exon 46433 46542 . + 0 ID=Contig001288_ERROPOS3153804__exon_46433_46542;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;;transl_table=1
GWHAOPM00000765 EVM CDS 46433 46542 . + 1 ID=Contig001288_ERROPOS3153804__CDS_46433_46542;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;Protein_Accession=GWHPAOPM050481;;transl_table=1
GWHAOPM00000765 EVM exon 46575 46801 . + 0 ID=Contig001288_ERROPOS3153804__exon_46575_46801;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;;transl_table=1
GWHAOPM00000765 EVM CDS 46575 46801 . + 2 ID=Contig001288_ERROPOS3153804__CDS_46575_46801;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;Protein_Accession=GWHPAOPM050481;;transl_table=1
GWHAOPM00000765 EVM exon 48105 49103 . + 0 ID=Contig001288_ERROPOS3153804__exon_48105_49103;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;;transl_table=1
GWHAOPM00000765 EVM CDS 48105 49103 . + 0 ID=Contig001288_ERROPOS3153804__CDS_48105_49103;Parent=OeCYP51G64a.1;Parent_Accession=GWHTAOPM050481;Protein_Accession=GWHPAOPM050481;;transl_table=1
文件“测试1.txt”中的内容如下:
OeCYP51G64a
代码如下:
a = open('.\\测试.txt')
b = open('.\\测试1.txt')
c = open('.\\数量1.txt','w')
gff = a.readlines()
cyp = b.readlines()
shul = ['CYPname\texon\tintron\n']
for i in cyp:
d = ''
st = ''
p450 = i.strip()
for m in gff:
if p450 in m:
st += m
d = p450 + '\t' + str(st.count('exon')/2) + '\t' + str(st.count('exon')/2-1) + '\n'
print(st)
print(st.count('exon'))
print(st.count('exon')/2)
assert 3>4
shul.append(d)
c.writelines(shul)
a.close()
b.close()
c.close()
问题:文件“测试.txt”中明明‘exon’出现的次数只有4次,为什么会被count成8次呢
你这里单查exon 肯定是8个啊 你要查前后都有空格的 就是4个了
>>> len(re.findall("exon",ts))
8
>>> len(re.findall(" exon ",ts))
4
>>>
|
-
|