- pw = "pz ZcG pkHMcG, eY zgm 1950G, H rQcIq cx QmGmHQWgmQG kmu iB AmzQcqckeG, eYWkIueYr hcgY JcY vmIMHYY HYu tzHYeGkHl PkHM, umJmkcqmu zgm AcYzm yHQkc Mmzgcu. KmYmQHkkB GqmHXeYr, zgm AcYzm yHQkc Mmzgcu eG H GzHzeGzeWHk HqqQcHWg zc GckJm umzmQMeYeGzeW MHYB-icuB qQcikmMG. VY 1953 AmzQcqckeG Wc-HIzgcQmu zgm xeQGz qHqmQ cY H zmWgYeaIm zgHz lHG WmYzQHk zc zgm Mmzgcu Ycl XYclY HG GeMIkHzmu HYYmHkeYr. LgeG kHYuMHQX qHqmQ Ggclmu zgm xeQGz YIMmQeWHk GeMIkHzecYG cx H keaIeu. Lgm HkrcQezgM xcQ rmYmQHzeYr GHMqkmG xQcM zgm CckznMHYY ueGzQeiIzecY lHG kHzmQ rmYmQHkenmu iB j.O. EHGzeYrG zc imWcMm zgm AmzQcqckeG-EHGzeYrG HkrcQezgM. Em eG WQmuezmu HG qHQz cx zgm zmHM zgHz WHMm Iq lezg zgm YHMm AcYzm yHQkc Mmzgcu eY QmxmQmYWm zc H WckkmHrIm'G QmkHzeJm'G kcJm xcQ zgm WHGeYcG cx AcYzm yHQkc. AcYzm yHQkc MmzgcuG HQm H WkHGG cx WcMqIzHzecYHk HkrcQezgMG zgHz QmkB cY QmqmHzmu QHYucM GHMqkeYr zc WcMqIzm zgmeQ QmGIkzG."
- from collections import Counter
- cp = Counter(pw).most_common()
- print(cp)
复制代码
就得到了:
- [(' ', 140), ('m', 82), ('z', 70), ('H', 67), ('c', 65), ('Y', 47), ('G', 46), ('Q', 45), ('e', 44), ('k', 42), ('g', 32), ('M', 29), ('u', 23), ('q', 20), ('W', 18), ('I', 15), ('r', 14), ('x', 11), ('.', 9), ('A', 8), ('l', 7), ('i', 6), ('B', 6), (',', 5), ('J', 5), ('y', 5), ('X', 3), ('-', 3), ('E', 3), ('p', 2), ('1', 2), ('9', 2), ('5', 2), ('a', 2), ('L', 2), ('n', 2), ("'", 2), ('Z', 1), ('0', 1), ('h', 1), ('v', 1), ('t', 1), ('P', 1), ('K', 1), ('V', 1), ('3', 1), ('C', 1), ('j', 1), ('O', 1)]
复制代码
其中去掉“空格”以后,出现频率最高的依次就是“mzHc”,可以看到这几个字母的频率大大高于其他字符,应该是符合“维基百科”的推论的。“m”的频率最高毫无疑问应该是对应"e",但是其他几个字母的频率较接近就需要和“tao”进行排列组合的尝试了。 |