[已解决]为什么我用.isalpha()检查全是中文的字符串，结果是true？

哈n0 · 发表于 2018-2-18 01:05:21

您需要登录才可以下载或查看，没有账号？立即注册

x

为什么我用.isalpha()检查全是中文的字符串，结果是true？

最佳答案

solomonxian

2018-2-18 11:41:49

我试了下，还真是

以前都不知道啊
然后我查了文档

str.isalpha()
Return true if all characters in the string are alphabetic and there is at least one character, false otherwise. Alphabetic characters are those characters defined in the Unicode character database as “Letter”, i.e., those with general category property being one of “Lm”, “Lt”, “Lu”, “Ll”, or “Lo”. Note that this is different from the “Alphabetic” property defined in the Unicode Standard.

复制代码

说是isalpha在Unicode字符串是基于"Letter"定义的字符，不是标准意义上的纯字母

还有另外一个文档

bytes.isalpha()
bytearray.isalpha()
Return true if all bytes in the sequence are alphabetic ASCII characters and the sequence is not empty, false otherwise. Alphabetic ASCII characters are those byte values in the sequence b'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'

复制代码

二进制字符串中的isalpha方法似乎就是基于ASCII码纯字母的判断

所以建议用b字符串的isalpha 方法，比如

复制代码

solomonxian · 发表于 2018-2-18 11:41:49

我试了下，还真是

以前都不知道啊
然后我查了文档

str.isalpha()
Return true if all characters in the string are alphabetic and there is at least one character, false otherwise. Alphabetic characters are those characters defined in the Unicode character database as “Letter”, i.e., those with general category property being one of “Lm”, “Lt”, “Lu”, “Ll”, or “Lo”. Note that this is different from the “Alphabetic” property defined in the Unicode Standard.

复制代码

说是isalpha在Unicode字符串是基于"Letter"定义的字符，不是标准意义上的纯字母

还有另外一个文档

bytes.isalpha()
bytearray.isalpha()
Return true if all bytes in the sequence are alphabetic ASCII characters and the sequence is not empty, false otherwise. Alphabetic ASCII characters are those byte values in the sequence b'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'

复制代码

二进制字符串中的isalpha方法似乎就是基于ASCII码纯字母的判断

所以建议用b字符串的isalpha 方法，比如

复制代码

哈n0 · 发表于 2018-2-18 13:15:05

solomonxian 发表于 2018-2-18 11:41
我试了下，还真是
以前都不知道啊
然后我查了文档

我是小白，有点不懂你最后一句话
请问什么是b字符串的isalpha
还有中文是为什么能用ASCⅡ码判断啊

solomonxian · 发表于 2018-2-18 16:46:35

哈n0 发表于 2018-2-18 13:15
我是小白，有点不懂你最后一句话
请问什么是b字符串的isalpha
还有中文是为什么能用ASCⅡ码判断啊

我用的python3.5
字符串主要分为 str 和 bytes，(还有个bytearray是bytes的可变类型，暂时不太需要了解)
通过encode 和 decode 转换

上面的名词有不明白的，那你需要百度

str 和 bytes 这两个类都有isalpha()方法，我上面贴的文档就是这回事
并不是ASCII码能判断中文，
是用encode对中文编码，这样识别出来不是纯字母
你可以仔细看看文档的内容

哈n0 · 发表于 2018-2-19 20:35:58

solomonxian 发表于 2018-2-18 16:46
我用的python3.5
字符串主要分为 str 和 bytes，(还有个bytearray是bytes的可变类型，暂时不太需要了 ...

谢谢大佬

账号		自动登录	找回密码
密码			立即注册