python_48_Python3中字符编码与转码

时间:2023-03-10 08:37:28
python_48_Python3中字符编码与转码
python3默认是Unicode,不用声明# -*- coding:utf-8 -*-,如果声明则是utf-8
unicode='你好'
print('utf-8:',unicode.encode())#encode成utf-8,转码之后会变成byte类型
unicode_to_gbk=unicode.encode('gbk')#默认就是unicode,不用再decode,直接可以转成gbk
print(unicode_to_gbk)#打印gbk gbk_to_utf8=unicode_to_gbk.decode('gbk').encode('utf-8')#gbk转成utf-8,与Python2相同
print('utf-8:',gbk_to_utf8)
print(gbk_to_utf8.decode('utf-8'))#decode一下byte字节格式就变成中文
gbk格式和gb2312字符相同,原因是gbk向下兼容gb18130,gb18130向下兼容gb2312

unicode='你好'.encode('utf-8')#python3中unicode转换成utf-8