当前位置: 首页 > 知识库问答 >
问题:

将看起来像utf-8的字符转换为真正的utf-8

翟浩穰
2023-03-14

我将python3和Nmap用于smb-os发现。nse脚本获取有关网络中endpoint的详细信息。特别是主机名。当主机名包含非ascii字符时,我会收到类似utf-8的字符串,如下所示:

"\\xD7\\x95\\xD7\\x95\\xD7\\x99\\xD7\\xA0\\xD7\\x93\\xD7\\x95\\xD7\\xA17"

我想要的是将此字符串转换为真正的utf-8(编码或解码,没关系),如下所示:

b"\xD7\x95\xD7\x95\xD7\x99\xD7\xA0\xD7\x93\xD7\x95\xD7\xA17" or "ווינדוס7"

我尝试了以下方法,但没有成功:

 - .replace("\\", "\")
 - .encode('latin1').decode()
 - .decode()

据我所知,问题在于脚本的输出以解码python字符串的形式给我utf-8编码字符。

可以自动转换吗?

注意:nmap 命令(请注意,它是 xml 输出,它是使用子进程计算的,并使用元素树进行解析):

nmap --script smb-os-discovery.nse -oX - 192.168.200.100

全输出示例:

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE nmaprun>
<?xml-stylesheet href="file:///usr/bin/../share/nmap/nmap.xsl" type="text/xsl"?>
<!-- Nmap 7.12 scan initiated Sun May 27 07:12:27 2018 as: nmap -&#45;script smb-os-discovery.nse -oX - 192.168.200.100 -->
<nmaprun scanner="nmap" args="nmap -&#45;script smb-os-discovery.nse -oX - 192.168.200.100" start="1527405147" startstr="Sun May 27 07:12:27 2018" version="7.12" xmloutputversion="1.04">
<scaninfo type="syn" protocol="tcp" numservices="1000" services="1,3-4,6-7,9,13,17,19-26,30,32-33,37,42-43,49,53,70,79-85,88-90,99-100,106,109-111,113,119,125,135,139,143-144,146,161,163,179,199,211-212,222,254-256,259,264,280,301,306,311,340,366,389,406-407,416-417,425,427,443-445,458,464-465,481,497,500,512-515,524,541,543-545,548,554-555,563,587,593,616-617,625,631,636,646,648,666-668,683,687,691,700,705,711,714,720,722,726,749,765,777,783,787,800-801,808,843,873,880,888,898,900-903,911-912,981,987,990,992-993,995,999-1002,1007,1009-1011,1021-1100,1102,1104-1108,1110-1114,1117,1119,1121-1124,1126,1130-1132,1137-1138,1141,1145,1147-1149,1151-1152,1154,1163-1166,1169,1174-1175,1183,1185-1187,1192,1198-1199,1201,1213,1216-1218,1233-1234,1236,1244,1247-1248,1259,1271-1272,1277,1287,1296,1300-1301,1309-1311,1322,1328,1334,1352,1417,1433-1434,1443,1455,1461,1494,1500-1501,1503,1521,1524,1533,1556,1580,1583,1594,1600,1641,1658,1666,1687-1688,1700,1717-1721,1723,1755,1761,1782-1783,1801,1805,1812,1839-1840,1862-1864,1875,1900,1914,1935,1947,1971-1972,1974,1984,1998-2010,2013,2020-2022,2030,2033-2035,2038,2040-2043,2045-2049,2065,2068,2099-2100,2103,2105-2107,2111,2119,2121,2126,2135,2144,2160-2161,2170,2179,2190-2191,2196,2200,2222,2251,2260,2288,2301,2323,2366,2381-2383,2393-2394,2399,2401,2492,2500,2522,2525,2557,2601-2602,2604-2605,2607-2608,2638,2701-2702,2710,2717-2718,2725,2800,2809,2811,2869,2875,2909-2910,2920,2967-2968,2998,3000-3001,3003,3005-3007,3011,3013,3017,3030-3031,3052,3071,3077,3128,3168,3211,3221,3260-3261,3268-3269,3283,3300-3301,3306,3322-3325,3333,3351,3367,3369-3372,3389-3390,3404,3476,3493,3517,3527,3546,3551,3580,3659,3689-3690,3703,3737,3766,3784,3800-3801,3809,3814,3826-3828,3851,3869,3871,3878,3880,3889,3905,3914,3918,3920,3945,3971,3986,3995,3998,4000-4006,4045,4111,4125-4126,4129,4224,4242,4279,4321,4343,4443-4446,4449,4550,4567,4662,4848,4899-4900,4998,5000-5004,5009,5030,5033,5050-5051,5054,5060-5061,5080,5087,5100-5102,5120,5190,5200,5214,5221-5222,5225-5226,5269,5280,5298,5357,5405,5414,5431-5432,5440,5500,5510,5544,5550,5555,5560,5566,5631,5633,5666,5678-5679,5718,5730,5800-5802,5810-5811,5815,5822,5825,5850,5859,5862,5877,5900-5904,5906-5907,5910-5911,5915,5922,5925,5950,5952,5959-5963,5987-5989,5998-6007,6009,6025,6059,6100-6101,6106,6112,6123,6129,6156,6346,6389,6502,6510,6543,6547,6565-6567,6580,6646,6666-6669,6689,6692,6699,6779,6788-6789,6792,6839,6881,6901,6969,7000-7002,7004,7007,7019,7025,7070,7100,7103,7106,7200-7201,7402,7435,7443,7496,7512,7625,7627,7676,7741,7777-7778,7800,7911,7920-7921,7937-7938,7999-8002,8007-8011,8021-8022,8031,8042,8045,8080-8090,8093,8099-8100,8180-8181,8192-8194,8200,8222,8254,8290-8292,8300,8333,8383,8400,8402,8443,8500,8600,8649,8651-8652,8654,8701,8800,8873,8888,8899,8994,9000-9003,9009-9011,9040,9050,9071,9080-9081,9090-9091,9099-9103,9110-9111,9200,9207,9220,9290,9415,9418,9485,9500,9502-9503,9535,9575,9593-9595,9618,9666,9876-9878,9898,9900,9917,9929,9943-9944,9968,9998-10004,10009-10010,10012,10024-10025,10082,10180,10215,10243,10566,10616-10617,10621,10626,10628-10629,10778,11110-11111,11967,12000,12174,12265,12345,13456,13722,13782-13783,14000,14238,14441-14442,15000,15002-15004,15660,15742,16000-16001,16012,16016,16018,16080,16113,16992-16993,17877,17988,18040,18101,18988,19101,19283,19315,19350,19780,19801,19842,20000,20005,20031,20221-20222,20828,21571,22939,23502,24444,24800,25734-25735,26214,27000,27352-27353,27355-27356,27715,28201,30000,30718,30951,31038,31337,32768-32785,33354,33899,34571-34573,35500,38292,40193,40911,41511,42510,44176,44442-44443,44501,45100,48080,49152-49161,49163,49165,49167,49175-49176,49400,49999-50003,50006,50300,50389,50500,50636,50800,51103,51493,52673,52822,52848,52869,54045,54328,55055-55056,55555,55600,56737-56738,57294,57797,58080,60020,60443,61532,61900,62078,63331,64623,64680,65000,65129,65389"/>
<verbose level="0"/>
<debugging level="0"/>
<host starttime="1527405147" endtime="1527405149"><status state="up" reason="arp-response" reason_ttl="0"/>
<address addr="192.168.200.100" addrtype="ipv4"/>
<address addr="xx:xx:xx:xx:xx:xx" addrtype="mac" vendor="VMware"/>
<hostnames>
<hostname name="**************7.testlab.local" type="PTR"/>
</hostnames>
<ports><extraports state="closed" count="990">
<extrareasons reason="resets" count="990"/>
</extraports>
<port protocol="tcp" portid="135"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="msrpc" method="table" conf="3"/></port>
<port protocol="tcp" portid="139"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="netbios-ssn" method="table" conf="3"/></port>
<port protocol="tcp" portid="445"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="microsoft-ds" method="table" conf="3"/></port>
<port protocol="tcp" portid="2030"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="device2" method="table" conf="3"/></port>
<port protocol="tcp" portid="3389"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="ms-wbt-server" method="table" conf="3"/></port>
<port protocol="tcp" portid="5357"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="wsdapi" method="table" conf="3"/></port>
<port protocol="tcp" portid="5500"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="hotline" method="table" conf="3"/></port>
<port protocol="tcp" portid="49152"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="unknown" method="table" conf="3"/></port>
<port protocol="tcp" portid="49153"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="unknown" method="table" conf="3"/></port>
<port protocol="tcp" portid="49154"><state state="open" reason="syn-ack" reason_ttl="128"/><service name="unknown" method="table" conf="3"/></port>
</ports>
<hostscript><script id="smb-os-discovery" output="&#xa;  OS: Windows 7 Professional 7601 Service Pack 1 (Windows 7 Professional 6.1)&#xa;  OS CPE: cpe:/o:microsoft:windows_7::sp1:professional&#xa;  Computer name: \xD7\x95\xD7\x95\xD7\x99\xD7\xA0\xD7\x93\xD7\x95\xD7\xA17&#xa;  NetBIOS computer name: \xE2\x80\xA6\xE2\x80\xA6\xE2\x80\xB0\xC2\x90\xC6\x91\xE2\x80\xA6\xE2\x80\x987&#xa;  Domain name: TestLab.Local&#xa;  Forest name: TestLab.Local&#xa;  FQDN: \xD7\x95\xD7\x95\xD7\x99\xD7\xA0\xD7\x93\xD7\x95\xD7\xA17.TestLab.Local&#xa;  System time: 2018-05-27T10:24:04+03:00&#xa;"><elem key="os">Windows 7 Professional 7601 Service Pack 1</elem>
<elem key="lanmanager">Windows 7 Professional 6.1</elem>
<elem key="server">\xE2\x80\xA6\xE2\x80\xA6\xE2\x80\xB0\xC2\x90\xC6\x91\xE2\x80\xA6\xE2\x80\x987\x00</elem>
<elem key="date">2018-05-27T10:24:04+03:00</elem>
<elem key="fqdn">\xD7\x95\xD7\x95\xD7\x99\xD7\xA0\xD7\x93\xD7\x95\xD7\xA17.TestLab.Local</elem>
<elem key="domain_dns">TestLab.Local</elem>
<elem key="forest_dns">TestLab.Local</elem>
<elem key="workgroup">TESTLAB\x00</elem>
<elem key="cpe">cpe:/o:microsoft:windows_7::sp1:professional</elem>
</script></hostscript><times srtt="214" rttvar="58" to="100000"/>
</host>
<runstats><finished time="1527405149" timestr="Sun May 27 07:12:29 2018" elapsed="2.24" summary="Nmap done at Sun May 27 07:12:29 2018; 1 IP address (1 host up) scanned in 2.24 seconds" exit="success"/><hosts up="1" down="0" total="1"/>
</runstats>
</nmaprun>

共有1个答案

裴宏壮
2023-03-14

溶液:

正如@tripleee所指出的,我最终使用了一个反向repr函数,如下所示:我有这个字符串:

a="\\xD7\\x95\\xD7\\x95\\xD7\\x99\\xD7\\xA0\\xD7\\x93\\xD7\\x95\\xD7\\xA17"

想要这个字符串:

a="ווינדוס7"

首先,我创建了一个格式化字符串,如下所示:

b="b'{}'".format(a)

然后,我使用literal_evalfunc,如下所示:

c=literal_eval(b)

正如eval所期望的那样,我收到了这个“c”格式的输出:

c=b'\xd7\x95\xd7\x95\xd7\x99\xd7\xa0\xd7\x93\xd7\x95\xd7\xa17'

然后,我将这个字节字符串解码为 utf-8 字符串::

d=c.decode('utf-8')

并得到了我想要的结果:

d="ווינדוס7"
 类似资料:
  • 问题内容: 从我的数据库即时通讯获取以下字符串: 它应该说的是: 我试图像这样将其转换为utf-8: 但是仍然是同样的问题。 有任何想法吗? 问题答案: 使用npm中的utf8模块对字符串进行编码/解码。 安装: 在浏览器中: 在Node.js中: API: 编码: 将任何给定的JavaScript字符串(字符串)编码为UTF-8,并返回该字符串的UTF-8编码版本。如果输入字符串包含非标量值(即

  • 问题内容: 我需要将unicode字符串转换为具有以unicode编码的非ascii字符的字符串。例如,字符串“汉字Max”应显示为“ \ u6F22 \ u5B57 Max”。 我尝试过的 的不同组合 新的String(sourceString.getBytes(encoding1),encoding2) Apache StringEscapeUtils也会转义ascii字符,例如双引号 Str

  • 问题内容: 我们有一个数据输入人员,他在Windows上使用UTF-16编码,并且希望拥有utf-8并删除BOM。utf-8转换有效,但BOM仍然存在。我将如何删除?这是我目前拥有的: 如果我使用hexdump -CI,请参阅: 在结果文件中。如何删除物料清单? 谢谢 问题答案: 只需使用和: 将为您摆脱BOM表(并推断出字节序)。

  • 我正在使用下面的代码从SVG文件创建PNG,并将其写入Servlet输出流。SVG文件的轴名中包含UTF-8(日语)字符。 上面的代码在我的本地程序中运行良好,但是当我上传到服务器时,它显示的是正方形而不是字符。当有英文字符而不是日文字符时,它在本地和服务器上都能正常工作。 你能告诉我你是否面临同样的问题。会不会和Tomcat有关? 我已经从服务器上检查了SVG文件,UTF-8字符在其中可见。我还

  • 我试图保存一个字符串在希伯来文文件,同时有文件ANSI编码。恐怕所有的尝试都失败了。 PHP文件本身是UTF-8 这是我正在尝试的代码: 由于某种原因,返回false。 另一次尝试是: 这返回一个空字符串。当这不起作用时,将输出字符集更改为Windows-1255的工作。所以函数本身工作,但由于某种原因,它不转换为1252。 我运行这个函数之前和之后的和打印的结果 在图标之前编码是UTF-8,在图

  • 我有一个从第三方应用程序收到的字符串,我希望在我的Windows界面上使用C#以任何语言正确显示它。 由于编码不正确,我的字符串中有一段在西班牙语中如下所示: AcciÃn 然而它应该看起来像这样: 阿克松 根据这个问题的答案:如何知道C#中的字符串编码,我接收到的编码应该已经在UTF-8中出现了,但它是在编码时读取的。默认值(可能是ANSI?)。 我试图将这个字符串转换成真正的UTF-8,但问题