遇到一个问题，就是用 fontutils.py中对我的字体做并集

Question

遇到一个问题，就是用 fontutils.py中对我的字体做并集

daixiangzi opened this issue 4 years ago · 3 comments

然后得到一个类似于你的word.txt,但是在做 idx = [chars[c] for c in text]取类别的时候发现，对于数字出现Key error，后来我查了下，我保存下来的word.txt中的数字都是windows-1252编码，而我的系统都是UTF-8编码，所以会出现这种情况，请问你遇到过这种情况么

Answer 1 · 2020-06-05T06:44:32.000Z

下面是我的测试代码。
import os
import sys
import chardet
import codecs
f = codecs.open(sys.argv[1], mode='r', encoding='utf-8')
lines = f.readlines()
f.close()
words = [l.strip() for l in lines]

dicts = {}
for i, char in enumerate(words):
print(char)
dicts[char] = i
print(dicts['4'])

Answer 2 · 2020-06-05T09:37:42.000Z

您好，写word.txt文件时也使用UTF-8编码，再试试看

…

在 2020年6月5日，下午2:44，daixiangzi ***@***.***> 写道：下面是我的测试代码。 import os import sys import chardet import codecs f = codecs.open(sys.argv[1], mode='r', encoding='utf-8') lines = f.readlines() f.close() words = [l.strip() for l in lines] dicts = {} for i, char in enumerate(words): print(char) dicts[char] = i print(dicts['4']) — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#12 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABV2ST43WDCOEJQJUMXAZ2DRVCH5ZANCNFSM4NTHZIJQ>.

Answer 3 · 2020-06-05T11:11:57.000Z

嗯。找到原因了，是我代码的bug

…

---原始邮件--- 发件人: "mick.yi"<notifications@github.com> 发送时间: 2020年6月5日(周五) 下午5:38 收件人: "yizt/crnn.pytorch"<crnn.pytorch@noreply.github.com>; 抄送: "daixiangzi"<543826458@qq.com>;"Author"<author@noreply.github.com>; 主题: Re: [yizt/crnn.pytorch] 遇到一个问题，就是用 fontutils.py中对我的字体做并集 (#12) 您好，写word.txt文件时也使用UTF-8编码，再试试看 > 在 2020年6月5日，下午2:44，daixiangzi <notifications@github.com> 写道： > > > 下面是我的测试代码。 > import os > import sys > import chardet > import codecs > f = codecs.open(sys.argv[1], mode='r', encoding='utf-8') > lines = f.readlines() > f.close() > words = [l.strip() for l in lines] > > dicts = {} > for i, char in enumerate(words): > print(char) > dicts[char] = i > print(dicts['4']) > > — > You are receiving this because you are subscribed to this thread. > Reply to this email directly, view it on GitHub <#12 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABV2ST43WDCOEJQJUMXAZ2DRVCH5ZANCNFSM4NTHZIJQ>. > — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.