how to run this project.

Question

how to run this project.

narvin opened this issue 8 years ago · 11 comments

hello sir.
you have did great work to post this code.
but i am getting this error.
please help me!
File "model.py", line 123, in
train(parser.parse_args())
File "model.py", line 81, in train
train_inp, train_out = get_train_data()
File "/home/narvin123/ner-lstm-master/input.py", line 6, in get_train_data
emb = pickle.load(open('embeddings/train_embed.pkl', 'rb'))
IOError: [Errno 2] No such file or directory: 'embeddings/train_embed.pkl'

Answer 1 · 2017-04-22T13:47:16.000Z

you need to generate the embeddings from the training dataset first. Example if it is the conll dataset. Run the get_conll_embeddings.py to generate the embeddings and use them as input to train the model.

Answer 2 · 2018-05-29T06:17:27.000Z

when ihave tried to run resize_input .py file as following im getting error
python resize_input.py --input /home/kusum/Desktop/codes/ner-lstm-master/data/input/ --output /home/kusum/Desktop/codes/ner-lstm-master/data/output/ --trim 70
Traceback (most recent call last):
File "resize_input.py", line 55, in
remove_crap(args.input)
File "resize_input.py", line 7, in remove_crap
f = open(input_file)
IOError: [Errno 21] Is a directory: '/home/kusum/Desktop/codes/ner-lstm-master/data/input/'

Answer 3 · 2018-05-29T06:24:07.000Z

Use the resize input on files and not on directories

…

On Tue, May 29, 2018, 11:51 AM kusumlata123 ***@***.***> wrote: why i am getting this errror python resize_input.py --input /home/kusum/Desktop/codes/ner-lstm-master/data/input/ --output /home/kusum/Desktop/codes/ner-lstm-master/data/output/ --trim 70 Traceback (most recent call last): File "resize_input.py", line 55, in remove_crap(args.input) File "resize_input.py", line 7, in remove_crap f = open(input_file) IOError: [Errno 21] Is a directory: '/home/kusum/Desktop/codes/ner-lstm-master/data/input/' — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#6 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ALSRmyOE_LtAF7-9aMUXD0RVgaMyfXYlks5t3Oj3gaJpZM4MEA-1> .

Answer 4 · 2018-05-29T06:29:05.000Z

sir this resize_input file , i can apply on utf -8 hindi text data. na On Tue, May 29, 2018 at 11:54 AM, Shreenivas Bharadwaj < notifications@github.com> wrote:

…

Use the resize input on files and not on directories On Tue, May 29, 2018, 11:51 AM kusumlata123 ***@***.***> wrote: > why i am getting this errror > python resize_input.py --input > /home/kusum/Desktop/codes/ner-lstm-master/data/input/ --output > /home/kusum/Desktop/codes/ner-lstm-master/data/output/ --trim 70 > Traceback (most recent call last): > File "resize_input.py", line 55, in > remove_crap(args.input) > File "resize_input.py", line 7, in remove_crap > f = open(input_file) > IOError: [Errno 21] Is a directory: > '/home/kusum/Desktop/codes/ner-lstm-master/data/input/' > > — > You are receiving this because you modified the open/close state. > Reply to this email directly, view it on GitHub > <#6 (comment) >, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ALSRmyOE_LtAF7- 9aMUXD0RVgaMyfXYlks5t3Oj3gaJpZM4MEA-1> > . > — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#6 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AlRARPtQh295VLWnXGc-BbNWN0mSL-BBks5t3OmKgaJpZM4MEA-1> .

Answer 5 · 2018-05-29T06:49:56.000Z

There is a sample corpus provided, you can check whether it is utf8 or unicode On Tue, May 29, 2018, 11:59 AM kusumlata123 <notifications@github.com> wrote:

…

sir this resize_input file , i can apply on utf -8 hindi text data. na On Tue, May 29, 2018 at 11:54 AM, Shreenivas Bharadwaj < ***@***.***> wrote: > Use the resize input on files and not on directories > > On Tue, May 29, 2018, 11:51 AM kusumlata123 ***@***.***> > wrote: > > > why i am getting this errror > > python resize_input.py --input > > /home/kusum/Desktop/codes/ner-lstm-master/data/input/ --output > > /home/kusum/Desktop/codes/ner-lstm-master/data/output/ --trim 70 > > Traceback (most recent call last): > > File "resize_input.py", line 55, in > > remove_crap(args.input) > > File "resize_input.py", line 7, in remove_crap > > f = open(input_file) > > IOError: [Errno 21] Is a directory: > > '/home/kusum/Desktop/codes/ner-lstm-master/data/input/' > > > > — > > You are receiving this because you modified the open/close state. > > Reply to this email directly, view it on GitHub > > < #6 (comment) > >, > > or mute the thread > > <https://github.com/notifications/unsubscribe-auth/ALSRmyOE_LtAF7- > 9aMUXD0RVgaMyfXYlks5t3Oj3gaJpZM4MEA-1> > > . > > > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#6 (comment) >, > or mute the thread > < https://github.com/notifications/unsubscribe-auth/AlRARPtQh295VLWnXGc-BbNWN0mSL-BBks5t3OmKgaJpZM4MEA-1 > > . > — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#6 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ALSRmxGoN2Pt0P66yG8HGMBjy5XdRhXGks5t3OqzgaJpZM4MEA-1> .

Answer 6 · 2018-05-29T07:15:03.000Z

Use hindi_util.py to convert to conll format then proceed On Tue, May 29, 2018, 12:19 PM V.Shreenivas Bharadwaj < vshreenivasbharadwaj@gmail.com> wrote:

…

There is a sample corpus provided, you can check whether it is utf8 or unicode On Tue, May 29, 2018, 11:59 AM kusumlata123 ***@***.***> wrote: > sir this resize_input file , i can apply on utf -8 hindi text data. na > > On Tue, May 29, 2018 at 11:54 AM, Shreenivas Bharadwaj < > ***@***.***> wrote: > > > Use the resize input on files and not on directories > > > > On Tue, May 29, 2018, 11:51 AM kusumlata123 ***@***.***> > > wrote: > > > > > why i am getting this errror > > > python resize_input.py --input > > > /home/kusum/Desktop/codes/ner-lstm-master/data/input/ --output > > > /home/kusum/Desktop/codes/ner-lstm-master/data/output/ --trim 70 > > > Traceback (most recent call last): > > > File "resize_input.py", line 55, in > > > remove_crap(args.input) > > > File "resize_input.py", line 7, in remove_crap > > > f = open(input_file) > > > IOError: [Errno 21] Is a directory: > > > '/home/kusum/Desktop/codes/ner-lstm-master/data/input/' > > > > > > — > > > You are receiving this because you modified the open/close state. > > > Reply to this email directly, view it on GitHub > > > < > #6 (comment) > > >, > > > or mute the thread > > > <https://github.com/notifications/unsubscribe-auth/ALSRmyOE_LtAF7- > > 9aMUXD0RVgaMyfXYlks5t3Oj3gaJpZM4MEA-1> > > > . > > > > > > > — > > You are receiving this because you commented. > > Reply to this email directly, view it on GitHub > > <#6 (comment) > >, > > or mute the thread > > < > https://github.com/notifications/unsubscribe-auth/AlRARPtQh295VLWnXGc-BbNWN0mSL-BBks5t3OmKgaJpZM4MEA-1 > > > > . > > > > — > You are receiving this because you modified the open/close state. > Reply to this email directly, view it on GitHub > <#6 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ALSRmxGoN2Pt0P66yG8HGMBjy5XdRhXGks5t3OqzgaJpZM4MEA-1> > . >

Answer 7 · 2018-05-31T04:24:26.000Z

I didn't able to run that file. On Tue, 29 May 2018, 12:45 Shreenivas Bharadwaj, <notifications@github.com> wrote:

…

Use hindi_util.py to convert to conll format then proceed On Tue, May 29, 2018, 12:19 PM V.Shreenivas Bharadwaj < ***@***.***> wrote: > There is a sample corpus provided, you can check whether it is utf8 or > unicode > > On Tue, May 29, 2018, 11:59 AM kusumlata123 ***@***.***> > wrote: > >> sir this resize_input file , i can apply on utf -8 hindi text data. na >> >> On Tue, May 29, 2018 at 11:54 AM, Shreenivas Bharadwaj < >> ***@***.***> wrote: >> >> > Use the resize input on files and not on directories >> > >> > On Tue, May 29, 2018, 11:51 AM kusumlata123 ***@***.*** > >> > wrote: >> > >> > > why i am getting this errror >> > > python resize_input.py --input >> > > /home/kusum/Desktop/codes/ner-lstm-master/data/input/ --output >> > > /home/kusum/Desktop/codes/ner-lstm-master/data/output/ --trim 70 >> > > Traceback (most recent call last): >> > > File "resize_input.py", line 55, in >> > > remove_crap(args.input) >> > > File "resize_input.py", line 7, in remove_crap >> > > f = open(input_file) >> > > IOError: [Errno 21] Is a directory: >> > > '/home/kusum/Desktop/codes/ner-lstm-master/data/input/' >> > > >> > > — >> > > You are receiving this because you modified the open/close state. >> > > Reply to this email directly, view it on GitHub >> > > < >> #6 (comment) >> > >, >> > > or mute the thread >> > > <https://github.com/notifications/unsubscribe-auth/ALSRmyOE_LtAF7- >> > 9aMUXD0RVgaMyfXYlks5t3Oj3gaJpZM4MEA-1> >> > > . >> > > >> > >> > — >> > You are receiving this because you commented. >> > Reply to this email directly, view it on GitHub >> > < #6 (comment) >> >, >> > or mute the thread >> > < >> https://github.com/notifications/unsubscribe-auth/AlRARPtQh295VLWnXGc-BbNWN0mSL-BBks5t3OmKgaJpZM4MEA-1 >> > >> > . >> > >> >> — >> You are receiving this because you modified the open/close state. >> Reply to this email directly, view it on GitHub >> <#6 (comment) >, >> or mute the thread >> < https://github.com/notifications/unsubscribe-auth/ALSRmxGoN2Pt0P66yG8HGMBjy5XdRhXGks5t3OqzgaJpZM4MEA-1 > >> . >> > — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#6 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AlRARFx6vNjU2OqNOaD9rKNX0snrtPnkks5t3PV-gaJpZM4MEA-1> .

Answer 8 · 2018-08-05T15:55:00.000Z

when i run this python resize_input.py --input JAMSHEDPUR00.text --output output1 --trim 50
after this output is like this why it is so
0 sentences trimmed out of 0 total sentences

Answer 9 · 2018-08-05T16:43:20.000Z

Iam also getting this error
python hindi_util.py --format ssf --input hindi_ssf_corpus.txt --dist 3
Traceback (most recent call last):
File "hindi_util.py", line 14, in
assert len(args.dist) == 3
AssertionError

Answer 10 · 2018-08-05T17:21:44.000Z

when i run this command python hindi_util.py --format ssf --input hindi_ssf_corpus.txt
then it will not give any error but
giving file which has no data

Answer 11 · 2018-08-05T18:32:06.000Z

python2 hindi_util.py --format text --input fullnews_id_
Traceback (most recent call last):
File "hindi_util.py", line 12, in
open('hin.text', 'w').write(wxc.convert(open(args.input).read()))
File "/home/kusum/.local/lib/python2.7/site-packages/wxconv/wx_format.py", line 226, in convert
return self.transform(line)
File "/home/kusum/.local/lib/python2.7/site-packages/wxconv/wx.py", line 2922, in utf2wx
unicode_ = self.normalize(unicode_)
File "/home/kusum/.local/lib/python2.7/site-packages/wxconv/wx.py", line 1798, in normalize
text = text.replace('\uFEFF', '') # BYTE_ORDER_MARK
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe0 in position 20: ordinal not in range(128)