wikipedia2vec/wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia
PythonNOASSERTION
Issues
- 2
BufferError after training embeddings
#79 opened by ScholliYT - 0
Mapping with entities
#83 opened by giuspillo - 1
API?
#82 opened by thistlillo - 0
- 0
- 0
How to use build-mention-db output
#75 opened by lucidviews - 3
lmdb.Error: mdb_txn_commit: Input/output error when trying to train on dump file.
#74 opened by lucidviews - 5
Exploring the database - what type of database
#70 opened by Filco306 - 2
Hello all i am getting this unicode error surrogate thing. i m a new user and have no idea what this is. i use this code everything month and never had this issue before. plz help!!! this is a client deliverable and i am stressing!!!
#73 opened by skhurram3108 - 5
- 2
Pip installation failed
#69 opened by tonywang531 - 1
UnicodeEncodeError: 'utf-8' codec can't encode characters in position 2-3: surrogates not allowed
#68 opened by Funsom - 5
Error for training with --case-sensitive True
#67 opened by debraj135 - 4
- 1
Fuzzy Matching
#65 opened by g-luo - 4
Wikipedia2Vec.load() KeyError
#64 opened by nihaldsouza - 2
Training on custom dataset
#63 opened by vivekkalyan - 4
wiki2vec.get_entity_vector
#62 opened by Phelan164 - 3
How to add another tokenizer?
#61 opened by Kyubyong - 1
Embedding for <unknown> token.
#60 opened by sinhprous - 1
progressively updating model with new text?
#59 opened by katlap - 1
Entity detector model on custom dataset
#58 opened by anjalibhavan - 1
New dump for 2019/2020
#56 opened by lbozarth - 2
Code for Named Entity Disambiguation
#55 opened by vardaan123 - 5
Is it possible to do something like 'infer vector' given a document not in the data?
#53 opened by youssefavx - 1
- 2
- 1
- 1
Parsing disambiguation page
#44 opened by EternalMoment - 1
Out-of-vocabulary words
#45 opened by DeepInEvil - 4
can not build dump database
#49 opened by Lavine24 - 1
What Japanese text pre-processing method is used?
#46 opened by emadg - 0
- 2
- 1
training the same model multiple times?
#43 opened by nickcastro - 2
how can the result file saved by command save_text --format word2vec loaded by word2vec?
#40 opened by SnowPi - 8
- 2
- 4
wikipedia id other than title
#35 opened by EternalMoment - 2
suspicious / in title containing '
#37 opened by EternalMoment - 2
- 6
Loading pretrained model failed
#34 opened by MathildaSu - 1
Command not found after installation
#33 opened by pwecar - 1
- 1
Entity Extraction using Wikipedia2Vec
#31 opened by iknoorjobs - 6
IndexError when training
#30 opened by pazzo83 - 3
- 7
- 1
Where are the anchor context model words?
#28 opened by andrejzg - 1
wiki id for wiki entities
#25 opened by wadhwasahil