wikipedia2vec/wikipedia2vec

A tool for learning vector representations of words and entities from Wikipedia

PythonNOASSERTION

Issues

BufferError after training embeddings
#79 opened 6 months ago by ScholliYT
2
Mapping with entities
#83 opened 7 months ago by giuspillo
0
API?
#82 opened 8 months ago by thistlillo
1
Wikipedia2Vec embedding training RAM requirements ?
#80 opened a year ago by deter3
0
There was a problem loading the word vector binary file trained by myself
#78 opened 2 years ago by TytTest
0
How to use build-mention-db output
#75 opened 2 years ago by lucidviews
0
lmdb.Error: mdb_txn_commit: Input/output error when trying to train on dump file.
#74 opened 2 years ago by lucidviews
3
Exploring the database - what type of database
#70 opened 3 years ago by Filco306
5
Hello all i am getting this unicode error surrogate thing. i m a new user and have no idea what this is. i use this code everything month and never had this issue before. plz help!!! this is a client deliverable and i am stressing!!!
#73 opened 2 years ago by skhurram3108
2
Embeddings exist for entities that do not have pages in Wikipedia
#57 opened 4 years ago by katlap
5
Pip installation failed
#69 opened 3 years ago by tonywang531
2
UnicodeEncodeError: 'utf-8' codec can't encode characters in position 2-3: surrogates not allowed
#68 opened 3 years ago by Funsom
1
Error for training with --case-sensitive True
#67 opened 3 years ago by debraj135
5
Throws binary incompatability error when importing on python 3.8
#66 opened 3 years ago by sorenmulli
4
Fuzzy Matching
#65 opened 3 years ago by g-luo
1
Wikipedia2Vec.load() KeyError
#64 opened 3 years ago by nihaldsouza
4
Training on custom dataset
#63 opened 3 years ago by vivekkalyan
2
wiki2vec.get_entity_vector
#62 opened 4 years ago by Phelan164
4
How to add another tokenizer?
#61 opened 4 years ago by Kyubyong
3
Embedding for <unknown> token.
#60 opened 4 years ago by sinhprous
1
progressively updating model with new text?
#59 opened 4 years ago by katlap
1
Entity detector model on custom dataset
#58 opened 4 years ago by anjalibhavan
1
New dump for 2019/2020
#56 opened 4 years ago by lbozarth
1
Code for Named Entity Disambiguation
#55 opened 4 years ago by vardaan123
2
Is it possible to do something like 'infer vector' given a document not in the data?
#53 opened 4 years ago by youssefavx
5
KeyError when trying get_entity_vector on some Wikipedia titles
#54 opened 4 years ago by katlap
1
How to get most similar items to added/subtracted vectors?
#51 opened 4 years ago by youssefavx
2
Availability Wiki dump 20-04-2018 (dd-mm-yyyy)
#50 opened 4 years ago by mickvanhulst
1
Parsing disambiguation page
#44 opened 5 years ago by EternalMoment
1
Out-of-vocabulary words
#45 opened 5 years ago by DeepInEvil
1
can not build dump database
#49 opened 5 years ago by Lavine24
4
What Japanese text pre-processing method is used?
#46 opened 5 years ago by emadg
1
"crosslingual-map" branch, seems to be missing "langlink.txt" file
#48 opened 5 years ago by Lavine24
0
Cant not download the Chinese pretrained model for 300-dim text model.
#47 opened 5 years ago by Lavine24
2
training the same model multiple times?
#43 opened 5 years ago by nickcastro
1
how can the result file saved by command save_text --format word2vec loaded by word2vec?
#40 opened 5 years ago by SnowPi
2
ModuleNotFoundError: No module named 'wikipedia2vec.dictionary'
#41 opened 5 years ago by AMParanoid
8
category flag fails to filter all category pages
#36 opened 5 years ago by EternalMoment
2
wikipedia id other than title
#35 opened 5 years ago by EternalMoment
4
suspicious / in title containing '
#37 opened 5 years ago by EternalMoment
2
suspicious leading and trailing space in title
#38 opened 5 years ago by EternalMoment
2
Loading pretrained model failed
#34 opened 5 years ago by MathildaSu
6
Command not found after installation
#33 opened 5 years ago by pwecar
1
Difference between get_word_vector() and get_entity_vector()
#32 opened 5 years ago by iknoorjobs
1
Entity Extraction using Wikipedia2Vec
#31 opened 5 years ago by iknoorjobs
1
IndexError when training
#30 opened 5 years ago by pazzo83
6
How to extract entities from text using Wikipedia2Vec
#29 opened 5 years ago by iknoorjobs
3
Delete the token '\n' (and '\r' if any) in the txt format.
#27 opened 5 years ago by cloudyyyyy
7
Where are the anchor context model words?
#28 opened 5 years ago by andrejzg
1
wiki id for wiki entities
#25 opened 5 years ago by wadhwasahil
1