cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

PythonMIT

Issues

Please add a LICENSE to this repo
#21 opened 2 years ago by napsternxg
0
MUISTI!
#39 opened 2 years ago by NMDMaria
0
tokenizing '20th' to '2','0','th'
#30 opened 3 years ago by KavishBhatia
1
she's --> she ' s
#8 opened 6 years ago by CarloSegat
3
Word Statistics File not Found. | Receiving 404 error while dowloading the file.
#28 opened 3 years ago by imVParashar
17
Word statistics not found.....How can I solve this error?
#29 opened 3 years ago by masterbo98
3
how to get the word statistics?
#31 opened 3 years ago by UGUESS-lzx
2
Updation of url : https://www.dropbox.com/s/a84otqrg6u1c5je/stats.zip?dl=1 required
#11 opened 3 years ago by devamanyu
9
How can the text_processor be parelize?
#27 opened 4 years ago by danielafe7-usp
0
"maximum recursion depth exceeded" Error
#26 opened 4 years ago by mjag7682
1
Can Ekphrasis be used in other languages?
#25 opened 4 years ago by shuningge
1
Remove one character entities on slang dictionary
#22 opened 5 years ago by daviddias99
0
spelling correction mostly is not working
#20 opened 5 years ago by stas00
0
what's wrong with "Word statistics files not found!"
#4 opened 7 years ago by lvjianwei123
5
Segmentation: Preserve case?
#19 opened 5 years ago by davidbernat
0
Do you exposure your underlying language model for uni/bigrams?
#18 opened 5 years ago by davidbernat
0
urllib.error.HTTPError: HTTP Error 429: Too Many Requests
#16 opened 5 years ago by ab4all
2
The TextPreProcessor class only supports segmenting text with hastags. Required support for normal text segmenter.
#15 opened 5 years ago by aman5319
0
Log messages print to stdout
#2 opened 6 years ago by ckingdev
1
Add tests for regexes
#1 opened 6 years ago by cbaziotis
0
Getting URLError: <urlopen error [Errno 60] Operation timed out>
#13 opened 6 years ago by BlaBlaPer
2
Memory usage
#9 opened 6 years ago by xro7
1
Failed during generate_stats.py
#12 opened 6 years ago by JingLiJJ
0
Spell corrector in other languages
#10 opened 6 years ago by al-jwarizmi
0
Ekphrasis downloads statistics in /usr/local
#3 opened 6 years ago by georgepar
2
Installing from pypi doesn't pull in deps
#5 opened 6 years ago by ckingdev
2
Warning regarding using TextPreProcessor as a preprocessing for torchtext.data.Field()
#7 opened 6 years ago by davidalbertonogueira
1
extracting url
#6 opened 6 years ago by kishore0905
0