PaschalisAg

Quantifying language, plotting it, looking at my creations, and shouting "it's alive!"

Donostia International Physics Center (DIPC)Donostia-San Sebastian, Basque Country

PaschalisAg's Stars

brightmart/text_classification
all kinds of text classification models and more with deep learning
Language:Python7.9k2.6k
kk7nc/Text_Classification
Text Classification Algorithms: A Survey
Language:Python1.8k544
peng-yiwen/WiKC
A cleaned version of Wikidata taxonomy - Refined using Large Language Models
Language:HTML5
jwngr/sdow
Six Degrees of Wikipedia
Language:TypeScript1.8k92
kavgan/nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Language:Jupyter Notebook1.2k793
Grasia/wiki-scripts
Miscellaneous scripts to gather and process data of wikis.
Language:Jupyter Notebook2211
optuna/optuna
A hyperparameter optimization framework
Language:Python11.1k1.1k
mantasu/cs231n
Shortest solutions for CS231n 2021-2024
Language:Jupyter Notebook27663
diffbot/knowledge-net
KnowledgeNet: A Benchmark Dataset for Knowledge Base Population
Language:Python26335
ericmjl/Network-Analysis-Made-Simple
An introduction to network analysis and applied graph theory using Python and NetworkX
Language:Jupyter Notebook1k401
alonnir/snacks
Snack size awesome list for Social Network Analysis resources
Language:Dockerfile274
maxpumperla/hyperas
Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization
Language:Python2.2k318
stanford-oval/WikiChat
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
Language:Python1.2k108
PieterBeullens/medtrans_stylo
Files for stylometric analysis of medieval translators
Language:Jupyter Notebook2
AlexMoreo/diff-vectors
Diff-Vectors for Authorship Analysis
Language:Python5
SupervisedStylometry/SuperStyl
Supervised Stylometry
Language:Python215
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Language:Python1.6k141
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
Language:Python3.8k967
sknetwork-team/scikit-network
Graph Algorithms
Language:Python61365
aditya-grover/node2vec
Language:Scala2.6k914
josh-ashkinaze/Normalized-Google-Distance
A python script to calculate normalized google distance (NGD). This is a semantic similarity metric based on Google search results
Language:Python166
qcrit/DSH-2018-LatinProseVerse
Replication code for Chaudhuri et al., "A small set of stylometric features differentiates Latin prose and verse," Digital Scholarship in the Humanities 2018
Language:JavaScript32
tesserae/tesserae
The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels. Select two poems below to see a list of lines sharing two or more words (regardless of inflectional changes).
Language:PHP2923
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.3k874
stanford-oval/wikidata-emnlp23
WikiSP, a semantic parser for Wikidata. WikiWebQuestions, a SPARQL-annotated dataset on Wikidata
Language:Python848
MaartenGr/PolyFuzz
Fuzzy string matching, grouping, and evaluation.
Language:Python75268
keithmcnulty/ona_book
Handbook of Graphs and Networks in People Analytics
Language:TeX11430
CambridgeUniversityPress/FirstCourseNetworkScience
Tutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis
Language:Jupyter Notebook371180
practical-nlp/practical-nlp-code
Official Repository for Code associated with 'Practical Natural Language Processing' book by O'Reilly Media
Language:Jupyter Notebook1.3k614
kasparvonbeelen/ghi_python
Programming for Historians
Language:Jupyter Notebook151

PaschalisAg

PaschalisAg's Stars

brightmart/text_classification

kk7nc/Text_Classification

peng-yiwen/WiKC

jwngr/sdow

kavgan/nlp-in-practice

Grasia/wiki-scripts

optuna/optuna

mantasu/cs231n

diffbot/knowledge-net

ericmjl/Network-Analysis-Made-Simple

alonnir/snacks

maxpumperla/hyperas

stanford-oval/WikiChat

PieterBeullens/medtrans_stylo

AlexMoreo/diff-vectors

SupervisedStylometry/SuperStyl

urchade/GLiNER

attardi/wikiextractor

sknetwork-team/scikit-network

aditya-grover/node2vec

josh-ashkinaze/Normalized-Google-Distance

qcrit/DSH-2018-LatinProseVerse

tesserae/tesserae

karpathy/minbpe

stanford-oval/wikidata-emnlp23

MaartenGr/PolyFuzz

keithmcnulty/ona_book

CambridgeUniversityPress/FirstCourseNetworkScience

practical-nlp/practical-nlp-code

kasparvonbeelen/ghi_python