Pinned Repositories
allennlp
An open-source NLP research library, built on PyTorch.
cv
Basic image filtering to back propagation using the MNIST data set (LeCun)
neural_nets
personalized-gen
Code, models, and data for "Personalized Text Generation with Fine-Grained Linguistic Control". EACL 2024, Personalization of Generative AI.
rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
arabic-gec
Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
CAMeLBERT
Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.
gender-reinflection
Code, models, and data for "Gender-Aware Reinflection using Linguistically Enhanced Neural Models". COLING 2020, GeBNLP.
gender-rewriting
Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022.
balhafni's Repositories
balhafni/personalized-gen
Code, models, and data for "Personalized Text Generation with Fine-Grained Linguistic Control". EACL 2024, Personalization of Generative AI.
balhafni/neural_nets
balhafni/rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
balhafni/allennlp
An open-source NLP research library, built on PyTorch.
balhafni/awkg
awkg is an awk-like text-processing tool powered by python language
balhafni/bert
TensorFlow code and pre-trained models for BERT
balhafni/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
balhafni/char-rnn
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
balhafni/csci_544_fa_18_slides
slides from the fall 2018 version of csci 544
balhafni/data_mining
Spark/Scala to process huge datasets. This repo includes implementations for a recommendation system, k-means clustering, frequent items set, and streaming
balhafni/dlbook_notation
LaTeX files for the Deep Learning book notation
balhafni/drone_controller
Parrot ar-drone 2.0 controller with live streaming capabilities
balhafni/emnlp-2022
balhafni/EMNLP_2022
balhafni/errant
ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.
balhafni/etk
Extraction Toolkit
balhafni/EventStoryLine
Materials for the StoryLine extraction task - annotated data, baselines and evaluation scripts, evaluation data.
balhafni/m2scorer
MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.
balhafni/naacl_transfer_learning_tutorial
Repository of code for the NAACL tutorial on Transfer Learning in NLP
balhafni/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
balhafni/rltk
Record Linkage ToolKit
balhafni/sacreBLEU
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
balhafni/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
balhafni/tech-interview-handbook
💯 Algorithms study materials, behavioral content and tips for rocking your coding interview
balhafni/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
balhafni/Theano
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.
balhafni/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
balhafni/ulf-tokenizer
Tokenizer developed by Ulf Harmjakob @ USC ISI
balhafni/Web-Karma
Information Integration Tool
balhafni/xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding