keyboardAnt

AI Researcher & Engineer | LLMs

NYC

Pinned Repositories

bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python0 0 01
Decoupling-Gating-From-Linearity
Language:Jupyter Notebook1 0 00
deep-learning-research-lab
A distributed computing infrastructure for simulations, model training & fine-tuning.
Language:Python3 1 02
distributed-speculative-inference
The fastest off-the-shelf inference algorithm for LLMs (ICLR’25)
Language:Python4 3 91
embedding-projector-standalone
Language:HTML1 0 00
keyboardAnt.github.io
Personal blog about research, machine learning, deep learning, data science and software engineering.
Language:CSS2 0 00
nn-mem-vision
An infrastructure for empirical research: Studying the limits of neural networks for computer vision tasks.
Language:Jupyter Notebook1 1 00
nnlib
PyTorch based NN modules frequently used in various projects
Language:Python1 0 00
wis-glucose-challenge
This is the 1st Place Winner (1 out of 45) in a data science competition led by Prof. Eran Segal on time-series forecasting via neural networks (Kaggle-like).
Language:Python1 1 00

keyboardAnt's Repositories

keyboardAnt/distributed-speculative-inference
The fastest off-the-shelf inference algorithm for LLMs (ICLR’25)
Language:Python4 3 91
keyboardAnt/deep-learning-research-lab
A distributed computing infrastructure for simulations, model training & fine-tuning.
Language:Python3 1 02
keyboardAnt/keyboardAnt.github.io
Personal blog about research, machine learning, deep learning, data science and software engineering.
Language:CSS2 0 00
keyboardAnt/Decoupling-Gating-From-Linearity
Language:Jupyter Notebook1 0 00
keyboardAnt/nn-mem-vision
An infrastructure for empirical research: Studying the limits of neural networks for computer vision tasks.
Language:Jupyter Notebook1 1 00
keyboardAnt/optimal-stopping-game
A game of probability & optimization simulating Optimal Stopping Problem variants. Supports multiple agents with different policies.
Language:Python1 1 01
keyboardAnt/pilev2
Language:Python1 0 01
keyboardAnt/simpleaichat
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
Language:Python1 0 00
keyboardAnt/wis-glucose-challenge
This is the 1st Place Winner (1 out of 45) in a data science competition led by Prof. Eran Segal on time-series forecasting via neural networks (Kaggle-like).
Language:Python1 1 00
keyboardAnt/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python0 0 01
keyboardAnt/ai4code_repair_workshop
AI4Code syntax repair tutorial for IAP 2023
Language:Jupyter Notebook0 0
keyboardAnt/blog
Public repo for HF blog posts
keyboardAnt/containerapps-albumapi-python
Container Apps Quickstart: Python Album API
Language:Python0 0
keyboardAnt/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python0 0
keyboardAnt/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0
keyboardAnt/detecting-fake-text
Giant Language Model Test Room
Language:TypeScript0 0
keyboardAnt/FinRL
FinRL: Financial Reinforcement Learning. 🔥
keyboardAnt/FinRL-Tutorials
Tutorials. Please star.
keyboardAnt/GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
Language:Python0 0
keyboardAnt/hf-bench
Benchmark TTFT, TPOT, T/s, Speedup
Language:Python
keyboardAnt/huggingface-demos
Personal demos using Hugging Face 🤗 tools
keyboardAnt/Implicit-Regularization-Towards-Rank-Minimization-in-ReLU-Networks--presentation
Slides to present our research paper on the foundations of deep learning, “Implicit Regularization Towards Rank Minimization in ReLU Networks”.
Language:TeX1 0
keyboardAnt/keyboardAnt
1 0
keyboardAnt/openai-cookbook
Examples and guides for using the OpenAI API
Language:MDX0 0
keyboardAnt/optimum-benchmark
A repository for benchmarking HF Optimum's optimizations for inference and training.
Language:Python0 0
keyboardAnt/picoGPT
An unnecessarily tiny implementation of GPT-2 in NumPy.
Language:Python0 0
keyboardAnt/smartbugs-curated
SB Curated is a curated dataset of Solidity smart contracts annotated with tagged vulnerabilities. The dataset was created to evaluate the accuracy of automated analysis tools.
Language:Solidity0 0
keyboardAnt/speculative-sampling
Simple implementation of Speculative Sampling in NumPy for GPT-2.
Language:Jupyter Notebook0 0
keyboardAnt/TheAgentCompany
An agent benchmark with tasks in a simulated software company.
keyboardAnt/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python