Pinned Repositories
2022SegmentationST
SIGMORPHON 2022 Shared Task on Morpheme Segmentation
academic
Jekyll theme with a focus on simplicity, typography and flexibility
adapter-transformers
Huggingface Transformers + Adapters = ❤️
Adversarial_Video_Generation
A TensorFlow Implementation of "Deep Multi-Scale Video Prediction Beyond Mean Square Error" by Mathieu, Couprie & LeCun.
agency-jekyll-theme
Agency Theme for Jekyll
amber-data-prep
Data preparation code for Amber 7B LLM
data2vec
goemotions
minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
MaveriQ's Repositories
MaveriQ/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
MaveriQ/agency-jekyll-theme
Agency Theme for Jekyll
MaveriQ/amber-data-prep
Data preparation code for Amber 7B LLM
MaveriQ/goemotions
MaveriQ/MicroLlama
This is a 300M MicroLlama version of TinyLlama
MaveriQ/benchmark
MaveriQ/creative-jekyll-theme
MaveriQ/dolma
Data and tools for generating and inspecting OLMo pre-training data.
MaveriQ/flota
MaveriQ/jekyll-theme-neumorphism
Neumorphism designed Jekyll theme for personal websites, portfolios and resumes.
MaveriQ/langchain-chatbot-demo
Examples of chatbot implementations with Langchain and Streamlit
MaveriQ/linggpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
MaveriQ/LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
MaveriQ/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
MaveriQ/minbpe_spark_gcp
Implementing MinBPE training on GCP DataProc (serverless spark on GCP)
MaveriQ/MobiLlama
MobiLlama : Small Language Model tailored for edge devices
MaveriQ/OLMo
Modeling, training, eval, and inference code for OLMo
MaveriQ/pandas-ai
PandasAI is the Python library that integrates Gen AI into pandas, making data analysis conversational
MaveriQ/paralegal
Streamit app with langchain and huggingface
MaveriQ/promptbench
A unified evaluation framework for large language models
MaveriQ/promptsource
Toolkit for creating, sharing and using natural language prompts.
MaveriQ/python-package-template
A template repo for Python packages from AllenAI
MaveriQ/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
MaveriQ/spacyface
Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see
MaveriQ/sql-eval
Evaluate the accuracy of LLM generated outputs
MaveriQ/Streamlit-Authenticator
A secure authentication module to validate user credentials in a Streamlit application.
MaveriQ/tiktokenizer
Online playground for OpenAPI tokenizers
MaveriQ/useb
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.
MaveriQ/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
MaveriQ/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.