aisyahrzk's Stars
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
mcinglis/c-style
My favorite C programming practices.
loganwatchorn/notes-pmpp
Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
visual-layer/fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
riley-x/SankeyFlow
Python package for creating Sankey flow diagrams in Matplotlib
godkingjay/selenium-twitter-scraper
This is a Twitter Scraper which uses Selenium for scraping tweets. It is capable of scraping tweets from home, user profile, hashtag, query or search, and advanced searches.
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
machinelearningzuu/RAG-University
Welcome to the RAG University repository! This repository contains code implementations for Retrieval-Augmented Generation (RAG) models, specifically designed for Language Model (LM) tasks. RAG models combine the strengths of both retrieval and generation approaches, enhancing the capabilities of LLMs.
mlfoundations/open_clip
An open source implementation of CLIP.
mesolitica/whisper-static-cache
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
sucv/paperCrawler
This is a Scrapy-based web-spider. It scrapes papers from TOP conferences and journals.
awesome-selfhosted/awesome-selfhosted
A list of Free Software network services and web applications which can be hosted on your own servers
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
snguyenthanh/better_profanity
Blazingly fast cleaning swear words (and their leetspeak) in strings
IST-DASLab/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
apple/corenet
CoreNet: A library for training deep neural networks
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
tinygrad/open-gpu-kernel-modules
NVIDIA Linux open GPU with P2P support
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
hewliyang/waze-traffic-api
Unofficial Python SDK for reverse engineered Waze APIs
huggingface/parler-tts
Inference and training library for high-quality TTS models.
AlbughdadiM/sentinel2-explorer
A repo gathering tools to search, download and process Sentinel-2 images using GCP services
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'