Pinned Repositories
accelerate-deepspeed-test
Testing DeepSpeed integration in 🤗 Accelerate
accelerate-megatron-test
Testing the accelerate megatron integration
Extractive-Text-Summarization-Using-Neural-Networks
Extractive Text Summarization Using Neural Networks approaches: CNN, RNN and Linear-SVM
LLM-Workshop
LLM Workshop by Sourab Mangrulkar
Machine_Learning
Machine Learning models for Regression Tasks and Classification Tasks.
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
peft-codegen-25
Registration_App
This is an android app for events registration which was used for the NIT-GOA's cultural fest Raag.
pacman100's Repositories
pacman100/LLM-Workshop
LLM Workshop by Sourab Mangrulkar
pacman100/peft-codegen-25
pacman100/openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
pacman100/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
pacman100/peft-sd-webui-additional-networks
pacman100/acm_winter_school_gen_ai
Code for the hands-on projects for the students attending the ACM India Winter School 2023
pacman100/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
pacman100/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
pacman100/blog
Public repo for HF blog posts
pacman100/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
pacman100/llama-recipes
Examples and recipes for Llama 2 model
pacman100/pacman100
pacman100/pacman100.github.io
pacman100/peft-dreambooth
pacman100/peft-dreambooth-ui
An UI for Dreambooth training and inference using PEFT - Minimal, Easy to use and Adaptable.
pacman100/peft-whisper-large
pacman100/ram_efficient_fsdp
pacman100/twinny
The most no-nonsense locally hosted (or API hosted) AI code completion plugin for Visual Studio Code, like GitHub Copilot but 100% free and 100% private.
pacman100/bitsandbytes
8-bit CUDA functions for PyTorch
pacman100/chat-ui
Open source codebase powering the HuggingChat app
pacman100/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
pacman100/easyllm
pacman100/hf_fsdp
A very basic fsdp wrapper for HF.
pacman100/huggingface-vscode
Code completion VSCode extension for OSS models
pacman100/llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
pacman100/llama.cpp
LLM inference in C/C++
pacman100/llama2.c
Inference Llama 2 in one file of pure C
pacman100/llm.c
LLM training in simple, raw C/CUDA
pacman100/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
pacman100/trl
Train transformer language models with reinforcement learning.