rin2401's Stars
massgravel/Microsoft-Activation-Scripts
Open-source Windows and Office activator featuring HWID, Ohook, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
hwchase17/langchain
⚡ Building applications with LLMs through composability ⚡
geekan/HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
google-research/text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
emeryberger/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
saffsd/langid.py
Stand-alone language identification system
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
agiresearch/OpenAGI
OpenAGI: When LLM Meets Domain Experts
GoogleCloudPlatform/ml-design-patterns
Source code accompanying O'Reilly book: Machine Learning Design Patterns
domeccleston/sharegpt
Easily share permanent links to ChatGPT conversations with your friends
mckaywrigley/clarity-ai
A simple Perplexity AI clone.
kakaobrain/kogpt
KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
miso-belica/jusText
Heuristic based boilerplate removal tool
commoncrawl/cc-pyspark
Process Common Crawl data with Python and Spark
bigcode-project/bigcode-dataset
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
alexandres/terashuf
terashuf shuffles multi-terabyte text files using limited memory
commoncrawl/cc-index-table
Index Common Crawl archives in tabular format
EleutherAI/openwebtext2
bigscience-workshop/data_tooling
Tools for managing datasets for governance and training.
bigscience-workshop/catalogue_data
Scripts to prepare catalogue data