jialin-yu's Stars
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
uclanlp/awesome-fairness-papers
Papers on fairness in NLP
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
OAI/OpenAPI-Specification
The OpenAPI Specification Repository
fastapi/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
bradyneal/realcause
Realistic benchmark for different causal inference methods. The realism comes from fitting generative models to data with an assumed causal structure.
microsoft/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
AliciaCurth/CATENets
Sklearn-style implementations of Neural Network-based Conditional Average Treatment Effect (CATE) Estimators.
kotartemiy/pygooglenews
If Google News had a Python library
rpryzant/causal-bert-pytorch
Pytorch implementation of "Adapting Text Embeddings for Causal Inference"
singlasahil14/salient_imagenet
Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
statsmodels/statsmodels
Statsmodels: statistical modeling and econometrics in Python
scipy/scipy
SciPy library main repository
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
bigscience-workshop/promptsource
Toolkit for creating, sharing and using natural language prompts.
rspeer/python-ftfy
Fixes mojibake and other glitches in Unicode text, after the fact.
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
QuivrHQ/quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
pathwaycom/llm-app
Dynamic RAG for enterprise. Ready to run with Docker,⚡in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
AakashKumarNain/annotated_research_papers
This repo contains annotated research papers that I found really good and useful
deepfakes/faceswap
Deepfakes Software For All
microsoft/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
fiddler-labs/fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
huggingface/trl
Train transformer language models with reinforcement learning.
BasisResearch/chirho
An experimental language for causal reasoning
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"