mayank-soni's Stars
pytorch/captum
Model interpretability and understanding for PyTorch
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
aiverify-foundation/LLM-Evals-Catalogue
This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation frameworks, benchmarks and papers.
sarthfrey/leetcode-course
A guide to crushing tech interviews.
smgstudio/risk-dice
Dice code used in RISK: Global Domination
matbahasa/TALPCo
TUFS Asian Language Parallel Corpus
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
podondra/gym-gridworlds
Gridworld environments for OpenAI gym.
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
sarnthil/unify-emotion-datasets
A Survey and Experiments on Annotated Corpora for Emotion Classification in Text
openai/openai-cookbook
Examples and guides for using the OpenAI API
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
ggerganov/llama.cpp
LLM inference in C/C++
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
declare-lab/conv-emotion
This repo contains implementation of different architectures for emotion recognition in conversations.
ray-project/llm-numbers
Numbers every LLM developer should know
mayank-soni/GEMBA
GEMBA — GPT Estimation Metric Based Assessment
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".