miguel-kjh
Software Engineer - Deep Learning Researcher - PhD student
SIANILas Palmas de Gran Canarias, Spain
miguel-kjh's Stars
rasbt/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
lucasxlu/HMTNet
Official PyTorch implementation of paper <Hierarchical Multi-task Network For Race, Gender and Facial Attractiveness Recognition> (IEEE International Conference on Image Processing (ICIP) 2019)
frederikme/TinderBotz
Automated Tinder bot and scraper using selenium in python.
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
glgh/awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
jayroxis/CKA-similarity
An Numpy and PyTorch Implementation of CKA-similarity with CUDA support
santacml/nn_pruning_uniqueness
Prune a model while finetuning or training.
xai-org/grok-1
Grok open release
GoodAI/goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
rasbt/dora-from-scratch
LoRA and DoRA from Scratch Implementations
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
callummcdougall/ARENA_3.0
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
redwoodresearch/rust_circuit_public
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
hannamw/gpt2-greater-than
Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model
HoagyC/sparse_coding
Using sparse coding to find distributed representations used by neural networks.
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
state-spaces/mamba
Mamba SSM architecture
JShollaj/awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step