miguel-kjh

Software Engineer - Deep Learning Researcher - PhD student

SIANILas Palmas de Gran Canarias, Spain

miguel-kjh's Stars

rasbt/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used
12922
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Language:C#16.6k4.1k
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python4.8k559
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python2.1k191
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.2k354
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Language:Python4.8k478
lucasxlu/HMTNet
Official PyTorch implementation of paper <Hierarchical Multi-task Network For Race, Gender and Facial Attractiveness Recognition> (IEEE International Conference on Image Processing (ICIP) 2019)
Language:Python466
frederikme/TinderBotz
Automated Tinder bot and scraper using selenium in python.
Language:Python540145
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python76.1k6k
glgh/awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
27512
jayroxis/CKA-similarity
An Numpy and PyTorch Implementation of CKA-similarity with CUDA support
Language:Jupyter Notebook7711
santacml/nn_pruning_uniqueness
Prune a model while finetuning or training.
Language:Python4
xai-org/grok-1
Grok open release
Language:Python49.1k8.3k
GoodAI/goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
Language:HTML488
rasbt/dora-from-scratch
LoRA and DoRA from Scratch Implementations
Language:Jupyter Notebook16611
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python8.7k789
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.3k423
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2k134
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Language:Python47130
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
Language:Python1.2k239
callummcdougall/ARENA_3.0
Language:HTML170118
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
Language:Jupyter Notebook15624
redwoodresearch/rust_circuit_public
Language:Rust562
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
Language:Python68540
hannamw/gpt2-greater-than
Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model
Language:Python41
HoagyC/sparse_coding
Using sparse coding to find distributed representations used by neural networks.
Language:Jupyter Notebook13125
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
Language:Python2.3k224
state-spaces/mamba
Mamba SSM architecture
Language:Python11.5k940
JShollaj/awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
97281
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook21.3k2.2k

miguel-kjh

miguel-kjh's Stars

rasbt/LLM-workshop-2024

Unity-Technologies/ml-agents

vwxyzjn/cleanrl

allenai/RL4LMs

huggingface/alignment-handbook

stanford-oval/storm

lucasxlu/HMTNet

frederikme/TinderBotz

yt-dlp/yt-dlp

glgh/awesome-llm-human-preference-datasets

jayroxis/CKA-similarity

santacml/nn_pruning_uniqueness

xai-org/grok-1

GoodAI/goodai-ltm-benchmark

rasbt/dora-from-scratch

karpathy/minbpe

jzhang38/TinyLlama

THUDM/AgentBench

penghao-wu/vstar

TransformerLensOrg/TransformerLens

callummcdougall/ARENA_3.0

TransformerLensOrg/CircuitsVis

redwoodresearch/rust_circuit_public

yule-BUAA/MergeLM

hannamw/gpt2-greater-than

HoagyC/sparse_coding

dvmazur/mixtral-offloading

state-spaces/mamba

JShollaj/awesome-llm-interpretability

rasbt/LLMs-from-scratch