a514514772
Interested in ML/DL/CV domains. A PhD student at CISPA, Germany.
CISPA – Helmholtz Center for Information SecuritySaarbrucken, Germany
a514514772's Stars
3b1b/manim
Animation engine for explanatory math videos
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
karpathy/LLM101n
LLM101n: Let's build a Storyteller
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
ShiArthur03/ShiArthur03
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
cxli233/FriendsDontLetFriends
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
youssefHosni/Data-Science-Interview-Questions-Answers
Curated list of data science interview questions and answers
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
yoshitomo-matsubara/torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
JShollaj/awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
matanki-saito/EU4dll
Europa Universalis IV double byte language patch; master:1.34.2, dev:1.37.4.0
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
ngruver/llmtime
google-deepmind/opro
official code for "Large Language Models as Optimizers"
test-time-training/ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
buggyyang/CDC_compression
codebase for Lossy Image Compression with Conditional Diffusion Models
microsoft/DPSDA
[ICLR 2024] Generating DP Synthetic Data without Training
FareedKhan-dev/create-stable-diffusion-from-scratch
Implemented a stable diffusion architecture using PyTorch.
nctu-eva-lab/AntifakePrompt
This is the official implementation of AntifakePrompt.
NVlabs/MCPNet
[CVPR 2024] Official Repository for MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes
LemonATsu/CUDA-kNN-Aniso-Gaussian-Feature-Aggregation
uds-lsv/multilingual-icl-analysis
Code for the paper 'The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis' (Findings of ACL 2024)
uds-lsv/AAdaM
Code for the paper 'AAdaM at SemEval-2024 Task 1: Augmentation and Adaptation for Multilingual Semantic Textual Relatedness'