GeorgeTzannetos
PhD Student in Reinforcement Learning @MPI-SWS @machine-teaching-group
Max Planck Institute for Software SystemsSaarbrücken
GeorgeTzannetos's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
ReproModel/repromodel
Boosting the AI research efficiency
Cornell-RL/tril
Orion-AI-Lab/Hephaestus
Hephaestus: A large scale multitask dataset towards InSAR understanding
Orion-AI-Lab/EfficientBigEarthNet
Code and models for efficient training on the BigEarthNet dataset for Land Use Land Cover classification
inquire-benchmark/INQUIRE
This repo contains the evaluation code for the INQUIRE benchmark
Orion-AI-Lab/igarss23_DL4NH
Deep Learning for monitoring and forecasting natural hazards with earth observation data
omipan/camera_traps_self_supervised
This repository contains the code for reproducing the results of our ICCV 2021 paper: "Focus on the Positives: Self-Supervised Learning for Biodiversity Monitoring".
ngbountos/ConvolutionalsSeq2Seq
PyTorch implementation of convolutional seq2seq ( Convolutional Sequence to Sequence Learning Jonas Gehring et.al)
machine-teaching-group/neurips2020_synthesizing-tasks