Pinned Repositories
ARAMARL
Experiments for the paper RL under Threats
awesome-rlaif
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
distilled-self-critique
distilled Self-Critique refines the outputs of a LLM with only synthetic data
gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
machine-learning
Projects for the Computational Geometry and Machine Learning course developed in Python
refined-dpo
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
samsi-deep-learning
Practical materials for the Deep Learning course at SAMSI/Duke Uni
sgmcmc-force
Samplers from the paper "Stochastic Gradient MCMC with Repulsive Forces"
stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients 🎨
zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
vicgalle's Repositories
vicgalle/stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients 🎨
vicgalle/gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
vicgalle/zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
vicgalle/distilled-self-critique
distilled Self-Critique refines the outputs of a LLM with only synthetic data
vicgalle/refined-dpo
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
vicgalle/awesome-rlaif
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
vicgalle/autocrit-likert-gpt
Automatic and zero-shot critique of outputs using the OpenAI API with json outputs
vicgalle/random-thoughts
A personal blog and wiki about language models, reinforcement learning and what not.
vicgalle/ai-ml-course
Materials for the course: AI ML & Analytics
vicgalle/art-explorer
Semantic search over paintings databases using deep learning
vicgalle/curso-ml-avanzado-21
vicgalle/data-sharing
vicgalle/zero-shot-api
vicgalle/ARAMARL
Experiments for the paper RL under Threats
vicgalle/configurable-safety-tuning
Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"
vicgalle/neural-classifier
Neural text classifier using pytorch for legal sentences
vicgalle/nn-review
Code for the paper "Current advances in neural networks"
vicgalle/optimal-reward-design
vicgalle/phd-thesis
vicgalle/train-text2text
vicgalle/vicgalle
vicgalle/vicgalle.github.io
My personal webpage
vicgalle/wiki-example
vicgalle/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
vicgalle/d3-carto-map
A mapping API that uses D3 geospatial functionality
vicgalle/dlatscale_draft
This is an early version of Deep Learning at Scale course for Yandex School of Data Analysis
vicgalle/library-of-phi
vicgalle/model_card
vicgalle/stable-diffusion-webui-aesthetic-gradients
Aesthetic gradients extension for web ui
vicgalle/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.