vicgalle

Research Scientist

Komorebi AI & ICMAT-CSICMadrid

Pinned Repositories

ARAMARL
Experiments for the paper RL under Threats
Language:Jupyter Notebook2 5 02
awesome-rlaif
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
8 2 00
configurable-safety-tuning
Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"
Language:Python7 2 01
distilled-self-critique
distilled Self-Critique refines the outputs of a LLM with only synthetic data
Language:Jupyter Notebook10 2 00
gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
Language:Python334 9 4357
refined-dpo
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
Language:Jupyter Notebook11 2 00
samsi-deep-learning
Practical materials for the Deep Learning course at SAMSI/Duke Uni
Language:Jupyter Notebook6 4 02
sgmcmc-force
Samplers from the paper "Stochastic Gradient MCMC with Repulsive Forces"
Language:Jupyter Notebook9 4 00
stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients 🎨
Language:Jupyter Notebook708 18 1865
zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
Language:Python31 2 17

vicgalle's Repositories

vicgalle/stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients 🎨
Language:Jupyter Notebook708 18 1865
vicgalle/gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
Language:Python334 9 4357
vicgalle/zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
Language:Python31 2 17
vicgalle/refined-dpo
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
Language:Jupyter Notebook11 2 00
vicgalle/distilled-self-critique
distilled Self-Critique refines the outputs of a LLM with only synthetic data
Language:Jupyter Notebook10 2 00
vicgalle/awesome-rlaif
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
8 2 00
vicgalle/configurable-safety-tuning
Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"
Language:Python7 2 01
vicgalle/ai-ml-course
Materials for the course: AI ML & Analytics
Language:Jupyter Notebook5 2 01
vicgalle/autocrit-likert-gpt
Automatic and zero-shot critique of outputs using the OpenAI API with json outputs
Language:Python4 2 01
vicgalle/random-thoughts
A personal blog and wiki about language models, reinforcement learning and what not.
4 3 00
vicgalle/art-explorer
Semantic search over paintings databases using deep learning
Language:Python3 3 00
vicgalle/curso-ml-avanzado-21
Language:Jupyter Notebook3 3 10
vicgalle/data-sharing
Language:Jupyter Notebook3 4 0
vicgalle/zero-shot-api
Language:Python3 3 3
vicgalle/ARAMARL
Experiments for the paper RL under Threats
Language:Jupyter Notebook2 5 02
vicgalle/neural-classifier
Neural text classifier using pytorch for legal sentences
Language:Python2 3 01
vicgalle/nn-review
Code for the paper "Current advances in neural networks"
1 3 0
vicgalle/optimal-reward-design
Language:Jupyter Notebook1 3 0
vicgalle/phd-thesis
Language:TeX1 3 0
vicgalle/train-text2text
Language:Python1 3 1
vicgalle/vicgalle
1 3 01
vicgalle/vicgalle.github.io
My personal webpage
Language:CSS1 3 1
vicgalle/wiki-example
1 3 0
vicgalle/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
1 0
vicgalle/d3-carto-map
A mapping API that uses D3 geospatial functionality
Language:JavaScript2 0
vicgalle/dlatscale_draft
This is an early version of Deep Learning at Scale course for Yandex School of Data Analysis
Language:Jupyter Notebook2 0
vicgalle/library-of-phi
vicgalle/model_card
2 0
vicgalle/stable-diffusion-webui-aesthetic-gradients
Aesthetic gradients extension for web ui
Language:Python1 0
vicgalle/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Language:Python2 0