pmineiro

Pinned Repositories

cca
canonical correlation analysis: routines and demos
Language:MATLAB4 3 00
elfcb
Empirical Likelihood for Contextual Bandits
Language:Jupyter Notebook12 3 01
fastapprox
Approximate and vectorized versions of common mathematical functions
Language:Mathematica12 1 02
hashpca
Scalable PCA via Hashing
Language:C++5 3 00
ldlmd2016
slides and other artifacts from http://letsdiscussnips2016.weebly.com/
3 2 03
linrepcb
SpannerIGW for linearly representable infinite action contextual bandits
Language:Jupyter Notebook3 1 00
memoryrl
combining memory and rl
Language:Python3 5 02
randembed
Randomized embeddings for extreme learning
Language:MATLAB24 1 05
smoothcb
Smoothed IGW for infinite action contextual bandits
Language:ReScript3 1 00
xlst
eXtreme Learning Spectral Trees
Language:C++4 1 01

pmineiro's Repositories

pmineiro/randembed
Randomized embeddings for extreme learning
Language:MATLAB24 1 05
pmineiro/elfcb
Empirical Likelihood for Contextual Bandits
Language:Jupyter Notebook12 3 01
pmineiro/fastapprox
Approximate and vectorized versions of common mathematical functions
Language:Mathematica12 1 02
pmineiro/hashpca
Scalable PCA via Hashing
Language:C++5 3 00
pmineiro/cca
canonical correlation analysis: routines and demos
Language:MATLAB4 3 00
pmineiro/xlst
eXtreme Learning Spectral Trees
Language:C++4 1 01
pmineiro/ldlmd2016
slides and other artifacts from http://letsdiscussnips2016.weebly.com/
3 2 03
pmineiro/linrepcb
SpannerIGW for linearly representable infinite action contextual bandits
Language:Jupyter Notebook3 1 00
pmineiro/memoryrl
combining memory and rl
Language:Python3 5 02
pmineiro/smoothcb
Smoothed IGW for infinite action contextual bandits
Language:ReScript3 1 00
pmineiro/bearsmovie
someplace to put my xtranormal video
1 1 00
pmineiro/cb_bakeoff
scripts for evaluation of contextual bandit algorithms
Language:Python1 2 01
pmineiro/csrobust
Robust confidence sequences
Language:Jupyter Notebook1 1 0
pmineiro/vowpal_wabbit
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
Language:C++1 1 0
pmineiro/aums
Alternative Universe Show
0 0
pmineiro/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Language:Python1 0
pmineiro/CNTK
Computational Network Toolkit (CNTK)
Language:C++1 0
pmineiro/coba
Contextual bandit benchmarking
Language:Python0 0
pmineiro/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook1 0
pmineiro/DrQA
Reading Wikipedia to Answer Open-Domain Questions
Language:Python1 0
pmineiro/estimators
Estimators to perform off-policy evaluation
Language:Python2 0
pmineiro/grlcaffe
Caffe: a fast open framework for deep learning.
Language:C++2 0
pmineiro/lampstuff
Language:Jupyter Notebook2 0
pmineiro/LLF-Bench
A benchmark for evaluating learning agents based on just language feedback
Language:Python1 0
pmineiro/mwt-ds
Umbrella repository for projects related to the MWT Decision Service
Language:JavaScript0 0
pmineiro/mycaffe
me messing around with caffe
Language:C++2 0
pmineiro/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python0 0
pmineiro/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python
pmineiro/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python1 0
pmineiro/ubuntu-ranking-dataset-creator
A script that creates train, valid and test datasets for the ranking task from Ubuntu corpus dialogs.
Language:Jupyter Notebook1 0