microsoft-fevieira

@microsoft

Pinned Repositories

academy
Ray tutorials from Anyscale
Language:Jupyter Notebook0 0 00
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language:Python0 0 00
allenact
An open source framework for research in Embodied-AI from AI2.
Language:Python0 0 00
ArtML
example codes for the Art and ML class
Language:Jupyter Notebook0 0 00
ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
Language:Python0 0 00
Autoencoders
Implementation of simple autoencoders networks with Keras
Language:Jupyter Notebook0 0 00
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Language:Jupyter Notebook00
awesome-isaac-gym
A curated list of awesome NVIDIA Issac Gym frameworks, papers, software, and resources
0 0 00
awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
0 0 00
azure-search-openai-demo
Demonstration of how to leverage Azure OpenAI and Cognitive Search to enable Information Search and Discovery over organizational content
Language:Python00

microsoft-fevieira's Repositories

microsoft-fevieira/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Language:Jupyter Notebook00
microsoft-fevieira/azure-search-openai-demo
Demonstration of how to leverage Azure OpenAI and Cognitive Search to enable Information Search and Discovery over organizational content
Language:Python00
microsoft-fevieira/Bard
Reverse engineering of Google's Bard API
Language:Python00
microsoft-fevieira/coba
Contextual bandit benchmarking
Language:Python0 0 00
microsoft-fevieira/D4RL
A benchmark for offline reinforcement learning.
Language:Python0 0 00
microsoft-fevieira/Bard-API
The python package that returns response of Google Bard through API.
Language:Python
microsoft-fevieira/conversation-visualizer
Language:JavaScript
microsoft-fevieira/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python0 0
microsoft-fevieira/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
Language:C++0 0
microsoft-fevieira/gdc
Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"
Language:Python0 0
microsoft-fevieira/Griddly
A grid-world game engine for game AI research
Language:C++0 0
microsoft-fevieira/hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
microsoft-fevieira/holoassist.github.io
microsoft-fevieira/k9
Self-Taught Data Science
Language:HTML0 0
microsoft-fevieira/langchain
⚡ Building applications with LLMs through composability ⚡
Language:Python
microsoft-fevieira/offline_rl
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
microsoft-fevieira/openplayground
An LLM playground you can run on your laptop
microsoft-fevieira/openplayground-api
A reverse engineered Python API wrapper for OpenPlayground (nat.dev)
Language:Python
microsoft-fevieira/point-e
Point cloud diffusion for 3D model synthesis
microsoft-fevieira/rank-game
SOTA algorithms for imitation learning (LfD and LfO) - Ranking algorithms for imitation learning (TMLR 2023)
microsoft-fevieira/reflexion
Reflexion: an autonomous agent with dynamic memory and self-reflection
microsoft-fevieira/rembg
Rembg is a tool to remove images background.
Language:Python0 0
microsoft-fevieira/river
🌊 Online machine learning in Python
Language:Python0 0
microsoft-fevieira/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python
microsoft-fevieira/roman
Python library for real-time control of a robotic manipulator
microsoft-fevieira/spinningup-simple-install
Modified OpenAI's spinningup to use environments exploiting revealed randomness
microsoft-fevieira/transformer
Implementation of the transformer architecture in PyTorch, just for fun & educational purposes.
Language:Jupyter Notebook0 0
microsoft-fevieira/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python0 0
microsoft-fevieira/VQ-Diffusion
Official implementation of VQ-Diffusion
Language:Python0 0
microsoft-fevieira/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Jupyter Notebook0 0