Pinned Repositories
academy
Ray tutorials from Anyscale
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
allenact
An open source framework for research in Embodied-AI from AI2.
ArtML
example codes for the Art and ML class
ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
Autoencoders
Implementation of simple autoencoders networks with Keras
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
awesome-isaac-gym
A curated list of awesome NVIDIA Issac Gym frameworks, papers, software, and resources
awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
azure-search-openai-demo
Demonstration of how to leverage Azure OpenAI and Cognitive Search to enable Information Search and Discovery over organizational content
microsoft-fevieira's Repositories
microsoft-fevieira/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
microsoft-fevieira/azure-search-openai-demo
Demonstration of how to leverage Azure OpenAI and Cognitive Search to enable Information Search and Discovery over organizational content
microsoft-fevieira/Bard
Reverse engineering of Google's Bard API
microsoft-fevieira/coba
Contextual bandit benchmarking
microsoft-fevieira/D4RL
A benchmark for offline reinforcement learning.
microsoft-fevieira/Bard-API
The python package that returns response of Google Bard through API.
microsoft-fevieira/conversation-visualizer
microsoft-fevieira/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
microsoft-fevieira/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
microsoft-fevieira/gdc
Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"
microsoft-fevieira/Griddly
A grid-world game engine for game AI research
microsoft-fevieira/hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
microsoft-fevieira/holoassist.github.io
microsoft-fevieira/k9
Self-Taught Data Science
microsoft-fevieira/langchain
⚡ Building applications with LLMs through composability ⚡
microsoft-fevieira/offline_rl
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
microsoft-fevieira/openplayground
An LLM playground you can run on your laptop
microsoft-fevieira/openplayground-api
A reverse engineered Python API wrapper for OpenPlayground (nat.dev)
microsoft-fevieira/point-e
Point cloud diffusion for 3D model synthesis
microsoft-fevieira/rank-game
SOTA algorithms for imitation learning (LfD and LfO) - Ranking algorithms for imitation learning (TMLR 2023)
microsoft-fevieira/reflexion
Reflexion: an autonomous agent with dynamic memory and self-reflection
microsoft-fevieira/rembg
Rembg is a tool to remove images background.
microsoft-fevieira/river
🌊 Online machine learning in Python
microsoft-fevieira/RL4LMs
A modular RL library to fine-tune language models to human preferences
microsoft-fevieira/roman
Python library for real-time control of a robotic manipulator
microsoft-fevieira/spinningup-simple-install
Modified OpenAI's spinningup to use environments exploiting revealed randomness
microsoft-fevieira/transformer
Implementation of the transformer architecture in PyTorch, just for fun & educational purposes.
microsoft-fevieira/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
microsoft-fevieira/VQ-Diffusion
Official implementation of VQ-Diffusion
microsoft-fevieira/whisper
Robust Speech Recognition via Large-Scale Weak Supervision