rafapi

AI Alignment and Human-in-the-loop Machine Learning, focusing on the interplay between artificial intelligence and human-guided learning processes.

@ServiceNow ResearchLondon, UK

Pinned Repositories

baal
Bayesian active learning library for research and industrial usecases.
Language:Python905 16 11686
AI-Researcher
AI researcher - with a single query determine focus areas to investigate, searching the web and scraping content from relevant websites to do research autonomously.
Language:Python0 0 00
alternative_article
Language:Python0 1 00
contrastive_rl
Language:Python3 1 01
fastapi-prophet
Stock Market predictions with Prophet and FastAPI
Language:Python17 0 03
pomolux
Python API for Luxafor combined with a Pomodoro timer
Language:Python1 2 10
PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
Language:Python14012
TapeAgents
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
Language:Python296 12 3637

rafapi's Repositories

rafapi/fastapi-prophet
Stock Market predictions with Prophet and FastAPI
Language:Python17 0 03
rafapi/contrastive_rl
Language:Python3 1 01
rafapi/AI-Researcher
AI researcher - with a single query determine focus areas to investigate, searching the web and scraping content from relevant websites to do research autonomously.
Language:Python0 0 00
rafapi/alternative_article
Language:Python0 1 00
rafapi/backend-microservices
Microservices using RabbitMQ as message broker
Language:Python0 2 01
rafapi/blog
Source code of my personal blog
Language:TypeScript0 1 00
rafapi/fastapi_text_sum
Text Summarisation using FastAPI
Language:Python0 1 00
rafapi/mvenv
A Python 3 Virtual Environment Management Tool
Language:Shell0 2 00
rafapi/summariser_client
A multi-platform client to consume the Summariser API
Language:Dart0 2 00
rafapi/custom-py-docker
Dockerfile with custom python indtall
Language:Dockerfile1 0
rafapi/d4rl
A benchmark for offline reinforcement learning.
Language:Python0 0
rafapi/faas-fns
FaaS functions
Language:HTML2 0
rafapi/finetuner
Finetuning any DNN for better embedding on neural search tasks
Language:Python0 0
rafapi/frontend-microservices
Language:TypeScript2 01
rafapi/google-research
Google Research
Language:Jupyter Notebook0 0
rafapi/jax-rl
Language:Python1 0
rafapi/Kalman-and-Bayesian-Filters-in-Python
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.
Language:Jupyter Notebook0 0
rafapi/mjrl
Reinforcement learning algorithms for MuJoCo tasks
Language:Python0 0
rafapi/nano-aha-moment
Single GPU, From Scratch (No RL Library), Efficient, Full Parameter Tuning Implementation of DeepSeek R1-Zero style training.
rafapi/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Jupyter Notebook0 0
rafapi/PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
Language:Python
rafapi/PMTG
Language:Python2 0
rafapi/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python0 0
rafapi/question-answering
Language:Jupyter Notebook2 0
rafapi/rafapi
1 0
rafapi/rafapi.github.io
Language:HTML1 0
rafapi/stable-diffusion-webui
Stable Diffusion web UI
Language:Python0 0
rafapi/thompson
Thompson Sampling Tutorial
Language:Jupyter Notebook0 0
rafapi/trl
Train transformer language models with reinforcement learning.
rafapi/whatsapp-mcp