ai-nikolai
CS.PhD in LLM Agents @ Imperial College London || ex tech-founder || LLMs, Agent AI, NLP, RL
London
Pinned Repositories
AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
ai-nikolai.github.io
Nikolai Rozanov's personal homepage.
annotation_analysis
Anotation Analysis Package
barl
Bayesian Approximate Reinforcement Learning (BARL)
lida
LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)
LLamp
Larage Language Model Planning (LLAMP)
rl-environments
RLENVS: Reinforcement Learning Environments
StateAct
StateAct
matilda
MATILDA: Multi-AnnoTator multi-language Interactive Lightweight Dialogue Annotator
Retrograph
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
ai-nikolai's Repositories
ai-nikolai/annotation_analysis
Anotation Analysis Package
ai-nikolai/rl-environments
RLENVS: Reinforcement Learning Environments
ai-nikolai/barl
Bayesian Approximate Reinforcement Learning (BARL)
ai-nikolai/lida
LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)
ai-nikolai/re-arc
Extension of the ARC dataset with Explanations
ai-nikolai/AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
ai-nikolai/ai-nikolai.github.io
Nikolai Rozanov's personal homepage.
ai-nikolai/ai-nikolai.github.io-archive
Nikolai's Homepage
ai-nikolai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
ai-nikolai/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
ai-nikolai/cs-video-courses
List of Computer Science courses with video lectures.
ai-nikolai/kernel-example-matlab
Small library in Matlab for Kernel Methods and non-parametric functional data analysis
ai-nikolai/LLamp
Larage Language Model Planning (LLAMP)
ai-nikolai/StateAct
StateAct
ai-nikolai/ai-nikolai
ai-nikolai/ARC-AGI
The Abstraction and Reasoning Corpus
ai-nikolai/awesome-o1
A bibliography and survey of the papers surrounding o1
ai-nikolai/cuda-101
Cuda practicals based on Prof. Mike Giles course in Oxford University in July 2024
ai-nikolai/dspy
DSPy: The framework for programming—not prompting—language models
ai-nikolai/kactl
KTH Algorithm Competition Template Library (... eller KTHs AC-tillverkande lapp)
ai-nikolai/kernel-methods
Kernel Library Built on top of Tensorflow (in progress)
ai-nikolai/L1B3RT45
JAILBREAK PROMPTS FOR ALL MAJOR AI MODELS
ai-nikolai/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
ai-nikolai/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
ai-nikolai/Retrograph-1
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
ai-nikolai/snfs-primes
A list of SNFS Primes and the corresponding SAGE worksheet
ai-nikolai/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
ai-nikolai/WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents