ai-nikolai

CS.PhD in LLM Agents @ Imperial College London || ex tech-founder || LLMs, Agent AI, NLP, RL

London

Pinned Repositories

AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
Language:HTML0 0 00
ai-nikolai.github.io
Nikolai Rozanov's personal homepage.
Language:HTML0 0 00
annotation_analysis
Anotation Analysis Package
Language:Python2 1 00
barl
Bayesian Approximate Reinforcement Learning (BARL)
Language:Python1 2 00
lida
LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)
Language:JavaScript1 1 00
LLamp
Larage Language Model Planning (LLAMP)
Language:Jupyter Notebook0 1 00
rl-environments
RLENVS: Reinforcement Learning Environments
Language:Python2 2 00
StateAct
StateAct
0 1 00
matilda
MATILDA: Multi-AnnoTator multi-language Interactive Lightweight Dialogue Annotator
Language:JavaScript149 8 1128
Retrograph
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Language:Python20 4 95

ai-nikolai's Repositories

ai-nikolai/annotation_analysis
Anotation Analysis Package
Language:Python2 1 00
ai-nikolai/rl-environments
RLENVS: Reinforcement Learning Environments
Language:Python2 2 00
ai-nikolai/barl
Bayesian Approximate Reinforcement Learning (BARL)
Language:Python1 2 00
ai-nikolai/lida
LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)
Language:JavaScript1 1 00
ai-nikolai/re-arc
Extension of the ARC dataset with Explanations
Language:Jupyter Notebook1
ai-nikolai/AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
Language:HTML0 0 00
ai-nikolai/ai-nikolai.github.io
Nikolai Rozanov's personal homepage.
Language:HTML0 0 00
ai-nikolai/ai-nikolai.github.io-archive
Nikolai's Homepage
Language:HTML0 1 00
ai-nikolai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 1 00
ai-nikolai/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python0 1 00
ai-nikolai/cs-video-courses
List of Computer Science courses with video lectures.
0 1 00
ai-nikolai/kernel-example-matlab
Small library in Matlab for Kernel Methods and non-parametric functional data analysis
Language:Matlab0 2 00
ai-nikolai/LLamp
Larage Language Model Planning (LLAMP)
Language:Jupyter Notebook0 1 00
ai-nikolai/StateAct
StateAct
0 1 00
ai-nikolai/ai-nikolai
1 0
ai-nikolai/ARC-AGI
The Abstraction and Reasoning Corpus
Language:JavaScript0 0
ai-nikolai/awesome-o1
A bibliography and survey of the papers surrounding o1
Language:TeX0 0
ai-nikolai/cuda-101
Cuda practicals based on Prof. Mike Giles course in Oxford University in July 2024
Language:C++1 0
ai-nikolai/dspy
DSPy: The framework for programming—not prompting—language models
ai-nikolai/kactl
KTH Algorithm Competition Template Library (... eller KTHs AC-tillverkande lapp)
Language:C++0 0
ai-nikolai/kernel-methods
Kernel Library Built on top of Tensorflow (in progress)
Language:Python2 0
ai-nikolai/L1B3RT45
JAILBREAK PROMPTS FOR ALL MAJOR AI MODELS
0 0
ai-nikolai/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook0 0
ai-nikolai/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Language:Python0 0
ai-nikolai/Retrograph-1
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Language:Python1 0
ai-nikolai/snfs-primes
A list of SNFS Primes and the corresponding SAGE worksheet
Language:TeX2 0
ai-nikolai/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Language:Python1 0
ai-nikolai/WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Language:Python0 0