Pinned Repositories
AutoObjectRemoval
Automated object removal from videos based on instance segmentation and flow-guided video completion.
bitllama
Initial implementation of 1.58-bit Llama Model
cerebras-lora
Instruct-tune Cerebras-GPT on consumer hardware
GermanBenchmark
A repository containing the code for translating popular LLM benchmarks to German.
KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
OFA_Explain
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
prismer_gradio_demo
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
tagesschau
bjoernpl's Repositories
bjoernpl/llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
bjoernpl/KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
bjoernpl/GermanBenchmark
A repository containing the code for translating popular LLM benchmarks to German.
bjoernpl/lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
bjoernpl/cerebras-lora
Instruct-tune Cerebras-GPT on consumer hardware
bjoernpl/tagesschau
bjoernpl/prismer_gradio_demo
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
bjoernpl/bitllama
Initial implementation of 1.58-bit Llama Model
bjoernpl/handball_referee_eval
bjoernpl/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
bjoernpl/FastEval
Fast evaluation of chat language models. Includes leaderboard.
bjoernpl/llm_unterhaltung
bjoernpl/bjoernpl
bjoernpl/axolotl
Go ahead and axolotl questions
bjoernpl/de_instruct
bjoernpl/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
bjoernpl/distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency
bjoernpl/epfl-megatron
distributed trainer for LLMs
bjoernpl/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
bjoernpl/inspect_ai
Inspect: A framework for large language model evaluations
bjoernpl/llama-pipeline-parallel
A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.
bjoernpl/llama_index
LlamaIndex is a data framework for your LLM applications
bjoernpl/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.
bjoernpl/NeedleInAHaystack_DE
Doing simple retrieval from LLM models at various context lengths to measure accuracy
bjoernpl/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
bjoernpl/qlora_oasst
QLoRA: Efficient Finetuning of Quantized LLMs
bjoernpl/stk
bjoernpl/text-dedup-oscar2023
All-in-one text de-duplication
bjoernpl/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
bjoernpl/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs