bjoernpl

@ellamind @DiscoResearch

Pinned Repositories

AutoObjectRemoval
Automated object removal from videos based on instance segmentation and flow-guided video completion.
Language:Jupyter Notebook6 1 00
bitllama
Initial implementation of 1.58-bit Llama Model
Language:Python2 2 10
cerebras-lora
Instruct-tune Cerebras-GPT on consumer hardware
Language:Jupyter Notebook8 0 00
GermanBenchmark
A repository containing the code for translating popular LLM benchmarks to German.
Language:Python24 1 31
KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
Language:Python27 1 21
llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
Language:Python48 1 010
lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
Language:Python13 0 04
OFA_Explain
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Language:Python3 0 00
prismer_gradio_demo
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language:Python3 0 00
tagesschau
Language:Python4 1 02

bjoernpl's Repositories

bjoernpl/llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
Language:Python48 1 010
bjoernpl/KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
Language:Python27 1 21
bjoernpl/GermanBenchmark
A repository containing the code for translating popular LLM benchmarks to German.
Language:Python24 1 31
bjoernpl/lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
Language:Python13 0 04
bjoernpl/cerebras-lora
Instruct-tune Cerebras-GPT on consumer hardware
Language:Jupyter Notebook8 0 00
bjoernpl/tagesschau
Language:Python4 1 02
bjoernpl/prismer_gradio_demo
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language:Python3 0 00
bjoernpl/bitllama
Initial implementation of 1.58-bit Llama Model
Language:Python2 2 10
bjoernpl/handball_referee_eval
Language:Python2 1 0
bjoernpl/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python1 0 0
bjoernpl/FastEval
Fast evaluation of chat language models. Includes leaderboard.
Language:Python1 0 01
bjoernpl/llm_unterhaltung
Language:Python1 1 00
bjoernpl/bjoernpl
0 0 00
bjoernpl/axolotl
Go ahead and axolotl questions
Language:Python0 0
bjoernpl/de_instruct
1 0
bjoernpl/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0
bjoernpl/distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency
Language:Python0 0
bjoernpl/epfl-megatron
distributed trainer for LLMs
Language:Python0 0
bjoernpl/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Language:Python0 0
bjoernpl/inspect_ai
Inspect: A framework for large language model evaluations
Language:Python
bjoernpl/llama-pipeline-parallel
A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.
Language:Python0 0
bjoernpl/llama_index
LlamaIndex is a data framework for your LLM applications
bjoernpl/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.
Language:Python0 0
bjoernpl/NeedleInAHaystack_DE
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook
bjoernpl/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python0 0
bjoernpl/qlora_oasst
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook0 0
bjoernpl/stk
Language:Python0 0
bjoernpl/text-dedup-oscar2023
All-in-one text de-duplication
Language:Jupyter Notebook0 0
bjoernpl/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
bjoernpl/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0