Blaizzy

MLOps | LLMs | RAG | Ex - ML Research Engineer @arcee-ai

kulissiwa.comPoland

Pinned Repositories

fastmlx
FastMLX is a high performance production ready API to host MLX models.
Language:Python25230
60daysUdacity
Language:Jupyter Notebook1 2 00
BiSeNet-Implementation
Here is a tutorial on how to implement a research Paper with Keras
Language:Jupyter Notebook43 3 331
Boring_weekends
This where I put my crazy projects that I do when I'm bored or inspired by the boredom
Language:Jupyter Notebook2 2 02
Cancer_classifier
Data science, AI and Machine Learning
Language:Jupyter Notebook1 1 03
Coding-LLMs-from-scratch
Language:Jupyter Notebook29 1 01
fastmlx
FastMLX is a high performance production ready API to host MLX models.
Language:Python12 0 02
LLMOps
Deploy and scale Large Language Models (LLMs) in production.
Language:Jupyter Notebook36 2 04
mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
Language:Python90 2 17
mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Language:Python706 8 9460

Blaizzy's Repositories

Blaizzy/mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Language:Python706 8 9460
Blaizzy/mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
Language:Python90 2 17
Blaizzy/LLMOps
Deploy and scale Large Language Models (LLMs) in production.
Language:Jupyter Notebook36 2 04
Blaizzy/Coding-LLMs-from-scratch
Language:Jupyter Notebook29 1 01
Blaizzy/fastmlx
FastMLX is a high performance production ready API to host MLX models.
Language:Python12 0 02
Blaizzy/cuda-learning
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
51
Blaizzy/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Language:Python1 0 00
Blaizzy/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Language:Python1 0 0
Blaizzy/LayerSkip
"LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", Accepted to ACL 2024
Language:Python1 0 0
Blaizzy/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python1 0 0
Blaizzy/VoCoT
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
1
Blaizzy/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Language:Python1 1 0
Blaizzy/axolotl
Go ahead and axolotl questions
Language:Python0 0
Blaizzy/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
Language:Python0 0
Blaizzy/cohere-toolkit
Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
Language:TypeScript0 0
Blaizzy/distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Language:Python0 0
Blaizzy/gemma-2B-10M
Gemma 2B with 10M context length using Infini-attention.
Language:Python0 0
Blaizzy/gpt-prompt-engineer
Language:Jupyter Notebook0 0
Blaizzy/langchain
🦜🔗 Build context-aware reasoning applications
Language:Python0 0
Blaizzy/laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
Blaizzy/llama-3-8b-self-align
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation applied to llama 3 8b
Language:Python0 0
Blaizzy/Mixtral-Model-Expert-Extractor
Language:Python0 0
Blaizzy/mlx
MLX: An array framework for Apple silicon
Language:C++0 0
Blaizzy/mlx-examples
Examples in the MLX framework
Language:Python0 0
Blaizzy/MMAudio
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Blaizzy/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
Language:Python0 0
Blaizzy/moshi
Language:Python0 0
Blaizzy/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Language:Python0 0
Blaizzy/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python0 0
Blaizzy/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Language:Python1 0