Pinned Repositories
fastmlx
FastMLX is a high performance production ready API to host MLX models.
60daysUdacity
BiSeNet-Implementation
Here is a tutorial on how to implement a research Paper with Keras
Boring_weekends
This where I put my crazy projects that I do when I'm bored or inspired by the boredom
Cancer_classifier
Data science, AI and Machine Learning
Coding-LLMs-from-scratch
fastmlx
FastMLX is a high performance production ready API to host MLX models.
LLMOps
Deploy and scale Large Language Models (LLMs) in production.
mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Blaizzy's Repositories
Blaizzy/mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Blaizzy/mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
Blaizzy/LLMOps
Deploy and scale Large Language Models (LLMs) in production.
Blaizzy/Coding-LLMs-from-scratch
Blaizzy/fastmlx
FastMLX is a high performance production ready API to host MLX models.
Blaizzy/cuda-learning
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or looking to optimize and scale your GPU-accelerated applications.
Blaizzy/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Blaizzy/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Blaizzy/LayerSkip
"LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", Accepted to ACL 2024
Blaizzy/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Blaizzy/VoCoT
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
Blaizzy/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Blaizzy/axolotl
Go ahead and axolotl questions
Blaizzy/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
Blaizzy/cohere-toolkit
Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
Blaizzy/distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Blaizzy/gemma-2B-10M
Gemma 2B with 10M context length using Infini-attention.
Blaizzy/gpt-prompt-engineer
Blaizzy/langchain
🦜🔗 Build context-aware reasoning applications
Blaizzy/laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
Blaizzy/llama-3-8b-self-align
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation applied to llama 3 8b
Blaizzy/Mixtral-Model-Expert-Extractor
Blaizzy/mlx
MLX: An array framework for Apple silicon
Blaizzy/mlx-examples
Examples in the MLX framework
Blaizzy/MMAudio
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Blaizzy/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
Blaizzy/moshi
Blaizzy/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Blaizzy/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Blaizzy/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.