Mihaiii's Stars
2U1/Phi3-Vision-Finetune
An open-source implementaion for fine-tuning Phi3-Vision and Phi3.5-Vision by Microsoft.
GaiZhenbiao/Phi3V-Finetuning
Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
evanrichards/json-schema-logits-processor
AnswerDotAI/fastsql
AnswerDotAI/FastHTML-Gallery
tabulapdf/tabula
Tabula is a tool for liberating data tables trapped inside PDF files
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Hironsan/HateSonar
Hate Speech Detection Library for Python.
cognitivecomputations/grokadamw
asg017/sqlite-vec
A vector search SQLite extension that runs anywhere!
aiola-lab/whisper-medusa
Whisper with Medusa heads
arcee-ai/DistillKit
An Open Source Toolkit For LLM Distillation
Mihaiii/trivia
A live multiplayer trivia game where users can bid for the subject of the next question
Metaspectral/Hyperspectral-Starter
Meriegg/node-eFactura-generator
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
camelot-dev/camelot
A Python library to extract tabular data from PDFs
AnswerDotAI/fastlite
A bit of extra usability for sqlite
KwaiVGI/LivePortrait
Bring portraits to life!
Maximilian-Winter/llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
huggingface/local-gemma
Gemma 2 optimized for your local machine.
AnswerDotAI/fasthtml
The fastest way to create an HTML app
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
google-gemini/cookbook
Examples and guides for using the Gemini API
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.