Mihaiii

Romania

Mihaiii's Stars

opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python20.2k 108 6721.4k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python20.1k 142 5342k
flipperdevices/flipperzero-firmware
Flipper Zero firmware source code
Language:C13k 296 1.1k2.7k
feder-cr/linkedIn_auto_jobs_applier_with_AI
LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.
Language:Python12.6k 82 2632k
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python4.6k 352 37675
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python4.5k 22 1.4k394
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
Language:Swift4k 39 148338
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python3.6k 41 123215
PatrickJS/awesome-cursorrules
📄 A curated list of awesome .cursorrules files
3.2k 24 6173
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.6k 28 49178
comet-ml/opik
Open-source end-to-end LLM Development Platform
Language:Java2.4k 31 82151
bytedance/GiantMIDI-Piano
Language:Python1.7k 24 11177
zml/zml
High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild
Language:Zig1.7k 24 1660
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python1.6k 20 23123
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Language:Jupyter Notebook915 13 1388
PABannier/bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Language:C++740 38 8861
sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models
Language:C#590 24 14192
JUSTSUJAY/nlp-zero-to-hero
NLP Zero to Hero in just 10 Kernels
Language:Jupyter Notebook526 6 069
RayFernando1337/MLX-Auto-Subtitled-Video-Generator
Generate accurate transcripts using Apple's MLX framework
Language:Python330 1 030
PragmaticMachineLearning/docai
Structured information extraction from documents
Language:Python286 4 427
njucckevin/SeeClick
The model, data and code for the visual GUI Agent SeeClick
Language:HTML235 2 4712
zhangfaen/finetune-Qwen2-VL
Language:Jupyter Notebook226 2 1921
1mrat/cursor
Repo of cursor prompts
213 11 117
luckyrobots/luckyrobots
We are on a mission to make robotics available to the regular software engineers, by decoupling it from ROS and physical hardware.
Language:Python91 8 107
2U1/Phi3-Vision-Finetune
An open-source implementaion for fine-tuning Phi3-Vision and Phi3.5-Vision by Microsoft.
Language:Python75 2 3112
GaiZhenbiao/Phi3V-Finetuning
Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.
Language:Python54 1 318
AMAAI-Lab/MidiCaps
A large-scale dataset of caption-annotated MIDI files.
Language:Python49 2 51
microsoft/UICaption
We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This dataset was used to pre-train the Lexi model which provides a generic representation of UI screens and their components.
Language:Python35 4 34
showlab/GUI-Narrator
Repository of GUI Action Narrator
Language:JavaScript4 2 00
mihaidobrescu1111/guess_the_word
Language:Python1

Mihaiii

Mihaiii's Stars

opendatalab/MinerU

microsoft/graphrag

flipperdevices/flipperzero-firmware

feder-cr/linkedIn_auto_jobs_applier_with_AI

NexaAI/nexa-sdk

modelscope/ms-swift

argmaxinc/WhisperKit

linkedin/Liger-Kernel

PatrickJS/awesome-cursorrules

ictnlp/LLaMA-Omni

comet-ml/opik

bytedance/GiantMIDI-Piano

zml/zml

feizc/FluxMusic

merveenoyan/smol-vision

PABannier/bark.cpp

sandrohanea/whisper.net

JUSTSUJAY/nlp-zero-to-hero

RayFernando1337/MLX-Auto-Subtitled-Video-Generator

PragmaticMachineLearning/docai

njucckevin/SeeClick

zhangfaen/finetune-Qwen2-VL

1mrat/cursor

luckyrobots/luckyrobots

2U1/Phi3-Vision-Finetune

GaiZhenbiao/Phi3V-Finetuning

AMAAI-Lab/MidiCaps

microsoft/UICaption

showlab/GUI-Narrator

mihaidobrescu1111/guess_the_word