achrafs758's Stars
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
edshee/mlserver-example
An example showing how to use multi model serving with MLServer
MUZAMMILPERVAIZ/Arabic-Handwritten-OCR
A deep learning model (DCNNs+Bi LSTMs+CTC Loss) for identification of Handwritten Arabic Text
ibug-group/face_detection
ORB-HD/deface
Video anonymization by face detection
MaryamBoneh/Vehicle-Detection
Vehicle Detection Using Deep Learning and YOLO Algorithm
navinfoeurope/anonymizer
Detection and blurring of human faces and license plates in images.
varungupta31/dashcam_anonymizer
Code to Blur Human Faces and Vehicle License Plates in Video and Images using a SoTA Object Detection model YOLOv8
daniel1uno/Anonymize_panoramic_images
Blur faces out of panoramic 360 images obtained from mobile mapping
camenduru/Video-LLaMA-colab
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Lafifi-24/Build-dataset-using-LLM-from-pdfs
CVEProject/cvelistV5
CVE cache of the official CVE List in CVE JSON 5 format
pppoe/Fooocus-SAM
Added SAM to FOOOCUS for to automatically generate masks for Inpainting - See Post For Details
lllyasviel/Fooocus
Focus on prompting and generating
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
xai-org/grok-1
Grok open release
OpenDriveLab/DriveLM
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
vztu/BVQA_Benchmark
A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
chenfei-wu/TaskMatrix
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
jshilong/GPT4RoI
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
BAAI-DCAI/Visual-Instruction-Tuning
SVIT: Scaling up Visual Instruction Tuning
vis-nlp/ChartQA
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
lupantech/MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
trsvchn/coco-viewer
Minimalistic COCO Dataset Viewer in Tkinter
zhilin007/FFA-Net
FFA-Net: Feature Fusion Attention Network for Single Image Dehazing
HumanSignal/label-studio-ml-backend
Configs and boilerplates for Label Studio's Machine Learning backend
Dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.