Chevolier's Stars
aws-samples/creating-ml-models-with-pyspark-on-amazon-emr
meta-soul/MetaSpore
A unified end-to-end machine intelligence platform
reczoo/FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
bytedance/LargeBatchCTR
Large batch training of CTR models based on DeepCTR with CowClip.
shenweichen/DeepCTR-Torch
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
mindalpha/MindAlpha
shenweichen/DeepCTR
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
ultralytics/ultralytics
Ultralytics YOLO11 🚀
peak/s5cmd
Parallel S3 and local filesystem execution tool.
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
openvpi/audio-slicer
Python script that slices audio with silence detection
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
LLaVA-VL/LLaVA-NeXT
aws-samples/amazon-sagemaker-host-and-inference-whisper-model
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
whn09/amazon-sagemaker-visual-search
This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon Elasticsearch service
OpenNMT/CTranslate2
Fast inference engine for Transformer models
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
HLTCHKUST/cantonese-asr
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.