LyuLumos's Stars
pigeoner/A_novel_bit-level_image_encryption_algorithm_based_on_chaotic_maps
一种新的基于混沌映射的比特级图像加密算法的 python 实现(原论文题目:A novel bit-level image encryption algorithm based on chaotic maps,链接:https://doi.org/10.1016/j.optlaseng.2015.09.007)
protectai/vulnhuntr
Zero shot vulnerability discovery using LLMs
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
enoche/MMRec
A Toolbox for MultiModal Recommendation. Integrating 10+ Models...
Open-Source-O1/Open-O1
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
QwenLM/AutoIF
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
SeunghyunSEO/optimized_hf_llama_class_for_training
pcg-mlp/KsanaLLM
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
huggingface/autotrain-advanced
🤗 AutoTrain Advanced
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
catalyst-team/catalyst
Accelerated deep learning R&D
LowinLi/transformers-stream-generator
This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.
microsoft/MInference
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
microsoft/onnxruntime-genai
Generative AI extensions for onnxruntime
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
YiyanXu/DiFashion
Diffusion Models for Generative Outfit Recommendation
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone