Pishtiko's Stars
QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
microsoft/MoGe
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
instantX-research/InstantIR
InstantIR: Blind Image Restoration with Instant Generative Reference 🔥
RoaringBitmap/RoaringBitmap
A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
soimort/you-get
:arrow_double_down: Dumb downloader that scrapes the web
kijai/ComfyUI-MochiWrapper
ucbepic/docetl
A system for agentic LLM-powered data processing and ETL
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
DPDK/dpdk
Data Plane Development Kit
fudan-generative-vision/hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
coder/coder
Provision remote development environments via Terraform
LituRout/RF-Inversion
Rectified Flow Inversion (RF-Inversion)
genmoai/mochi
The best OSS video generation models
Zyphra/Zamba2
PyTorch implementation of models from the Zamba2 series.
facebookresearch/LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
facebookresearch/LWE-benchmarking
This repository contains code to generate and preprocess Learning with Errors (LWE) data and implementations of four LWE attacks uSVP, SALSA, Cool&Cruel, and Dual Hybrid Meet-in-the-Middle (MitM). We invite contributors to reproduce our results, improve on these methods, and/or suggest new concrete attacks on LWE.
alimama-creative/FLUX-Controlnet-Inpainting
VRSEN/agency-swarm
The only reliable agent framework built on top of the latest OpenAI Assistants API.
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
a-r-r-o-w/finetrainers
Memory-optimized training scripts for video models based on Diffusers
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
rhymes-ai/Aria
Codebase for Aria - an Open Multimodal Native MoE
huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
lapce/lapce
Lightning-fast and Powerful Code Editor written in Rust
NVlabs/EAGLE
Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs
chaidiscovery/chai-lab
Chai-1, SOTA model for biomolecular structure prediction