yuymf's Stars
public-apis/public-apis
A collective list of free APIs
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
KillianLucas/open-interpreter
A natural language interface for computers
testerSunshine/12306
12306智能刷票,订票
KRTirtho/spotube
🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
joaomdmoura/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
facebookresearch/codellama
Inference code for CodeLlama models
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
langchain-ai/langgraph
Build resilient language agents as graphs.
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
bclavie/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
google-research/kubric
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
3DTopia/LGM
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
stanford-oval/WikiChat
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
researchmm/LightTrack
[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
david-rhodes/GSOPs
Gaussian Splat Operators for SideFX Houdini
yc9701/pansori
Tools for ASR Corpus Generation from Online Video
neu-vi/ezflow
A modular PyTorch library for optical flow estimation using neural networks
vision4robotics/SCT
This is the official code for the paper "Tracker Meets Night: A Transformer Enhancer for UAV Tracking".
HUSTDML/CTTrack
difhnp/MAT
code for 'Representation Learning for Visual Object Tracking by Masked Appearance Transfer'
vision4robotics/SGDViT