emilyjiayaoli's Stars
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
karpathy/llm.c
LLM training in simple, raw C/CUDA
mem0ai/mem0
The Memory layer for your AI apps
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
vikhyat/moondream
tiny vision language model
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
novicezk/midjourney-proxy
代理 MidJourney 的discord频道,实现api形式调用AI绘图
googleworkspace/md2googleslides
Generate Google Slides from markdown
ragapp/ragapp
The easiest way to use Agentic RAG in any enterprise
smirnov-am/awesome-saas-boilerplates
apptension/saas-boilerplate
SaaS Boilerplate - Open Source and free SaaS stack that lets you build SaaS products faster in React, Django and AWS. Focus on essential business logic instead of coding repeatable features!
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
jacobhilton/deep_learning_curriculum
Language model alignment-focused deep learning curriculum
daochenzha/data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
carlini/yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
web-arena-x/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
cvdfoundation/kinetics-dataset
ai-dock/comfyui
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
ArtVentureX/comfyui-animatediff
AnimateDiff for ComfyUI
google-research/syn-rep-learn
Learning from synthetic data - code and models
implerhq/impler.io
Powerful CSV & Excel Import experience for SaaS 🚀 Save months building data import experience from scratch 💰
reka-ai/reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
Nano1337/GraFT
GraFT: Gradual Fusion Transformer for Multimodal Re-Identification