geraldthewes's Stars
Huang2002200/PlantNet-and-PSegNet
The official code of PlantNet and PSegNet
Xianjun-Yang/PLLaMa
PLLaMA: an Open-source Large Language Model for Plants
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
dsgt-kaggle-clef/plantclef-2024
pratikkayal/PlantDoc-Dataset
Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020
Aider-AI/aider
aider is AI pair programming in your terminal
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait
xinsir6/ControlNetPlus
ControlNet++: All-in-one ControlNet for image generations and editing!
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
TheMistoAI/MistoLine
A Versatile and Robust SDXL-ControlNet Model for Adaptable Line Art Conditioning
BatouResearch/controlnet-tile-upscale
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
ml6team/fondant
Production-ready data processing made easy and shareable
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
ozgrozer/chatgpt-artifacts
Bring Claude's Artifacts feature to ChatGPT
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
togethercomputer/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
DachunKai/EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Doriandarko/maestro
A framework for Claude Opus to intelligently orchestrate subagents.
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
OceannTwT/ra-isf
[ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
microsoft/kernel-memory
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
Kent0n-Li/ChatDoctor
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More