tommasocalo's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
phidatahq/phidata
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
SawyerHood/draw-a-ui
Draw a mockup and generate html for it
NirDiamant/GenAI_Agents
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
ant-research/MagicQuill
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
PacktPublishing/LLM-Engineers-Handbook
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
SylphAI-Inc/LLM-engineer-handbook
A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.
langgptai/awesome-claude-prompts
This repo includes Claude prompt curation to use Claude better.
hendurhance/ui-ux
📚 This guide is designed to help you learn UI/UX design, and is divided into three levels: Beginner, Intermediate, and Expert. It includes learning resource, guides and tools that cover all aspects of designing user interfaces and user experiences.
StreetLamb/tribe
Low code tool to rapidly build and coordinate multi-agent teams
nv-tlabs/LLaMA-Mesh
Unifying 3D Mesh Generation with Language Models
showlab/DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
PKU-YuanGroup/Machine-Mindset
An MBTI Exploration of Large Language Models
revdotcom/reverb
Open source inference code for Rev's model
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
waterhorse1/LLM_Tree_Search
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
wtybest/HairCLIPv2
[ICCV 2023] HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
rgreenblatt/arc_draw_more_samples_pub
Draw more samples
OpenGVLab/GUI-Odyssey
GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes from 6 mobile devices, spanning 6 types of cross-app tasks, 201 apps, and 1.4K app combos.
hila-chefer/Conceptor
Official implementation of the paper The Hidden Language of Diffusion Models
euanong/image-hijacks
Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime
poloclub/FairVis
FairVis: Visual Analytics for Discovering Intersectional Bias in Machine Learning
fredhohman/fredhohman.github.io
showlab/videogui
[NeurIPS2024] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
eyalev/awesome-reading-lists
Books Reading Lists
andreaprotopapa/cheatsheets
Repo for guides publicly available
aalto-ui/SemanticCollage
SemanticCollage: A semantically enriched digital mood board tool
williamShuppert/CookNook
(In Progress) 🍽️ CookNook: Share and organize your favorite recipes with friends and enthusiasts. Discover new flavors, collaborate on culinary creations, and turn your kitchen into a culinary haven!