piperino11's Stars
huggingface/smolagents
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Pythagora-io/gpt-pilot
The first real AI developer
asiff00/On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption handling.
hexgrad/kokoro
https://hf.co/hexgrad/Kokoro-82M
SakanaAI/self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
bytedance/UI-TARS
bytedance/UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
OpenSPG/KAG
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
crux82/MM-IGLU-IT
menloresearch/ichigo
Local realtime voice AI
crux82/u-deppllama
Dependency parsing with Large Language Models
crux82/FEVER-it
This repository contains the Italian dataset for Fact Verification
crux82/gqa-it
Italian Question Answering on Image Scene Graphs
KindXiaoming/pykan
Kolmogorov Arnold Networks
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
FusionBrainLab/OmniFusion
OmniFusion — a multimodal model to communicate using text and images
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
huggingface/parler-tts
Inference and training library for high-quality TTS models.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
state-spaces/mamba
Mamba SSM architecture
mistralai/mistral-inference
Official inference library for Mistral models
microsoft/X-Decoder
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
QuivrHQ/quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.