lucferreira-27's Stars
notDavidsGit/animesocialnetworks
rasters and gexf network files for the not david power of friendship video
huridocs/pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
mc-bench/orchestrator
Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.
livekit/agents
Build real-time multimodal AI applications 🤖🎙️📹
Nutlope/llamacoder
Open source Claude Artifacts – built with Llama 3.1 405B
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
YassKhazzan/openperplex_backend_os
openperplex is an opensource AI search engine
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
zakkor/lluminous
A fast, light, open chat UI with full tool use support across many models
e2b-dev/e2b-cookbook
Examples of using E2B
cpldcpu/MisguidedAttention
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
redimp/otterwiki
A minimalistic wiki powered by python, markdown and git.
Dahie/caramelize
Tool to migrate legacy wikis/documentation to markdown git-repository retaining history and syntax.
huggingface/parler-tts
Inference and training library for high-quality TTS models.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
ragavsachdeva/magi
Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.
SleeeepyZhou/moondream
tiny vision language model
adenzu/Manga-Panel-Extractor
A simple program that takes manga pages and outputs the panels on them. Website: https://adenzu.github.io/Manga-Panel-Extractor/
stanford-oval/WikiChat
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
mckaywrigley/chatbot-ui
AI chat for any model.
MiAO-AI-Lab/LARP
Cerlancism/chatgpt-subtitle-translator
Efficient translation tool based on ChatGPT API
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
EricGuo5513/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Audio-AGI/WavJourney
WavJourney: Compositional Audio Creation with LLMs
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.