bauerem's Stars
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
deepfakes/faceswap
Deepfakes Software For All
ottoweiss/pdf-to-audiobook
Uses OpenAI API to clean pdf then converts it to professional grade audiobook with text to speech.
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Emiliankapp/Tetris
OnedocLabs/react-print-pdf
Build and generate PDF using React 📄 UI kit for PDFs and print documents. Simple, reusable components and templates to create great invoices, docs, brochures. Use your favorite front-end framework React to build your next PDF.
nianlonggu/MemSum-DQA
Adapting an Efficient Long Document Extractive Summarizer for Document Question Answering
NoneJou072/robochain
A simulation framework based on ROS2 and LLMs(like GPT) for robot interaction tasks in the era of large models
goncayilmaz/Seven-Wonders-Saga-Continues
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
siriuswapnil/NLP-Matchmaking
Using TF-IDF and Keyword Extraction techniques to perform matchmaking between multiple parties in multi event system. (Python, Flask, NLTK, SQLite, Bootstrap)
rudolfwilliam/higher_order_adversarial_robustness
Exploring higher order effects in derivative regularization for adversarial robustness. Deep Learning project at ETH Zürich, Autumn Semester 2021
e2b-dev/E2B
Secure open source cloud runtime for AI apps & AI agents
bibinprathap/ERP
ERP Software This software can be described as a complete business software solution.It has module such as sales , purchase ,inventory,Accounts.
nateraw/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
nerfstudio-project/nerfacc
A General NeRF Acceleration Toolbox in PyTorch.
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
lucidrains/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
eps696/aphantasia
CLIP + FFT/DWT/RGB = text to image/video
sczhou/LEDNet
[ECCV 2022] LEDNet: Joint Low-light Enhancement and Deblurring in the Dark
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
limitless-af10/YT-Sentiment-Classifier-Chrome-Extension-v2