bauerem

bauerem's Stars

VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Language:Python13.8k863
deepfakes/faceswap
Deepfakes Software For All
Language:Python52.4k13.2k
ottoweiss/pdf-to-audiobook
Uses OpenAI API to clean pdf then converts it to professional grade audiobook with text to speech.
Language:Python321
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:Python3k274
Emiliankapp/Tetris
Language:JavaScript1
OnedocLabs/react-print-pdf
Build and generate PDF using React 📄 UI kit for PDFs and print documents. Simple, reusable components and templates to create great invoices, docs, brochures. Use your favorite front-end framework React to build your next PDF.
Language:TypeScript2.3k84
nianlonggu/MemSum-DQA
Adapting an Efficient Long Document Extractive Summarizer for Document Question Answering
Language:Jupyter Notebook101
NoneJou072/robochain
A simulation framework based on ROS2 and LLMs(like GPT) for robot interaction tasks in the era of large models
Language:Python10912
goncayilmaz/Seven-Wonders-Saga-Continues
Language:Java2
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Language:Python1.3k129
siriuswapnil/NLP-Matchmaking
Using TF-IDF and Keyword Extraction techniques to perform matchmaking between multiple parties in multi event system. (Python, Flask, NLTK, SQLite, Bootstrap)
Language:HTML1
rudolfwilliam/higher_order_adversarial_robustness
Exploring higher order effects in derivative regularization for adversarial robustness. Deep Learning project at ETH Zürich, Autumn Semester 2021
Language:Python21
e2b-dev/E2B
Secure open source cloud runtime for AI apps & AI agents
Language:TypeScript7k455
bibinprathap/ERP
ERP Software This software can be described as a complete business software solution.It has module such as sales , purchase ,inventory,Accounts.
Language:JavaScript4930
nateraw/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
Language:Python4.4k422
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML11k945
nerfstudio-project/nerfacc
A General NeRF Acceleration Toolbox in PyTorch.
Language:Python1.4k115
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Language:Python14.2k1.6k
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python2.1k515
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9.1k1.7k
lucidrains/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Language:Python1.9k183
eps696/aphantasia
CLIP + FFT/DWT/RGB = text to image/video
Language:Python775102
sczhou/LEDNet
[ECCV 2022] LEDNet: Joint Low-light Enhancement and Deblurring in the Dark
Language:Python21428
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python8.5k811
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++25.3k4k
limitless-af10/YT-Sentiment-Classifier-Chrome-Extension-v2
Language:JavaScript1