Drwaish

I am a Gen AI and trying to enhance my skill with the help of Computer Science Communities through knowledge I have.

Pakistan

Drwaish's Stars

divelab/AIRS
Artificial Intelligence Research for Science (AIRS)
Language:Python54463
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.4k428
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.7k159
KwaiVGI/SynCamMaster
[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
Language:Python2987
ictnlp/Auto-RAG
This is the official repository for Auto-RAG.
Language:Python16914
YesianRohn/TextSSR
code for TextSSR paper
Language:Python791
PKU-YuanGroup/ConsisID
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Language:Python53027
IDEA-Research/ChatRex
Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Language:Python1173
ChenHoy/DROID-Splat
End-to-End SLAM with camera calibration, monocular prior integration and dense Rendering
Language:Python28014
MIC-DKFZ/nnUNet
Language:Python6.1k1.8k
basf/mamba-tabular
Mambular is a Python package that simplifies tabular deep learning by providing a suite of models for regression, classification, and distributional regression tasks. It includes models such as Mambular, TabM, FT-Transformer, TabulaRNN, TabTransformer, and tabular ResNets.
Language:Python1629
jdh-algo/JoyVASA
Language:Python56248
lewis081/CCL-Net
two-stage framework based method with cascaded contrastive learning for UIE
Language:Python4
AlonzoLeeeooo/StableV2V
The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".
Language:Python1215
plageon/HtmlRAG
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems
Language:Python27922
DS4SD/docling
Get your documents ready for gen AI
Language:Python16.6k845
getmaxun/maxun
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
Language:TypeScript6.2k452
cvlab-kaist/PF3plat
Official Implementation of "PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting"
Language:Python1735
HelloVision/HelloMeme
The official HelloMeme GitHub site
Language:Python53337
Lakonik/MVEdit
3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
Language:JavaScript30615
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Language:Python95548
MCG-NJU/EMA-VFI
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
Language:Python41742
ptrvilya/blendify
Lightweight Python framework that provides a high-level API for creating and rendering scenes with Blender.
Language:Python78520
ai4colonoscopy/IntelliScope
Frontiers in Intelligent Colonoscopy [ColonSurvey | ColonINST | ColonGPT]
Language:Python463
microsoft/BitNet
Official inference framework for 1-bit LLMs
Language:C++12.5k873
amjadraza/pandasai-app-gradio
Language:Python556
adarshb3/Virtual-Try-On-Application-using-Flask-Twilio-and-Gradio
This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on model. Users can send images via WhatsApp to try on garments virtually, and the results are sent back to them.
Language:Python31736
hubertsiuzdak/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Language:Python46026
cvg/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
Language:Python61827
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.5k199

Drwaish

Drwaish's Stars

divelab/AIRS

FoundationVision/VAR

InternLM/InternLM-XComposer

KwaiVGI/SynCamMaster

ictnlp/Auto-RAG

YesianRohn/TextSSR

PKU-YuanGroup/ConsisID

IDEA-Research/ChatRex

ChenHoy/DROID-Splat

MIC-DKFZ/nnUNet

basf/mamba-tabular

jdh-algo/JoyVASA

lewis081/CCL-Net

AlonzoLeeeooo/StableV2V

plageon/HtmlRAG

DS4SD/docling

getmaxun/maxun

cvlab-kaist/PF3plat

HelloVision/HelloMeme

Lakonik/MVEdit

rhymes-ai/Allegro

MCG-NJU/EMA-VFI

ptrvilya/blendify

ai4colonoscopy/IntelliScope

microsoft/BitNet

amjadraza/pandasai-app-gradio

adarshb3/Virtual-Try-On-Application-using-Flask-Twilio-and-Gradio

hubertsiuzdak/snac

cvg/depthsplat

mit-han-lab/efficientvit