Pinned Repositories
IllusionVQA
This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
AsymmetricAttack
Official implementation of Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks
awesome-vlm-inference-strategies
A curated list of inference strategies and algorithms that boost Vision Language Model (VLM) performance
BdSL40_Dataset_AI_for_Bangla_2.0_Honorable_Mention
Bangla Sign Language Dataset (BdSL40) comprises of 611 videos over 40 BdSL words with 8 to 22 video clips per word.
cs221stanford
Follow along of the course homework of Stanford CS221
DLSprint2022-Champion
Winning Solution of DLSprint2022. The task was Automatic Speech Recognition on the Common Voices BN dataset. The model we used was wav2vec2.
EnsembleT5
A simple class override that allows ensembling T5 models from huggingface/transformers during inference. Works with trainer().
GPU-Puzzles
Solve puzzles. Learn CUDA.
reflection_removal_using_unet
Using UNET to remove reflection from images. Pedagogical model, not intended to be competitive.
Bangla-Complex-Named-Entity-Recognition-Challenge
Winning Solution for the Bangla Complex Named Entity Recognition Challenge - BDOSN NLP Hackathon 2023
Patchwork53's Repositories
Patchwork53/DLSprint2022-Champion
Winning Solution of DLSprint2022. The task was Automatic Speech Recognition on the Common Voices BN dataset. The model we used was wav2vec2.
Patchwork53/AsymmetricAttack
Official implementation of Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks
Patchwork53/reflection_removal_using_unet
Using UNET to remove reflection from images. Pedagogical model, not intended to be competitive.
Patchwork53/awesome-vlm-inference-strategies
A curated list of inference strategies and algorithms that boost Vision Language Model (VLM) performance
Patchwork53/BdSL40_Dataset_AI_for_Bangla_2.0_Honorable_Mention
Bangla Sign Language Dataset (BdSL40) comprises of 611 videos over 40 BdSL words with 8 to 22 video clips per word.
Patchwork53/cs221stanford
Follow along of the course homework of Stanford CS221
Patchwork53/GPU-Puzzles
Solve puzzles. Learn CUDA.
Patchwork53/AsymmetricBias
Patchwork53/BUET_CSE310_Compiler
Using flex and bison to make a compiler including symbol table, lexical analysis, syntax and semantic analysis and Assembly Code Generation for intel 8086
Patchwork53/EEEDAY-Dathathon2023-1st-Runners-Up
Bangla Grammar Error Detection with T5 transformer
Patchwork53/EnsembleT5
A simple class override that allows ensembling T5 models from huggingface/transformers during inference. Works with trainer().
Patchwork53/BUET_CSE_3_2
Offlines, Onlines and Projects of Level 3 Term 2
Patchwork53/Challenging-Image-Generation-Prompts
Some challenging cases for Image Generation using different image generating models
Patchwork53/CSE316_Microcontroller
CSE316_Term_Project
Patchwork53/CYB102-Prework-Starter
Patchwork53/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Patchwork53/dspy
DSPy: The framework for programming—not prompting—foundation models
Patchwork53/GenshinQuickMafs
Patchwork53/llm-foundry
LLM training code for MosaicML foundation models
Patchwork53/makemore
An autoregressive character-level language model for making more things
Patchwork53/pace_2021_mu_solver
Efficient heuristic solver for the Cluster Editing problem
Patchwork53/Patchwork53
About Me
Patchwork53/Patchwork53.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Patchwork53/RedDot_Internship
Patchwork53/VIPCUP2023_OLIVES
This repository provides starter code and recommendations for starting the VIP CUP 2023 competition.
Patchwork53/VIPCUP2023_OLIVES_test
This repository provides starter code and recommendations for starting the VIP CUP 2023 competition.
Patchwork53/VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)