shamima19's Stars
shuhanmirza/Bengali-Poem-Dataset
🗄️ Stylometric Dataset of Bengali Poems
build-on-aws/llm-rag-vectordb-python
Explore sample applications and tutorials demonstrating the prowess of Amazon Bedrock with Python. Learn to integrate Bedrock with databases, use RAG techniques, and showcase experiments with langchain and streamlit.
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
aws-samples/sagemaker-studio-foundation-models
aws-samples/fsi-genai-bootcamp
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
hyeonsangjeon/AWS-LLM-SageMaker
SageMaker Ployglot based RAG opensearch
aws-samples/generative-ai-cdk-constructs-samples
This repo provides sample generative AI stacks built atop the AWS Generative AI CDK Constructs.
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
aws-samples/foundation-model-benchmarking-tool
Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
cvlab-stonybrook/DewarpNet
Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
cs-chan/Total-Text-Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
aws-samples/sagemaker-genai-hosting-examples
phamquiluan/jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
RizhaoCai/Awesome-FAS
Paper collection of about the face anti-spoofing
machine-intelligence-laboratory/DDI-100
Distorted Document Images dataset (DDI-100).
ibm-aur-nlp/PubLayNet
xuebinqin/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
srush/annotated-mamba
Annotated version of the Mamba paper
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
google-deepmind/funsearch
federico-busato/Modern-CPP-Programming
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
TalalWasim/Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
hhsinping/few_shot_fas