huggingface-datasets
There are 88 repositories under huggingface-datasets topic.
grok-ai/nn-template
Generic template to bootstrap your PyTorch project.
xlang-ai/UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
AI-Northstar-Tech/vector-io
The only Vector tooling you'll need. Star the repo and look out for an email to try out a brand new Vector Data Exploration demo! Use the universal VDF format for vector datasets to easily export and import data from all vector databases, and re-embed it using any model
BirkhoffG/jax-dataloader
Pytorch-like dataloaders in JAX.
vTuanpham/Large_dataset_translator
Translate large dataset to any language with google translation api and multithreads processing, no key required!
onesuper/HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
SmithaUpadhyaya/fashion_image_caption
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
BUAADreamer/Chinese-LLaVA-Med
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
xieincz/huggingface-go
huggingface-go : 加速下载 huggingface 的模型和数据集
daspartho/predict-subreddit
NLP model that predicts subreddit based on the title of a post
TirendazAcademy/Hugging-Face-Tutorials
Getting started with Hugging Face
raidionics/AeroPath
🫁 AeroPath: An airway segmentation benchmark dataset with challenging pathology
batmanscode/Talk2Book
Use AI to personify books, so that you can talk to them 🙊
mrcabbage972/simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
npuichigo/tarzan
High-level API for tar-based dataset
PRITHIVSAKTHIUR/EHRM-Models
EHRM [ Electronic Health Record Management ] introduces a centralized platform for analyzing patient records, offering insights into billing amounts, demographics, prevalent diagnoses, medical conditions, consulted doctors, admission types, and medication usage.
shunk031/huggingface-datasets_JGLUE
JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets
daspartho/bored-ape-diffusion
diffusion model for unconditional image generation of Bored Apes
anujsahani01/English-Marathi-Translation
Fine-tuned and compared 3 🤗 pre-trained Multilingual LLMs
hearmeneigh/e621-rising-configs
Configuration files for building E621-Rising v3 SDXL model and dataset
aaaastark/Pretrain_Finetune_Transformers_Pytorch
Pre-Training and Fine-Tuning transformer models using PyTorch and the Hugging Face Transformers library. Whether you're delving into pre-training with custom datasets or fine-tuning for specific classification tasks, these notebooks offer explanations and code for implementation.
balnarendrasapa/road-detection
This is a course project for DSCI-6011 - Deep Learning. deals with Drivable Area and lane segmentation for self driving cars
BUAADreamer/MLLM-Finetuning-Demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
shunk031/cookiecutter-huggingface-datasets
cookiecutter for huggingface datasets
sileod/metaeval
Collection of tasks for meta-learning and extreme multitask learning
wsobanski/scraper-tvp
Scraping large amount of articles for transformer training.
michelecafagna26/HL-dataset
[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.
shunk031/huggingface-datasets_COCOA
COCOA: Semantic Amodal Segmentation for huggingface datasets
shunk031/huggingface-datasets_wrime
WRIME for huggingface datasets
creative-graphic-design/huggingface-datasets_Magazine
Magazine dataset from Content-aware Generative Modeling of Graphic Design Layouts for huggingface datasets
creative-graphic-design/huggingface-datasets_Rico
Rico: A Mobile App Dataset for Building Data-Driven Design Applications for huggingface datasets
dnth/postgresql-multimodal-retrieval
Vector/Hybrid Search & Retrieval on PostgreSQL database using Vision Language Model.
ksgr5566/AutoTuneNLP
A comprehensive toolkit for seamless data generation and fine-tuning of NLP models, all conveniently packed into a single block.
shunk031/huggingface-datasets_MSCOCO
Microsoft COCO: Common Objects in Context for huggingface datasets
ItzCrazyKns/Dataset-Converter
A Python script for converting URL-based datasets into image datasets.
morikaglobal/finetune_bert_model
Fine-tuning pretrained BERT model for sentiment analysis (text classification)