datacuration
There are 9 repositories under datacuration topic.
NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
data-prep-kit/data-prep-kit
Open source project for data preparation for GenAI applications
chapmanjacobd/library
99+ CLI tools to build, browse, and blend your media library
WDscholia/scholia
Wikidata-based scholarly profiles
purvasingh96/Data-Collection-for-CarZam
An image + data web scraper build to crawl the CarMax website and store relevant information for vehicle identification projects.
benjaminocampo/DataCuration
Exploration and data curation of a dataset given by a Kaggle competition (https://www.kaggle.com/dansbecker/melbourne-housing-snapshot) related to properties that were sold in Melbourne in 2016 and 2017. The meaning of this project is to prepare a well-structured matrix, so it can be used to run a model in order to estimate their prices.
GaloRomero/pepadbPosgreScript
PostgreSQL code for archaeological data management
SudeepSinha09/IITM-Business_Data_Management_Project-Uttam_Supermarket
This is a capstone project for the course of Business Analytics and Business Data Management at IIT Madras. The project involves analyzing sales data of Uttam Supermarket in Indore, which has 5 franchises, collected over a year. The analysis includes store-wise and monthly sales, the effect of holidays on sales, and weekly sales analysis.
kosson/sva21
Acest repo conține materiale, seturi de date și soluții care au fost folosite în cadrul Școlii de vară Astra, prima ediție, 2021