datacuration

There are 9 repositories under datacuration topic.

  • NVIDIA/NeMo-Curator

    Scalable data pre processing and curation toolkit for LLMs

    Language:Jupyter Notebook86812197119
  • data-prep-kit/data-prep-kit

    Open source project for data preparation for GenAI applications

    Language:HTML79920503217
  • library

    chapmanjacobd/library

    99+ CLI tools to build, browse, and blend your media library

    Language:Python44383814
  • WDscholia/scholia

    Wikidata-based scholarly profiles

    Language:JavaScript239151.8k87
  • purvasingh96/Data-Collection-for-CarZam

    An image + data web scraper build to crawl the CarMax website and store relevant information for vehicle identification projects.

    Language:Python3101
  • benjaminocampo/DataCuration

    Exploration and data curation of a dataset given by a Kaggle competition (https://www.kaggle.com/dansbecker/melbourne-housing-snapshot) related to properties that were sold in Melbourne in 2016 and 2017. The meaning of this project is to prepare a well-structured matrix, so it can be used to run a model in order to estimate their prices.

    Language:Jupyter Notebook1172
  • GaloRomero/pepadbPosgreScript

    PostgreSQL code for archaeological data management

    Language:SQL0100
  • SudeepSinha09/IITM-Business_Data_Management_Project-Uttam_Supermarket

    This is a capstone project for the course of Business Analytics and Business Data Management at IIT Madras. The project involves analyzing sales data of Uttam Supermarket in Indore, which has 5 franchises, collected over a year. The analysis includes store-wise and monthly sales, the effect of holidays on sales, and weekly sales analysis.

  • kosson/sva21

    Acest repo conține materiale, seturi de date și soluții care au fost folosite în cadrul Școlii de vară Astra, prima ediție, 2021