Rajshekhar-Reddy1's Stars
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
ieee8023/covid-chestxray-dataset
We are building an open database of COVID-19 cases with chest X-ray or CT images.
pytorch/text
Models, data loaders and abstractions for language processing, powered by PyTorch
wainshine/Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
mdn/browser-compat-data
This repository contains compatibility data for Web technologies as displayed on MDN
googlecreativelab/quickdraw-dataset
Documentation on how to access and use the Quick, Draw! Dataset.
doccano/doccano
Open source annotation tool for machine learning practitioners.
NirantK/awesome-project-ideas
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
zalandoresearch/fashion-mnist
A MNIST-like fashion product database. Benchmark :point_down:
joke2k/faker
Faker is a Python package that generates fake data for you.
sayaliwalke30/Kaggle-Projects
This repo contains 4 different projects. Built various machine learning models for Kaggle competitions. Also carried out Exploratory Data Analysis, Data Cleaning, Data Visualization, Data Munging, Feature Selection etc
automl/auto-sklearn
Automated Machine Learning with scikit-learn
rhiever/datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
rasbt/mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
zomux/deepy
A highly extensible deep learning framework
cernopendata/opendata.cern.ch
Source code for the CERN Open Data portal
benbalter/congressional-districts
Historic and current US Congressional districts as GeoJSON, versioned within Git
GSA/data
Assorted data from the General Services Administration.
Chicago/food-inspections-evaluation
This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
FDA/openfda
openFDA is an FDA project to provide open APIs, raw data downloads, documentation and examples, and a developer community for an important collection of FDA public datasets.
uscensusbureau/citysdk
Convenient JavaScript utilities for working with Census APIs: Statistics, Cartographic GeoJSON, lat/lng -> FIPS, and other niceties (written in ClojureScript)
OpenExoplanetCatalogue/open_exoplanet_catalogue
The main data repository for the Open Exoplanet Catalogue
unitedstates/congress-legislators
Members of the United States Congress, 1789-Present, in YAML/JSON/CSV, as well as committees, presidents, and vice presidents.
openaddresses/openaddresses
A global repository of open address, building, and parcel data.
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
Sven-Bo/portfolio-tracking-excel-python
Sven-Bo/python-word-automation