yrstartrain's Stars
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
DAGWorks-Inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
stitchfix/hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
sudy-super/AutoMATA
Tools to implement active inferring and pseudo-consciousness in LLM
CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
smly/mjai.app
Mahjong game simulator for RiichiLab https://mjai.app
spotipy-dev/spotipy
A light weight Python library for the Spotify Web API
deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
aws-samples/aws-ml-jp
SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集
oreilly-japan/data-science-on-aws-jp
DequanWang/tent
ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization
The-Japan-DataScientist-Society/100knocks-preprocess
データサイエンス100本ノック(構造化データ加工編)
Valkyrja3607/tuning_playbook_ja
ディープラーニングモデルの性能を体系的に最大化するためのプレイブック
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
skillup-ai/tettei-engineer
徹底攻略ディープラーニングE資格エンジニア問題集
sony/nnabla-rl
Deep reinforcement learning library built on top of Neural Network Libraries
MubertAI/Mubert-Text-to-Music
A simple notebook demonstrating prompt-based music generation via Mubert API
Jeff-sjtu/HybrIK
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
firmai/deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
JunlinHan/YOCO
Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.
nvedant07/STIR
Implementation of ICML (Oral) 2022 paper "Measuring Representational Robustness of Neural Networks Through Shared Invariances"
mingyuan-zhang/MotionDiffuse
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model