Riaz123's Stars
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
binhnguyennus/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
faif/python-patterns
A collection of design patterns/idioms in Python
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
pymc-devs/pymc
Bayesian Modeling and Probabilistic Programming in Python
google/trax
Trax — Deep Learning with Clear Code and Speed
carpedm20/DCGAN-tensorflow
A tensorflow implementation of "Deep Convolutional Generative Adversarial Networks"
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
probml/pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
hwalsuklee/tensorflow-generative-model-collections
Collection of generative models in Tensorflow
christianversloot/machine-learning-articles
🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.
chris-chris/ml-engineer-roadmap
WIP: Roadmap to becoming a machine learning engineer in 2020
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
aws-samples/aws-glue-samples
AWS Glue code samples
jakobrunge/tigramite
Tigramite is a python package for causal inference with a focus on time series data. The Tigramite documentation is at
neo4j/NaLLM
Repository for the NaLLM project
metabrainz/listenbrainz-server
Server for the ListenBrainz project, including the front-end (javascript/react) code that it serves and all of the data processing components that LB uses.
blue-yonder/turbodbc
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
ankonzoid/LearningX
Deep & Classical Reinforcement Learning + Machine Learning Examples in Python
abhishekrana/DeepFashion
Apparel detection using deep learning
Minyus/pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Frank-qlu/recruit
recruit 招聘爬虫+数据分析 1.爬虫: 采用Scrapy 分布式爬虫技术,使用mongodb作为数据存储,爬取的网站Demo为51job,数据我目前爬了有几千条 2.数据处理: 采用pandas对爬取的数据进行清洗和处理 2.数据分析: 采用flask后端获取mongodb数据,前端使用bootstrap3.echarts以及D3的词云图,如果喜欢请star or Fork,预览详见
chamkank/hone
Convert CSV to automatically nested JSON
oracle/analytical-sql-examples
NO LONGER MAINTAINED. Code samples for Oracle's analytical SQL features
MHaringa/insurancerating
R-package for actuarial pricing
bvanaken/explain-BERT-QA
Code for the CIKM 2019 Paper: How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations
harrystech/arthur-redshift-etl
ELT Code for your Data Warehouse
jamesbyars/apache-spark-etl-pipeline-example
Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computing.
minerva-ml/steppy-toolkit
Curated set of transformers that make your work with steppy faster and more effective :telescope:
Riaz123/SparkALS_AWS_SageMaker
ALS based recommendation Engine Build on Apache spark & served on AWS Sagemaker