mohamedyd
I serve as a Senior Research Scientist and Project Leader at Software AG located in Darmstadt, Germany. My research focuses on democratizing data quality tools.
Software AGGermany
Pinned Repositories
active-learning
AI-ML-Driven-Companies-In-Egypt
A list of AI/ML driven companies in Egypt
ai-research-data-valuation-repository
This repository hosts the Data Valuation framework designed for Iterative Self-Learning purposes.
augur-code
Augur is a toolset that helps simulate and detect drift in different types of datasets, to define the best metrics that can be used to predict drift before it happens.
AutoCure
Source code of the paper AutoCure: Automated Tabular Data Curation for ML Pipelines
awesome_resources
A collection of resources in different domains
Energy_efficient-WSNs
Time-series prediction for energy-efficient wireless sensor networks: Case study of detectinng plants' diseases in greenhouses
rein-benchmark
A comprehensive benchmark for data cleaning methods and their impact of ML models
RTClean
SAGED
Source code of the paper: SAGED: Few-Shot Meta Learning for Tabular Data Error Detection
mohamedyd's Repositories
mohamedyd/awesome_resources
A collection of resources in different domains
mohamedyd/rein-benchmark
A comprehensive benchmark for data cleaning methods and their impact of ML models
mohamedyd/Energy_efficient-WSNs
Time-series prediction for energy-efficient wireless sensor networks: Case study of detectinng plants' diseases in greenhouses
mohamedyd/SAGED
Source code of the paper: SAGED: Few-Shot Meta Learning for Tabular Data Error Detection
mohamedyd/AI-ML-Driven-Companies-In-Egypt
A list of AI/ML driven companies in Egypt
mohamedyd/augur-code
Augur is a toolset that helps simulate and detect drift in different types of datasets, to define the best metrics that can be used to predict drift before it happens.
mohamedyd/AutoCure
Source code of the paper AutoCure: Automated Tabular Data Curation for ML Pipelines
mohamedyd/RTClean
mohamedyd/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
mohamedyd/Code-AI-Tree-Search
mohamedyd/D3Bench
Source code of the benchmark of open-source drift detection tools
mohamedyd/data-centric-ai
Resources for Data Centric AI
mohamedyd/data-valuation
mohamedyd/data_cleaning_with_latent_operators
Code repository for the LOP paper on data cleaning.
mohamedyd/DatasetCondensation
Dataset Condensation (ICLR21 and ICML21)
mohamedyd/DistributionalShapley
Distributional Shapley: A Distributional Framework for Data Valuation
mohamedyd/egyptians-in-ai
A website dedicated to showcasing the profiles of prominent Egyptian researchers in the field of AI.
mohamedyd/Failed-ML
Compilation of high-profile real-world examples of failed machine learning projects
mohamedyd/google-research
Google Research
mohamedyd/Haipipe
mohamedyd/LLM4GCode
mohamedyd/mohamedyd.github.io
mohamedyd/opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
mohamedyd/ReClean
Source code of the paper "ReClean: Reinforcement Learning for Automated Data Cleaning in ML Pipelines"
mohamedyd/robocon_egypt_2008
Design and implementation of two mobile robots and a manual robot
mohamedyd/sato
Code and data for Sato https://arxiv.org/abs/1911.06311.
mohamedyd/Scalable-Data-Valuation-Health-Care-Shapley-Value
mohamedyd/SEED
SEED: Domain-Specific Data Curation With Large Language Models
mohamedyd/sherlock-project
This repository provides data and scripts to use Sherlock, a neural-network based model to detect semantic data types. https://sherlock.media.mit.edu
mohamedyd/thin-ML-deployment
Template for quickly deploying any machine learning model to an endpoint