baris-yazici
Interested in ethical, efficient, data-centric machine learning.
Lafayette College / London School of EconomicsLondon, UK
baris-yazici's Stars
neuml/txtai
đź’ˇ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
langflow-ai/langflow
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
PeiJieSun/diffnet
Graph Neural Network based Social Recommendation Model. SIGIR2019.
WLiK/LLM4Rec-Awesome-Papers
A list of awesome papers and resources of recommender system on large language model (LLM).
Lancelot39/Causal-Copilot
microsoft/causica
eshnich/transformers-learn-causal-structure
PAIR-code/what-if-tool
Source code/webpage/demos for the What-If Tool
ustunb/actionable-recourse
python tools to check recourse in linear classification
Piyushi-0/ACE
Code for our ICML '19 paper: Neural Network Attributions: A Causal Perspective.
FenTechSolutions/CausalDiscoveryToolbox
Package for causal inference in graphs and in the pairwise settings. Tools for graph structure recovery and dependencies are included.
eqasim-org/ile-de-france
An open synthetic population of ĂŽle-de-France for agent-based transport simulation
propublica/compas-analysis
Data and analysis for 'Machine Bias'
fairlearn/fairlearn
A Python package to assess and improve fairness of machine learning models.
vanderschaarlab/mlforhealthlabpub
Machine Learning and Artificial Intelligence for Medicine.
vanderschaarlab/Data-IQ
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data
vanderschaarlab/hyperimpute
A framework for prototyping and benchmarking imputation methods
vanderschaarlab/synthcity
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
vanderschaarlab/autoprognosis
A system for automating the design of predictive modeling pipelines tailored for clinical prognosis.
vanderschaarlab/Interpretability
Resources for Machine Learning Explainability
shadcn-ui/ui
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
awslabs/python-deequ
Python API for Deequ
kuleshov/cornell-cs5785-2024-applied-ml
Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2024)
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
databricks/LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
stanford-futuredata/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬