ck37
Psychiatry faculty; biostatistics PhD. {Targeted, deep, machine} learning, NLP, IRT, computer vision, exposure mixtures, EHRs.
Harvard Medical School, Mass General HospitalBoston, MA
ck37's Stars
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
autogluon/autogluon
Fast and Accurate ML in 3 Lines of Code
meta-llama/llama-models
Utilities intended for use with Llama models.
geopy/geopy
Geocoding library for Python.
posit-dev/positron
Positron, a next-generation data science IDE
ddsjoberg/gtsummary
Presentation-Ready Data Summary and Analytic Result Tables
ModelOriented/DrWhy
DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.
linkedin/FastTreeSHAP
Fast SHAP value computation for interpreting tree-based models
suinleelab/treeexplainer-study
Code and documentation for experiments in the TreeExplainer paper
ThomasBury/arfs
All Relevant Feature Selection
ModelOriented/DALEXtra
Extensions for the DALEX package
kozodoi/fairness
R package for computing and visualizing fair ML metrics
ModelOriented/drifter
Concept Drift and Concept Shift Detection for Predictive Models
IyarLin/survXgboost
A small wrapper package that enables full survival curve estimation using xgboost
simonpcouch/syrup
Measure Memory and CPU Usage of R Code
wlandau/posit2024
posit::conf(2024) presentation about {mirai} and {crew}
canagnos/mcp
Tools for Measuring Classification Performance for R, Python and Spark
alexluedtke12/pydimple
OHDSI/ParallelLogger
An R package for easy parallel computing, logging, and function call automation.
seonhee99/EHR-SeqSQL
Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Findings)
PheWAS/PhecodeX
OHDSI/RiskStratifiedEstimation
An R package for evaluating treatment effect heterogeneity using a risk-based approach.
canagnos/hmeasure
Measuring Classification Performance: the hmeasure package for R
cran/CalibratR
:exclamation: This is a read-only mirror of the CRAN R package repository. CalibratR — Mapping ML Scores to Calibrated Predictions
nt-williams/covid-RCT-covar
Simulation study evaluating what variables should be included as covariates in analyses for COVID-19 RCTs
bbj-lab/PD_Progression
jmendelson256/samplingNR
R package for stratified sample allocation under anticipated nonresponse