data-harmonization
There are 25 repositories under data-harmonization topic.
pha4ge/hAMRonization
Parse multiple Antimicrobial Resistance Analysis Reports into a common data structure
harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.
NPLinker/nplinker
A python framework for microbial natural products data mining by integrating genomics and metabolomics data
VIDA-NYU/bdi-kit
A Python toolkit for biomedical data integration and harmonization
CoAxLab/pycombat
Python implementation of Combat for data harmonisation, allowing also to remove unwanted effects
cidgoh/pathogen-genomics-package
This is the DataHarmonizer spreadsheet web application bundled with pathogen genomics data entry and validation templates
datasnack/datahub
Self-hostable, open-source engine for reproducible data harmonization, dataset building & exploration
ncsuSEAL/McGregor-et-al-2024
Code and sample data for McGregor et al., 2024
SCAI-BIO/datastew
Python library for intelligent data stewardship using Large Language Model (LLM) embeddings
SCAI-BIO/index
Intelligent data steward toolbox using Large Language Model embeddings for automated Data-Harmonization
SCAI-BIO/tsnepad
AD & PD cohort variable distributions
harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
harmonydata/harmonyapi
This is the source code for the Harmony project REST API
maelstrom-research/ipaq
International Physical Activity Questionnaire (IPAQ) variables
lodi-m/piccard
Visualizing demographic evolution using geographically inconsistent census data
harmonydata/harmonydata.github.io
Blog for NLP data harmonisation project Harmony, open source solution using Python for psychologists
NajiaAhmadi/VisualisationWithPython
Graphics for the article "Methods used in the development of Common Data Models for health data – a Scoping Review"
syedmfuad/geospatial_misc
Miscellaneous codes for harmonizing agricultural output and other agri-related data raster files and shapefiles. Extracts from raster files the grid-cell data by shapefile boundary.
dfornika/amrhike
Proof-of-concept for storing and querying harmonized AMR Genomic Analysis Results in datahike
jcaperella29/clinical-text-mining_R_SCRIPT
A lightweight R script for text mining and harmonizing medical phenotype data. Cleans, standardizes, and maps diagnoses to ICD-10 codes, with clinical annotations for enhanced data usability.
syedmfuad/spatial_miscellaneous
Miscellaneous codes for spatial mapping and raster data manipulation