wyp1125
Consultant - Data science, Machine learning, Biostatistics and Python. Accepting new projects.
BDX Research & ConsultingHerndon, Virginia
Pinned Repositories
AntEpiSeeker
A C++ tool for detecting epistatic interactions for case-control studies using a two-stage ant colony optimization algorithm
aws-obesity-risk-prediction
aws-serverless-fintech
Programs for a serverless model based fintech app
comm_datasci_procedures
A toolkit for common data science procedures which can be applied to different domains such as marketing, fraud detection and e-commerce.
MCScanX
MCScanX: Multiple Collinearity Scan toolkit X version. The most popular synteny analysis tool in the world!
MCScanX-transposed
Source code for my Bioinformatics paper "MCScanX-transposed: detecting transposed gene duplications based on multiple colinearity scans".
NGS_Pipeline_Utilities
A python toolkit to facilitate development, regression testing and deployment of NGS pipelines
SAS-Biostat-Tools
SAS scripts for various biostatistics applications. A complete pipeline for analyzing microarray data using linear models is included.
SAS-Clinical-Trials-Toolkit
SAS scripts for clinical trials applications including generating SDTM domains, ADaM datasets, and Define.xml files
SeqEnhDL
Multiple machine learning and deep learning models for sequence-based enhancer prediction
wyp1125's Repositories
wyp1125/MCScanX
MCScanX: Multiple Collinearity Scan toolkit X version. The most popular synteny analysis tool in the world!
wyp1125/SAS-Clinical-Trials-Toolkit
SAS scripts for clinical trials applications including generating SDTM domains, ADaM datasets, and Define.xml files
wyp1125/MCScanX-transposed
Source code for my Bioinformatics paper "MCScanX-transposed: detecting transposed gene duplications based on multiple colinearity scans".
wyp1125/SAS-Biostat-Tools
SAS scripts for various biostatistics applications. A complete pipeline for analyzing microarray data using linear models is included.
wyp1125/AntEpiSeeker
A C++ tool for detecting epistatic interactions for case-control studies using a two-stage ant colony optimization algorithm
wyp1125/Biostatistics
R programs for biostatistics models and TLF
wyp1125/compute-grs
A small Perl script for computing genetic risk scores for obesity from a VCF file
wyp1125/DNN-Precision-Medicine
Deep neural network (DNN) models for a precision medicine problem using Python and TensorFlow.
wyp1125/PHP-Market-Data-Visualization
PHP codes to visualize realtime stock data via candlestick and RSI chart.
wyp1125/SeqEnhDL
Multiple machine learning and deep learning models for sequence-based enhancer prediction
wyp1125/AntEpiSeeker2
Extended AntEpiSeeker algorithm relating epistasis detection to biological pathway analysis using ant colony optimization
wyp1125/Sklearn-Bank-Marketing
Build multiple machine learning models (Nearest Neighbors, SVMs, Decision Tree, Random Forest, Naive Bayes, etc.) for bank marketing data using sklearn and pandas.
wyp1125/additive_epistasis
Collection of R, Python, Perl and Tensorflow programs to simulate and detect the additive epistasis genetic model
wyp1125/LSOSS
An R package for detection of cancer outlier differential gene expression
wyp1125/NGS_Pipeline_Utilities
A python toolkit to facilitate development, regression testing and deployment of NGS pipelines
wyp1125/aws-obesity-risk-prediction
wyp1125/aws-serverless-fintech
Programs for a serverless model based fintech app
wyp1125/comm_datasci_procedures
A toolkit for common data science procedures which can be applied to different domains such as marketing, fraud detection and e-commerce.
wyp1125/cCNN-Image-Classifier
Configurable Convolutional Neural Networks for Image Classification using Python and TensorFlow. Wrapper scripts are also provided.
wyp1125/emr-pyspark-life-styling-analysis
wyp1125/lung_cancer_risk
A full-stack deep learning solution for evaluating lung cancer risks
wyp1125/Machine-Learning-libSVM-LASSO
A compile of Perl and R scripts for applying machine learning techniques including SVM and LASSO
wyp1125/NGS-clinical-pipelines
A compile of Perl scripts and Linux commands for NGS clinical pipelines
wyp1125/PHP-MySQL-SNPxGE2
PHP code for my Bioinformatics paper "SNPxGE2: a database for human SNP-coexpression associations".
wyp1125/PySpark-MR-SVM-BioMed
PySpark scripts for applying MapReduce and SVM to high-throughput biomedical datasets.
wyp1125/serverless-ml-models
wyp1125/xSyn
xSyn: a software tool for identifying sophisticated three-way interactions from cancer expression data