Pinned Repositories
Data-Reduction-and-Octree-based-Clustering-of-Ligand-Conformations-in-Hadoop
This repository contains a linear clustering approach for large datasets of molecular geometries produced by high-throughput molecular dynamics simulations (e.g., protein folding and protein-ligand docking simulations) for our papers in Comp. Biol. Med. 2012 and HPCC 2012 conference. The clustering is adapted for MapReduce and implemented in Hadoop.
DockingAtHome-Website
Source code for the Docking@Home BOINC project web page.
Mimir
Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI, while redesigning the execution model to incorporate a number of sophisticated optimization techniques that achieve similar or better performance with significant reduction in the amount of memory used.
NASMo-TiAM
Workflow for Generating North America Soil Moisture at 250m Dataset Derived From Time-specific Adaptable Machine Learning Models
NHANES-Analytics
This repository contains cone for analysis of the NHANES dataset. Specifically, it contains code which will examine the unique food items in the NHANES dietary data. The food items are clustered based on nutrient similarities into new food groups. These food groups represent the result of a data-driven approach of developing food groups for use in dietary analysis studies.
QCN-Explorer
QCN Explorer is an education focused web interface for QCN-sim. Users can create earthquake scenarios and visualize the effects.
Reproducibility_EHT
SOMOSPIE
SOMOSPIE (Soil Moisture Spatial Inference Engine) consists of a Jupyter Notebook and a suite of machine learning methods to process inputs of available coarse-grained soil moisture data at its native spatial resolution. Features include the selection of a geographic region of interest, prediction of missing values across the entire region of interest (i.e., gap-filling), analysis of generated fine-grained predictions, and visualization of both predictions and analyses.
Src_FDS
Weather_Data_Analytics
This repository contains the MATLAB software for the frequency based analysis framework developed for our paper in eScience 2015 conference. The framework is an adaptation of a cluster tool previously proposed to predict idle resources in non-dedicated clusters. The framework employs empirical cumulative distribution function to benchmark and model occurrences of extreme climate events, specifically extreme temperature and precipitation. The framework is broken into two phases: learning phase and prediction phase. The learning phase uses ECDF-based analysis to generate modeling and forecasting windows. The prediction phases applies the modeling window to most recent weather data to estimate the likelihood that given proportions of the region can experience extreme temperature and precipitation events. This repository also contains a sample input in INPUT folder to test the functionality of the framework.
Global Computing Lab's Repositories
TauferLab/SOMOSPIE
SOMOSPIE (Soil Moisture Spatial Inference Engine) consists of a Jupyter Notebook and a suite of machine learning methods to process inputs of available coarse-grained soil moisture data at its native spatial resolution. Features include the selection of a geographic region of interest, prediction of missing values across the entire region of interest (i.e., gap-filling), analysis of generated fine-grained predictions, and visualization of both predictions and analyses.
TauferLab/Reproducibility_EHT
TauferLab/NASMo-TiAM
Workflow for Generating North America Soil Moisture at 250m Dataset Derived From Time-specific Adaptable Machine Learning Models
TauferLab/ANACIN-X
This project advances the reproducibility study of HPC applications by proposing an open-source modular framework for automatic measurement, analysis, and visualization of non-determinism and root causes of non-determinism in MPI applications.
TauferLab/DockingAtHome-Website
Source code for the Docking@Home BOINC project web page.
TauferLab/hatchet
Tree- or Graph-indexed Pandas DataFrames for analyzing performance data
TauferLab/Reproducibility_A4NN_ICPP23
TauferLab/tauferlab.github.io
TauferLab/benchpark
An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments
TauferLab/CSMPI
TauferLab/dspaces
Margo Based DataSpaces
TauferLab/dumpi_to_graph
TauferLab/dyad
DYAD: DYnamic and Asynchronous Data Streamliner
TauferLab/flux-core
core services for the Flux resource management framework
TauferLab/flux-docs
Documentation for the Flux-Framework
TauferLab/flux-framework-tutorials
Tutorial slides and materials
TauferLab/flux-radiuss-tutorial-2023
Files for the Flux RADIUSS Tutorials
TauferLab/GEOtiled
TauferLab/hatchet-tutorial
TauferLab/llnl-hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
TauferLab/MTCheck
Merkle Tree based Checkpointing
TauferLab/PerfFlowAspect
An Aspect Oriented Programming (AOP)-based tool to analyze cross-cutting performance concerns of composite science workflows.
TauferLab/Pluto
TauferLab/Reproducibility_ICPP23_ORANGES
TauferLab/Reproducibility_ICPP23_Scalable_GPU_Deduplication
TauferLab/Reproducibility_Scalar_GPU_Dedup_ICPP23_Results
TauferLab/Src_DYAD_UCX_Perftest
TauferLab/thicket
TauferLab/thicket-tutorial
TauferLab/XPSI
Framework for identifying protein structural properties from diffraction patterns.