Open MSI Python Code

v0.1.0

David Elbert¹, Maggie Eminizer², Sam Tabrisky³

¹Hopkins Extreme Materials Institute (HEMI), PARADIM Materials Innovation Platform, and Dept. of Earth and Planetary Sciences, The Johns Hopkins University, Baltimore, MD, USA

²Institute for Data Intensive Engineering and Science (IDIES), Dept. of Physics and Astronomy, The Johns Hopkins University, Baltimore, MD, USA

³Depts. of Biology and Computer Science, Dartmouth College, Hanover, NH, and HEMI, The Johns Hopkins University, Baltimore, MD, USA

Introduction

User-friendly implementation and extension of common data streaming applications using Apache Kakfa, written in Python

Available on GitHub at https://github.com/openmsi/openmsipython

Developed for Open MSI (NSF DMREF award #1921959)

Installation

Programs use the Python implementation of the Apache Kafka API, and are designed to run on Windows machines connected to laboratory instruments. The only base requirements are Python >=3.7, git, and pip.

Quick start with miniconda3

The quickest way to get started is to use Miniconda3. Miniconda3 installers can be downloaded from the website here, and installation instructions can be found on the website here.

With Miniconda installed, next create and switch to a new environment for Open MSI. In a terminal window (or Anaconda Prompt in admin mode on Windows) type:

conda create -n openmsi python=3
conda activate openmsi

This environment needs a special variable set to allow the Kafka Python code to find its dependencies on Windows (see here for more details), so after you've done the above, type the following commands to set the variable and then refresh the environment:

conda env config vars set CONDA_DLL_SEARCH_MODIFICATION_ENABLE=1
conda deactivate #this command will give a warning, that's normal
conda activate openmsi

You'll need to use that second "activate" command every time you open a Terminal window or Anaconda Prompt to switch to the openmsi environment.

Miniconda installs pip, and if you need to install Git you can do so with

conda install -c anaconda git

(or use the instructions on the website here.)

Cloning this repo and installing the openmsipython package

While in the openmsi environment, navigate to wherever you'd like to store this code, and type:

git clone https://github.com/openmsi/openmsipython.git
cd openmsipython
pip install .
cd ..

This will give you access to all of the console commands discussed below, as well as any of the other modules in the openmsipython package. If you'd like to be able to make changes to the openmsipython code without reinstalling, you can include the --editable flag in the pip install command.

If you like, you can check everything with:

python

>>> import openmsipython

And if that line runs without any problems then the package was installed correctly.

To-do list

The following items are currently planned to be implemented ASAP:

Adding a safer and more graceful shutdown when stopping Services so that no external lag time needs to be considered
Allowing watching directories where large files are in the process of being created/saved instead of just directories where fully-created files are being added
Implementing other data types and serialization schemas, likely using Avro
Create pypi and conda installations. Pypi method using twine here: https://github.com/bast/pypi-howto. Putting on conda-forge is a heavier lift. Need to decide if it's worth it; probably not for such an immature package.
Re-implement PDV plots from a submodule

Questions that will arise later (inFAQs?)

What are best practices for topic creation and naming? Should we have a new topic for each student, for each instrument, for each “kind” of data, ...?
Would it be possible to have an environment and dependency definition? YAML??
How do I know (and trust!) my data made it and is safe?
What if I forget and write my data to some “wrong” place? What if I write my data to the directory twice?
Should I clear my data out of the streaming directory once it’s been produced to Kafka?

meethari/openmsipython

Open MSI Python Code

v0.1.0

David Elbert¹, Maggie Eminizer², Sam Tabrisky³

Introduction

Installation

Quick start with miniconda3

Cloning this repo and installing the openmsipython package

Other documentation

To-do list

Questions that will arise later (inFAQs?)

meethari/openmsipython

Open MSI Python Code

v0.1.0

David Elbert1, Maggie Eminizer2, Sam Tabrisky3

Introduction

Installation

Quick start with miniconda3

Cloning this repo and installing the openmsipython package

Other documentation

To-do list

Questions that will arise later (inFAQs?)

David Elbert¹, Maggie Eminizer², Sam Tabrisky³