Perform numpy
-like analysis on data
that remains in someone else's
server
Quickstart
-
Install our handy 🛵 cli tool which makes deploying a Domain or Network server a one-liner:
pip install -U hagrid
-
Then run our interactive jupyter Install 🧙🏽♂️ WizardBETA:
hagrid quickstart
-
In the tutorial you will learn how to install and deploy:
PySyft
= ournumpy
-like 🐍 Python library for computing onprivate data
in someone else'sDomain
PyGrid
= our 🐳docker
/k8s
/ 🐧vm
Domain
&Network
Servers whereprivate data
lives -
During quickstart we will deploy
PyGrid
to localhost with 🐳docker
, however 🛵 HAGrid can deploy tok8s
or a 🐧ubuntu
VM onazure
/gcp
/ANY_IP_ADDRESS
by using 🔨ansible
†
Install Notes
- HAGrid Requires: 🐍
python
🐙git
- Run:pip install -U hagrid
- Interactive Install 🧙🏽♂️ WizardBETA Requires 🛵
hagrid
: - Run:hagrid quickstart
†Windows
does not supportansible
, preventing some remote deployment targets - PySyft Requires: 🐍
python 3.7+
- Run:pip install -U syft --pre
*macOS
Apple Silicon users need cmake:brew install cmake
‡Windows
users must run this first:pip install jaxlib==0.3.14 -f https://whls.blob.core.windows.net/unstable/index.html
- PyGrid Requires: 🐳
docker
/k8s
or 🐧ubuntu
VM - Run:hagrid launch ...
Versions
0.7.0 beta
- dev
branch 👈🏽
0.6.0
- Course 3
0.5.1
- Course 2 + M1 Hotfix
0.2.0
- 0.5.0
Deprecated
PySyft and PyGrid use the same version
and its best to match them up where possible. We release weekly betas which can be used in each context:
PySyft: pip install -U syft --pre
PyGrid: hagrid launch ... tag=latest
HAGrid is a cli / deployment tool so the latest version of hagrid
is usually the best.
What is Syft?
Syft
is OpenMined's open source
stack that provides secure
and private
Data Science in Python. Syft decouples private data
from model training, using techniques like Federated Learning, Differential Privacy, and Encrypted Computation. This is done with a numpy
-like interface and integration with Deep Learning
frameworks, so that you as a Data Scientist
can maintain your current workflow while using these new privacy-enhancing techniques
.
Why should I use Syft?
Syft
allows a Data Scientist
to ask questions
about a dataset
and, within privacy limits
set by the data owner
, get answers
to those questions
, all without obtaining a copy
of the data itself. We call this process Remote Data Science
. It means in a wide variety of domains
across society, the current risks
of sharing information (copying
data) with someone such as, privacy invasion, IP theft and blackmail will no longer prevent the vast benefits
such as innovation, insights and scientific discovery which secure access will provide.
No more cold calls to get access
to a dataset. No more weeks of wait times
to get a result
on your query
. It also means 1000x more data
in every domain. PySyft opens the doors to a streamlined Data Scientist workflow
, all with the individual's privacy
at its heart.
Tutorials
|
|
|
Terminology
Community
Courses
Contributors
OpenMined and Syft appreciates all contributors, if you would like to fix a bug or suggest a new feature, please see our guidelines.
Supporters
Open Collective
OpenMined
is a registered 501(c)(3)
in the USA. We are funded by our gracious supporters on Open Collective.
Disclaimer
Syft is under active development and is not yet ready for pilots on private data without our assistance. As early access participants, please contact us via Slack or email if you would like to ask a question or have a use case that you would like to discuss.
License
Apache License 2.0
Person icons created by Freepik - Flaticon