A simulacrum of the NHS Secure Data Environment, to facilitate education, exploration, external code development and testing.
NB this is unofficial, and not endorsed by NHS England or HDR UK / BHF Data Science Centre.
- 🧑💻 Environment - Google Colab based for ease of use.
- 💿 Synthetic data
- 🏥 Hospital Episode Statistics Admitted Patient Care: See NHS Digital: Artificial data pilot.
- Databricks Community Edition ❌.
- Doesn't support Git.
- Google Colab. ✅
- Supports Spark
- Poor Github integration - can open/save, but can't easily sync.
- Doesn't persist?
- Github codespaces ❌
- Flexible but too complex for beginners
- https://aka.ms/configure-codespace.
- https://github.com/education/codespaces-project-template-py.
- Local ❌
- Way too complex for beginners!
- Environment: pip, conda, ? poetry.
- Containers: docker, devcontainer.
- https://github.com/jplane/pyspark-devcontainer.
- Should be able to manage exactly the same as codespaces?
- Other HES datasets:
- HES OP
- HES A&E
- HES CC (NB not part of Artificial Data Pilot)
- HES MAT (NB not part of Artificial Data Pilot)
- Synthetic ONS Deaths.
- Synthetic GDPPR.