Medicines Discovery Catapult
Multidisciplinary expertise in Bioinformatics, Cheminformatics, Bioimaging informatics, Machine Learning, Natural Language Processing and Software Engineering
Pinned Repositories
aurac-web-plugin
Browser plugin that can find, highlight and link to interesting life science things in web pages and PDFs.
doclib-common
Common utilities and components used by the document library
image-model-runner
json-combiner
Utility to scan and parse files by format and convert them into json output that can be ingested by mongoimport, Currently supports parsing of json and xml formats
prefetch
A Scala based microservice that fetches and processes remote files. Part of the Document Library eco-system
py-mongo
Wrapper for pymongo to allow simple instantiation of MongoDB from configuration.
scala-queue
Use Scala & Pekko to send and receive typed messages with RabbitMQ
scala-utils
A handful of convenience utilities in Scala for things like hashing and concurrency
spreadsheet-to-text-converter
Part of the Document Library micro service ecosystem. Converts spreadsheets to text.
supervisor
Part of the Document Library micro service ecosystem. Co-ordinates all the consumer micro services.
Medicines Discovery Catapult's Repositories
mdcatapult/aurac-web-plugin
Browser plugin that can find, highlight and link to interesting life science things in web pages and PDFs.
mdcatapult/py-mongo
Wrapper for pymongo to allow simple instantiation of MongoDB from configuration.
mdcatapult/doclib-common
Common utilities and components used by the document library
mdcatapult/gitlab-ci-cd-versioning
Code versioning and other continuous integration helpers for Gitlab CI/CD pipelines
mdcatapult/openseadragon
MDC clone of openseadragon
mdcatapult/PDF-Converter
Webservice to convert PDFs to HTML
mdcatapult/prefetch
A Scala based microservice that fetches and processes remote files. Part of the Document Library eco-system
mdcatapult/py-postgres
Methods to simplify configuring and connecting to Postgresql databases
mdcatapult/py-queue
Module to abstract queues. Currently implements RabbitMQ.
mdcatapult/py-util
Utility methods to help work with lists & dicts
mdcatapult/scala-mongo
Scala wrapper for MongoDB
mdcatapult/scala-queue
Use Scala & Pekko to send and receive typed messages with RabbitMQ
mdcatapult/scala-utils
A handful of convenience utilities in Scala for things like hashing and concurrency
mdcatapult/spreadsheet-to-text-converter
Part of the Document Library micro service ecosystem. Converts spreadsheets to text.
mdcatapult/supervisor
Part of the Document Library micro service ecosystem. Co-ordinates all the consumer micro services.
mdcatapult/viawebgl
MDC clone of viawebgl
mdcatapult/ai-tool-experiments
Lots and lots of LLM agent and RAG demos using crew-ai, langchain & llama-index.
mdcatapult/bio-aurac
Coordinating repository allowing users to run the bio-aurac application.
mdcatapult/biobert
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
mdcatapult/COSHH-API
Go API complementing the COSHH UI web app
mdcatapult/COSHH-UI
Angular web app for tracking the locations and expiry dates of laboratory chemicals
mdcatapult/document-to-text-converter
Part of the Document Library micro service ecosystem. Converts PDFs and other document types to raw text. Uses Apache Tika.
mdcatapult/entity-recognition
Web service to recognise entities in html using a redis based dictionary
mdcatapult/go-astilectron-bundler
Bundle your Astilectron app with ease
mdcatapult/image-converter
Web service for cropping .ome.tiff files and combining .tiff files into a single pyramid .ome.tiff file
mdcatapult/pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.
mdcatapult/py-config
Python lib that allows environment variables to override hocon style yaml or json config
mdcatapult/regex-entity-resolver
Scala web service for identifying terms within text and resolving them to a value
mdcatapult/scala-source-reader
Utility methods to read text from input streams like zip files. Uses Apache Tika
mdcatapult/unarchiver
Part of the Document Library micro service ecosystem. Uncompresses files that have been archived using zip, tar, gzip, 7zip & more