DGRPool is a web application designed to serve as a comprehensive repository for Drosophila Genetic Reference Panel (DGRP) phenotyping datasets. It not only provides a centralized hub for accessing existing literature on DGRP phenotyping data but also offers tools for conducting basic systems genetics-inspired analyses. The primary aim of DGRPool is to make DGRP phenotyping data findable, accessible, interoperable, and reusable (FAIR) to facilitate research in the field of Drosophila genetics.
- Dataset Repository: DGRPool acts as a repository for DGRP phenotyping datasets, indexed with relevant keywords and categorized for easy search and access.
- Community Curated: Users can become curators, contributing to the maintenance and curation of the datasets, ensuring accuracy and relevance.
- Interactive Data Analysis: Users can perform various analyses including phenotype correlation, Genome-Wide Association Studies (GWAS), and Phenome-Wide Association Studies (PheWAS) directly on the platform.
- User-Submitted Phenotypes: Researchers can upload their own phenotypic data to the platform for correlation analyses and GWAS, expanding the utility of DGRPool.
- Proof-of-Concept Studies: DGRPool showcases its potential through proof-of-concept studies, providing insights into biological discoveries facilitated by the platform.
Users can explore the repository by searching for keywords or browsing through curated categories such as "ageing", "metabolism", or "olfactory". Each study is meticulously curated to ensure data accuracy and relevance.
DGRPool encourages community participation by allowing users to become curators. Curators help maintain the platform by formatting and validating submitted datasets, ensuring the quality of the data.
Users can perform various analyses including phenotype correlation and GWAS. Pre-calculated GWAS analyses are available for browsing, and users can also upload their own phenotypic data for analysis.
DGRPool provides proof-of-concept studies to showcase its potential in facilitating biological discoveries. These studies highlight associations between phenotypes, providing valuable insights for further research.
To set up the development server for DGRPool, follow these steps:
-
Obtain the Database Dump: Get the dump file from the database (
pg_dump dgrpool >dgrpool.dump
) and place it in thestartdb
folder. If the folder doesn't exist, create one. -
Create Data Folder: Ensure there is a
/data
folder in the root directory of the project. If it doesn't exist, create one. -
Copy Files: Copy all files present in
/data/dgrpool
on the DGRPool server to the local/data
folder. You can use tools likersync
for this purpose. -
Build Files: If the files are not built automatically, navigate to the container and run the following commands:
npm run build & npm run build:css
-
Create services: There are several services that we create and enable for running at the server start. You can find them in ./services
On a linux system, they should be put in
/etc/systemd/system/
Then, you can reboot the service daemon:systemctl daemon-reload
After that, you can start all services with the commands:
systemctl start solr systemctl start rails systemctl start jobdemon systemctl start postgresql
And check if they are running properly using:
systemctl status solr systemctl status rails systemctl status jobdemon systemctl status postgresql
The service will automatically restart the programs if they fail. We will also enable them so that they automatically start when the server boots:
systemctl enable solr systemctl enable rails systemctl enable jobdemon systemctl enable postgresql
DGRPool welcomes contributions from the community. Users interested in becoming curators or contributing to the development of the platform can reach out through the provided contact information on the website.
For inquiries or support, please contact the DGRPool team at bioinfo.epfl@gmail.com
See LICENSE file
By leveraging the power of community-driven curation and interactive data analysis, DGRPool aims to accelerate research in Drosophila genetics and provide a valuable resource for the scientific community.