sablokgaurav
Gaurav Sablok, Academic Staff Member, Bioinformatics, Institute for Biochemistry and Biology, Universität Potsdam, Potsdam,Germany
Universität PotsdamGermany
Pinned Repositories
bacterial-keras
A keras implementation of machine learning for bacterial genomes, takes a fasta file and the annotation features and the genes you want to train the keras model
codingplotter
a coding plotter for the protein annotations coming from the annotation of the genome using the protein hints and to extract and plot the specific length estimates.
domain-analyzer
This repository contains a datascience based faster implementation of the domain predictions from the interpro scan and it will give you a complete domains information, coordinates and other associative information. I used a mapping dataframe approach to make it faster rather than looping it over and over.
domain-directed-graphs
This repository contains a function which will prepare the domain graphs analysis, if you will specify a domain or an interpro, it will give you all the parent and the child graphs for the directed and undirected graphs modelling
expression-neural-network
a deep neural expression based classifier demonstrated to fit a unbalanced dataset, expression datasets across the samples
genome-shell-utility
a genome shell utility to help you with the project management and organization. It will automatically create the folders and transfers the files to the sever and will move around.
pacbio-nanopore-polyATGC-trimmer
A regular expression based polyATGC trimmer for the long reads or the fastq reads extremely fast and returns a fasta and also a dataframe for the sequence classification
pacbiohifi-analyzer
a pacbiohifi analyzer for the pacbio hifi reads and gives all the information for the pacbiohifi reads from raw to the graph alignments
pangraphs-genome-assembly
a complete workflow that can be dockerized for the long read assembly, it allows for the genome assembly update as well as it allows for the assembly from the start. if you have the illumina reads it allows for the genome mapping also
python-algorithms-datastructures
This repository contains the codes which i have posted on linkedln solutions for the leetcode, interview query and the codewars questions and i used a different approach as compared to the approach everywhere mentioned
sablokgaurav's Repositories
sablokgaurav/codingplotter
a coding plotter for the protein annotations coming from the annotation of the genome using the protein hints and to extract and plot the specific length estimates.
sablokgaurav/nextflow-pacbiohifi
a nextflow pacbiohifi for the genome assembly from the pacbiohifi. It also includes supports for the visualization and genome assessment.
sablokgaurav/protein-annotator
python package to analyze the protein coding regions for the genome annotation. It uses the miniprot for the alignment and gives you all the protein predicted mRNA, coding regions and other exon positions.
sablokgaurav/pacbiohifi-analyzer
a pacbiohifi analyzer for the pacbio hifi reads and gives all the information for the pacbiohifi reads from raw to the graph alignments
sablokgaurav/arabidopsis-maf-cap-accessions
arabidopsis-maf-cap-accessions. genome extraction, alignments, visualization, phylogenomics, ancestral tree
sablokgaurav/coding-stitcher-pangenome
a coding sticher for genome annotations, which stitch all your coding regions coming out of the exon alignments and will produce the gene visualization for the pangenome
sablokgaurav/data-driven-web-apps-with-flask
Course demo code and other hand-out materials for our data-driven web apps in Flask course
sablokgaurav/deeplearning4nlp-tutorial
Hands-on tutorial on deep learning with a special focus on Natural Language Processing (NLP)
sablokgaurav/evoseq-genome-informatics
a R package for the genomes annotations to phylogeny. a R package for the analysis of the specific genes from the sequenced genomes.
sablokgaurav/flux-models-ruby
implementation of mathematical models in ruby bindings and also as shards for the crystal. This repository will be updated regularly for the complete integration
sablokgaurav/genome-annotation-multivisual
dplyr version of visualization of all the coding regions for a specific ids from protein alignment.
sablokgaurav/genome-annotation-visualizer
a R function part to visualizae the genes coming from the genome alignment proteome annotations. This is a part of the evoseq R package
sablokgaurav/GenomeAnnotation
Best practices and workflow for genome annotation
sablokgaurav/genomehifi-contiguity
a conda yaml for the genomehifi-contiguity that will allow you to create the environment for all the analysis.
sablokgaurav/intergenic-extractor
extracting all the intergenic regions from the genome annotation using the protein alignments.
sablokgaurav/miniprot-protein-annotator
a protein coding regions annotator that will take the alignment file in the PAF/GFF format and will extract the complete coding regions and prepares them for deep learning.
sablokgaurav/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
sablokgaurav/mRNAplotter
plotting tools for the mRNA from the proteome to the genome anntoation. Produces a tab delimited files with the start and the stop of the mRNAs
sablokgaurav/ontologies-semanticweb
a ontology to the relation patcher that will prepare the distance implementation. It will prepare the files for the nextworkx also.
sablokgaurav/ontologyanalyzer
python package for the genome annotations, semantic web and graph implementation on the ontologies.
sablokgaurav/pacbiohifi-meryl
plotting tools for pacbiohifi meryl hapmers and yak kmers
sablokgaurav/r-package-backup
r function for the R package installation
sablokgaurav/ruby-gems-bioinformatics
a collection of the ruby gems that are of the frequent use in my bioinformatics analysis and also for the nominal task.
sablokgaurav/sablokgaurav
sablokgaurav/scala-sbt-class-mapper
a scala sbt class for making the class based mapper for the genome alignment and parsing them for the visual inference
sablokgaurav/scaling-kmers-neural
from word segmentation to neural kmers
sablokgaurav/semanticweb-ontologies-prepare
generating the ontology graphs and the system relationship.
sablokgaurav/smile
Statistical Machine Intelligence & Learning Engine
sablokgaurav/snp-extract
extracting all the snp sites for the panache and then estimating a identity matrix.
sablokgaurav/visualfreq
alignment visualization and phylogeny plotting for genome aliged regions.