Since I am using data from de novo assembly, I filtered the genes using the script

  • Used kallisto to map and count reads;
  • Used trinity script to get top 90% genes expressed in each sample;
  • Selected the genes that were in at least one of these lists.