Results for TCGA samples having all "NA" values
boseb opened this issue · 1 comments
Hi there,
I was following the vignette and found that TCGA samples do not have any data points. Please help.
The code is below:
Find a project of interest
project_info <- abstract_search("TCGA")
Download the gene-level RangedSummarizedExperiment data
download_study(project_info$project[7])
Load the data
load(file.path(project_info$project[7], "rse_gene.Rdata"))
Delete it if you don't need it anymore
#unlink(project_info$project[7], recursive = TRUE)
Before adding predictions
dim(colData(rse_gene))##11284 864
Add the predictions
rse_gene <- add_predictions(rse_gene)
After adding the predictions
dim(colData(rse_gene))#11284 876
Explore the variables
x<-colData(rse_gene)[, 865:ncol(colData(rse_gene))]
x[1:50,1:5]
session_info()
─ Session info ───────────────────────────────────────────────────────────────────────────────────────────────
setting value
version R version 4.0.3 (2020-10-10)
os Red Hat Enterprise Linux Server 7.5 (Maipo)
system x86_64, linux-gnu
ui RStudio
language (EN)
collate en_US.UTF-8
ctype en_US.UTF-8
tz America/Chicago
date 2021-07-06
─ Packages ───────────────────────────────────────────────────────────────────────────────────────────────────
! package * version date lib source
annotate 1.68.0 2020-10-27 [1] Bioconductor
AnnotationDbi * 1.52.0 2020-10-27 [1] Bioconductor
AnnotationFilter 1.14.0 2020-10-27 [1] Bioconductor
AnnotationHub * 2.22.1 2021-04-16 [1] Bioconductor
askpass 1.1 2019-01-13 [1] CRAN (R 4.0.3)
assertthat 0.2.1 2019-03-21 [1] CRAN (R 4.0.3)
backports 1.2.1 2020-12-09 [1] CRAN (R 4.0.3)
base64enc 0.1-3 2015-07-28 [1] CRAN (R 4.0.3)
Biobase * 2.50.0 2020-10-27 [1] Bioconductor
BiocFileCache * 1.14.0 2020-10-27 [1] Bioconductor
BiocGenerics * 0.36.1 2021-04-16 [1] Bioconductor
BiocManager 1.30.15 2021-05-11 [1] CRAN (R 4.0.3)
BiocParallel 1.24.1 2020-11-06 [1] Bioconductor
BiocStyle 2.18.1 2020-11-24 [1] Bioconductor
BiocVersion 3.12.0 2020-04-27 [1] Bioconductor
biomaRt 2.46.3 2021-02-09 [1] Bioconductor
Biostrings 2.58.0 2020-10-27 [1] Bioconductor
biovizBase 1.38.0 2020-10-27 [1] Bioconductor
bit 4.0.4 2020-08-04 [1] CRAN (R 4.0.3)
bit64 4.0.5 2020-08-30 [1] CRAN (R 4.0.3)
bitops 1.0-7 2021-04-24 [1] CRAN (R 4.0.3)
blob 1.2.1 2020-01-20 [1] CRAN (R 4.0.3)
broom 0.7.6 2021-04-05 [1] CRAN (R 4.0.3)
BSgenome 1.58.0 2020-10-27 [1] Bioconductor
bumphunter * 1.32.0 2020-10-27 [1] Bioconductor
cachem 1.0.5 2021-05-15 [1] CRAN (R 4.0.3)
callr 3.7.0 2021-04-20 [1] CRAN (R 4.0.3)
caTools 1.18.2 2021-03-28 [1] CRAN (R 4.0.3)
CCA * 1.2.1 2021-03-01 [1] CRAN (R 4.0.3)
cellranger 1.1.0 2016-07-27 [1] CRAN (R 4.0.3)
checkmate 2.0.0 2020-02-06 [1] CRAN (R 4.0.3)
chron 2.3-56 2020-08-18 [1] CRAN (R 4.0.3)
cli 2.5.0 2021-04-26 [1] CRAN (R 4.0.3)
cluster 2.1.0 2019-06-19 [2] CRAN (R 4.0.3)
clusterProfiler * 3.18.1 2021-02-09 [1] Bioconductor
codetools 0.2-16 2018-12-24 [2] CRAN (R 4.0.3)
colorspace 2.0-1 2021-05-04 [1] CRAN (R 4.0.3)
cowplot 1.1.1 2020-12-30 [1] CRAN (R 4.0.3)
crayon 1.4.1 2021-02-08 [1] CRAN (R 4.0.3)
curl 4.3.1 2021-04-30 [1] CRAN (R 4.0.3)
data.table * 1.14.0 2021-02-21 [1] CRAN (R 4.0.3)
DBI 1.1.1 2021-01-15 [1] CRAN (R 4.0.3)
dbplyr * 2.1.1 2021-04-06 [1] CRAN (R 4.0.3)
DEFormats 1.18.0 2020-10-27 [1] Bioconductor
DelayedArray 0.16.3 2021-03-24 [1] Bioconductor
derfinder * 1.24.2 2020-12-18 [1] Bioconductor
derfinderHelper 1.24.1 2020-12-18 [1] Bioconductor
derfinderPlot * 1.24.1 2020-12-18 [1] Bioconductor
desc 1.3.0 2021-03-05 [1] CRAN (R 4.0.3)
DESeq2 * 1.30.1 2021-02-19 [1] Bioconductor
devtools 2.4.2 2021-06-07 [1] CRAN (R 4.0.3)
dichromat 2.0-0 2013-01-24 [1] CRAN (R 4.0.3)
digest 0.6.27 2020-10-24 [1] CRAN (R 4.0.3)
DO.db 2.9 2021-07-06 [1] Bioconductor
doRNG 1.8.2 2020-01-27 [1] CRAN (R 4.0.3)
DOSE 3.16.0 2020-10-27 [1] Bioconductor
dotCall64 * 1.0-1 2021-02-11 [1] CRAN (R 4.0.3)
downloader 0.4 2015-07-09 [1] CRAN (R 4.0.3)
dplyr * 1.0.6 2021-05-05 [1] CRAN (R 4.0.3)
edgeR * 3.32.1 2021-01-14 [1] Bioconductor
ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.0.3)
enrichplot 1.10.2 2021-01-28 [1] Bioconductor
ensembldb 2.14.1 2021-04-19 [1] Bioconductor
evaluate 0.14 2019-05-28 [1] CRAN (R 4.0.3)
fansi 0.5.0 2021-05-25 [1] CRAN (R 4.0.3)
farver 2.1.0 2021-02-28 [1] CRAN (R 4.0.3)
fastmap 1.1.0 2021-01-25 [1] CRAN (R 4.0.3)
fastmatch 1.1-0 2017-01-28 [1] CRAN (R 4.0.3)
fda * 5.1.9 2020-12-16 [1] CRAN (R 4.0.3)
fds * 1.8 2018-10-31 [1] CRAN (R 4.0.3)
fgsea 1.16.0 2020-10-27 [1] Bioconductor
fields * 12.3 2021-05-17 [1] CRAN (R 4.0.3)
forcats * 0.5.1 2021-01-27 [1] CRAN (R 4.0.3)
foreach * 1.5.1 2020-10-15 [1] CRAN (R 4.0.3)
foreign 0.8-80 2020-05-24 [2] CRAN (R 4.0.3)
Formula 1.2-4 2020-10-16 [1] CRAN (R 4.0.3)
fs 1.5.0 2020-07-31 [1] CRAN (R 4.0.3)
genefilter 1.72.1 2021-01-21 [1] Bioconductor
geneplotter 1.68.0 2020-10-27 [1] Bioconductor
generics 0.1.0 2020-10-31 [1] CRAN (R 4.0.3)
GenomeInfoDb * 1.26.7 2021-04-08 [1] Bioconductor
GenomeInfoDbData 1.2.4 2021-06-08 [1] Bioconductor
GenomicAlignments 1.26.0 2020-10-27 [1] Bioconductor
GenomicFeatures 1.42.3 2021-04-01 [1] Bioconductor
GenomicFiles 1.26.0 2020-10-27 [1] Bioconductor
GenomicRanges * 1.42.0 2020-10-27 [1] Bioconductor
GenomicState * 0.99.9 2021-07-06 [1] Bioconductor
GEOquery 2.58.0 2020-10-27 [1] Bioconductor
GGally 2.1.2 2021-06-21 [1] CRAN (R 4.0.3)
ggbio 1.38.0 2020-10-27 [1] Bioconductor
ggforce 0.3.3 2021-03-05 [1] CRAN (R 4.0.3)
V ggplot2 * 3.3.3 2021-06-25 [1] CRAN (R 4.0.3)
ggraph 2.0.5 2021-02-23 [1] CRAN (R 4.0.3)
ggrepel 0.9.1 2021-01-15 [1] CRAN (R 4.0.3)
glue 1.4.2 2020-08-27 [1] CRAN (R 4.0.3)
GO.db 3.12.1 2021-06-18 [1] Bioconductor
GOSemSim 2.16.1 2020-10-29 [1] Bioconductor
gplots * 3.1.1 2020-11-28 [1] CRAN (R 4.0.3)
graph 1.68.0 2020-10-27 [1] Bioconductor
graphlayouts 0.7.1 2020-10-26 [1] CRAN (R 4.0.3)
gridExtra 2.3 2017-09-09 [1] CRAN (R 4.0.3)
gsubfn * 0.7 2018-03-16 [1] CRAN (R 4.0.3)
gtable 0.3.0 2019-03-25 [1] CRAN (R 4.0.3)
gtools 3.9.2 2021-06-06 [1] CRAN (R 4.0.3)
haven 2.4.1 2021-04-23 [1] CRAN (R 4.0.3)
hdrcde 3.4 2021-01-18 [1] CRAN (R 4.0.3)
Hmisc 4.5-0 2021-02-28 [1] CRAN (R 4.0.3)
hms 1.1.0 2021-05-17 [1] CRAN (R 4.0.3)
htmlTable 2.2.1 2021-05-18 [1] CRAN (R 4.0.3)
htmltools 0.5.1.1 2021-01-22 [1] CRAN (R 4.0.3)
htmlwidgets 1.5.3 2020-12-10 [1] CRAN (R 4.0.3)
httpuv 1.6.1 2021-05-07 [1] CRAN (R 4.0.3)
httr 1.4.2 2020-07-20 [1] CRAN (R 4.0.3)
igraph 1.2.6 2020-10-06 [1] CRAN (R 4.0.3)
interactiveDisplayBase 1.28.0 2020-10-27 [1] Bioconductor
IRanges * 2.24.1 2020-12-12 [1] Bioconductor
iterators * 1.0.13 2020-10-15 [1] CRAN (R 4.0.3)
janitor * 2.1.0 2021-01-05 [1] CRAN (R 4.0.3)
jpeg 0.1-8.1 2019-10-24 [1] CRAN (R 4.0.3)
jsonlite 1.7.2 2020-12-09 [1] CRAN (R 4.0.3)
KernSmooth 2.23-17 2020-04-26 [2] CRAN (R 4.0.3)
knitr 1.33 2021-04-24 [1] CRAN (R 4.0.3)
knitrBootstrap 1.0.2 2018-05-24 [1] CRAN (R 4.0.3)
ks 1.13.1 2021-06-01 [1] CRAN (R 4.0.3)
later 1.2.0 2021-04-23 [1] CRAN (R 4.0.3)
lattice 0.20-41 2020-04-02 [2] CRAN (R 4.0.3)
latticeExtra 0.6-29 2019-12-19 [1] CRAN (R 4.0.3)
lazyeval 0.2.2 2019-03-15 [1] CRAN (R 4.0.3)
lifecycle 1.0.0 2021-02-15 [1] CRAN (R 4.0.3)
limma * 3.46.0 2020-10-27 [1] Bioconductor
locfit * 1.5-9.4 2020-03-25 [1] CRAN (R 4.0.3)
lubridate 1.7.10 2021-02-26 [1] CRAN (R 4.0.3)
magrittr 2.0.1 2020-11-17 [1] CRAN (R 4.0.3)
maps 3.3.0 2018-04-03 [1] CRAN (R 4.0.3)
markdown 1.1 2019-08-07 [1] CRAN (R 4.0.3)
MASS * 7.3-53 2020-09-09 [2] CRAN (R 4.0.3)
Matrix * 1.3-4 2021-06-01 [1] CRAN (R 4.0.3)
matrixcalc * 1.0-4 2021-06-03 [1] CRAN (R 4.0.3)
MatrixGenerics * 1.2.1 2021-01-30 [1] Bioconductor
matrixStats * 0.59.0 2021-06-01 [1] CRAN (R 4.0.3)
mclust 5.4.7 2020-11-20 [1] CRAN (R 4.0.3)
memoise 2.0.0 2021-01-26 [1] CRAN (R 4.0.3)
mime 0.10 2021-02-13 [1] CRAN (R 4.0.3)
modelr 0.1.8 2020-05-19 [1] CRAN (R 4.0.3)
munsell 0.5.0 2018-06-12 [1] CRAN (R 4.0.3)
mvtnorm 1.1-2 2021-06-07 [1] CRAN (R 4.0.3)
nnet 7.3-14 2020-04-26 [2] CRAN (R 4.0.3)
openssl 1.4.4 2021-04-30 [1] CRAN (R 4.0.3)
org.Hs.eg.db * 3.12.0 2021-06-29 [1] Bioconductor
OrganismDbi 1.32.0 2020-10-27 [1] Bioconductor
pcaPP * 1.9-74 2021-04-23 [1] CRAN (R 4.0.3)
pillar 1.6.1 2021-05-16 [1] CRAN (R 4.0.3)
pkgbuild 1.2.0 2020-12-15 [1] CRAN (R 4.0.3)
pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.0.3)
pkgload 1.2.1 2021-04-06 [1] CRAN (R 4.0.3)
plyr * 1.8.6 2020-03-03 [1] CRAN (R 4.0.3)
PMA * 1.2.1 2020-02-03 [1] CRAN (R 4.0.3)
png 0.1-7 2013-12-03 [1] CRAN (R 4.0.3)
polyclip 1.10-0 2019-03-14 [1] CRAN (R 4.0.3)
pracma 2.3.3 2021-01-23 [1] CRAN (R 4.0.3)
prettyunits 1.1.1 2020-01-24 [1] CRAN (R 4.0.3)
processx 3.5.2 2021-04-30 [1] CRAN (R 4.0.3)
progress 1.2.2 2019-05-16 [1] CRAN (R 4.0.3)
promises 1.2.0.1 2021-02-11 [1] CRAN (R 4.0.3)
ProtGenerics 1.22.0 2020-10-27 [1] Bioconductor
proto * 1.0.0 2016-10-29 [1] CRAN (R 4.0.3)
ps 1.6.0 2021-02-28 [1] CRAN (R 4.0.3)
purrr * 0.3.4 2020-04-17 [1] CRAN (R 4.0.3)
qvalue 2.22.0 2020-10-27 [1] Bioconductor
R.methodsS3 1.8.1 2020-08-26 [1] CRAN (R 4.0.3)
R.oo 1.24.0 2020-08-26 [1] CRAN (R 4.0.3)
R.utils 2.10.1 2020-08-26 [1] CRAN (R 4.0.3)
R6 2.5.0 2020-10-28 [1] CRAN (R 4.0.3)
rainbow * 3.6 2019-01-29 [1] CRAN (R 4.0.3)
rappdirs 0.3.3 2021-01-31 [1] CRAN (R 4.0.3)
RBGL 1.66.0 2020-10-27 [1] Bioconductor
RColorBrewer 1.1-2 2014-12-07 [1] CRAN (R 4.0.3)
Rcpp 1.0.6 2021-01-15 [1] CRAN (R 4.0.3)
RCurl * 1.98-1.3 2021-03-16 [1] CRAN (R 4.0.3)
readr * 1.4.0 2020-10-05 [1] CRAN (R 4.0.3)
readxl 1.3.1 2019-03-13 [1] CRAN (R 4.0.3)
recount * 1.16.1 2020-12-18 [1] Bioconductor
RefManageR 1.3.0 2020-11-13 [1] CRAN (R 4.0.3)
regionReport * 1.24.2 2020-12-18 [1] Bioconductor
remotes 2.4.0 2021-06-02 [1] CRAN (R 4.0.3)
rentrez 1.2.3 2020-11-10 [1] CRAN (R 4.0.3)
reprex 2.0.0 2021-04-02 [1] CRAN (R 4.0.3)
reshape * 0.8.8 2018-10-23 [1] CRAN (R 4.0.3)
reshape2 * 1.4.4 2020-04-09 [1] CRAN (R 4.0.3)
rlang * 0.4.11 2021-04-30 [1] CRAN (R 4.0.3)
rmarkdown 2.8 2021-05-07 [1] CRAN (R 4.0.3)
rngtools 1.5 2020-01-23 [1] CRAN (R 4.0.3)
rpart 4.1-15 2019-04-12 [2] CRAN (R 4.0.3)
rprojroot 2.0.2 2020-11-15 [1] CRAN (R 4.0.3)
Rsamtools 2.6.0 2020-10-27 [1] Bioconductor
RSQLite * 2.2.7 2021-04-22 [1] CRAN (R 4.0.3)
rstudioapi 0.13 2020-11-12 [1] CRAN (R 4.0.3)
rtracklayer 1.50.0 2020-10-27 [1] Bioconductor
rvcheck 0.1.8 2020-03-01 [1] CRAN (R 4.0.3)
rvest 1.0.0 2021-03-09 [1] CRAN (R 4.0.3)
S4Vectors * 0.28.1 2020-12-09 [1] Bioconductor
scales 1.1.1 2020-05-11 [1] CRAN (R 4.0.3)
scatterpie 0.1.6 2021-04-23 [1] CRAN (R 4.0.3)
sessioninfo * 1.1.1 2018-11-05 [1] CRAN (R 4.0.3)
shadowtext 0.0.8 2021-04-23 [1] CRAN (R 4.0.3)
shiny 1.6.0 2021-01-25 [1] CRAN (R 4.0.3)
snakecase 0.11.0 2019-05-25 [1] CRAN (R 4.0.3)
spam * 2.6-0 2020-12-14 [1] CRAN (R 4.0.3)
sqldf * 0.4-11 2017-06-28 [1] CRAN (R 4.0.3)
statmod * 1.4.36 2021-05-10 [1] CRAN (R 4.0.3)
stringi 1.6.2 2021-05-17 [1] CRAN (R 4.0.3)
stringr * 1.4.0 2019-02-10 [1] CRAN (R 4.0.3)
SummarizedExperiment * 1.20.0 2020-10-27 [1] Bioconductor
survival 3.2-7 2020-09-28 [2] CRAN (R 4.0.3)
TCGAbiolinks * 2.18.0 2020-10-27 [1] Bioconductor
TCGAbiolinksGUI.data 1.10.0 2020-10-29 [1] Bioconductor
testthat 3.0.2 2021-02-14 [1] CRAN (R 4.0.3)
tibble * 3.1.2 2021-05-16 [1] CRAN (R 4.0.3)
tidygraph 1.2.0 2020-05-12 [1] CRAN (R 4.0.3)
tidyr * 1.1.3 2021-03-03 [1] CRAN (R 4.0.3)
tidyselect 1.1.1 2021-04-30 [1] CRAN (R 4.0.3)
tidyverse * 1.3.1 2021-04-15 [1] CRAN (R 4.0.3)
tweenr 1.0.2 2021-03-23 [1] CRAN (R 4.0.3)
usethis 2.0.1 2021-02-10 [1] CRAN (R 4.0.3)
utf8 1.2.1 2021-03-12 [1] CRAN (R 4.0.3)
VariantAnnotation 1.36.0 2020-10-27 [1] Bioconductor
vctrs * 0.3.8 2021-04-29 [1] CRAN (R 4.0.3)
viridis * 0.6.1 2021-05-11 [1] CRAN (R 4.0.3)
viridisLite * 0.4.0 2021-04-13 [1] CRAN (R 4.0.3)
withr 2.4.2 2021-04-18 [1] CRAN (R 4.0.3)
xfun 0.23 2021-05-15 [1] CRAN (R 4.0.3)
XML 3.99-0.6 2021-03-16 [1] CRAN (R 4.0.3)
xml2 1.3.2 2020-04-23 [1] CRAN (R 4.0.3)
xtable 1.8-4 2019-04-21 [1] CRAN (R 4.0.3)
XVector 0.30.0 2020-10-27 [1] Bioconductor
yaml 2.2.1 2020-02-01 [1] CRAN (R 4.0.3)
zlibbioc 1.36.0 2020-10-27 [1] Bioconductor
Hi,
Thanks for the report. I can see what you are talking about. However, there's a small detail you missed at rse_gene <- add_predictions(rse_gene)
. It should have been rse_gene <- add_predictions(rse_gene, is_tcga = TRUE)
as you can see below:
> rse_gene_extra <- add_predictions(rse_gene, is_tcga = TRUE)
2021-08-02 23:01:18 downloading the predictions to /var/folders/cx/n9s558kx6fb7jf5z_pgszgb80000gn/T//RtmpGOfOsK/PredictedPhenotypes_v0.0.06.rda
trying URL 'https://github.com/leekgroup/recount-website/blob/master/predictions/PredictedPhenotypes_v0.0.06.rda?raw=true'
Content type 'application/octet-stream' length 548129 bytes (535 KB)
==================================================
downloaded 535 KB
Loading objects:
PredictedPhenotypes
> dim(colData(rse_gene_extra))
[1] 11284 876
> x2 <- colData(rse_gene_extra)[, 865:ncol(colData(rse_gene_extra))]
> head(x2[, 1:5])
DataFrame with 6 rows and 5 columns
reported_sex predicted_sex accuracy_sex
<factor> <factor> <numeric>
3DFF72D2-F292-497E-ACE3-6FAA9C884205 male male 0.989506
B1E54366-42B9-463C-8615-B34D52BD14DC male male 0.989506
473713F7-EB41-4F20-A37F-ACD209E3CB75 male male 0.989506
11F18F54-9B33-4C33-BDF9-0F093F4F3336 male male 0.989506
136B7576-1108-4FA3-8254-6069F0CA879A male male 0.989506
E81FA8B7-3FFE-4F73-94AF-0B5257D7F81A male male 0.989506
reported_samplesource
<factor>
3DFF72D2-F292-497E-ACE3-6FAA9C884205 tissue
B1E54366-42B9-463C-8615-B34D52BD14DC tissue
473713F7-EB41-4F20-A37F-ACD209E3CB75 tissue
11F18F54-9B33-4C33-BDF9-0F093F4F3336 tissue
136B7576-1108-4FA3-8254-6069F0CA879A tissue
E81FA8B7-3FFE-4F73-94AF-0B5257D7F81A tissue
predicted_samplesource
<factor>
3DFF72D2-F292-497E-ACE3-6FAA9C884205 tissue
B1E54366-42B9-463C-8615-B34D52BD14DC tissue
473713F7-EB41-4F20-A37F-ACD209E3CB75 tissue
11F18F54-9B33-4C33-BDF9-0F093F4F3336 tissue
136B7576-1108-4FA3-8254-6069F0CA879A cell_line
E81FA8B7-3FFE-4F73-94AF-0B5257D7F81A tissue
> packageVersion("recount")
[1] ‘1.18.0’
Best,
Leo