- Caching calls removed from
cBioPortalData
. API requests will always be performed.cBioDataPack
will continue to cache data files.
- Improvements to links in documentation and citation section in the vignette
- Remove error when
studyId
build success is unknown (@vlaufer, #69) - Parse tokens with different formats, e.g. genie token (@ZWael, #70)
- Clean up duplicate rows created in the
sampleMap
generated from the data - Fixed warning trigger when empty
molecularProfileId
datasets are found
- Update
successrate
thresholds and fix long tests
- Add
cgdsr
tocBioPortalData
migration vignette (@kmezhoud, #54) - Unmapped experiments are now added to the metadata in
cBioDataPack
- Set default
api. = api/v2/api-docs
incBioPortal
to access the API protocol's new location - The
fetchData
developer function added to handle bothmolecularData
andmutationData
requests as deduced from themolecularProfileIds
vector
- Check for valid
studyId
s withgetStudies
incBioDataPack
ask
argument correctly passed down to caching mechanism incBioDataPack
check_build
option available incBioDataPack
particularly for new studies that have not been checked against.
- Auth
token
string or file can now be included in thecBioPortal
function. - The
check_build
argument can be set toFALSE
for alternative APIs, e.g., KidsFirst, when usingcBioPortalData
queryGeneTable
translates gene IDs ('hugoGeneSymbols' <> 'entrezGeneIds') via the API servicegetDataByGenes
supersedesgetDataByGenePanel
getStudies()
replacesdata('studiesTable')
to discover study IDs
- Fixed issue where the
by
argument was not passed togetDataByGenes
in internal calls - Add names to metadata elements that originate from GISTIC datasets.
- A study's build status can be obtained from
getStudies()
, which has replaceddata('studiesTable')
. - Partial loading of data files supported. A warning is emitted when a
data file is not able to be loaded in
cBioDataPack
. cBioPortalData
checks thedata(studiesTable)
to verify that study datasets are building, otherwise provide a message in interactive sessions.
- Vignettes include additional information (#38, @lwaldron)
getDataByGenePanel
deprecated forgetDataByGenes
which handles input of both gene panels and genescBioPortalData
now allows for gene inputs as either Entrez IDs or Hugo symbols (#24, @jucor) andsampleIds
input- When
gene
inputs are provided, theby
argument has to agree with the type of genes provided (either beentrezGeneId
orhugoGeneSymbol
).
- Fixed an issue where the labels in the
metadata
fromcBioDataPack
were missing ('LICENSE' and 'Fusion'; #37) loadStudy
allowscleanup=TRUE
for removing files afteruntar
-ing- Published article now available with
citation("cBioPortalData")
studiesTable
includes additional columnspack_build
andapi_build
to indicate to the user which datasets have been successfully built asMultiAssayExperiment
objects. Users will be notified when a dataset, reported as not building, is requested from thecBioDataPack
function.- Add
sampleIds
argument togetDataByGenePanel
as part of cache re-work - Allow more flexibility in the hostname when accessing the API with
cBioPortal
(@inodb, #16) cBioDataPack
downloads from a more robust repository (AWS S3; @inodb, #22)removePackCache
andremoveDataCache
now remove data from the user's cache based on inputs to respective functions (cBioDataPack
andcBioPortalData
)
- Attempt to merge additional clinical data files from tarballs in
cBioDataPack
. - Switch to using
read.delim
instead ofread_tsv
internally to avoid assigningNA
to chromosome column - Use 'PATIENT_ID' when available to determine if experiment data is provided in the tarball files.
- Add tests using
testthat
- Update and include percentages of studies successfully imported using
cBioDataPack
andcBioPortalData
in the documentation - Fix read-in when identifiers are numeric instead of character (@jucor, #27)
- Include pagination parameters in
geneTable
function (@xinwei-sher, #29)
- Bioconductor release!
- Updated the
README.md
file from R Markdown file. - Uses the latest version of
rapiclient
on CRAN - Prepare package for Bioconductor submission
- Include protein metadata as a
RaggedExperiment
from mutation molecular profiles (TCGA only)
- API authentication option removed and not needed
- Package supports nearly all study identifiers based on recent tests
- Only a handful of study identifiers are unsuccessful (create an issue to prioritize).
- Make better use of the API return values to craft the sample map for
MultiAssayExperiment
creation - Additional data included in the metadata slot of the
MultiAssayExperiment
object. Future revisions will include this data asrowData
. - Change vignette titles for build
cBioDataPack
allows users to download packaged data objects from download.cbioportal.org/- Data packs are cached using
BiocFileCache
to avoid re-downloading cBioPortalData
lets users query the cbioportal.org API and retrieve slices of data according to gene, molecular profile identifiers, etc.- Queries through
cBioPortalData
use a caching mechanism to avoid repeat downloads of data and improve load times - Both functions return a
MultiAssayExperiment
as the primary data representation - Only a number of study datasets are currently possible to load. Issues can arise with mismatched or munged identifiers
- The cBioPortal API representation is handled by the
AnVIL
package which makes use ofrapiclient
to provide an automatic R interface to the API
- Data pack downloads use an alternative method for download when a download fails