Changes:
-
Added function to make deployment to posit connect servers easier
-
Added ability to use CohortGenerator sample functionality to executeDiagnostics which speeds up execution for very large cohort definitions
-
Requires use of FeatureExtraction 3.4.0 to support new API
-
Removed annotation tables from result schema script
Bug Fixes:
-
fixed broken migrations using "ADD COLUMN "
-
Fixed broken links in manaual
-
Fixed issue with Andromeda calls in characterization on R 4.3.x
-
Fix for Vroom issue on mac
Bug Fix:
-
Added support for newer API to OhdsiShinyModules (while maintaining support for old version)
-
Resolved issue with incidence rates sometimes exporting as null/infinite by setting to 0 in all error cases
- Resolved issue where concept sets were only exported when diagnostics for them were executed. All concept sets are now always exported into results csv and imported into databases
- Fixes to unit tests breaking with R 4.3 due to change of Sys.time function
-
Added requirement for Andromeda 0.6.0 (which is implicitly required by DatabaseConnector)
-
Fixed issue saving cohort subsets to csv file for viewing and analysis in shiny app
-
Do not run orphan concepts checks for any subset cohorts
-
Remove use of lookback period for IR calculations - this is now a setting of the call to the package
-
Added data migration to support subsets in database schema (allow future functionality to take care of them)
-
Added functionality to
launchDiagnosticsExplorer
to make publishing to poist connect/shinyapps.io more straightforward (still requires removal of ggiraph) -
Moved most shiny code to
OHDSI/OhdsiShinyModules
Bug Fixes:
- Removed package dependency snapshot capture as it was breaking on newer versions of R
Changes:
-
Removed CohortExplorer app as it's now part of a new package.
-
Added support for custom FeatureExtraction features
Bug Fixes:
-
Fixed error when checking for cdm_source table
-
Removal of
.data$
usage across package to fix tidyselect warning
Changes:
-
Major refactoring of shiny app to use modular code for ease of maintenance
-
Some tests for shiny modules in
inst/shiny/DiagnosticsExplorer/tests
-
Added support for table prefixes in Diagnostics Explorer databases (e.g. cg_cohort_definition)
-
Enabled annotation on local instances of DiagnosticsExplorer
-
Added yaml configuration for diagnostics explorer app, including docs on usage
-
Storage of version number in database results file to allow future migrations
-
Slight optimization of cohort characterization queries in diagnostics explorer
-
Support for database migrations by adding
migrateDataModel
functionality. Versions of data generated with CohortDiagnostics 3.0.0 are intended to be future compatible. i.e. if you have an sqlite results file or postgres database generated with version 3.0.0 of cohort diagnostics new shiny app functionality will be supported if you runmigrateDataModel
on it. -
Changed type to dataType and fieldName to columnName to align with strategus modules
-
Refactored export of characterization results to use common export
-
Ensure that tests cases always use a continuous covariate
-
Batch operations for executing cohort relationship, time series, and feature extraction based diagnostics.
-
New parameter minCharacterizationMean. This introduces a cut off for the output of FeatureExtraction. In the absence of the parameter the output would have atleast one row for every covariateId in the datasource - most having very low count to be useful for diagnostics.
Bug fixes:
-
Fixed issue uploading results to postgres db caused by null values in primary key field. Removed constraint to fix issue.
-
Fix for
index_event_breakdown
having duplicate entries where concept is observed in the same domain multiple times -
Many other issues resolved in shiny codebase
Changes:
-
Changed default batch size for characterization feature extraction from 100 to 5 as it was causing performance issues on redshift clusters.
-
Allow setting of batch size for feature extraction with
options("CohortDiagnostics-FE-batch-size" = batchSize)
Bug fixes:
-
Fixed issue with writing csvs caused by update to SqlRender camelCaseToSnakeCase function check that caused execution to crash if parameters were null.
-
Fixed issue with observation period overflowing sql integer on BigQuery causing execution to crash
Bug fixes:
-
Updated old/incorrect documentation on package usage
-
Fixed bug with new versions of CohortGenerator v0.5.0 causing cohort definition sets in package to not load
-
Fixed bug in shiny app where multiple runs on the same database would cause the app to crash when selecting database
Changes:
-
Time series diagnostics removed
-
Removed runCohortDiagnostics function - this has now been completely replaced with executeDiagnostics
-
Removed
loadCohortsFromPackage
function as this is now replaced withCohortGenerator::getCohortDefinitionSet
-
Removed instantiate cohort functionality,
instantiateCohortSet
should now be used with theCohortGenerator
package -
Removed optional
inclusionStatisticsFolder
parameter, this is now all exported directly fromCohortGenerator
without the need to generate this first. -
Removed usage of Rdata files in DiagnosticsExplorer shiny app and function to create them
preMergeDiagnosticsFiles
-
Added function
createMergedResultsFile
which outputs a shiny app -
Added support for any
SqlRender/DatabaseConnector
compatible database (note, this is experimental. Postgres and sqlite are the only backends recommended for use in production environments) -
Improved metadata collection and storage from runs of cohort diagnostics.
-
Removed phenotype_id field from data ddl
-
Additional checks to the output of cohort diagnostics to ensure it conforms to its own results data model. The new function (internal) is makeDataExportable. Results data model csv file has been enhanced with new fields, including a field to specify if the value is to be subjected to privacy protection (i.e. min cell count, eg. person count). Note a bug was discovered in the orphan concepts and included source concepts that was leading to duplication of row records by primary key. This bug has been fixed by calculating its max value grouped by primary keys. It will be fixed in another commit.
-
New optional diagnostics computes temporal relationship between any two cohorts. The settings for the temporal relationship between cohorts defaults to be the same as temporalCovariateSettings. This diagnostics will be integrated into the characterization output of diagnostics explorer, where cohorts will be covariates.
-
New optional diagnostics called time series diagnostics. Time series diagnostics takes as input a calendar period range, and in that calendar period range for calendar units (year, quarter, month) computes the approximate new occurrence (approximates incidence) and observations (approximates prevalence) of the cohort start and cohort end dates during the calendar period.
Bug fixes:
- Added support for users to include non-standard columns in their CDM preventing crashes
Bug fixes:
- Fixed a bug that was causing generation of premerged and upload files to fail when the output had fields that were not in the results data model.
Bug fixes:
- Replace use dplyr across() for bug introduced by tidyr v1.2.0
Bug fixes:
- Fixed syntax error causing empty description field in inclusion rule stats
Bug fixes:
-
Added new lines in NEWS.md to fix package site.
-
Updated vignette to fix minor issues
-
Cohorts with zero counts are now stored in results and display in shiny app
Changes:
-
Added
executeDiagnostics
function which aims to replacerunCohortDiagnostics
as the main interface to the package. -
Updated vignette on "Running Cohort Diagnostics" to give clearer instructions
-
Removed vignettes on usage that are no longer required
-
Improved testing across database platforms
-
Moved some not particularly useful warnings to
logInfo
Bug Fixes:
-
User code removed from CohortExplorer to fix issue #618
-
Fixed bug with
runBreakdownIndexEvents = TRUE
failed for drug_era table with SQL error - Issue #695 -
Fixed error when computing incidence rates on BigQuery.
-
Fixed error when
cdm_source
table is empty (warning remains). -
Fixed error when instantiating cohorts on BigQuery.
Changes:
-
Minor cosmetic changes to diagnostics explorer shiny app. Typo fix
-
Fix for warning from type-convert #661
-
Use Sex instead of Gender in Diagnostics Explorer #676
Bug Fixes:
- Privacy protecting feature bug fix. In prior version covariate_value and covariate_value_dist failed privacy protection. Thank you @msuchard for reporting the issue and @schuemie for fixing #658
Changes:
- On starting diagnostics explorer using launchDiagnosticsExplorer - checks were added for remotes and CirceR #595
Bug Fixes:
- Diagnostics explorer - characterization plot would show a warning message when one of the cohorts selected has no data. Added check for this issue.
Bug fixes:
- DiagnosticsExplorer fixes a bug of app failure when runIndexEventBreakdown, runOrphanConcepts, includedSourceConcept is set to FALSE
Bug fixes:
- DiagnosticsExplorer shiny app bug fixes: inclusion rule display.
- Handle situation where cdm_source may have more than one row record. warning + use max value for vocabularyVersion.
- Switch off runTimeSeries for BigQuery because of SQL translation issue, to be addressed in version 2.3. #503 Also runTimeSeries is set to FALSE by default.
- Fixed a bug in DDL - temporal_covariate_value_dist #490 See issue #497
Changes:
- Diagnostics explorer Shiny app enhancements:
-
Improved tool tip
-
Various improvements to plots for consistent color, axis labels and labels
-
Visit context table addition
-
Diagnostic explorer is now a distinct shiny application from phenotype library. PhenotypeExplorer is a stand alone shiny app in package PhenotypeLibrarian.
-
Lot of UX changes. Reactivity deferred on drop down menus.
-
Changes to improve app stability.
-
Index event breakdown now has subject count
-
Index event breakdown calculates _source_concept_id from source fields in CDM tables.
-
Vocabulary database schema is now supported.
-
Metadata (vocabulary version information from data source) is now collected.
-
OracleTempSchema use deprecated in favor of tempEmulationSchema.
-
Run against external concept count has been removed, as concept counts data is not available. Function 'runCohortDiagnosticsUsingExternalCounts' is removed.
-
Removed code related to referentConceptId = phenotypeId/1000 as it does not always hold true.
-
Create cohort table function is now private. Please use instantiate cohort.
-
checkInputFileEncoding is not exported as a public function (as not the scope of CohortDiagnostics).
-
Updated results data model to include new tables (resolved_concepts).
-
Cohort Diagnostics results data model now compliant with standard characterization output.
-
Support for cohort_censor_stats table in webapi 2.8.1 #387
-
Add time series diagnostics computation. Output is not in Diagnostics explorer in this version.
-
Any improvements to help with usability and stability. Informative messages to help with debugbing as needed.
-
phenotypeDescription is no longer supported as input for cohort diagnostics.
Bug fixes:
-
databaseName and databaseDescription should be non NULL
-
Fixed computation of standard deviation and standard difference of mean for binary covariates.
Changes:
-
Many improvements in performance when computing diagnostics. Now allows computation of diagnostics for many (>100) cohorts at once.
-
The Diagnostics Explorer Shiny app can now also run against results data in a database. Added functions for uploading the diagnostics data to a database. This becomes necessary when dealing with very large data (e.g. more than 100 cohort definitions).
-
Added ability to group cohort definitions by phenotypes, and add descriptions of phenotypes. Currently the only consequence of grouping by phenotype is that cohort overlap is only computed within a phenotype. Grouping by phenotypes is done by adding a
phenotypeId
field to thecohortsToCreate
file, and by adding aphenotypeDescription
file. See theexamplePhenotypeLibraryPackage
in the repo for an example. -
The cohort overlap plot now uses stacked bar charts instead of a Venn diagram to allow showing multiple comparisons across multiple databases at once.
-
The cohort characteristics comparison plot now colors by domain, and allows filtering by domain.
-
Switching from 'plotly' to 'ggiraph' for interactive plotting.
-
Added a Visit Context diagnostic.
Bug fixes:
- Fixing numerous bugs introduced in v1.2.x.
Bug fixes:
- Additional bug fixes for characterization/temporal characterization.
Bug fixes:
- Additional bug fixes for characterization/temporal characterization.
Bug fixes:
- Added details log when characterization/temporal characterization does not return results or returns result below threshold value. By default we filter out results in from Characterization and Temporal Characterization where the value is less than 0.001. This was leading to empty results for some cohorts - causing errors.
Bug fixes:
-
Fixed error when many concept sets have to be instantiated.
-
Removed ohdsi/SqlRender from Remotes #189
-
Fixed Digit precision for RJSONIO::toJson and fromJSON #161 This is an important fix. If digit precision is not explicitly specified in RJSONIO, then scientific notation is used. This issue seems to only happen when an integer id (conceptId, conceptSetId, cohortId etc) >= 10,000,000 (which is rare). Please use this update if you have id's > 10,000,000.
New features:
-
Minor UI changes to Diagnostics explorer. Added missing sort.
-
Added better labels for plots.
-
Download plots.
Bug fixes:
-
Changes dependency to ROhdsiWebApi (>= 1.1.0)
-
DiagnosticsExplorer display bug fixes
New features:
-
All objects in DiagnosticsExplorer are sorted by default #173
-
Multi select for concepts #199
Bug fixes:
- Ensure concept sets across cohort definitions are unique #174 (changes dependency to ROhdsiWebApi (>= 1.1.0))
Changes:
-
New function to retrieve concept set json from cohort json \code{extractConceptSetsJsonFromCohortJson}
-
New function to retrieve concept set sql from cohort sql \code{extractConceptSetsSqlFromCohortSql}
-
DiagnosticsExplorer shiny app - DataTable now rendered using server side processing. Bug fixes and UI improvements.
-
DiagnosticsExplorer shiny app - Phenotype library mode (released)
-
DiagnosticsExplorer shiny app - Combine included source concepts and orphan concepts into one submenu #129
Bug fixes:
Changes:
-
Shiny app UI improvements
-
Link out to Atlas and Athena from cohortId. Supports baseUrl.
-
(beta - unreleased) support for Phenotype library. Shiny app will look for two additional csv files phenotypeDescription and cohortDescription that put the DiagnosticExplorer in Phenotype Library Mode. Plan to release in future version >= 1.2
-
Changed default selections for temporal characterization
-
Added minimum threshold value to covariate_value and temporal_covariate_value with default value = 0 (future release, we plan to make this 0.005 i.e. 0.5%)
Bug fixes:
- Minor bug fixes.
Changes:
-
Added temporal characterization
-
UI changes to Shiny app diagnostic explorer
Bug fixes:
-
Circe-be update introduced bug in parsing concept sets in cohort definition.
-
Handling of empty cohorts
Changes:
-
Error handling: Use checkmate R-package to provide more informative error messages.
-
Refactor runCohortDiagnostics: added new function by refactoring existing private functions. This new function get the JSON and parameterized OHDSI SQL for the cohorts for which diagnostics has been requested \code{getCohortsJsonAndSql}
Note: The code has been partially refactored to depend on tidyverse.
Changes:
-
Adapting to new ROhdsiWebApi (>= 1.0.0) interface.
-
Added log info that provides addition run time details #72
Bugfixes:
-
Fixing broken getTimeDistributions function.
-
Fixing broken instantiateCohort function.
Changes:
-
Added database information tab to Diagnostics Explorer Shiny app.
-
Using Andromeda instead of ff to store large data objects (used for characterization).
Bugfixes:
-
Fixed error when cohort definition has no concept sets.
-
Fixed error in index event breakdown when entry event contained multiple criteria for the same domain.
-
Fixed error related to tempEmulationSchema argument (needed on Oracle and BigQuery).
-
Fixed use of deprecated function, causing warnings and causing older versions of ParallelLogger to error.
Changes
- Adding option to run cohort instantiation and diagnostics incrementally, so only running those tasks that have changed since the last ru.
Changes
- Orphan and included concepts tables now show concept ID.
Bugfixes:
-
Fixed error in orphan code SQL (hard-coded database schema).
-
Fixed bug when concept counts become too big for a regular INT.
-
Fixed counts of standard concepts shown when using external concept count table.
Changes
-
Orphan code check now resolves concept sets instead of using verbatim concepts. Should include specificity.
-
Added option to use the same y-axis for all databases in incidence rate plot.
-
Added runCohortDiagnosticsUsingExternalCounts function.
-
Added option to use cohort definitions in WebAPI instead of those stored in study package.
Bugfixes:
-
Fixed bug causing weird false positives in orphan codes (caused by transforming concept IDs to scientific notation).
-
Better handling in viewer when some analyses are not executed.
-
Now also showing cohorts that had zero entries.
-
Fixed error when there was only 1 cohort.
Initial version