-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Being able to identify media not uploaded #102
Comments
Discussion point for later implementation:
|
…s for each organisation: Sync and MediaAnalysis
- Consolidate all Create Job requests validation - Truncate jobDetail description
…g for UUID and excluding Mobile and Adhoc entries
Additionally introudced isHavingDuplicates column to mediaAnalysis table, which specifies if duplicates exist in the media table, the count isn't stored as a separate column but can be determined by joining with media table. select * from media m join media_analysis ma on m.uuid = ma.uuid and m.image_url = ma.image_url;
-- where m.uuid = '60b99f84-37cf-4873-b1a6-5e3f7f88af3b'; |
Sql query to use in setup alert on metabase when there are new anomalies.-- set role goonj;
with counts as (select count(*) AS Total,
sum(case when is_valid_url = false then 1 else 0 end) AS InvalidURLsCount,
sum(case when is_present_in_storage = false then 1 else 0 end) AS MissingMediaCount,
sum(case when is_thumbnail_generated = false then 1 else 0 end) AS MissingThumbnailCount,
sum(case when is_having_duplicates = true then 1 else 0 end) AS MediaWhichHasAtleastOneDuplicateCount
from media_analysis) select * from counts
where Total > 0 and (InvalidURLsCount > 0 or MissingMediaCount > 138 or MissingThumbnailCount > 372 or MediaWhichHasAtleastOneDuplicateCount > 291); |
For reference and dev testing, use postman collection for triggering ETL Jobs for Sync and/or MediaAnalysis. |
Add sensible values for staging/prerelease/prod for AVNI_MEDIA_ANALYSIS_JOB_REPEAT_INTERVAL in avni-infra. Also isn't the default of 2 minutes too low (if env var not configured)? MediaAnalysisTableRegenerateAction.process
|
@1t5j0y Absorbed the review comments |
https://avni.freshdesk.com/a/tickets/3932
Need:
Currently we don't have a mechanism to know if media URLs synced have been successfully uploaded to s3. This is important considering the frequent media anomalies we have encountered so far like media URLs present but media not present in S3, thumbnails not generated, etc., This will help us to establish a monitoring mechanism before user reports it. When there is no automatic way to determine we miss to check it.
We are not doing the same to make sure data(other than images) are synced because of the below reasons:
Context:
Media anomaly: One of the below:
AC:
, audit_infoPost deployment steps
Tech details if needed for reference: Ignore if not relevant
Old:
AC:
Out of scope:
To check if there are media anomalies for the existing data in the media table
Added above based on suggestions from here
Old: Ignore
Use cases
Definitions
Media anomaly: One of the below:
AC: (Based on the suggestions from here)
If never run for org,
If previously run for org,
To get s3 contents after a particular date:
Input:
Output:
Input:
Input issues:
The text was updated successfully, but these errors were encountered: