With the introduction of #627, the ClinVar control refresh job will be invoked at a per score-set level with a specific month and year version to refresh. We should make an enhancement to this job routine that allows the job to be invoked with a list of versions to refresh. Since variants in ClinGen will only have one associated ClinVar control, we should only need to make one request to the service to fetch it. The current workflow though will make a request for each version.
An updated workflow might look like:
- Fetch each variant-summary TSV from ClinVar
- Make a request to ClinGen to fetch the associated ClinVar accession for a variant
- Associate this accession with our mapped variant based on each TSV
This would significantly reduce the job time by eliminating N-1 ClinGen requests.
With the introduction of #627, the ClinVar control refresh job will be invoked at a per score-set level with a specific month and year version to refresh. We should make an enhancement to this job routine that allows the job to be invoked with a list of versions to refresh. Since variants in ClinGen will only have one associated ClinVar control, we should only need to make one request to the service to fetch it. The current workflow though will make a request for each version.
An updated workflow might look like:
This would significantly reduce the job time by eliminating N-1 ClinGen requests.