Hi,
Is it possible to run DPclust3p on INDELs that are present in a VCF file (from mutect2 output)?
From looking through the code I think the issue is that the runGetDirichletProcessInfo function requires the count input to be in the format of A,T,C,G counts and not REF count vs. ALT counts. INDELs counts could fall in format of REF vs. ALT counts but can't be coerced into A,T,C,G counts.
I imagine the dumpcountsFromVCF function would have to be re-written to give REF vs. ALT count output format, and then runGetDirichletProcessInfo have to be re-coded to accept this input format.
Do you know of anyone who has done this? Do you have any suggestions for how to get DPclust3p information for INDELs? I don't need to actually run all of DPclust on the INDELs, just DPclust pre-proprocessing as I am just interested in site copy number and the multiplicity of the INDELs.
Thanks
Hi,
Is it possible to run DPclust3p on INDELs that are present in a VCF file (from mutect2 output)?
From looking through the code I think the issue is that the runGetDirichletProcessInfo function requires the count input to be in the format of A,T,C,G counts and not REF count vs. ALT counts. INDELs counts could fall in format of REF vs. ALT counts but can't be coerced into A,T,C,G counts.
I imagine the dumpcountsFromVCF function would have to be re-written to give REF vs. ALT count output format, and then runGetDirichletProcessInfo have to be re-coded to accept this input format.
Do you know of anyone who has done this? Do you have any suggestions for how to get DPclust3p information for INDELs? I don't need to actually run all of DPclust on the INDELs, just DPclust pre-proprocessing as I am just interested in site copy number and the multiplicity of the INDELs.
Thanks