Hi, the purpose of my analysis is to get the paternal specific kmer, and to increase the specificity of kmer I chosed 62mer to perform analysis, because 72mer would lead to an error of meryl.
At first, I generated the 62mer of progeny, maternal and paternal with HiFi data for progeny and maternal, Illumina data for paternal.
meryl count k=62 progeny.fq output progeny.meryl
meryl count k=62 maternal.fq output maternal.meryl
meryl count k=62 paternal.fq output paternal.meryl
Then, the maternal/paternal count of common kmer between progeny and maternal/paternal were outputed.
meryl intersect maternal.meryl/ progeny.meryl/ output mat-progeny_commonk62.meryl
meryl intersect paternal.meryl/ progeny.meryl/ output pat-progeny_commonk62.meryl
and printed the kmer count:
meryl print mat-progeny_commonk62.meryl > mat-progeny.kmerCount
meryl print pat-progeny_commonk62.meryl > pat-progeny.kmerCount
Then, I merged the mat-progeny.kmerCount and pat-progeny.kmerCount, and I thought the maternal kmers whose count equals to 0 were paternal specific kmer. I extract the sequences of paternal specifc kmers and aligned them to the maternal genome (assembled by the same HiFi file with kmer count) with bowtie2, however, a large proportion of them could be aligned to maternal genome without mismatch or indel (CIGAR=62M), which means the paternal specific kmers could be find in maternal.
Here is my confusion, the paternal specific kmer should not be existed in maternal, why they could be aligned in maternal genome.
Hope for your reply, thanks very much.
Hi, the purpose of my analysis is to get the paternal specific kmer, and to increase the specificity of kmer I chosed 62mer to perform analysis, because 72mer would lead to an error of
meryl.At first, I generated the 62mer of
progeny,maternalandpaternalwith HiFi data forprogenyandmaternal, Illumina data forpaternal.meryl count k=62 progeny.fq output progeny.merylmeryl count k=62 maternal.fq output maternal.merylmeryl count k=62 paternal.fq output paternal.merylThen, the maternal/paternal count of common kmer between progeny and maternal/paternal were outputed.
meryl intersect maternal.meryl/ progeny.meryl/ output mat-progeny_commonk62.merylmeryl intersect paternal.meryl/ progeny.meryl/ output pat-progeny_commonk62.meryland printed the kmer count:
meryl print mat-progeny_commonk62.meryl > mat-progeny.kmerCountmeryl print pat-progeny_commonk62.meryl > pat-progeny.kmerCountThen, I merged the
mat-progeny.kmerCountandpat-progeny.kmerCount, and I thought the maternal kmers whose count equals to 0 were paternal specific kmer. I extract the sequences of paternal specifc kmers and aligned them to the maternal genome (assembled by the same HiFi file with kmer count) withbowtie2, however, a large proportion of them could be aligned to maternal genome without mismatch or indel (CIGAR=62M), which means the paternal specific kmers could be find in maternal.Here is my confusion, the paternal specific kmer should not be existed in maternal, why they could be aligned in maternal genome.
Hope for your reply, thanks very much.