Skip to content

individual specific kmer could be find in other individual #53

@leon945945

Description

@leon945945

Hi, the purpose of my analysis is to get the paternal specific kmer, and to increase the specificity of kmer I chosed 62mer to perform analysis, because 72mer would lead to an error of meryl.

At first, I generated the 62mer of progeny, maternal and paternal with HiFi data for progeny and maternal, Illumina data for paternal.
meryl count k=62 progeny.fq output progeny.meryl
meryl count k=62 maternal.fq output maternal.meryl
meryl count k=62 paternal.fq output paternal.meryl

Then, the maternal/paternal count of common kmer between progeny and maternal/paternal were outputed.
meryl intersect maternal.meryl/ progeny.meryl/ output mat-progeny_commonk62.meryl
meryl intersect paternal.meryl/ progeny.meryl/ output pat-progeny_commonk62.meryl
and printed the kmer count:
meryl print mat-progeny_commonk62.meryl > mat-progeny.kmerCount
meryl print pat-progeny_commonk62.meryl > pat-progeny.kmerCount

Then, I merged the mat-progeny.kmerCount and pat-progeny.kmerCount, and I thought the maternal kmers whose count equals to 0 were paternal specific kmer. I extract the sequences of paternal specifc kmers and aligned them to the maternal genome (assembled by the same HiFi file with kmer count) with bowtie2, however, a large proportion of them could be aligned to maternal genome without mismatch or indel (CIGAR=62M), which means the paternal specific kmers could be find in maternal.

Here is my confusion, the paternal specific kmer should not be existed in maternal, why they could be aligned in maternal genome.
Hope for your reply, thanks very much.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions