The personalized diploid genome of the KOLF iPSC cell line. KOLF2.1J is the parental iPSC line and serves as a reference cell line for distributed iPSC derivatives. It has a relatively stable karyotype, maintains pluripotency, exhibits consistent growth characteristics, and harbors fewer problematic structural abnormalities compared to other iPSC lines. Therefore, generating a complete telomere-to-telomere (T2T) diploid assembly for this valuable resource is warranted, as it would enable broad community use and facilitate the characterization of its personalized variants and genomic features.
For more data analysis and assembly procedures, please see the original paper:
- Jerez, P. A., Rhie, A., Kim, J., Hebbar, P., Nag, S., Antipov, D., ... & Blauwendraat, C. (2026). The complete genome of the KOLF2. 1J reference iPSC line. bioRxiv, 2026-03.
KOLF2.1Jv1.1.noY.fa: T2T reconstruction of 45 chromosomes of KOLF iPSC line, except chrY and, include chrM, you can download this from here.- This genome is viewable in the UCSC browser.
- 1:1 Liftover T2T-CHM13v2.0 <-> KOLF2.1Jv1.1, see here.
- KOLF2.1Jv1.1 primary <-> T2T-CHM13v2.0 :
KOLF2.1Jv1.1.pri_vs_chm13v2.chainKOLF2.1Jv1.1.pri_vs_chm13v2.inverted.chain
- KOLF2.1Jv1.1 alternative <-> T2T-CHM13v2.0 :
KOLF2.1Jv1.1.alt_vs_chm13v2.chainKOLF2.1Jv1.1.alt_vs_chm13v2.inverted.chain
- KOLF2.1Jv1.1 primary <-> T2T-CHM13v2.0 :
- 1:1 Liftover KOLF2.1Jv1.1 primary <-> alternative, see here.
- KOLF2.1Jv1.1 primary <-> KOLF2.1Jv1.1 alternative :
KOLF2.1Jv1.1.pri_vs_alt.chainKOLF2.1Jv1.1.pri_vs_alt.inverted.chain
- KOLF2.1Jv1.1 primary <-> KOLF2.1Jv1.1 alternative :
- All alignments on the diploid assembly, excluding chrY, have been stored here.
- Illumina
KOLF_IPS.noY.bam
- ONT
KOLF_ASTRO_IND12.sorted.noY.bamKOLF_ASTRO_NYSCF.sorted.noY.bamKOLF_IPS.mapped.sorted.noY.bamKOLF_IPSC_UL.noY.bamKOLF_MGL.mapped.sorted.noY.bamKOLF_NEURON.mapped.sorted.noY.bamKOLF_NGN2.mapped.sorted.noY.bam
- PacBio
KOLF_IPSC_PACBIO.noY.bam
- Methylation calls from ONT data are provided in BED file format by cell type and can be downloaded here.
ASTRO_IND12.mrg.modkit.pri.bed.gzASTRO_NYSCF.mrg.modkit.pri.bed.gzIPS.mrg.modkit.pri.bed.gzMGL.mrg.modkit.pri.bed.gzNEURON.mrg.modkit.pri.bed.gzNGN2.mrg.modkit.pri.bed.gz
- HiC aligned on the primary haplotype, you can download this from here.
iMicroglia.kolf2.1Jv1.1pri.autosomes_X_M.bamiPSc.kolf2.1Jv1.1pri.autosomes_X_M.bam
- Hi-C data aligned to the primary haplotype are available for download here.
KOLF_MAT.gff3KOLF_PAT_noY.gff3
All transcriptomic sequencing data are available for download [here].(https://zenodo.org/records/18777224).
- ONT direct sequencing data
ASTRO_IND12.ONT.noY.bamASTRO_NYSCF.ONT.noY.bamIPS.ONT.noY.bamMGL.ONT.noY.bamNEURON.ONT.noY.bamNGN2.ONT.noY.bamOLIGO_NYSCF.ONT.noY.bam
- PacBio ISOSEQ sequencing data
MGL_WT_1.noY.bamMGL_WT_2.noY.bamMGL_WT_3.noY.bam
- Illumina bulk RNA sequencing data
ipsc_illumina_hg38.noY.bam
SV calls and heterozygosity sites are also available for download here.
- SV calling
hapdiff_phased.sv.noY.bed
- The heterozygosity site between primary and alternative haplotype
hetsites.hetsites.fix.bed.gz
- CAGE sequencing data from the KOLF2.1J reference line for iPSC, cortical neuron, and microglia. This data was generated by DNAform. Average length from all three cell types is 75bp at around 3Gb per cell. Please visit here.
KOLF.IPS.noY.bamKOLF.Neuron.noY.bamKOLF.iMGL.noY.bam
Please raise issues on this Github repository concerning this dataset.
Jerez, P. A., Rhie, A., Kim, J., Hebbar, P., Nag, S., Antipov, D., ... & Blauwendraat, C. (2026). The complete genome of the KOLF2. 1J reference iPSC line. bioRxiv, 2026-03.
Feb 25, 2026 V1.1 assembly released.
Apr 7, 2026 Additional alignment data listed.
