The data in dbRIP is provided as the following flat files for downloading


Supplementary data file for manuscript by Tang et al, 2018
(header: dbRIP_ID|Original_ID|Type|Position|strand|Subfamily|Class|family| Fragment_count|length|chrSs|chrEs|cons_len|conSts|conEs|conLeft| TSD|TSD_length|5TSD|3TSD|Pre_motif|5TR|5TR_len|3TR|3TR_len|RIMD|RIMD_len)

sequences for each entry are presented in 9 lines in the following order, an empty line indicates the value being null:
5' flanking
5' TSD
5' TR
5' RMID
ME
3' RMID
3' TR
3' TSD
3' flanking

Release 2h (hg18)
RIP_type.txt contains all dbRIP data in a nohead tabular txt format.
The 23 fields are bin chrom chromStart chromEnd name score strand originalId forwardPrimer reversePrimer polyClass polyFamily polySubfamily polySeq polySource reference ascertainingMethod remarks tm fillsize emptysize disease genoRegion
RIP_*_genotype.txt contains the following columns: dbRIP ID, ethnic group name, number of samples with the insertion in the genotype of +/+, +/-, -/-, respectively.


Release 2h (hg19)
RIP_type.txt contains all dbRIP data in a nohead tabular txt format
The 23 fields are bin chrom chromStart chromEnd name score strand originalId forwardPrimer reversePrimer polyClass polyFamily polySubfamily polySeq polySource reference ascertainingMethod remarks tm fillsize emptysize disease genoRegion
Release 2 (hg19) share same genotype files with Release 2 (hg18)

Reference genome containing non-reference L1 insertions (201 L1s for a total of 817,696 bp inserted into hg38 (no alternative assemblies and unassigned assemblies)). Other related resource files are provided.

Back to dbRIP