Demonstrated Utility
The ease-of-use, consistency, and proven performances of Arima-HiC
workflow, and, the open-source Bioinformatics strategy have made the
Arima-HiC kits and services as a highly popular choice for generating
chromosome-spanning assemblies. Indeed, Arima-HiC has been selected
by the Genome10K consortium to generate high-quality assemblies and
base-polishing of >260 vertebrate species
6
.
An additional benefit is that the same Arima-HiC data generated for genome
scaffolding can then be used to investigate the 3D conformation of the
genome. Chromatin conformation information is useful for many studies
including cell development, gene regulation, and pathogen resistance.
Additional Details
Please refer to the Genome Conformation and Low Input Application Notes,
available by contacting info@arimagenomics.com.
Learn more online at arimagenomics.com
crosslink
DNA
digest with
Arima RE Cocktail
fill ends &
mark with biotin
purify, shear DNA &
pull down biotin
ligate
ligate adapters
PCR amplify
library
paired-end
sequencing
6-hour workflow using Arima-HiC kit
1-day workflow using commercial library
prep kits (e.g. KAPA Hyper Prep)
Figure 1: The Arima-HiC workflow results in ligated and biotinylated DNA that is prepared as a library and amplified using
pre-validated library prep kits, and then subject to paired-end Illumina sequencing.
Figure 2: The Arima-HiC workflow results in ligated and biotinylated DNA that is prepared as a library and amplified using pre-validated library prep kits,
and then subject to paired-end Illumina sequencing.
Figure 4: 125 Mb NG50 Human chromosome-spanning assemblies,
generated by Arima-HiC data 4Mb contigs from Oxford Nanopore converted to
125Mb assemblies via Arima-HiC data, using SALSA open-source tool. Charge in
color in the ideogram represents a contig gap or error. Post Arima-HiC assemblies
have minimal color change, suggesting long (NG50 125Mb and chromosome-span-
ning) & accurate assemblies. Figure from Ref (3), shared with permission.
References
1. Lieberman-Aiden E, et al “Comprehensive Mapping of Long-Range Interactions Reveals Folding
Principles of the Human Genome” Science 326, 289-293 (2009)
2. Rao SP, et al “A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of
Chromatin Looping” Cell 159, 1665-1680 (2014)
3. Ghurye J, et al “Integrating Hi-C links with assembly graphs for chromosome-scale assembly”
BioRxiv, doi: http://dx.doi.org/10.1101/261149 (2018)
4. SALSA Bioinformatics Tool for Arima-HiC. https://github.com/machinegun/SALSA
5. Dudchenko O, et al "De novo assembly of the Aedes aegypti genome using Hi-C yields chromo-
some-length scaffolds" Science 356, 92-95 (2017)
6. Genome10K Press Release (1). https://www.businesswire.com/news/home/20180117006561/en
Table 1: Arima-HiC Specifications
Total Time 6 hours
Hands-on Time 1 hour
Number of Steps 8
Automation Capability Single–tube, 96-well plate compatible
Restriction Enzymes (RE) RE cutting at GATC and GANTC
Per-base Genome Uniformity “Spread”
(fraction of genome with avg. seq. depth)
~90%, as estimated from Human genome. Similar
performance expected in all species.
Sample Types Seeds, Tissue, blood, cell lines, whole insects
Sample Storage Conditions Fresh/frozen, Cross-linked, Ethanol
Input Quantity Amount of sample manifesting ~1ug DNA, low input
protocols also available
Species Plants, Invertebrates, Vertebrates
Library Prep Compatibility KAPA Hyperprep, Swift Accel NGS 2S, Illumina TruSeq,
NEBNext Ultra II, others
NGS Compatibility Illumina NGS paired end reads
Library Complexity 1 reaction provides libraries complex enough for 600M
reads
Sequencing Depth (X), recommended Arima-HiC: 15-60X, depending on the contig NG50
Data Analysis SALSA tool (Ref 3) and other open source tools
recommended
Figure 3: Arima-HiC data generates higher inter-contig signal strength
and spread, critical features that enable chromosome-spanning
assemblies at reduced sequencing. (A) Arima-HiC data and Competitor’s Hi-C
(data generated by Company D) is mapped to Hummingbird and Zebrafinch
contigs generated by Pacific Biosciences Sequencers. Regardless of the species,
Arima-HiC consistently generates higher inter-contig signal. (B) Hummingbird
Hi-C datasets from Arima Genomics and Competition analyzed in the context of
insert-sizes. That is, when Hi-C reads are categorized by insert-sizes, Arima-HiC
maintains high signal (2-3 fold) & coverage regardless of insert-size. In the
context of assembly, this “spread” of signal can enable contigs of all gap-sizes to
be well-assembled, to generate accurate assemblies at reduced sequencing cost.
To enable an unbiased analyses, these were performed with 2M Hi-C reads.
Analyses performed by Arang Rhie (NIH). Shared with permission.
A
Inter-contig signal
0
20
40
60
Hummingbird Zebrafinch
B
Per-base coverage
5
10
15
20
0
>100 >10,000 >1,000,000
Inserts or Gap-Size
Company D
Arima-HiC