biology/hisat2/hisat2-2.2.1

# Graph-Based Genome Alignment and Genotyping with HISAT2 and HISAT-genotype

## Contact

[Daehwan Kim](https://kim-lab.org) (infphilo@gmail.com) and [Chanhee Park](https://www.linkedin.com/in/chanhee-park-97677297/) (parkchanhee@gmail.com)

## Abstract

Rapid advances in next-generation sequencing technologies have dramatically changed our ability to perform genome-scale analyses. The human reference genome used for most genomic analyses represents only a small number of individuals, limiting its usefulness for genotyping. We designed a novel method, HISAT2, for representing and searching an expanded model of the human reference genome, in which a large catalogue of known genomic variants and haplotypes is incorporated into the data structure used for searching and alignment. This strategy for representing a population of genomes, along with a fast and memory-efficient search algorithm, enables more detailed and accurate variant analyses than previous methods. We demonstrate two initial applications of HISAT2: HLA typing, a critical need in human organ transplantation, and DNA fingerprinting, widely used in forensics. These applications are part of HISAT-genotype, with performance not only surpassing earlier computational methods, but matching or exceeding the accuracy of laboratory-based assays.

![](HISAT2-genotype.png)

For more information, see the following websites:
* [HISAT2 website](http://ccb.jhu.edu/software/hisat2)
* [HISAT-genotype website](http://ccb.jhu.edu/software/hisat-genotype)

## HISAT2
HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (whole-genome, transcriptome, and exome sequencing data) to a population of human genomes (as well as to a single reference genome). Based on an extension of BWT for a graph [1], we designed and implemented a graph FM index (GFM), an original approach and its first implementation to the best of our knowledge. In addition to using one global GFM index that represents general population, HISAT2 uses a large set of small GFM indexes that collectively cover the whole genome (each index representing a genomic region of 56 Kbp, with 55,000 indexes needed to cover human population). These small indexes (called local indexes) combined with several alignment strategies enable effective alignment of sequencing reads. This new indexing scheme is called Hierarchical Graph FM index (HGFM). We have developed HISAT2 based on the HISAT [2] and Bowtie 2 [3] implementations.  See the [HISAT2 website](http://ccb.jhu.edu/software/hisat2/index.shtml) for
more information.

A few notes:

1) HISAT2's index (HGFM) size for the human reference genome and 12.3 million common SNPs is 6.2GB. The SNPs consist of 11 million single nucleotide polymorphisms, 728,000 deletions, and 555,000 insertions. Insertions and deletions used in this index are small (usually <20bp). We plan to incorporate structural variations (SV) into this index.

2) The memory footprint of HISAT2 is relatively low, 6.7GB.

3) The runtime of HISAT2 is estimated to be slightly slower than HISAT (30–100% slower for some data sets).

4) HISAT2 provides greater accuracy for alignment of reads containing SNPs.

5) We released a first (beta) version of HISAT2 in September 8, 2015.

## License

[GPL-3.0](LICENSE)

# For reviwers, follow the instructions below to reproduce some of the results in the manuscript.

## Code

A specific version of [HISAT2 and HISAT-genotype](http://github.com/infphilo/hisat2) at GitHub is used (a branch name: hisat2_v2.2.0_beta).

## Initial setup

HISAT-genotype requires a 64-bit computer running either Linux or Mac OS X and at least 8 GB of RAM (16 GB of RAM is preferred). All the commands used should be run from the Unix shell prompt within a terminal window and are prefixed with a '$' character.

We refer to <b>hisat-genotype-top</b> as our top directory where all of our programs are located. <b>hisat-genotype-top</b> is a place holder that can be changed to another name according to user preference.
Run the following commands to install HISAT2 and HISAT-genotype.

    $ git clone https://github.com/infphilo/hisat2 hisat-genotype-top
    $ cd hisat-genotype-top
    hisat-genotype-top$ git checkout hisat2_v2.2.0_beta
    hisat-genotype-top$ make hisat2-align-s hisat2-build-s hisat2-inspect-s

To make the binaries built above and other python scripts available everywhere, add the hisat-genotype-top directory to the PATH environment variable (e.g. ~/.bashrc)

    export PATH=hisat-genotype-top:hisat-genotype-top/hisatgenotype_scripts:$PATH
    export PYTHONPATH=hisat-genotype-top/hisatgenotype_modules:$PYTHONPATH

To reflect the change, run the following command:

    $ source ~/.bashrc

Download real reads, simulated reads, and HISAT2 indexes, then move them into appropriate directories:

    hisat-genotype-top$ cd evaluation
    hisat-genotype-top/evaluation$ wget ftp://ftp.ccb.jhu.edu/pub/infphilo/hisat2/data/hisat2_20181025.tar.gz
    hisat-genotype-top/evaluation$ tar xvzf hisat2_20181025.tar.gz
    hisat-genotype-top/evaluation$ mkdir aligners aligners/bin; cd aligners/bin; ln -s ../../../hisat2* .; cd ../..
    hisat-genotype-top/evaluation$ mv hisat2/* .
    hisat-genotype-top/evaluation$ cd simulation; ./init.py; cd ../real; ./init.py; cd ..

## Run HISAT2 on the following simulated and real data sets.
###	10 million simulated read pairs with SNPs and with sequencing errors

    hisat-genotype-top/evaluation$ cd simulation/10M_DNA_mismatch_snp_reads_genome
    hisat-genotype-top/evaluation/simulation/10M_DNA_mismatch_snp_reads_genome$ ./calculate_read_cost.py --aligner-list hisat2 --paired-end --fresh

### 10 million simulated read pairs with SNPs and without sequencing errors

    hisat-genotype-top/evaluation$ cd simulation/10M_DNA_snp_reads_genome
    hisat-genotype-top/evaluation/simulation/10M_DNA_snp_reads_genome$ ./calculate_read_cost.py --aligner-list hisat2 --paired-end --fresh

### 10 million simulated read pairs without SNPs and with sequencing errors
    hisat-genotype-top/evaluation$ cd simulation/10M_DNA_mismatch_reads_genome
    hisat-genotype-top/evaluation/simulation/10M_DNA_mismatch_reads_genome$ ./calculate_read_cost.py --aligner-list hisat2 --paired-end --fresh

### 10 million simulated read pairs without SNPs and without sequencing errors
    hisat-genotype-top/evaluation$ cd simulation/10M_DNA_reads_genome
    hisat-genotype-top/evaluation/simulation/10M_DNA_reads_genome$ ./calculate_read_cost.py --aligner-list hisat2 --paired-end --fresh

### 10 million real read pairs
    hisat-genotype-top/evaluation$ cd real/DNA/10M
    hisat-genotype-top/evaluation/real/DNA/10M$ ./calculate_read_cost.py --aligner-list hisat2 --paired-end --fresh

### Interpreting output
    Example alignment output for simulated reads
    aligned: 1000000, multi aligned: 2654390
		    correctly mapped: 999963 (100.00%)
		    uniquely and correctly mapped: 967631 (96.76%)
			    54694 reads per sec (all)
			    Memory Usage: 86MB

The above lines show that 1,000,000 read pairs are aligned and the total number of alignments is 2,654,390. 999,963 pairs (100.00%) are correctly aligned (e.g. one of the alignments is correct). 967,631 (96.76%) pairs are uniquely and correctly aligned. HISAT2 aligns 54,594 reads with a peak memory usage of 86 MB of RAM.

Each run is expected to take up to several hours mostly due to the comparison of HISAT2’s reported alignments and true alignments and the expansion of repeat alignments.

## Details on HISAT-genotype run for HLA typing and assembly

To create a directory where we perform our analysis for HLA typing and assembly, which here is referred to as hla-analysis but can be changed by the user, execute the following command.

    hisat-genotype-top/evaluation$ mkdir hla-analysis

The current directory can be changed to hla-analysis as follows:

    hisat-genotype-top/evaluation$ cd hla-analysis

Additional program requirements: SAMtools (version 1.3 or later)

### Downloading a Graph Reference and Index
The graph reference we are going to build incorporates variants of numerous HLA alleles into the linear reference using a graph. The graph reference also includes some known variants of other regions of the genome (e.g. common small variants). To copy the graph reference, type:

    hisat-genotype-top/evaluation/hla-analysis$ mv ../hisat2-genotype/* .

### Typing and Assembly
Since whole genome sequencing (WGS) data includes reads that are from the whole genome, the first step is to extract the reads that belong to the HLA genes by aligning them to the graph reference with HISAT2. We provide these extracted reads in hisat-genotype-top/evaluation/hla-analysis/ILMN_20181025.

HISAT-genotype performs both HLA typing and assembly as follows.
You can perform HLA typing and assembly for HLA-A gene on sequencing reads from the genome NA12892 (Illumina's HiSeq 2000 platform).

    hisat-genotype-top/evaluation/hla-analysis$ hisatgenotype_locus.py --base hla --locus-list A --assembly -1 ILMN_20181025/NA12892.hla.extracted.1.fq.gz -2 ILMN_20181025/NA12892.hla.extracted.2.fq.gz

### DNA Fingerprinting
This function can be performed with the same commands used for “Typing and Assembly” and just replacing --base hla with --base codis.

### Interpreting Output
    Typing Output
    Number of reads aligned: 1507
      1 A*02:01:01:02L (count: 571)
      2 A*02:01:31 (count: 557)
      3 A*02:20:02 (count: 557)
      4 A*02:29 (count: 557)
      5 A*02:321N (count: 556)
      6 A*02:372 (count: 556)
      7 A*02:610:02 (count: 556)
      8 A*02:249 (count: 555)
      9 A*02:479 (count: 555)
      10 A*02:11:01 (count: 554)

The above lines show the top ten alleles that the most number of reads are mapped to or compatible with. For example, the allele first ranked, A\*02:01:01:02L, is compatible with 571 reads. This raw estimate based on the number of reads should not be used to determine the two true alleles because the alleles that resemble both but are not true alleles often tend to be compatible with more reads than either of the true alleles. Thus, we apply a statistical model to identify the two true alleles as described in the main text.

    Abundance of alleles
      1 ranked A*02:01:01:01 (abundance: 54.32%)
      2 ranked A*11:01:01:01 (abundance: 45.20%)
      3 ranked A*24:33 (abundance: 0.48%)

The above rankings show the top three alleles that are most abundant in the sample. Normally, the top two alleles in this estimate (e.g. A\*02:01:01:01 and A\*11:01:01:01) are considered as the two alleles that best match a given sequencing data.

Additional tutorials and details are available at the HISAT-genotype website: https://ccb.jhu.edu/hisat-genotype


## Data

The Data directory (`/data`) contains all input files for reproducing some of our results such as from the
evaluation of HISAT2 and other programs using both simulated and real reads, from typing and assembling
HLA genes of Illumina Platinum Genomes using HISAT-genotype, and from building a HISAT2 graph index.

* **Simulated read pairs**

| Type | Number of pairs | Path |
| - | - | - |
| SNPs and sequencing errors included | 10,000,000 | hisat-genotype-top/evaluation/reads/simulation/10M_DNA_mismatch_snp_reads_genome |
| SNPs included | 10,000,000 | hisat-genotype-top/evaluation/reads/simulation/10M_DNA_snp_reads_genome |
| Sequencing errors included | 10,000,000 | hisat-genotype-top/evaluation/reads/simulation/10M_DNA_mismatch_reads_genome |
| No SNPs nor sequencing errors included | 10,000,000 | hisat-genotype-top/evaluation/reads/simulation/10M_DNA_reads_genome |

Each directory comes with a true alignment file in SAM format so that users know where the reads were generated in the human reference genome.

* **Real read pairs**

| Number of read pairs | Path |
| - | - |
| 10,000,000 | hisat-genotype-top/evaluation/reads/real/DNA |

* **Human reference genome, SNPs, haplotypes, and HISAT2's indexes**

| Type | Path |
| - | - |
| GRCh38 reference | hisat-genotype-top/evaluation/data/genome.fa |
| SNPs | hisat-genotype-top/evaluation/data/genome.snp |
| Haplotypes | hisat-genotype-top/evaluation/data/genome.haplotype |
| HISAT2's prebuilt graph index for comparison with other aligners | hisat-genotype-top/evaluation/indexes/HISAT2/genome.[1-8].ht2 |
| HISAT2's prebuilt graph index for genotyping | hisat-genotype-top/evaluation/hla-analysis/genotype_genome.[1-8].ht2 |


## References

[1] Sirén J, Välimäki N, Mäkinen V (2014) Indexing graphs for path queries with applications in genome research. IEEE/ACM Transactions on Computational Biology and Bioinformatics 11: 375–388. doi: 10.1109/tcbb.2013.2297101

[2] Kim D, Langmead B, and Salzberg SL  HISAT: a fast spliced aligner with low memory requirements, Nature methods, 2015

[3] Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods 2012, 9:357-359
Name		Date	Size	#Lines	LOC
..		03-May-2022	-
docs/	H	03-May-2022	-	3,321	2,764
docs_jhu/	H	03-May-2022	-	1,065	915
evaluation/	H	03-May-2022	-	10,832	8,841
example/	H	24-Jul-2020	-	24,174	24,170
hisat2.xcodeproj/	H	24-Jul-2020	-	1,308	1,297
hisat2lib/	H	24-Jul-2020	-	1,809	1,047
li_hla/	H	24-Jul-2020	-	799	648
msvcc/	H	24-Jul-2020	-	4,047	3,424
scripts/	H	03-May-2022	-	13,072	9,241
third_party/	H	24-Jul-2020	-	188	122
.gitattributes	H A D	24-Jul-2020	29	2	1
.gitignore	H A D	24-Jul-2020	713	44	39
AUTHORS	H A D	24-Jul-2020	1.3 KiB	30	21
LICENSE	H A D	24-Jul-2020	34.3 KiB	675	553
MANUAL	H A D	24-Jul-2020	63.8 KiB	1,468	1,031
MANUAL.markdown	H A D	24-Jul-2020	77.4 KiB	2,438	1,564
Makefile	H A D	03-May-2022	15.2 KiB	565	424
NEWS	H A D	24-Jul-2020	584	17	12
README.md	H A D	24-Jul-2020	12.7 KiB	203	132
TUTORIAL	H A D	24-Jul-2020	202	5	3
VERSION	H A D	24-Jul-2020	6	2	1
_config.yml	H A D	24-Jul-2020	32	1	1
aligner_bt.cpp	H A D	24-Jul-2020	52.7 KiB	1,773	1,474
aligner_bt.h	H A D	24-Jul-2020	30.2 KiB	948	521
aligner_cache.cpp	H A D	24-Jul-2020	4.1 KiB	182	140
aligner_cache.h	H A D	24-Jul-2020	26.3 KiB	1,014	542
aligner_driver.cpp	H A D	24-Jul-2020	2.2 KiB	81	55
aligner_driver.h	H A D	24-Jul-2020	8.2 KiB	248	115
aligner_metrics.h	H A D	24-Jul-2020	11 KiB	353	247
aligner_report.h	H A D	24-Jul-2020	1,006	36	10
aligner_result.cpp	H A D	24-Jul-2020	55.7 KiB	2,130	1,750
aligner_result.h	H A D	24-Jul-2020	64.8 KiB	2,319	1,436
aligner_seed.cpp	H A D	24-Jul-2020	16.1 KiB	531	382
aligner_seed.h	H A D	24-Jul-2020	84 KiB	2,923	2,113
aligner_seed2.cpp	H A D	24-Jul-2020	40.7 KiB	1,246	890
aligner_seed2.h	H A D	24-Jul-2020	127.6 KiB	4,292	3,061
aligner_seed_policy.cpp	H A D	24-Jul-2020	29.6 KiB	917	615
aligner_seed_policy.h	H A D	24-Jul-2020	8.2 KiB	235	42
aligner_sw.cpp	H A D	24-Jul-2020	101.5 KiB	3,215	2,868
aligner_sw.h	H A D	03-May-2022	24.9 KiB	651	321
aligner_sw_common.h	H A D	24-Jul-2020	8.5 KiB	306	211
aligner_sw_driver.cpp	H A D	24-Jul-2020	736	21	0
aligner_sw_driver.h	H A D	24-Jul-2020	105.9 KiB	2,939	2,312
aligner_sw_nuc.h	H A D	24-Jul-2020	7 KiB	263	161
aligner_swsse.cpp	H A D	24-Jul-2020	2.6 KiB	89	50
aligner_swsse.h	H A D	24-Jul-2020	14.8 KiB	501	274
aligner_swsse_ee_i16.cpp	H A D	24-Jul-2020	62.1 KiB	1,912	1,401
aligner_swsse_ee_u8.cpp	H A D	24-Jul-2020	61.2 KiB	1,903	1,391
aligner_swsse_loc_i16.cpp	H A D	24-Jul-2020	74.5 KiB	2,273	1,599
aligner_swsse_loc_u8.cpp	H A D	24-Jul-2020	73.3 KiB	2,267	1,608
aln_sink.cpp	H A D	24-Jul-2020	25.4 KiB	786	648
aln_sink.h	H A D	24-Jul-2020	106 KiB	3,253	2,257
alphabet.cpp	H A D	24-Jul-2020	18.9 KiB	441	297
alphabet.h	H A D	24-Jul-2020	5.5 KiB	200	94
alt.h	H A D	24-Jul-2020	8.3 KiB	295	234
assert_helpers.h	H A D	24-Jul-2020	9.3 KiB	280	234
banded.cpp	H A D	24-Jul-2020	823	28	6
banded.h	H A D	24-Jul-2020	1.2 KiB	53	20
binary_sa_search.h	H A D	24-Jul-2020	3.5 KiB	103	58
bit_packed_array.cpp	H A D	24-Jul-2020	7.1 KiB	316	217
bit_packed_array.h	H A D	24-Jul-2020	2.8 KiB	106	56
bitpack.h	H A D	24-Jul-2020	2.2 KiB	81	41
blockwise_sa.h	H A D	24-Jul-2020	39.5 KiB	1,114	824
bp_aligner.h	H A D	24-Jul-2020	61.7 KiB	1,238	1,104
btypes.h	H A D	24-Jul-2020	1.3 KiB	49	21
ccnt_lut.cpp	H A D	24-Jul-2020	1.9 KiB	81	50
diff_sample.cpp	H A D	24-Jul-2020	4.5 KiB	118	70
diff_sample.h	H A D	24-Jul-2020	30.5 KiB	1,001	753
dp_framer.cpp	H A D	24-Jul-2020	36.4 KiB	911	542
dp_framer.h	H A D	24-Jul-2020	9.1 KiB	262	134
ds.cpp	H A D	24-Jul-2020	3.1 KiB	156	116
ds.h	H A D	24-Jul-2020	89.3 KiB	4,398	2,621
edit.cpp	H A D	24-Jul-2020	12.3 KiB	502	384
edit.h	H A D	24-Jul-2020	9.9 KiB	402	227
endian_swap.h	H A D	24-Jul-2020	4.1 KiB	161	90
extract_exons.py	H A D	03-May-2022	5.4 KiB	160	115
extract_splice_sites.py	H A D	03-May-2022	4.9 KiB	138	96
fast_mutex.h	H A D	24-Jul-2020	8.4 KiB	295	199
filebuf.h	H A D	24-Jul-2020	15.5 KiB	719	451
formats.h	H A D	24-Jul-2020	1.2 KiB	58	30
gbwt_graph.h	H A D	24-Jul-2020	102.9 KiB	2,798	2,328
gfm.cpp	H A D	24-Jul-2020	2.3 KiB	73	38
gfm.h	H A D	24-Jul-2020	252.3 KiB	6,976	5,498
gp.h	H A D	24-Jul-2020	2 KiB	84	41
group_walk.cpp	H A D	24-Jul-2020	759	21	1
group_walk.h	H A D	24-Jul-2020	55.6 KiB	1,625	1,154
hgfm.h	H A D	24-Jul-2020	102.2 KiB	2,654	2,199
hi_aligner.h	H A D	24-Jul-2020	288.1 KiB	6,925	6,082
hier_idx_common.h	H A D	24-Jul-2020	1.6 KiB	44	11
hisat2	H A D	03-May-2022	19.4 KiB	666	542
hisat2-build	H A D	03-May-2022	2.7 KiB	96	74
hisat2-build-new	H A D	24-Jul-2020	3 KiB	101	77
hisat2-inspect	H A D	03-May-2022	2.6 KiB	74	56
hisat2.cpp	H A D	24-Jul-2020	176 KiB	4,371	3,792
hisat2.sln	H A D	24-Jul-2020	5.3 KiB	83	81
hisat2_build.cpp	H A D	24-Jul-2020	32.3 KiB	850	749
hisat2_build_main.cpp	H A D	24-Jul-2020	2 KiB	71	39
hisat2_extract_exons.py	H A D	03-May-2022	5.4 KiB	160	115
hisat2_extract_snps_haplotypes_UCSC.py	H A D	03-May-2022	18.9 KiB	579	476
hisat2_extract_snps_haplotypes_VCF.py	H A D	03-May-2022	34.4 KiB	924	778
hisat2_extract_splice_sites.py	H A D	03-May-2022	4.9 KiB	138	96
hisat2_inspect.cpp	H A D	24-Jul-2020	28 KiB	792	688
hisat2_main.cpp	H A D	24-Jul-2020	2 KiB	70	38
hisat2_read_statistics.py	H A D	03-May-2022	5.5 KiB	237	156
hisat2_repeat.cpp	H A D	24-Jul-2020	31 KiB	884	750
hisat2_repeat_main.cpp	H A D	24-Jul-2020	2 KiB	71	39
hisat2_simulate_reads.py	H A D	03-May-2022	34.4 KiB	972	831
hisat_bp.cpp	H A D	24-Jul-2020	151.3 KiB	3,886	3,354
ival_list.cpp	H A D	24-Jul-2020	5.1 KiB	166	136
ival_list.h	H A D	24-Jul-2020	6.5 KiB	300	171
limit.cpp	H A D	24-Jul-2020	1.8 KiB	44	22
limit.h	H A D	24-Jul-2020	1.3 KiB	49	25
ls.cpp	H A D	24-Jul-2020	3.6 KiB	143	114
ls.h	H A D	24-Jul-2020	11.7 KiB	334	232
mask.cpp	H A D	24-Jul-2020	1 KiB	37	13
mask.h	H A D	24-Jul-2020	2.1 KiB	80	36
mem_ids.h	H A D	24-Jul-2020	1.2 KiB	36	9
mm.h	H A D	24-Jul-2020	1.5 KiB	52	20
multikey_qsort.cpp	H A D	24-Jul-2020	763	21	1
multikey_qsort.h	H A D	24-Jul-2020	38 KiB	1,238	938
opts.h	H A D	24-Jul-2020	7.2 KiB	195	172
outq.cpp	H A D	24-Jul-2020	5.4 KiB	202	159
outq.h	H A D	24-Jul-2020	3.2 KiB	150	83
pat.cpp	H A D	24-Jul-2020	46.3 KiB	1,800	1,478
pat.h	H A D	24-Jul-2020	43.6 KiB	1,789	1,175
pe.cpp	H A D	24-Jul-2020	30.9 KiB	942	692
pe.h	H A D	24-Jul-2020	9.5 KiB	322	161
presets.cpp	H A D	24-Jul-2020	2.6 KiB	88	55
presets.h	H A D	24-Jul-2020	1.5 KiB	68	26
processor_support.h	H A D	03-May-2022	2.3 KiB	73	41
qual.cpp	H A D	24-Jul-2020	3.9 KiB	86	58
qual.h	H A D	24-Jul-2020	6.5 KiB	237	159
radix_sort.h	H A D	24-Jul-2020	10.5 KiB	298	262
random_source.cpp	H A D	24-Jul-2020	3.2 KiB	129	99
random_source.h	H A D	24-Jul-2020	5.2 KiB	240	145
random_util.cpp	H A D	24-Jul-2020	907	25	4
random_util.h	H A D	24-Jul-2020	5.8 KiB	222	121
read.h	H A D	24-Jul-2020	14 KiB	534	375
read_qseq.cpp	H A D	24-Jul-2020	8.2 KiB	305	226
ref_coord.cpp	H A D	24-Jul-2020	1,014	34	11
ref_coord.h	H A D	24-Jul-2020	10.2 KiB	430	234
ref_read.cpp	H A D	24-Jul-2020	12.1 KiB	455	338
ref_read.h	H A D	24-Jul-2020	8.4 KiB	325	231
reference.cpp	H A D	24-Jul-2020	22.6 KiB	715	593
reference.h	H A D	24-Jul-2020	5.8 KiB	192	89
repeat.h	H A D	24-Jul-2020	24.5 KiB	628	530
repeat_builder.cpp	H A D	24-Jul-2020	160 KiB	4,756	4,027
repeat_builder.h	H A D	24-Jul-2020	26.8 KiB	963	747
repeat_kmer.h	H A D	24-Jul-2020	20.3 KiB	607	533
rfm.h	H A D	24-Jul-2020	41 KiB	1,137	970
sam.h	H A D	24-Jul-2020	38.9 KiB	1,255	1,033
scoring.cpp	H A D	24-Jul-2020	9.3 KiB	287	200
scoring.h	H A D	24-Jul-2020	17.6 KiB	547	318
search_globals.h	H A D	24-Jul-2020	1.4 KiB	49	25
sequence_io.h	H A D	24-Jul-2020	3.6 KiB	126	86
shmem.cpp	H A D	24-Jul-2020	1.3 KiB	50	17
shmem.h	H A D	24-Jul-2020	4.9 KiB	162	121
simple_func.cpp	H A D	24-Jul-2020	2.3 KiB	94	71
simple_func.h	H A D	24-Jul-2020	3.4 KiB	126	77
splice_site.cpp	H A D	24-Jul-2020	32.8 KiB	851	754
splice_site.h	H A D	24-Jul-2020	17.6 KiB	616	380
splice_site_mem.h	H A D	24-Jul-2020	1 MiB	6,225	6,212
splice_site_new.cpp	H A D	24-Jul-2020	45.6 KiB	1,158	1,028
spliced_aligner.h	H A D	24-Jul-2020	113.1 KiB	2,055	1,914
sse_util.cpp	H A D	24-Jul-2020	979	34	10
sse_util.h	H A D	03-May-2022	14 KiB	577	345
sstring.cpp	H A D	24-Jul-2020	5.3 KiB	203	168
sstring.h	H A D	24-Jul-2020	75.3 KiB	3,451	2,149
str_util.h	H A D	24-Jul-2020	1.1 KiB	48	22
threading.h	H A D	24-Jul-2020	1.3 KiB	58	28
timer.h	H A D	24-Jul-2020	2.4 KiB	88	50
tinythread.cpp	H A D	24-Jul-2020	8.9 KiB	321	203
tinythread.h	H A D	24-Jul-2020	20.7 KiB	715	376
tokenize.h	H A D	24-Jul-2020	1.7 KiB	63	33
tp.h	H A D	24-Jul-2020	3.7 KiB	119	73
unique.cpp	H A D	24-Jul-2020	2.3 KiB	67	23
unique.h	H A D	24-Jul-2020	14.4 KiB	532	376
util.h	H A D	24-Jul-2020	1.5 KiB	54	24
word_io.h	H A D	24-Jul-2020	8.1 KiB	394	240
zbox.h	H A D	24-Jul-2020	2.7 KiB	98	62