• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

.github/H16-Aug-2020-2114

cmake/H03-May-2022-554471

data/H07-May-2022-84,03582,994

lib/H16-Aug-2020-73,75063,348

scripts/H03-May-2022-43,55333,152

src/H03-May-2022-35,65524,695

.travis.ymlH A D16-Aug-20204.1 KiB151147

DockerfileH A D16-Aug-20201.8 KiB4939

LICENSEH A D16-Aug-202034.3 KiB675553

README.mdH A D16-Aug-20204.4 KiB8158

README.md

1# HH-suite3 for sensitive sequence searching
2
3(C) Johannes Soeding, Markus Meier, Martin Steinegger, Milot Mirdita, Michael Remmert, Andreas Hauser, Andreas Biegert
4
5[![BioConda Install](https://img.shields.io/conda/dn/bioconda/hhsuite.svg?style=flag&label=BioConda%20install)](https://anaconda.org/bioconda/hhsuite)
6[![Github All Releases](https://img.shields.io/github/downloads/soedinglab/hh-suite/total.svg)](https://github.com/soedinglab/hh-suite/releases/latest)
7[![Docker Pulls](https://img.shields.io/docker/pulls/soedinglab/hh-suite.svg)](https://hub.docker.com/r/soedinglab/hh-suite)
8[![Travis Build Status](https://travis-ci.org/soedinglab/hh-suite.svg?branch=master)](https://travis-ci.org/soedinglab/hh-suite)
9[![Codeship Status](https://codeship.com/projects/0936c290-2248-0133-bcb4-52bb0fef976f/status?branch=master)](https://codeship.com/projects/96085)
10
11The HH-suite is an open-source software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs).
12
13## Documentation
14
15We provide an extensive [user guide](https://github.com/soedinglab/hh-suite/wiki) with many usage examples, frequently asked questions and guides to build your own databases.
16
17### Installation
18
19HH-suite3 can also be installed by downloading a statically compiled version, [conda](https://github.com/conda/conda) or [Docker](https://github.com/moby/moby). HH-suite3 requires a 64-bit system (check with `uname -a | grep x86_64`) with at least the SSE2 instruction set (check by executing `cat /proc/cpuinfo | grep sse2` on Linux or `sysctl -a | grep machdep.cpu.features | grep SSE2` on macOS). `AVX2` is roughly 2x faster compared to SSE2.
20
21```
22# install via conda
23conda install -c conda-forge -c bioconda hhsuite
24# install docker
25docker pull soedinglab/hh-suite
26# static SSE2 build
27wget https://github.com/soedinglab/hh-suite/releases/download/v3.2.0/hhsuite-3.2.0-SSE2-Linux.tar.gz; tar xvfz hhsuite-3.2.0-SSE2-Linux.tar.gz; export PATH="$(pwd)/bin:$(pwd)/scripts:$PATH"
28# static AVX2 build
29wget https://github.com/soedinglab/hh-suite/releases/download/v3.2.0/hhsuite-3.2.0-AVX2-Linux.tar.gz; tar xvfz hhsuite-3.2.0-AVX2-Linux.tar.gz; export PATH="$(pwd)/bin:$(pwd)/scripts:$PATH"
30```
31:exclamation: Only the self-compiled HH-suite3 version includes MPI support, since MPI configuration is specific to the local environment.
32
33### Available Databases
34List of available database for HH-suite3:
35  - [Uniclust30](https://uniclust.mmseqs.com) [[pub]](https://doi.org/10.1093/nar/gkw1081)
36  - [BFD](https://bfd.mmseqs.com) (consists of 2.5 billion, mostly enviromental, protein sequences) [[pub]](https://doi.org/10.1038/s41592-019-0437-4)
37  - [Pfam/SCOP/PDB70/dbCAN](http://wwwuser.gwdg.de/~compbiol/data/hhsuite/databases/hhsuite_dbs/)
38
39
40### Compilation
41To compile from source, you will need a recent C/C++ compiler (at least GCC 4.8 or Clang 3.6) and [CMake](http://cmake.org/) 2.8.12 or later.
42
43To download the source code and compile the HH-suite execute the following commands:
44```
45git clone https://github.com/soedinglab/hh-suite.git
46mkdir -p hh-suite/build && cd hh-suite/build
47cmake -DCMAKE_INSTALL_PREFIX=. ..
48make -j 4 && make install
49export PATH="$(pwd)/bin:$(pwd)/scripts:$PATH"
50```
51
52:exclamation: To compile HH-suite3 on macOS, first install the `gcc` compiler from [Homebrew](https://brew.sh). The default macOS `clang` compiler does not support OpenMP and HH-suite3 will only be able to use a single thread. Then replace the `cmake` call above with the following one:
53
54```
55CC="$(brew --prefix)/bin/gcc-8" CXX="$(brew --prefix)/bin/g++-8" cmake -DCMAKE_INSTALL_PREFIX=. ..
56```
57
58
59## Usage
60For performing a single search iteration of HHblits, run HHblits with the following command:
61```
62hhblits -i <input-file> -o <result-file> -n 1 -d <database-basename>
63```
64
65For generating an alignment of homologous sequences:
66```
67hhblits -i <input-file> -o <result-file> -oa3m <result-alignment> -d <database-basename>
68```
69
70A detailed list of options for HHblits is available by running HHblits with the `-h` parameter.
71
72## Reference
73
74Steinegger M, Meier M, Mirdita M, Vöhringer H, Haunsberger S J, and Söding J (2019)
75HH-suite3 for fast remote homology detection and deep protein annotation, *BMC Bioinformatics*, 473. [doi: 10.1186/s12859-019-3019-7](https://doi.org/10.1186/s12859-019-3019-7)
76
77## Links
78
79* [Soeding lab](http://www.mpibpc.mpg.de/soeding)
80* [Databases for the HH-suite](http://wwwuser.gwdg.de/~compbiol/data/hhsuite/)
81