• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

.github/workflows/H23-Jan-2021-2015

data/H03-May-2022-1,6581,656

docs/H03-May-2022-11,0377,981

genomes/H23-Jan-2021-747738

scripts/H23-Jan-2021-7563

src/H23-Jan-2021-103,19471,102

test/H23-Jan-2021-1,115,4711,111,405

tutorial/H03-May-2022-5,3435,067

.gitignoreH A D23-Jan-2021617 3433

LICENSEH A D23-Jan-20211.1 KiB2419

MakefileH A D03-May-20229.7 KiB305206

README.mdH A D23-Jan-20213.2 KiB5234

README.md

1![shamalamadingdong](https://github.com/arq5x/bedtools2/blob/master/docs/bedtools.swiss.png "foo")
2
3bedtools - the swiss army knife for genome arithmetic
4=====================================================
5
6[Download current version](https://github.com/arq5x/bedtools2/releases/latest)
7
8[Documentation](http://bedtools.readthedocs.org)
9
10Summary
11-------
12Collectively, the bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic: that is, set theory on the genome. For example, bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF/GTF, VCF.
13
14While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), quite sophisticated analyses can be conducted by combining multiple bedtools operations on the UNIX command line.
15
16Performance
17-----------
18As of version 2.18, ``bedtools`` is substantially more scalable thanks to improvements we have made in the algorithm used to process datasets that are pre-sorted
19by chromosome and start position. As you can see in the plots below, the speed and memory consumption scale nicely
20with sorted data as compared to the poor scaling for unsorted data. The current version of bedtools intersect is as fast as (or slightly faster) than the ``bedops`` package's ``bedmap`` which uses a similar algorithm for sorted data.  The plots below represent counting the number of intersecting alignments from exome capture BAM files against CCDS exons.
21The alignments have been converted to BED to facilitate comparisons to ``bedops``. We compare to the bedmap ``--ec`` option because similar error checking is enforced by ``bedtools``.
22
23
24**Note:** bedtools could not complete when using 100 million alignments and the R-Tree algorithm used for unsorted data.
25
26
27![Speed Comparison](http://bedtools.readthedocs.org/en/latest/_images/speed-comparo.png)
28![Memory Comparison](http://bedtools.readthedocs.org/en/latest/_images/memory-comparo.png)
29
30
31Details
32-------
33First created through urgency and adrenaline by Aaron Quinlan Spring 2009.
34Maintained by the Quinlan Laboratory at the University of Virginia.
35
361. **Lead developers**:           Aaron Quinlan, Hao Hoou, Brent Pedersen, Neil Kindlon
372. **Significant contributions**: Hao Hou, John Marshall, Assaf Gordon, Royden Clark,  Brent Pedersen, Ryan Dale
383. **Repository**:                https://github.com/arq5x/bedtools2
394. **Stable releases**:           https://github.com/arq5x/bedtools2/releases
405. **Documentation**:             http://bedtools.readthedocs.org
416. **License**:                   Released under MIT license
42
43
44Citation
45--------
46*Please cite the following article if you use BEDTools in your research*:
47  * Quinlan AR and Hall IM, 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 26, 6, pp. 841–842.
48
49Also, if you use *pybedtools*, please cite the following.
50  * Dale RK, Pedersen BS, and Quinlan AR. Pybedtools: a flexible Python library for manipulating genomic datasets and annotations. Bioinformatics (2011). doi:10.1093/bioinformatics/btr539
51
52