• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

cufflinks.xcodeproj/H06-Jul-2019-6,8586,845

doc/H03-May-2022-9,1318,157

external_tests/tiny_tests/2x75/H06-Jul-2019-163161

src/H03-May-2022-103,58673,772

tests/unit_tests/H06-Jul-2019-546311

.gitignoreH A D06-Jul-2019230 2423

AUTHORSH A D06-Jul-2019795 1713

LICENSEH A D06-Jul-20191.5 KiB3023

Makefile.amH A D06-Jul-2019115 95

README.mdH A D06-Jul-201911.8 KiB189138

autogen.shH A D06-Jul-2019382 2218

ax_bam.m4H A D06-Jul-20194.3 KiB156138

ax_boost_base.m4H A D06-Jul-201913.4 KiB302274

ax_boost_filesystem.m4H A D06-Jul-20194.3 KiB119110

ax_boost_serialization.m4H A D06-Jul-20194.7 KiB118109

ax_boost_system.m4H A D06-Jul-20194.1 KiB122110

ax_boost_thread.m4H A D06-Jul-20195.6 KiB164151

ax_check_eigen.m4H A D06-Jul-20192.3 KiB9989

ax_check_zlib.m4H A D06-Jul-20194.5 KiB127121

ax_openmp.m4H A D06-Jul-20193.4 KiB102100

boost.m4H A D06-Jul-20192.2 KiB8879

configure.acH A D03-May-20224.5 KiB156123

index.mdH A D06-Jul-20191.6 KiB2316

make_bin.shH A D06-Jul-20191.5 KiB5741

version.m4H A D06-Jul-201937 21

README.md

1
2# Cufflinks
3
4The main *website* for cufflinks is [here](http://cole-trapnell-lab.github.io/cufflinks/)
5
6*NOTE*: If you're looking for old releases of Cufflinks, including source, you can find them [here](http://cole-trapnell-lab.github.io/cufflinks/install/).
7
8Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples. It accepts aligned RNA-Seq reads and assembles the alignments into a parsimonious set of transcripts. Cufflinks then estimates the relative abundances of these transcripts based on how many reads support each one, taking into account biases in library preparation protocols.
9
10Cufflinks was originally developed as part of a collaborative effort between the [Laboratory for Mathematical and Computational Biology](http://bio.math.berkeley.edu/), led by Lior Pachter at UC Berkeley, Steven Salzberg's [computational genomics group](http://ccb.jhu.edu/people/salzberg/) at the Institute of Genetic Medicine at Johns Hopkins University, and [Barbara Wold's lab](http://woldlab.caltech.edu/) at Caltech. The project is now maintained by [Cole Trapnell's](http://cole-trapnell-lab.github.io/) lab at the University of Washington.
11
12Cufflinks is provided under the OSI-approved [Boost License](http://en.wikipedia.org/wiki/Boost_Software_License)
13
14# News
15
16*To get the latest updates on the Cufflinks project and the rest of the "Tuxedo tools", please subscribe to our [**mailing list**](https://lists.sourceforge.net/lists/listinfo/bowtie-bio-announce)*
17
18# Install quick-start
19
20## Installing a pre-compiled binary release
21
22In order to make it easy to install Cufflinks, we provide a few binary packages [here](http://cole-trapnell-lab.github.io/cufflinks/install/) to save users from the occasionally frustrating process of building Cufflinks, which requires that you install the Boost libraries. To use the binary packages, simply download the appropriate one for your machine, untar it, and make sure the cufflinks,cuffdiff and cuffcompare binaries are in a directory in your PATH environment variable.
23
24# Building Cufflinks from source
25
26In order to build Cufflinks, you must have the [Boost C++ libraries](http://www.boost.org/) (version 1.47 or higher) installed on your system. See below for instructions on installing Boost.
27
28## Installing Boost
29
301. Download Boost and the bjam build engine. **WARNING:** Due to a serious issue with Boost Serlialization library introduced in version 1.56, Cufflinks currently can only be built with Boost version 1.55 or lower.  The issue is expected to be fixed in the upcoming Boost v1.59.
312. Unpack bjam and add it to your PATH.
323. Unpack the Boost tarball and cd to the Boost source directory. This directory is called the BOOST_ROOT in some Boost installation instructions.
334. Build Boost. Note that you can specify where to put Boost with the --prefix option. The default Boost installation directory is /usr/local. Take note of the boost installation directory, because you will need to tell the Cufflinks installer where to find Boost later on.
34
35- If you are on Mac OS X, type (all on one line):
36```bash
37bjam --prefix=<YOUR_BOOST_INSTALL_DIRECTORY> --toolset=darwin architecture=x86 address_model=32_64 link=static runtime-link=static --layout=versioned stage install
38```
39
40- If you are on a 32-bit Linux system, type (all on one line):
41```bash
42bjam --prefix=<YOUR_BOOST_INSTALL_DIRECTORY> --toolset=gcc architecture=x86 address_model=32 link=static runtime-link=static stage install
43```
44
45- If you are on a 64-bit Linux system, type (all on one line):
46```bash
47bjam --prefix=<YOUR_BOOST_INSTALL_DIRECTORY> --toolset=gcc architecture=x86 address_model=64 link=static runtime-link=static stage install
48```
49
50## Installing the SAM tools
51
521. [Download the SAM tools](http://samtools.sourceforge.net/)
532. Unpack the SAM tools tarball and cd to the SAM tools source directory.
543. Build the SAM tools by typing make at the command line.
554. Choose a directory into which you wish to copy the SAM tools binary, the included library <tt>libbam.a</tt>, and the library headers. A common choice is <tt>/usr/local/</tt>.
565. Copy libbam.a to the lib/ directory in the folder you've chosen above (e.g. <tt>/usr/local/lib/</tt>)
576. Create a directory called "bam" in the <tt>include/</tt> directory (e.g. <tt>/usr/local/include/bam</tt>)
587. Copy the headers (files ending in <tt>.h</tt>) to the include/bam directory you've created above (e.g. <tt>/usr/local/include/</tt>bam)
598. Copy the samtools binary to some directory in your <tt>PATH</tt>.
60
61## Installing the Eigen libraries
62
631. [Download Eigen](http://eigen.tuxfamily.org/)
642. Unpack the Eigen tarball and cd to the Eigen source directory.
653. Copy the Eigen/ subdirectory someplace on your system where you keep header files (e.g. /usr/local/include)
66
67## Building Cufflinks
68
69### If you are starting from a source tarball downloaded from [here](http://cole-trapnell-lab.github.io/cufflinks/install/):
70
71Unpack the Cufflinks source tarball (in this example for version 2.2.1):
72```bash
73tar zxvf cufflinks-2.2.1.tar.gz
74```
75Change to the Cufflinks directory:
76```bash
77cd cufflinks-2.2.1
78```
79
80### If you want to clone the Cufflinks github repo:
81```bash
82git clone https://github.com/cole-trapnell-lab/cufflinks.git
83cd cufflinks
84autoreconf --install
85```
86The above will generate the configure script.
87
88### To configure Cufflinks prior to the build
89
90If Boost is installed somewhere other than /usr/local, you will need to tell the installer where to find it using the --with-boost option. Specify where to install Cufflinks using the --prefix option.
91```bash
92./configure --prefix=/path/to/cufflinks/install --with-boost=/path/to/boost --with-eigen=/path/to/eigen
93```
94
95If you see any errors during configuration, verify that you are using Boost version 1.47 or higher, and that the directory you specified via --with-boost contains the boost header files and libraries. See the Boost Getting started page for more details. If you copied the SAM tools binaries to someplace other than /usr/local/, you may need to supply the --with-bam configuration option.
96Finally, make and install Cufflinks.
97```bash
98make
99make install
100```
101
102## Testing the installation
103
1041. [Download](http://cufflinks.cbcb.umd.edu/downloads/test_data.sam) the test data
1052. In the directory where you placed the test file, type:
106
107```bash
108cufflinks ./test_data.sam
109```
110
111You should see the following output:
112
113<pre>
114[bam_header_read] EOF marker is absent. The input is probably truncated.
115[bam_header_read] invalid BAM binary header (this is not a BAM file).
116File ./test_data.sam doesn't appear to be a valid BAM file, trying SAM...
117[13:23:15] Inspecting reads and determining fragment length distribution.
118> Processed 1 loci.                            [*************************] 100%
119Warning: Using default Gaussian distribution due to insufficient paired-end reads in open ranges.
120It is recommended that correct paramaters (--frag-len-mean and --frag-len-std-dev) be provided.
121> Map Properties:
122>       Total Map Mass: 102.50
123>       Read Type: 75bp x 75bp
124>       Fragment Length Distribution: Truncated Gaussian (default)
125>                     Estimated Mean: 200
126>                  Estimated Std Dev: 80
127[13:23:15] Assembling transcripts and estimating abundances.
128> Processed 1 loci.                            [*************************] 100%
129</pre>
130
131Verify that the file transcripts.gtf is in the current directory and looks like this (your file will have GTF attributes, omitted here for clarity)
132
133<pre>
134test_chromosome Cufflinks       exon    53      250     1000    +       .
135test_chromosome Cufflinks       exon    351     400     1000    +       .
136test_chromosome Cufflinks       exon    501     550     1000    +       .
137</pre>
138
139# Common uses of the Cufflinks package
140
141Cufflinks includes a number of tools for analyzing RNA-Seq experiments. Some of these tools can be run on their own, while others are pieces of a larger workflow. The complexity of your workflow depends on what you want to achieve with your analysis. For a complete discussion of how Cufflinks can help you with your analysis, please [see our protocol paper](http://www.nature.com/nprot/journal/v7/n3/full/nprot.2012.016.html). The paper includes a diagram (Figure 2) describing how the various parts of the Cufflinks package (and its companion tool TopHat) fit together. As of version 2.2.0, you can also run Cuffquant and Cuffnorm to make large scale analyses easier to handle. The figure below is an updated version of Figure 2 showing how the two utilities released after the protocol paper appeared fit into the workflow:
142
143<div style="text-align:center">
144![Workflow]({{ site.url }}/images/tuxedo_workflow.png)
145</div>
146
147You can use Cuffquant to pre-compute gene expression levels for each of your samples, which can save time if you have to re-run part of your analysis. Using Cuffquant also makes it easier to spread the load of computation for lots of samples across multiple computers. If you don't want to perform differential expression analysis, you can run Cuffnorm instead of Cuffdiff. Cuffnorm produces simple tables of expression values that you can look at in R (for example) to cluster samples and perform other follow up analysis.
148
149# Using pre-built annotation packages
150
151A number of steps in the Tuxedo package work better if you have pre-existing gene annotations. How you can use these annotations is detailed in our [protocol paper](http://www.nature.com/nprot/journal/v7/n3/full/nprot.2012.016.html). Illumina has kindly provided a large number of annotation packages for commonly used model organisms and humans. You can find these packages [here]({{ site.url }}/igenome_table/index.html).
152
153# References
154
155Cufflinks is an ongoing research project as well as a suite of tools. Here are the papers that describe the science behind the programs. If you use Cufflinks, please cite these papers in your work!
156
157**Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation**
158Cole Trapnell, Brian Williams, Geo Pertea, Ali Mortazavi, Gordon Kwan, Jeltje van Baren, Steven Salzberg, Barbara Wold, Lior Pachter.
159*Nature Biotechnology*, 2010, doi:10.1038/nbt.1621
160*Note: This is the original Cufflinks paper. Please cite this paper if you use Cufflinks in your work.*
161
162**Improving RNA-Seq expression estimates by correcting for fragment bias**
163Adam Roberts, Cole Trapnell, Julie Donaghey, John L. Rinn, Lior Pachter.
164*Genome Biology*, 2011, doi:10.1186/gb-2011-12-3-r22
165*Note: This paper describes improvements made to Cufflinks to handle bias in RNA-Seq read coverage. Please cite this paper if you use Cufflinks with the <tt>-b</tt> option in your work.*
166
167**Identification of novel transcripts in annotated genomes using RNA-Seq**
168Adam Roberts, Harold Pimentel, Cole Trapnell, Lior Pachter.
169*Bioinformatics*, 2011, doi:10.1093/bioinformatics/btr355
170*Note: This paper describes the RABT assembly algorithm. Please cite this paper if you use Cufflinks in RABT mode in your work.*
171
172**Differential analysis of gene regulation at transcript resolution with RNA-seq**
173Cole Trapnell, David Hendrickson, Martin Sauvageau, Loyal Goff, John L. Rinn, Lior Pachter
174*Nature Biotechnology*, 2012, doi:10.1038/nbt.2450
175*Note: This paper describes Cuffdiff 2. Please cite this paper if you use Cuffdiff in your work.*
176
177Cufflinks builds on many ideas, including some
178proposed in the following papers:
179
1801. Ali Mortazavi, Brian A Williams, Kenneth McCue, Lorian Schaeffer and Barbara
181Wold, "Mapping and quantifying mammalian transcriptomes by RNA-Seq",Nature
182Methods, volume 5, 621 - 628 (2008)
1832. Hui Jiang and Wing Hung Wong, "Statistical Inferences for isoform expression",
184Bioinformatics, 2009 25(8):1026-1032=
1853.Nicholas Eriksson, Lior Pachter, Yumi Mitsuya, Soo-Yon Rhee, Chunlin Wang,
186Baback Gharizadeh, Mostafa Ronaghi, Robert W. Shafer, Niko Beerenwinkel, "Viral
187population estimation using pyrosequencing", PLoS Computational Biology,
1884(5):e1000074
189