• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-Jun-2021-

READMEH A D01-Jun-20212.4 KiB5145

aster.proH A D01-Jun-2021145 33

aster_no_wrap.proH A D01-Jun-2021145 32

centaurea.nuH A D01-Jun-20211.1 KiB1916

dups.fastaH A D01-Jun-2021129 1510

elderberry.nuH A D01-Jun-20212.2 KiB3231

f001H A D01-Jun-2021179 43

f002H A D01-Jun-20211.7 KiB2626

f003H A D01-Jun-2021392 1110

fa01H A D01-Jun-2021809 1716

lavender.nuH A D01-Jun-2021683 99

loveliesbleeding.proH A D01-Jun-2021401 86

lupine.nuH A D01-Jun-2021922 1211

output001.m10H A D01-Jun-20218.1 KiB241227

output002.m10H A D01-Jun-202116.6 KiB485470

output003.m10H A D01-Jun-20216.4 KiB209190

output004.m10H A D01-Jun-20213.8 KiB110100

output005.m10H A D01-Jun-20213.8 KiB10090

output006.m10H A D01-Jun-20212.8 KiB8678

output007.m10H A D01-Jun-202113.4 KiB438424

output008.m10H A D01-Jun-202118.4 KiB605592

output009.m10H A D01-Jun-20218.3 KiB299286

output010.m10H A D01-Jun-2021982 2924

output011.m10H A D01-Jun-20215.8 KiB205197

output012.m10H A D01-Jun-20218.6 KiB340332

output013.m10H A D01-Jun-20217.3 KiB255242

output014.m10H A D01-Jun-2021975 2924

output015.m10H A D01-Jun-20213.3 KiB114106

output016.m10H A D01-Jun-202122.3 KiB909901

phlox.nuH A D01-Jun-2021801 1010

rose.proH A D01-Jun-2021701 1210

rosemary.proH A D01-Jun-2021535 88

sweetpea.nuH A D01-Jun-2021412 86

wisteria.nuH A D01-Jun-20212.7 KiB3938

README

1FASTA TEST DATA FILES
2=====================
3
4This directory contains various data files for testing the
5Fasta-related code in Biopython.
6
7The following are the common sequence file format, originally
8introduced as the input file format for Bill Pearson's FASTA
9tools.  These are for tested in Bio.SeqIO and Bio.AlignIO
10(where the format is called "fasta") as well as other older
11parts of Biopython such as the Bio.Fasta module.
12
13ID     Description
14f001   1 protein sequence
15f002   3 DNA sequences
16f003   2 proteins, with comments
17fa01   fasta alignment
18
19The following are example "machine readable" pairwise alignment
20output files from the FASTA tools when using the -m 10 command
21line option.  These are for testing the Bio.AlignIO and Bio.SearchIO
22code where the format is called "fasta-m10".
23
24output001.m10 - fasta35 protein-protein, 3 query sequences,
25                no histogram, with expectation threshold
26output002.m10 - fasta34 protein-protein, 3 query sequences,
27                with offsets and word size, max 2 hits per query
28output003.m10 - fasta34 protein-protein, 5 query sequences,
29                very strict threshold so not all have hits.
30output004.m10 - fasta35 nucleotide-nucleotide, 3 queries where
31                only the middle one has a single hit.
32output005.m10 - ssearch35 protein-protein, 3 queries where
33                only the middle one has a single hit.
34output006.m10 - fasta35 nucleotide-nucleotide, 1 query, in the
35                alignment the query has been reversed.
36output007.m10 - recreation of output001.m10 using fasta-36.3.4 (note that
37                histogram is now off by default, -H now turns it on, and the
38                -Q quiet flag no longer exists). Get more hits due to revised
39                e-value calculations, also ">>><<<" marks end of a query, not
40                just end of the file!
41output008.m10 - tfastx36 protein-nucleotide, 4 queries, some with no hits,
42                some matches with multiple HSPs (new feature in FASTA v36)
43output009.m10 - fasta36, multiple dna queries
44output010.m10 - fasta36, single dna query, no hit
45output011.m10 - fasta36, single dna query, each hit contains a single hsp
46output012.m10 - fasta36, single dna query, some hits contain multiple hsps
47output013.m10 - fasta36, multiple protein queries
48output014.m10 - fasta36, single protein query, no hit
49output015.m10 - fasta36, single protein query, each hit contains a single hsp
50output016.m10 - fasta36, single protein query, some hits contain multiple hsps
51