ClustalW2 is a general purpose multiple sequence alignment program for DNA or proteins. The online version is hosted by EMBL. It produces biologically meaningful multiple sequence alignments of divergent sequences. It calculates the best match for the selected sequences, and lines them up so that the identities, similarities and differences can be seen.
Minimus is one of several assembly pipelines included in the AMOS software package. It is designed specifically for small data-sets, such as the set of reads covering a specific gene. Note that the code will work for larger assemblies (we have used it to assemble bacterial genomes), however, due to its stringency, the resulting assembly will be highly fragmented. For large and/or complex assemblies the execution of Minimus should be followed by additional processing steps, such as scaffolding.
Elvira is a set of tools and procedures designed for high throughput assembly of small genomes such as viruses. The name Elvira stands for Executive for Large-scale VIRal Assembly. Elvira is particularly suitable in situations where any of the following occur:
1. Sequences must be individually trimmed against specific primers.
2. Sequences must be trimmed against degenerate primers.
3. Assemblies need to be explicitly partitioned based on sequence origin.
The Elvira package includes s
Celera Assembler is scientific software for DNA research. It can reconstruct long sequences of genomic DNA given the fragmentary data produced by whole-genome shotgun sequencing. The Celera Assembler has enabled discovery in microbial genomes, large eukaryotic genomes, diploid genomes, and genomes from environmental samples. Celera Assembler contributed the first diploid sequence of an individual human, and metagenomics assemblies of the Global Ocean Sampling
The Celera Assembler is a member of
SOAP has been in evolution from a single alignment tool to a tool package that provides full solution to next generation sequencing data analysis. Currently, it consists of a new alignment tool (SOAPaligner/soap2), a re-sequencing consensus sequence builder (SOAPsnp), an indel finder ( SOAPindel ), a structural variation scanner ( SOAPsv ) and a de novo short reads assembler ( SOAPdenovo ).
A new approach to assembling large, random shotgun sequencing projects has been developed. The TIGR Assembler overcomes several major obstacles to assembling such projects: the large number of pairwise comparisons required, the presence of repeat regions, chimeras introduced in the cloning process, and sequencing errors. A fast initial comparison of fragments based on oligonucleotide content is used to eliminate the need for a more sensitive comparison between most fragment pairs, thus greatly r
GATK is a structured programming framework designed to enable rapid development of efficient and robust analysis tools for next-generation DNA sequencers and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. The GATK s a Java-based tool intended for use on Linux (and other POSIX compatible) platforms.
TopHat is a Linux or OS X program that aligns RNA-Seq reads to a genome in order to identify exon-exon splice junctions. It is designed to work with reads produced by the Illumina Genome Analyzer, although it was successful used with reads from other technologies. TopHat aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons. It requires Bowtie, SAMtools an Python
This is the high throughput PCR primer design software that is used at the J. Craig Venter Institute. See: Li K, et al. "Novel computational methods for increasing PCR primer design effectiveness in directed sequencing". BMC Bioinformatics 2008, 9:191.
Lucy was designed and written at The Institute for Genomic Research (TIGR, now the J. Craig Venter Institute), and it has been used here for several years to clean sequence data from automated DNA sequencers prior to sequence assembly and other downstream uses. The quality trimming portion of lucy makes use of phred quality scores, such as those produced by many automated sequencers based on the Sanger sequencing method. As such, lucy’s quality trimming may not be appropriate for sequence da
Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome at a rate of over 25 million 35-bp reads per hour. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: typically about 2.2 GB for the human genome (2.9 GB for paired-end).
Edena (Exact DE Novo Assembler) is an assembler dedicated to process the millions of very short reads produced by the Illumina Genome Analyzer. Edena is based on the traditional overlap layout paradigm. All exact overlaps between any pair of reads are computed and structured in a graph (overlap step). Basically, the reads are indexed in a prefix array and overlaps are revealed by dichotomic search in the arrays. The graph is then analyzed to remove transitive and spurious edges (layout step). Fi
Next generation parallel sequencing technologies made chromatin immunoprecipitation followed by sequencing (ChIP-Seq) a popular strategy to study genome-wide protein-DNA interactions, while creating challenges for analysis algorithms. We present Model-based Analysis of ChIP-Seq (MACS) on short reads sequencers such as Genome Analyzer (Illumina / Solexa). MACS empirically models the length of the sequenced ChIP fragments, which tends to be shorter than sonication or library construction size esti