FAST: FAST Analysis of Sequences Toolbox

Travis J. Lawrence, Kyle T. Kauffman, Katherine C.H. Amrine, Dana L. Carper, Raymond S. Lee, Peter J. Becich, Claudia J. Canales, David H. Ardell

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

FAST (FAST Analysis of Sequences Toolbox) provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU's Not Unix) Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make it easy to rapidly prototype expressive bioinformatic workflows in a compact and generic command vocabulary. Compact combinatorial encoding of data workflows with FAST commands can simplify the documentation and reproducibility of bioinformatic protocols, supporting better transparency in biological data science. Interface self-consistency and conformity with conventions of GNU, Matlab, Perl, BioPerl, R, and GenBank help make FAST easy and rewarding to learn. FAST automates numerical, taxonomic, and text-based sorting, selection and transformation of sequence records and alignment sites based on content, index ranges, descriptive tags, annotated features, and in-line calculated analytics, including composition and codon usage. Automated content- and feature-based extraction of sites and support for molecular population genetic statistics make FAST useful for molecular evolutionary analysis. FAST is portable, easy to install and secure thanks to the relative maturity of its Perl and BioPerl foundations, with stable releases posted to CPAN. Development as well as a publicly accessible Cookbook and Wiki are available on the FAST GitHub repository at https://github.com/tlawrence3/FAST. The default data exchange format in FAST is Multi-FastA (specifically, a restriction of BioPerl FastA format). Sanger and Illumina 1.8+ FastQ formatted files are also supported. FAST makes it easier for non-programmer biologists to interactively investigate and control biological data at the speed of thought.

Original languageEnglish
Article number172
JournalFrontiers in Genetics
Volume6
Issue numberMAY
DOIs
StatePublished - 2015
Externally publishedYes

Keywords

  • BioPerl
  • Bioinformatic workflow
  • MultiFASTA
  • NCBI taxonomy
  • Open source
  • Pipeline
  • Regular expression
  • Unix philosophy

Fingerprint

Dive into the research topics of 'FAST: FAST Analysis of Sequences Toolbox'. Together they form a unique fingerprint.

Cite this