<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[A primer on microbial bioinformatics for nonbioinformaticians]]></title><description><![CDATA[<p dir="auto"><a href="https://www.clinicalmicrobiologyandinfection.com/article/S1198-743X(17)30709-7/fulltext" rel="nofollow ugc">https://www.clinicalmicrobiologyandinfection.com/article/S1198-743X(17)30709-7/fulltext</a><br />
Table 1List of bioinformatics software used for microbial bioinformatics data analysis<br />
Usage	Software name	Description	URL<br />
Quality measures and read preprocessing	FASTQC	Toolbox for displaying sequence statistics for next-generation sequencing reads	<a href="http://www.bioinformatics.babraham.ac.uk/projects/fastqc/" rel="nofollow ugc">http://www.bioinformatics.babraham.ac.uk/projects/fastqc/</a><br />
TRIMMOMATIC	Command-line based tool for trimming of short-read paired-end and single-ended data	<a href="http://www.usadellab.org/cms/?page=trimmomatic" rel="nofollow ugc">http://www.usadellab.org/cms/?page=trimmomatic</a><br />
FASTX-Toolkit	A collection of command line tools for preprocessing of short-read FASTA/FASTQ files	<a href="http://hannonlab.cshl.edu/fastx_toolkit/" rel="nofollow ugc">http://hannonlab.cshl.edu/fastx_toolkit/</a><br />
PRINSEQ	Command-line and web-based tool for filtering, reformatting, or trimming genomic and metagenomic sequence data, generates summary statistics in graphical and tabular format	<a href="http://prinseq.sourceforge.net/" rel="nofollow ugc">http://prinseq.sourceforge.net/</a>, <a href="http://edwards.sdsu.edu/cgibin/prinseq/prinseq.cgi" rel="nofollow ugc">http://edwards.sdsu.edu/cgibin/prinseq/prinseq.cgi</a><br />
Contamination detection	Kraken	Taxonomic assignment of reads, useful for metagenomics analysis or detection of contamination in pure culture samples	<a href="https://ccb.jhu.edu/software/kraken/" rel="nofollow ugc">https://ccb.jhu.edu/software/kraken/</a><br />
MIDAS	Taxonomic assignment of reads, useful for metagenomics analysis or detection of contamination in pure culture samples	<a href="https://github.com/snayfach/MIDAS" rel="nofollow ugc">https://github.com/snayfach/MIDAS</a><br />
Assembly software and pipelines	Velvet	De novo genomic assembler specially designed for short reads	<a href="http://github.com/dzerbino/velvet/tree/master" rel="nofollow ugc">http://github.com/dzerbino/velvet/tree/master</a><br />
SPAdes	De novo genomic assembler for short reads; it can also provide hybrid assemblies using long-read data together with short-read data	<a href="http://cab.spbu.ru/software/spades/" rel="nofollow ugc">http://cab.spbu.ru/software/spades/</a><br />
Canu	De novo genomic assembler designed for high-noise single-molecule sequencing such as long reads	<a href="http://github.com/marbl/canu" rel="nofollow ugc">http://github.com/marbl/canu</a><br />
INNUca	A standardized, fully automated, flexible, portable and pathogen-independent pipeline for bacterial genome assembly and quality control starting from short reads	<a href="http://github.com/INNUENDOCON/INNUca" rel="nofollow ugc">http://github.com/INNUENDOCON/INNUca</a><br />
shovill	A pipeline for bacterial genome assembly which improves SPAdes speed and accuracy	<a href="https://github.com/tseemann/shovill" rel="nofollow ugc">https://github.com/tseemann/shovill</a><br />
In silico typing	ReMatCh	Software for variant calling based on a read-mapping strategy to selected target sequences; also interacts with European Nucleotide Archive (ENA) repository, easily mining publicly available data	<a href="http://github.com/B-UMMI/ReMatCh" rel="nofollow ugc">http://github.com/B-UMMI/ReMatCh</a><br />
Short Read Sequence Typing for Bacterial Pathogens (SRST2)	It uses short-read data, MLST database and/or database of gene sequences (e.g. resistance genes, virulence genes) and reports the presence of STs and/or reference genes	<a href="http://github.com/katholt/srst2" rel="nofollow ugc">http://github.com/katholt/srst2</a><br />
Microbial InSilico Typer (MIST)	Rapid generation of in silico typing data (e.g. MLST, MLVA) from draft bacterial genome assemblies	<a href="http://bitbucket.org/peterk87/microbialinsilicotyper" rel="nofollow ugc">http://bitbucket.org/peterk87/microbialinsilicotyper</a><br />
SISTR	A web- and command line–accessible tool for Salmonella typing using draft genome assemblies	<a href="http://lfz.corefacility.ca/sistr-app/" rel="nofollow ugc">http://lfz.corefacility.ca/sistr-app/</a><br />
SeqSero	A web-accessible tool for Salmonella typing using raw reads or draft genome assemblies	<a href="http://www.denglab.info/SeqSero" rel="nofollow ugc">http://www.denglab.info/SeqSero</a><br />
RGI-CARD	Curated collection of antimicrobial resistance gene and mutation sequences, bioinformatics models and tools for their detection in bacterial genomes	<a href="http://www.card.mcmaster.ca/analyze/rgi" rel="nofollow ugc">http://www.card.mcmaster.ca/analyze/rgi</a><br />
ResFinder	A web-accessible tool for the detection of acquired antimicrobial resistance genes in bacterial genomes using raw reads or draft genome assemblies	<a href="https://cge.cbs.dtu.dk/services/ResFinder/" rel="nofollow ugc">https://cge.cbs.dtu.dk/services/ResFinder/</a><br />
VirulenceFinder	A web-accessible tool for the detection of virulence associated genes in Escherichia coli, Listeria spp., Staphylococcus aureus, Enterococcus spp. using raw reads or draft genome assemblies	<a href="https://cge.cbs.dtu.dk/services/VirulenceFinder/" rel="nofollow ugc">https://cge.cbs.dtu.dk/services/VirulenceFinder/</a><br />
MLST1.8	A web-accessible tool for the determination of MLST types from bacterial genomes using publicly available MLST schemas	<a href="https://cge.cbs.dtu.dk/services/MLST" rel="nofollow ugc">https://cge.cbs.dtu.dk/services/MLST</a><br />
Mlst2.9	Command line–based software which can extract MLST from bacterial genomes using publicly available MLST schemas	<a href="https://github.com/tseemann/mlstCFSANSNP" rel="nofollow ugc">https://github.com/tseemann/mlstCFSANSNP</a><br />
CFSAN SNP Pipeline	Pipeline for extracting high quality SNV matrices for sequences from closely related pathogens	<a href="http://snppipeline.readthedocs.io/en/latest/" rel="nofollow ugc">http://snppipeline.readthedocs.io/en/latest/</a><br />
Snippy	A pipeline for rapid identification of haploid variants and construction of phylogeny using core genome SNPs	<a href="http://github.com/tseemann/snippy" rel="nofollow ugc">http://github.com/tseemann/snippy</a><br />
SNVPhyl (Single Nucleotide Variant PHYLogenomics)	Pipeline for identifying SNV within a collection of microbial genomes and constructing a phylogenetic tree	<a href="http://snvphyl.readthedocs.io/en/latest/" rel="nofollow ugc">http://snvphyl.readthedocs.io/en/latest/</a><br />
Lyve-SET	A pipeline for using high-quality SNPs to create a phylogeny, especially for outbreak investigations	<a href="https://github.com/lskatz/lyve-SET" rel="nofollow ugc">https://github.com/lskatz/lyve-SET</a><br />
Gene-by-gene approaches	BIGSdb	Web-accessible database system designed to store and analyse linked phenotypic and genotypic information, including allele calling engine for gene-by-gene approach; it is the database system for both PubMLST and PasteurMLST	<a href="https://github.com/kjolley/BIGSdb" rel="nofollow ugc">https://github.com/kjolley/BIGSdb</a>, <a href="http://pubmlst.org" rel="nofollow ugc">http://pubmlst.org</a><br />
<a href="http://bigsdb.pasteur.fr/index.html" rel="nofollow ugc">http://bigsdb.pasteur.fr/index.html</a><br />
Enterobase	Curated database and online resource for molecular typing of Salmonella, Escherichia coli, Yersinia spp. and Moraxella spp. using gene-by-gene approach	<a href="http://enterobase.warwick.ac.uk/" rel="nofollow ugc">http://enterobase.warwick.ac.uk/</a><br />
Genome Profiler	Stand-alone gene-by-gene allele calling algorithm which uses conserved gene neighbourhoods to resolve gene paralogy	<a href="http://sourceforge.net/projects/genomeprofiler/" rel="nofollow ugc">http://sourceforge.net/projects/genomeprofiler/</a><br />
chewBBACA	A comprehensive and highly efficient stand-alone gene-by-gene allele calling algorithm based on coding DNA sequences, including suite of tools for providing overview of schema performance	<a href="https://github.com/B-UMMI/chewBBACA" rel="nofollow ugc">https://github.com/B-UMMI/chewBBACA</a><br />
Gene annotation	Prodigal	Protein-coding gene prediction software tool for bacterial and archaeal genomes	<a href="http://github.com/hyattpd/prodigal/wiki" rel="nofollow ugc">http://github.com/hyattpd/prodigal/wiki</a><br />
Prokka	Quick functional annotation of bacterial genomes producing standards-compliant output file	<a href="http://github.com/tseemann/prokka" rel="nofollow ugc">http://github.com/tseemann/prokka</a><br />
RAST	Fully automated service for annotating bacterial and archaeal genomes	<a href="http://rast.nmpdr.org/" rel="nofollow ugc">http://rast.nmpdr.org/</a><br />
MicroScope	Comprehensive analytical platform for genome annotation and analysis of bacterial genomes	<a href="http://www.genoscope.cns.fr/agc/microscope/home/index.php" rel="nofollow ugc">http://www.genoscope.cns.fr/agc/microscope/home/index.php</a><br />
NCBI prokaryotic genome annotation pipeline (PGAP)	Automatic prokaryotic genome annotation pipeline that combines ab initio gene prediction algorithms with homology-based methods	<a href="https://www.ncbi.nlm.nih.gov/genome/annotation_prok/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/genome/annotation_prok/</a><br />
NCBI Pathogen Detection	An online platform for sharing and comparing data on outbreak strains; currently contains databases for 20 bacterial species, focusing on food-borne pathogens and healthcare-associated infections	<a href="https://www.ncbi.nlm.nih.gov/pathogens/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pathogens/</a><br />
Genome alignments	Harvest	A suite of core genome alignment and visualization tools for quick and high-throughput analysis of intraspecific bacterial genomes	<a href="http://harvest.readthedocs.io/en/latest/" rel="nofollow ugc">http://harvest.readthedocs.io/en/latest/</a><br />
Mauve	Aligner for comparative analysis of full bacterial genomes	<a href="http://darlinglab.org/mauve/mauve.html" rel="nofollow ugc">http://darlinglab.org/mauve/mauve.html</a><br />
Homology clustering and Association studies	Roary	High speed stand-alone pan-genome pipeline for bacterial genomes	<a href="http://sanger-pathogens.github.io/Roary/" rel="nofollow ugc">http://sanger-pathogens.github.io/Roary/</a><br />
Scoary	Pan-genome–wide association studies using Roary output	<a href="https://github.com/AdmiralenOla/Scoary" rel="nofollow ugc">https://github.com/AdmiralenOla/Scoary</a><br />
Neptune	Software designed for detecting genomic signatures within bacterial populations	<a href="https://github.com/phac-nml/neptune" rel="nofollow ugc">https://github.com/phac-nml/neptune</a><br />
Phylogenetic inference	RAxML	Sequential and parallel maximum-likelihood phylogeny estimation that operates on nucleotide and protein sequence alignments	<a href="https://sco.h-its.org/exelixis/software.html" rel="nofollow ugc">https://sco.h-its.org/exelixis/software.html</a><br />
FastTree	Compute approximately maximum likelihood phylogenetic trees from large nucleotide or protein multiple sequence alignments	<a href="http://www.microbesonline.org/fasttree/" rel="nofollow ugc">http://www.microbesonline.org/fasttree/</a><br />
Gubbins	Compute maximum likelihood from alignment after removing regions containing elevated densities of base substitutions	<a href="https://github.com/sangerpathogens/gubbins" rel="nofollow ugc">https://github.com/sangerpathogens/gubbins</a><br />
ClonalFrameML	A maximum likelihood implementation of ClonalFrame designed for genomes sequences	<a href="https://github.com/xavierdidelot/ClonalFrameML" rel="nofollow ugc">https://github.com/xavierdidelot/ClonalFrameML</a><br />
PHYLOViZ	Online Web-based tool for phylogenetic inference, visualization, analysis and sharing of sequence-based typing methods that generate allelic profiles and associated epidemiologic data	<a href="http://online.phyloviz.net" rel="nofollow ugc">http://online.phyloviz.net</a><br />
PHYLOViZ 2.0	Stand-alone Java software for phylogenetic inference, visualization and analysis of sequence-based typing methods that generate allelic profiles and their associated epidemiologic data	<a href="http://www.phyloviz.net/" rel="nofollow ugc">http://www.phyloviz.net/</a><br />
Visualization tools	Microreact	A web-based tool for genomic epidemiology data visualization and sharing	<a href="http://microreact.org" rel="nofollow ugc">http://microreact.org</a><br />
Phandango	Interactive web-based tool for fast exploration of large-scale population genomics data sets combining output from multiple genomic analysis methods	<a href="https://github.com/jameshadfield/phandango" rel="nofollow ugc">https://github.com/jameshadfield/phandango</a><br />
iTOL	Web-based tool for display, annotation and management of phylogenetic trees	<a href="http://itol.embl.de/" rel="nofollow ugc">http://itol.embl.de/</a><br />
GenGIS 2	Application including 3-D graphical and Python interfaces allowing users to combine digital map data and sequences	<a href="http://kiwi.cs.dal.ca/GenGIS/Main_Page" rel="nofollow ugc">http://kiwi.cs.dal.ca/GenGIS/Main_Page</a><br />
Multipurpose analytical platforms and pipelines	Centre for Genomic Epidemiology Toolbox	A suite of web-based tools and service for pathogen molecular typing, genome assembly, phenotypic prediction (e.g. resistance prediction) and phylogeny construction	<a href="http://cge.cbs.dtu.dk/services/" rel="nofollow ugc">http://cge.cbs.dtu.dk/services/</a><br />
Integrated Rapid Infectious Disease Analysis (IRIDA) Platform	A Galaxy-based platform for real-time infectious disease outbreak investigation using genomic data including a sequence data management module and workflows, ontology framework (GenEpiO) and data visualization tools	<a href="https://irida.corefacility.ca/documentation/downloads/index.html" rel="nofollow ugc">https://irida.corefacility.ca/documentation/downloads/index.html</a>, <a href="http://irida.ca/" rel="nofollow ugc">http://irida.ca/</a><br />
Integration genomics in surveillance of food-borne pathogens (INNUENDO) platform	A platform for real-time disease outbreak investigation and surveillance of food-borne pathogens using genomic data including sequence-data management module, assembly modules with QA/QC measures, gene-by-gene analytical pipeline, ontology framework (GenEpiO) and visualization tools	<a href="https://github.com/INNUENDOCON/INNUENDO_platform" rel="nofollow ugc">https://github.com/INNUENDOCON/INNUENDO_platform</a><br />
Nullarbor	A pipeline for generating public health microbiology reports from sequenced isolates including sequencing specifics, species ID, subtypes and core SNP	<a href="http://github.com/tseemann/nullarbor" rel="nofollow ugc">http://github.com/tseemann/nullarbor</a></p>
]]></description><link>http://an.forum.genostack.com/topic/689/a-primer-on-microbial-bioinformatics-for-nonbioinformaticians</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 10:43:30 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/689.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 13 Jun 2022 06:27:09 GMT</pubDate><ttl>60</ttl></channel></rss>