Welcome to the MaxBin download page.

MaxBin is software for binning assembled metagenomic sequences based on an Expectation-Maximization algorithm. Users can understand the underlying bins (genomes) of the microbes in their metagenomes by simply providing assembled metagenomic sequences and the reads coverage information or sequencing reads. For users' convenience MaxBin will report genome-related statistics, including estimated completeness, GC content and genome size in the binning summary page.

Users can use MEGAN or similar software on MaxBin bins to find the taxonomy of each bin after the binning process is finished.

A newer version of MaxBin, MaxBin 2.0, was released. MaxBin 2.0 supports multiple samples at the same time. You can download MaxBIn 2.0 from the sourceforge website. (Note: now two versions of MaxBin, MaxBin and MaxBin 2.0, are updated concurrently. This is because there are still some features (e.g. reassembly) not supported by MaxBin 2.0. If you are working on one metagenome, MaxBin is good for you; however if you would like to try binning coassembly of multiple metagenomes, please download MaxBin 2.0.)
For instructions on how to run MaxBin, please read the README and look at the FAQ.

MaxBin is developed under Linux platform and should be able to run in most Linux system. Currently it does not support Mac system.

MaxBin is released under the BSD license.

MaxBin can also be downloaded from Sourceforge!

MaxBin publication
MaxBin citation: Wu YW, Tang YH, Tringe SG, Simmons BA, and Singer SW, "MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm", Microbiome, 2:26, 2014.
See below for the MaxBin manuscript supporting materials.

Current Version

NEW! MaxBin 1.4.5 (6/12/2015)
  • Try harder finding marker genes when the number of genomes is very low in target dataset
  • Put reassembled genomes into another directory for easy comparison
  • Users can now change the minimum probability cutoff to whatever values (even 0) if they prefer
  • Fixed a bug that print EM log twice

Older Versions

MaxBin 1.4.4 (3/25/2015)
  • Reassembly function now supports fastq format. Users can now input interleaved fastq file for reassembly purpose.

MaxBin 1.4.3 (3/12/2015)
  • MaxBin will now try even harder to extract at least one bin from very shallow metagenome; however it is still recommended to use deeply-sequenced metagenome as MaxBin input.

MaxBin 1.4.2 (12/8/2014)
  • Fixed a bug that cannot recruit reads for reassembly.
  • Added sections instructing users to download and install auxiliary software packages if autobuild_auxiliary fails.

MaxBin 1.4.1 (11/21/2014)
  • Fixed a bug that crashes MaxBin when sequences with all 'N's are encountered.

MaxBin 1.4 (10/8/2014)
  • Added a small script "autobuild_auxiliary" to automatically download, build, and setup auxiliary software packages from mirror sites. This should reduce most of the effort from building and setting auxiliary software.
  • Added -marketset option to choose between 107 marker genes and 40 marker genes sets. 40 marker genes are more suitale for extreme environments that dominated by Archaea while 107 marker genes are suitable for more general cases. Default is 107 marker genes.
  • Added -reassembly parameters for recruiting reads and then reassemble individual bins. This feature is still highly experimental. Feedbacks are more than welcomed.
  • Added Velvet as another mandatory auxiliary software package.
  • Due to the newly-released FragGeneScan 1.19 (which I helped to develop) that supports multi-thread function, MaxBin will now attempt to run FragGeneScan in multi-thread mode.

MaxBin 1.3 (5/12/2014)
  • Fixed bugs relating having 100+ bins.
  • Cleaned up the log file.
  • Added the -max_iteration option to adjust the maximum number of EM runs (default 50).
  • Unclassified sequences and sequences shorter than minimum length threshold are now stored in different files (.tooshort and .noclass).

MaxBin 1.2.1 (3/14/2014)
  • Added support for the FASTQ format when the -reads option is specified.

MaxBin 1.2 (1/27/2014)
  • Stabalized the connection between MaxBin and auxiliary software packages.
  • Added the function to check the output of auxiliary software packages. MaxBin will now output warning messages and stop running when there are errors.
  • Bug fixes.

MaxBin 1.1 (10/15/2013)
  • Fixed the bug that crashes the program when a FASTA file with a long single line is encountered.
  • Implemented recursive binning, in which binned FASTA files will be checked separately for further binning possibilities.
  • Make the summary file more informative by adding Completeness (i.e. how complete the binned genomes are), Genome sizes and GC content.
  • Sort the bins by descending order of their abundance levels.
  • Add total number of marker genes and the number of unique marker genes into 2nd and 3rd column of the .marker file.
  • Add a function to automatically "switch" to more powerful binning mode once MaxBin finds that it cannot find enough marker genes to determine bin number.

MaxBin 1.0.1 (6/25/2013)
  • Added -min_contig_len parameter for users to decide the minimum contig length acceptable in MaxBin. Default value of minimum contig length is 1000.
  • Added -plotmarker option and integrated it with R (Rscript) to plot the marker gene heatmap as PDF file.
  • Added -verbose option for debug usage.
  • Fix bug when the coverage of certain contig is 0. This should not happen very often, but codes are added to avoid program crash on such circumstance.

MaxBin 1.0 (not available for download) (4/29/2013)
  • Integrated core MaxBin algorithm into perl script to connect with 3rd-party software.
  • Added compile_MaxBin script for users to compile MaxBin core program and all 3rd-party software.

MaxBin Manuscript Supporting Materials

MaxBin FAQ

Q: How do I compile MaxBin from scratch?
A: Please run make under src directory to build the core program. You also need to download and install auxiliary software packages, including Bowtie2, FragGeneScan, and HMMER3 to run MaxBin correctly. Run the script autobuild_auxiliary to automatically download, build, and set all auxiliary software packages.

Q: I already have my own auxiliary software packages installed in my computer. Do I need to install them again?
A: Technically you don't need to install the software packages again; however, if your software packages are older than that used in MaxBin test, we recommend you update or download and install new software packages again (or simply run autobuild_auxiliary script). This will ensure MaxBin running process is free of errors.

Q: I don't have abundance information for my assembled scaffolds. What can I do?
A: MaxBin can find the abundance information from the sequencing reads automatically. Simply provide your reads file (in either FASTQ or FASTA format) using the -reads option. Currently MaxBin can only deal with one reads file at a time. Please concatenate your reads files if you have more than one of them.

Q: I ran MaxBin with -reads option. Where can I find the scaffold abundance information that MaxBin generated for me?
A: The abundance information is stored at (output file header).abund. You can use this file for re-run MaxBin for any other purpose.

Q: I cannot run MaxBin in Mac.
A: Currently MaxBin does not support Mac system. The developer is still working on porting MaxBin onto Mac.

Q: I have a question about MaxBin. Who should I contact?
A: Please contact the developer Yu-Wei Wu at ywwei@lbl.gov.