Job 195325c3 - Saccharomyces strains

Finished at
2021/06/07 16:04
Run time
111.63 min

Comparative Genomics results

Your prediction results are ready and at least available until 2021/20/07 16:04.

Click on a button to receive the related results.

Completeness (HTML) Missing (HTML)
Missing BUSCOs

Raw file Visual file Heatmap (HTML)

Query genomeReference genomeANI valueBidirectional fragmentsTotal fragments
Saccharomyces paradoxusY1288.184435314023
Saccharomyces paradoxusSK188.228336564023
Saccharomyces paradoxussake00188.189636594023
Saccharomyces paradoxusS288C88.231537004023
Saccharomyces paradoxusHLJ16788.194236374023
Y12Saccharomyces paradoxus88.191335393709
SK1Saccharomyces paradoxus88.157737124040
sake001Saccharomyces paradoxus88.186436483918
S288CSaccharomyces paradoxus88.222537054045
HLJ167Saccharomyces paradoxus88.163436393850

Table file

Genomes HLJ167 Y12 sake001 S288C Saccharomyces paradoxus SK1
HLJ167 103.13 100.81 102.45 104.42 55.1 104.87
Y12 101.61 102.02 102.59 103.32 54.66 104.61
sake001 102.1 101.49 106.2 106.54 58.25 108.43
S288C 101.47 99.65 103.89 115.5 60.82 108.86
Saccharomyces paradoxus 53.07 52.27 56.33 60.19 122.0 63.66
SK1 101.8 100.76 105.54 108.66 64.14 126.28

Please cite always:
Roman Martin, Thomas Hackl, Georges Hattab, Matthias G Fischer, Dominik Heider (2020). MOSGA: Modular Open-Source Genome Annotator. Bioinformatics. 36(22-23). 5514–5515. doi: 10.1093/bioinformatics/btaa1003.

Do you have some questions, issues or just would like to give us feedback? Please don't hesitate to write us or feel free to open a new issue on Gitlab.

Snakemake configuration Snakemake log What to cite

Single outputs
BUSCO genes BUSCOs category (HTML) BUSCOs missing (HTML) FastANI results FastANI visual file FastANI values (HTML) GC results GC Similarity SVG trimAl output MAFFT output

What to cite
Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009 1;25(15):1972-3. doi: 10.1093/bioinformatics/btp348.
Yu G, Smith D, Zhu H, Guan Y, Lam T T. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data (2017). Methods in Ecology and Evolution, 8(1):28-36. doi:10.1111/2041-210X.12628
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013 ;30(4):772-80. doi: 10.1093/molbev/mst010.
Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018 30;9(1):5114. doi: 10.1038/s41467-018-07641-9.
Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM. BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics. Mol Biol Evol. 2018 1;35(3):543-548. doi: 10.1093/molbev/msx319.
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014 1;30(9):1312-3. doi: 10.1093/bioinformatics/btu033.
Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002 15;30(14):3059-66. doi: 10.1093/nar/gkf436.
Martin R, Hackl T, Hattab G, Fischer MG, Heider D. MOSGA: Modular Open-Source Genome Annotator. Bioinformatics. 2021;36(22-23):5514-5515. doi: 10.1093/bioinformatics/btaa1003
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015 1;31(19):3210-2. doi: 10.1093/bioinformatics/btv351.
Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006 1;22(21):2688-90. doi: 10.1093/bioinformatics/btl446.

Upload your assembled FASTA genome file.

Priority (highest priority first)

    Functional Enrichment Analysis

    Protein-Protein Interactions Analysis

    Protein-Protein Interactions Analysis

    Choose your tools:


    Gene prediction tool

    Prediction of gene locations and splicing sites.

    Mode of work

    Evidence-based or ab initio prediction

    Functional prediction

    Functional gene prediction by comparison of protein databases.



    Detection of repeating sequences.



    Prediction of tRNA sequences.



    Search for rRNA sequence matches.

    Assembly Validation

    Genome Completeness

    Validate genome completeness.


    Contamination Detection and Assembly Quality

    UID Name FASTA files Submission Date Start date End date Mode Status

    The Modular Open-Source Genome Annotator (MOSGA) is a pipeline that easily creates draft genome annotation by a graphical user interface. It combines several specific prediction tools and generates a submission-ready annotation file.

    The source code is freely available on Gitlab.com. We recommend building a new docker container from the available Dockerfile in the linked Gitlab repository. MOSGA is written modular and allows easy integration of new prediction tools or even including whole third-party pipelines.

    For any questions or comments, please contact us: roman.martin@uni-marburg.de. We are happy to receive new suggestions or even merge requests for a pipeline extension. To provide an overview of the operation principle, we recommend reading our Gitlab wiki page.

    We are providing an example data set of the draft genome annotation of Cafeteria roenbergensis BVI strain. Initially, we used an early version of MOSGA to annotate this genome (Hackl et al., 2020). Hackl, T., Martin, R., Barenhoff, K. et al. Four high-quality draft genome assemblies of the marine heterotrophic nanoflagellate Cafeteria roenbergensis. Sci Data 7, 29 (2020).

    We provide two examples for the comparative genomics workflow: The Saccharomyces species phylogenetics and the Saccharomyces gene comparison. An exemplary annotation job for the organelle scanner based on the Nannochloropsis oceanica genome is here available.

    Please take care about the licenses of the selected tools.

    Whenever you use MOSGA please cite us:
    Roman Martin orcid, Thomas Hackl orcid, Georges Hattab orcid, Matthias Fischer orcid, Dominik Heider orcid (2020). MOSGA: Modular Open-Source Genome Annotator. Bioinformatics. 36(22-23). 5514–5515. doi: 10.1093/bioinformatics/btaa1003.

    This MOSGA instance is hosted by the Philipps University of Marburg for demonstration purposes.
    It runs on an AMD Zen processor with 16 threads and 32 GB of memory.

    Processed jobs that are older than 14 days will be deleted automatically.

    Incoming jobs are queued and processed as soon as possible. Jobs that stress our hardware longer than 48 hours could be terminated.

    Uploading jobs could be aborted depending on your bandwidth and the upload duration. Therefore we recommend not to upload files that are larger than 200 MiB.

    We reserve the right to analyze failed jobs to determine errors and to provide bug fixes and quality improvements. Your results will still not be shared and regularly delete.

    If you provide a notification email address we may contact you then we could detect that could be avoided or fixed.

    Server usage