BacDive

BacDive
Content
Description	The Bacterial Diversity Database
Contact
Research center	Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures GmbH
Primary citation	PMID 39470737
Release date	2012
Access
Website	http://bacdive.dsmz.de/
Web service URL	https://api.bacdive.dsmz.de/
Sparql endpoint	https://sparql.dsmz.de/bacdive/

BacDive (The Bacterial Diversity Database) is the worldwide largest database for standardized bacterial and archaeal strain-level information.

BacDive is a comprehensive resource containing diverse data on bacterial and archaeal strains, including taxonomy, morphology, physiology, sampling and environmental data and sequence information.^[1]^[2] The database is built on a base of curated data from culture collections. In 2025 BacDive contains information on 99,392 strains, including 21,168 type strains. The database is hosted by the Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures GmbH and is part of the integrated DSMZ Digital Diversity infrastructure. BacDive is a member of de.NBI - the German Network for Bioinformatics Infrastructure, as well as ELIXIR. The Global Biodata Coalition designated BacDive a Global Core Biodata Resource (GCBR) in 2022.^[3] In 2023, BacDive was additionally named as an ELIXIR Core Data Resource.^[4]

Database content

BacDive was initially released in April 2012 to standardize and make publicly available strain data of culture collections, other compendia, and publications. The first release, contained 89,758 entries for 18,157 strains and 179 different used data fields.^[5]

Today (as of December 2024), the database encompassed over 1000 different data fields. The database now comprises 2,709,516 entries for 99,392 strains. Each entry is linked to a reference.^[6] Data for each strain is divided into the categories "Name and taxonomic classification", "Morphology", "Culture and growth conditions, "Physiology and metabolism", "Isolation, sampling and environmental information." "Safety information", "Sequence information".^[7]

Since 2023 high-quality predicted data produced using machine learning models trained on curated BacDive data can be found in an additional section titled "Genome-based predictions".^[8]

Data access

Data can be accessed either via a GUI, via the RESTful web service.^[9], or via a SPARQL endpoint.

The GUI offers a simple search featuring auto-completion for searching strains by name, culture collection number, NCBI Tax ID or INSDC sequence accession number. Additionally, the user can use the advanced search, which enables the search in 130 data fields and gives the opportunity of complex queries by combining several fields. Data can be downloaded in CSV format for one or multiple strains.^[10]

Via the RESTful web service portal BacDive content can be accessed automatically (a free registration is needed). To support the use of the API, software clients in Python and R are available.

Other databases

For data that are outside the focus of BacDive, links to other databases are provided.

Other databases within the DSMZ Digital Diversity infrastructure:

External databases:

References

^ Abu-Jamous, Basel; Fa, Rui; Nandi, Asoke K. (2015). Integrative Cluster Analysis in Bioinformatics. John Wiley & Sons. p. 448. ISBN 9781118906552.
^ Reimer, LC; Sardà Carbasse, J; Koblitz, J; Ebeling, C; Podstawka, A; Overmann, J (January 7, 2022). "BacDive in 2022: the knowledge base for standardized bacterial and archaeal data". Nucleic Acids Research. 50 (Database issue): D741 – D746. doi:10.1093/nar/gkab961. PMC 8728306. PMID 34718743.
^ "Database from Braunschweig is essential for global bacteria research". dsmz.de. Retrieved 14 November 2024.
^ "ELIXIR announces new Core Data Resources and Recommended Interoperability Resources". elixir-europe.org. 14 December 2023. Retrieved 14 November 2024.
^ Söhngen, C; Boyke, B; Podstawka, A; Gleim, D; Overmann, J (October 13, 2013). "BacDive - The Bacterial Diversity Metadatabase". Nucleic Acids Research. 42 (Database issue): D592 – D599. doi:10.1093/nar/gkt1058. PMC 3965005. PMID 24214959.
^ "BacDive News". December 19, 2024.
^ Reimer, LC; Vetcininova, A; Sardà Carbasse, J; Söhngen, C; Gleim, D; Ebeling, C; Overmann, J (September 17, 2018). "BacDive in 2019: bacterial phenotypic data for High-throughput biodiversity analysis". Nucleic Acids Research. 47 (Database issue): D631 – D636. doi:10.1093/nar/gky879. PMC 6323973. PMID 30256983.
^ Schober, I; Koblitz, J; Sardà Carbasse, J; Ebeling, C; Schmidt, ML; Podstawka, A; Gupta, R; Ilangovan, V; Chamanara, J; Overmann, J; Reimer, LC (29 October 2024). "BacDive in 2025: the core database for prokaryotic strain data". Nucleic Acids Research. doi:10.1093/nar/gkae959. PMC 11701647. PMID 39470737.
^ Söhngen, C; Podstawka, A; Boyke, B; Gleim, D; Vetcininova, A; Reimer, LC; Ebeling, C; Pendarovski, C; Overmann, J (September 30, 2015). "BacDive - The Bacterial Diversity Metadatabase in 2016". Nucleic Acids Research. 44 (Database issue): D581 – D585. doi:10.1093/nar/gkv983. PMC 4702946. PMID 26424852.
^ Schober, I; Koblitz, J; Sardà Carbasse, J; Ebeling, C; Schmidt, ML; Podstawka, A; Gupta, R; Ilangovan, V; Chamanara, J; Overmann, J; Reimer, LC (29 October 2024). "BacDive in 2025: the core database for prokaryotic strain data". Nucleic Acids Research. doi:10.1093/nar/gkae959. PMC 11701647. PMID 39470737.

External links

Simple search at BacDive
Advanced search at BacDive
Web services at BacDive Archived 2016-09-17 at the Wayback Machine

[1] Abu-Jamous, Basel; Fa, Rui; Nandi, Asoke K. (2015). Integrative Cluster Analysis in Bioinformatics. John Wiley & Sons. p. 448. ISBN 9781118906552.

[2] Reimer, LC; Sardà Carbasse, J; Koblitz, J; Ebeling, C; Podstawka, A; Overmann, J (January 7, 2022). "BacDive in 2022: the knowledge base for standardized bacterial and archaeal data". Nucleic Acids Research. 50 (Database issue): D741 – D746. doi:10.1093/nar/gkab961. PMC 8728306. PMID 34718743.

[3] "Database from Braunschweig is essential for global bacteria research". dsmz.de. Retrieved 14 November 2024.

[4] "ELIXIR announces new Core Data Resources and Recommended Interoperability Resources". elixir-europe.org. 14 December 2023. Retrieved 14 November 2024.

[5] Söhngen, C; Boyke, B; Podstawka, A; Gleim, D; Overmann, J (October 13, 2013). "BacDive - The Bacterial Diversity Metadatabase". Nucleic Acids Research. 42 (Database issue): D592 – D599. doi:10.1093/nar/gkt1058. PMC 3965005. PMID 24214959.

[6] "BacDive News". December 19, 2024.

[7] Reimer, LC; Vetcininova, A; Sardà Carbasse, J; Söhngen, C; Gleim, D; Ebeling, C; Overmann, J (September 17, 2018). "BacDive in 2019: bacterial phenotypic data for High-throughput biodiversity analysis". Nucleic Acids Research. 47 (Database issue): D631 – D636. doi:10.1093/nar/gky879. PMC 6323973. PMID 30256983.

[8] Schober, I; Koblitz, J; Sardà Carbasse, J; Ebeling, C; Schmidt, ML; Podstawka, A; Gupta, R; Ilangovan, V; Chamanara, J; Overmann, J; Reimer, LC (29 October 2024). "BacDive in 2025: the core database for prokaryotic strain data". Nucleic Acids Research. doi:10.1093/nar/gkae959. PMC 11701647. PMID 39470737.

[9] Söhngen, C; Podstawka, A; Boyke, B; Gleim, D; Vetcininova, A; Reimer, LC; Ebeling, C; Pendarovski, C; Overmann, J (September 30, 2015). "BacDive - The Bacterial Diversity Metadatabase in 2016". Nucleic Acids Research. 44 (Database issue): D581 – D585. doi:10.1093/nar/gkv983. PMC 4702946. PMID 26424852.

[10] Schober, I; Koblitz, J; Sardà Carbasse, J; Ebeling, C; Schmidt, ML; Podstawka, A; Gupta, R; Ilangovan, V; Chamanara, J; Overmann, J; Reimer, LC (29 October 2024). "BacDive in 2025: the core database for prokaryotic strain data". Nucleic Acids Research. doi:10.1093/nar/gkae959. PMC 11701647. PMID 39470737.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]