PubMLST forum on data curation

PubMLST hosts a collection of curated databases that integrate population sequence data with provenance and phenotype information for over 100 different microbial species and genera. Many of the databases are overseen by an international community of curators whose role is to ensure that data match basic quality criteria and are entered consistently.

On February 5, 2021 we hosted a forum to bring together existing curators and anybody who has a potential interest in joining the team. Topics discussed were:

  • The role of curators
    • Ensuring alleles are correctly defined.
    • Ensuring isolate provenance (meta) data are reliable and as complete as possible.
  • The process of handing submissions
  • How curation can be improved with automation
    • Field validation using allowed values and range constraints
    • Required, optional, or expected fields
    • Defining and applying field rules

You can watch the event on our YouTube channel.


  • 14:00 Welcome and introduction - Martin Maiden, University of Oxford, UK
  • 14:05 PubMLST data curation - Keith Jolley, University of Oxford, UK
  • 14:50 Question and answer session - Moderated by Martin Maiden
  • 15:20 Future curation events - Holly Bratcher, University of Oxford, UK
  • 15:30 Close