Talk:Main Page: Difference between revisions

From Protist-Prokaryote Symbiosis Database
Jump to navigation Jump to search
Line 8: Line 8:


* [[Item:Q388]] -- reference 10.1038/ismej.2010.40 describes more symbionts, only one is in DB?
* [[Item:Q388]] -- reference 10.1038/ismej.2010.40 describes more symbionts, only one is in DB?
==Taxonomic groups that need updates==
* Euplotes symbionts
* Ciliate symbionts generally


==Project maintenance to-dos==
==Project maintenance to-dos==

Revision as of 14:10, 21 March 2024

NCBI taxon IDs requiring attention from NCBI Taxonomy team

Items that need some attention

  • Item:Q388 -- reference 10.1038/ismej.2010.40 describes more symbionts, only one is in DB?

Taxonomic groups that need updates

  • Euplotes symbionts
  • Ciliate symbionts generally

Project maintenance to-dos

Chores (have to periodically clear them):

  • Process backlog of new references
  • Add higher taxonomy for newly added taxa that are not yet represented (PR2 for eukaryotes)
  • Add parent taxa for all taxon items semi-automatically - try to match first word in taxon name to a genus name, remaining items have to be added manually
  • Ensure that all items have a class, check that class "placeholder taxon" is consistently used
  • Find references that have only DOIs and link them to reference items, or create new reference items for them

Ideas:

  • Link prokaryote names to LPSN?
  • Create reference items from DOIs in references and reconcile with Wikidata?
  • Add SILVA taxonomy for prokaryotes by mapping to NCBI taxon ID. Prefer SILVA to GTDB because more spp. represented by 16S, not just genomes
  • Better documentation of the data modeling, example workflow for adding a new entry based on information in a publication
  • Reference items should always have DOI as an alias, so it that users can easily check with the search bar if it is already used somewhere in the database

NB: The taxonomy version (e.g. PR2 v5.0.0 for eukaryotes) is tagged by linking to a reference item for that particular taxonomy version to the 'parent taxon' statement in a 'stated in' reference. Ideally we should be able to specify which taxonomy version we want to use, if branches get moved around in the future...

Modeling:

  • Sane way to model placeholder taxa and non-specific taxon statements? E.g. general statements like "all members of this family are associated with methanogens", without creating items for each individual?
  • Modeling interactions: RO is insufficient?
  • Add statements about metabolism ("phototroph", "nitrogen fixer") to symbiont items?
  • Dummy taxon for "microbiome" to enable us to add references for microbiome studies?