Skip to content

Standardize PAINT data production and use in the pipeline #413

@kltm

Description

@kltm

Currently, some resources have their PAINT annotations merged into their main output file (e.g. the paint_pombase.gaf file gets merged into the pombase.gaf file that we provide), while others do not (e.g. there is no paint_japonicus.gaf file, japonicus IBAs are found in the paint_other.gaf file instead).

This easily leads to confusion and makes it hard to diagnose issues.

As a solution, we propose a simple rule: if the primary gene ID space is not uniprot, then always include the paint_xxx file.

I believe this would mean the creation and merging of:

  • genedb_lmajor
  • genedb_tbrucei
  • genedb_pfalciparum
  • japonicus
  • pseudocap
  • sgn
  • the filtering of the above from the paint_other.gaf

(I assume that rnacentral is a special case here.)

This stems from the discussion of around a drop in japonicus annotations geneontology/helpdesk#526 .

Tagging @ValWood @cmungall @pgaudet @dustine32

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    Status

    TODO

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions