cocoatree.datasets.load_rhomboid_proteases

cocoatree.datasets.load_rhomboid_proteases()[source]

Load the rhomboid protease dataset

This dataset comes from Mihaljevic & Urban, Cell, 2020 (DOI: https://doi.org/10.1016/j.str.2020.07.015).

Returns

a dictionnary containing :
  • sequence_ids: a list of strings corresponding to sequence names

  • alignment: a list of strings corresponding to sequences. Because it is an MSA, all the strings are of same length.

  • sector_positions: a dictionnary of arrays containing the residue

positions associated to each sector as published in the original paper.

  • pdb_sequence: sequence extracted from E. coli’s PDB structure

  • pdb_positions: positions extracted from E. coli’s PDB structure

Examples using cocoatree.datasets.load_rhomboid_proteases

Rhomboid proteases

Rhomboid proteases