cocoatree.datasets.load_rhomboid_proteases¶
- cocoatree.datasets.load_rhomboid_proteases()[source]¶
Load the rhomboid protease dataset
This dataset comes from Mihaljevic & Urban, Cell, 2020 (DOI: https://doi.org/10.1016/j.str.2020.07.015).
Returns¶
- a dictionnary containing :
sequence_ids: a list of strings corresponding to sequence names
alignment: a list of strings corresponding to sequences. Because it is an MSA, all the strings are of same length.
sector_positions: a dictionnary of arrays containing the residue
positions associated to each sector as published in the original paper.
pdb_sequence: sequence extracted from E. coli’s PDB structure
pdb_positions: positions extracted from E. coli’s PDB structure