mmtfPyspark.datasets.swissModelDataset module

swissModelDataset

This module provides access to SWISS-MODEL datasets containing homology models.

References

get_swiss_models(uniProtIds)[source]

Downloads metadata for SWISS-MODEL homology models for alist of UniProtIds. The original data schema is flatterened into a row-based schema.

Parameters:

uniProtIds : list

list of UniProt Ids

Returns:

dataset

SwissModel dataset

Examples

>>> uniProtIds = ["P36575", "P24539", "O00244"]
>>> ds = swissProtDataset.get_swiss_models(uniProtIds)
>>> ds.show()
+------+--------+----+---+-----+----------+----+--------+-----------+--------+--------+--------+----------+-----------+
|    ac|sequence|from| to|qmean|qmean_norm|gmqe|coverage|oligo-state|  method|template|identity|similarity|coordinates|
+------+--------+----+---+-----+----------+----+--------+-----------+--------+--------+-- -----+----------+-----------+
|P36575|MSKVF...|   2|371|-3.06|0.66345522|0.75|0.953608|    monomer|Homology|1suj.1.A|68.66484|0.50463312|https://...|
|P24539|MLSRV...|  76|249|-2.51|0.67113881|0.65|0.679687|    monomer|Homology|5ara.1.S|84.48275|0.54788881|https://...|
|O00244|MPKHE...|   1| 68| 1.04|0.84233218|0.98|     1.0| homo-2-mer|Homology|1fe4.1.A|   100.0|0.60686457|https://...|
+------+--------+----+---+-----+----------+----+--------+-----------+--------+--------+--------+----------+-----------+
get_swiss_models_raw_data(uniProtIds)[source]

Downloads the raw metadata for SWISS-MODEL homology models. This dataset is in the original data schema as downloaded from SWISS-MODEL.

Parameters:

uniProtIds : list

list of UniProt Ids

Returns:

dataset

SwissModel dataset in original data schema