secondaryStructureExtractor.py
Creates a dataset of DSSP secondary structure assignments. The dataset includes protein sequence, the DSSP 3-state (Q3) and 8-state (Q8) assignments, and the fraction of alpha, beta, and coil within a chain. The input to this class must be a single protein chain.
get dataset of secondary structure assignments:
>>> pdb.flatMapToPair(new StructureToPolymerChains())
... .filter(new ContainsLProteinChain())
>>> secStruct = SecondaryStructureExtractor.getDataset(pdb)
>>> secStruct.show(10)